2024-03-19

Title: VISREAS: Complex Visual Reasoning with Unanswerable Questions

Title: Semi-Supervised Learning for Anomaly Traffic Detection via Bidirectional Normalizing Flows

Title: Generative Models and Connected and Automated Vehicles: A Survey in Exploring the Intersection of Transportation and AI

Title: Cooling-Guide Diffusion Model for Battery Cell Arrangement

Title: MoPE: Parameter-Efficient and Scalable Multimodal Fusion via Mixture of Prompt Experts

Title: Symbiotic Game and Foundation Models for Cyber Deception Operations in Strategic Cyber Warfare

Title: Neural Erosion: Emulating Controlled Neurodegeneration and Aging in AI Systems

Title: LightIt: Illumination Modeling and Control for Diffusion Models

Title: IMPRINT: Generative Object Compositing by Learning Identity-Preserving Representation

Title: Giving a Hand to Diffusion Models: a Two-Stage Approach to Improving Conditional Human Image Generation

Title: StableGarment: Garment-Centric Generation via Stable Diffusion

Title: Time Series Representation Learning with Supervised Contrastive Temporal Transformer

Title: Efficient Pruning of Large Language Model with Adaptive Estimation Fusion

Title: Securely Fine-tuning Pre-trained Encoders Against Adversarial Examples

Title: Anomaly Detection Based on Isolation Mechanisms: A Survey

Title: Active Label Correction for Semantic Segmentation with Foundation Models

Title: VisionCLIP: An Med-AIGC based Ethical Language-Image Foundation Model for Generalizable Retina Image Analysis

Title: DUE: Dynamic Uncertainty-Aware Explanation Supervision via 3D Imputation

Title: Just Say the Name: Online Continual Learning with Category Names Only via Data Generation

Title: A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment

Title: Zero-shot Generative Linguistic Steganography

Title: A Watermark-Conditioned Diffusion Model for IP Protection

Title: DTOR: Decision Tree Outlier Regressor to explain anomalies

Title: Efficient Diffusion-Driven Corruption Editor for Test-Time Adaptation

Title: Interpretable Machine Learning for TabPFN

Title: ScanTalk: 3D Talking Heads from Unregistered Scans

Title: Ctrl123: Consistent Novel View Synthesis via Closed-Loop Transcription

Title: Energy-Based Models with Applications to Speech and Language Processing

Title: Exploiting Topological Prior for Boosting Point Cloud Generation

Title: Task-Aware Low-Rank Adaptation of Segment Anything Model

Title: OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models

Title: Boosting Flow-based Generative Super-Resolution Models via Learned Prior

Title: Neuro-Symbolic Video Search

Title: Reward Guided Latent Consistency Distillation

Title: Endora: Video Generation Models as Endoscopy Simulators

Title: Unveiling and Mitigating Memorization in Text-to-image Diffusion Models through Cross Attention

Title: Zippo: Zipping Color and Transparency Distributions into a Single Diffusion Model

Title: RobustSentEmbed: Robust Sentence Embeddings Using Adversarial Self-Supervised Contrastive Learning

Title: Customizing Visual-Language Foundation Models for Multi-modal Anomaly Detection and Reasoning

Title: Incorporating Higher-order Structural Information for Graph Clustering

Title: Hierarchical Generative Network for Face Morphing Attacks

Title: Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models

Title: Self-Supervised Quantization-Aware Knowledge Distillation

Title: Self-supervised co-salient object detection via feature correspondence at multiple scales

Title: 3D Human Reconstruction in the Wild with Synthetic Data Using Generative Models

Title: A Versatile Framework for Multi-scene Person Re-identification

Title: Omni-Recon: Towards General-Purpose Neural Radiance Fields for Versatile 3D Applications

Title: Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion Model

Title: CGI-DM: Digital Copyright Authentication for Diffusion Models via Contrasting Gradient Inversion

Title: Artifact Feature Purification for Cross-domain Detection of AI-generated Images

Title: Quality-Aware Image-Text Alignment for Real-World Image Quality Assessment

Title: usfAD Based Effective Unknown Attack Detection Focused IDS Framework

Title: Self-Supervised Video Desmoking for Laparoscopic Surgery

Title: MaskDiffusion: Exploiting Pre-trained Diffusion Models for Semantic Segmentation

Title: MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data

Title: THOR: Text to Human-Object Interaction Diffusion via Relation Intervention

Title: Understanding Diffusion Models by Feynman's Path Integral

Title: Stylized Face Sketch Extraction via Generative Prior with Limited Data

Title: Forging the Forger: An Attempt to Improve Authorship Verification via Data Augmentation

Title: BrightDreamer: Generic 3D Gaussian Generative Framework for Fast Text-to-3D Synthesis

Title: Fast Personalized Text-to-Image Syntheses With Attention Injection

Title: SQ-LLaVA: Self-Questioning for Large Vision-Language Assistant

Title: Reasoning in Transformers - Mitigating Spurious Correlations and Reasoning Shortcuts

Title: Few-Shot VQA with Frozen LLMs: A Tale of Two Approaches

Title: GeoGaussian: Geometry-aware Gaussian Splatting for Scene Rendering

Title: Enhancing Bandwidth Efficiency for Video Motion Transfer Applications using Deep Learning Based Keypoint Prediction

Title: DynamicGlue: Epipolar and Time-Informed Data Association in Dynamic Environments using Graph Neural Networks

Title: Path-GPTOmic: A Balanced Multi-modal Learning Framework for Survival Outcome Prediction

Title: Investigating the Benefits of Projection Head for Representation Learning

Title: Automated data processing and feature engineering for deep learning and big data applications: a survey

Title: DreamSampler: Unifying Diffusion Sampling and Score Distillation for Image Manipulation

Title: VmambaIR: Visual State Space Model for Image Restoration

Title: BAGS: Building Animatable Gaussian Splatting from a Monocular Video with Diffusion Priors

Title: StyleChat: Learning Recitation-Augmented Memory in LLMs for Stylized Dialogue Generation

Title: CasSR: Activating Image Power for Real-World Image Super-Resolution

Title: Collage Prompting: Budget-Friendly Visual Recognition with GPT-4V

Title: Generative Motion Stylization within Canonical Motion Space

Title: Span-Based Optimal Sample Complexity for Weakly Communicating and General Average Reward MDPs

Title: VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding

Title: SeisFusion: Constrained Diffusion Model with Input Guidance for 3D Seismic Data Interpolation and Reconstruction

Title: CCC++: Optimized Color Classified Colorization with Segment Anything Model (SAM) Empowered Object Selective Color Harmonization

Title: Do CLIPs Always Generalize Better than ImageNet Models?

Title: Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors

Title: DEE: Dual-stage Explainable Evaluation Method for Text Generation

Title: EchoReel: Enhancing Action Generation of Existing Video Diffusion Models

Title: Learning Unified Reference Representation for Unsupervised Multi-class Anomaly Detection

Title: EffiVED:Efficient Video Editing via Text-instruction Diffusion Models

Title: CRS-Diff: Controllable Generative Remote Sensing Foundation Model

Title: LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models

Title: Arc2Face: A Foundation Model of Human Faces

Title: Diffusion-Based Environment-Aware Trajectory Prediction

Title: Binary Noise for Binary Tasks: Masked Bernoulli Diffusion for Unsupervised Anomaly Detection

Title: TTT-KD: Test-Time Training for 3D Semantic Segmentation through Knowledge Distillation from Foundation Models

Title: Urban Scene Diffusion through Semantic Occupancy Map

Title: PITA: Physics-Informed Trajectory Autoencoder

Title: S-JEPA: towards seamless cross-dataset transfer through dynamic spatial attention

Title: Infinite-ID: Identity-preserved Personalization via ID-semantics Decoupling Paradigm

Title: Is It Really You Who Forgot the Password? When Account Recovery Meets Risk-Based Authentication

Title: HVDistill: Transferring Knowledge from Images to Point Clouds via Unsupervised Hybrid-View Distillation

Title: Evaluating Text to Image Synthesis: Survey and Taxonomy of Image Quality Metrics

Title: Towards Understanding the Relationship between In-context Learning and Compositional Generalization

Title: GPT-4 as Evaluator: Evaluating Large Language Models on Pest Management in Agriculture

Title: IDF-CR: Iterative Diffusion Process for Divide-and-Conquer Cloud Removal in Remote-sensing Images

Title: CO3: Low-resource Contrastive Co-training for Generative Conversational Query Rewrite

Title: InTeX: Interactive Text-to-texture Synthesis via Unified Depth-aware Inpainting

Title: ReGenNet: Towards Human Action-Reaction Synthesis

Title: SuperLoRA: Parameter-Efficient Unified Adaptation of Multi-Layer Attention Modules

Title: CICLe: Conformal In-Context Learning for Largescale Multi-Class Food Risk Classification

Title: LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model

Title: Subjective-Aligned Dateset and Metric for Text-to-Video Quality Assessment

Title: Transfer Learning Beyond Bounded Density Ratios

Title: Unveil Conditional Diffusion Models with Classifier-free Guidance: A Sharp Statistical Theory

Title: Diffusion Denoising as a Certified Defense against Clean-label Poisoning

Title: Using Generative Text Models to Create Qualitative Codebooks for Student Evaluations of Teaching

Title: GetMesh: A Controllable Model for High-quality Mesh Generation and Manipulation

Title: Accelerating Scientific Discovery with Generative Knowledge Extraction, Graph-Based Representation, and Multimodal Intelligent Graph Reasoning

Title: Learning Useful Representations of Recurrent Neural Network Weight Matrices

Title: DreamMotion: Space-Time Self-Similarity Score Distillation for Zero-Shot Video Editing

Title: GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning

Title: SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion

Title: Leveraging Spatial and Semantic Feature Extraction for Skin Cancer Diagnosis with Capsule Networks and Graph Neural Networks

Title: VideoMV: Consistent Multi-View Generation Based on Large Video Generative Model

Title: HOIDiffusion: Generating Realistic 3D Hand-Object Interaction Data

Title: GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image

Title: Fast High-Resolution Image Synthesis with Latent Adversarial Diffusion Distillation

Title: LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation

Title: From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models

Title: Generic 3D Diffusion Adapter Using Controlled Multi-View Editing

Title: VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models

Title: One-Step Image Translation with Text-to-Image Models

Title: MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control

Title: Zero-Shot Image Feature Consensus with Deep Functional Maps

Title: Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation