2024-05-31

Title: ART: Automatic Red-teaming for Text-to-Image Models to Protect Benign Users

Title: Video Anomaly Detection in 10 Years: A Survey and Outlook

Title: Using Contrastive Learning with Generative Similarity to Learn Spaces that Capture Human Inductive Biases

Title: Evaluating Vision-Language Models on Bistable Images

Title: Diffusion Policy Attacker: Crafting Adversarial Attacks for Diffusion-based Policies

Title: MemControl: Mitigating Memorization in Medical Diffusion Models via Automated Parameter Selection

Title: A Full-duplex Speech Dialogue Scheme Based On Large Language Models

Title: Two-layer retrieval augmented generation framework for low-resource medical question-answering: proof of concept using Reddit data

Title: Contrasting Multiple Representations with the Multi-Marginal Matching Gap

Title: Blind Image Restoration via Fast Diffusion Inversion

Title: SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation

Title: Why Larger Language Models Do In-context Learning Differently?

Title: Do spectral cues matter in contrast-based graph self-supervised learning?

Title: Learning Robust Correlation with Foundation Model for Weakly-Supervised Few-Shot Segmentation

Title: Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Models

Title: View-Consistent Hierarchical 3D SegmentationUsing Ultrametric Feature Fields

Title: Diffusion Policies creating a Trust Region for Offline Reinforcement Learning

Title: Text Guided Image Editing with Automatic Concept Locating and Forgetting

Title: Streaming Video Diffusion: Online Video Editing with Diffusion Models

Title: HQ-DiT: Efficient Diffusion Transformer with FP4 Hybrid Quantization

Title: Mitigating annotation shift in cancer classification using single image generative models

Title: Improving SMOTE via Fusing Conditional VAE for Data-adaptive Noise Filtering

Title: Enhancing Reinforcement Learning with Label-Sensitive Reward for Natural Language Understanding

Title: Recurrent Deep Kernel Learning of Dynamical Systems

Title: Performance Examination of Symbolic Aggregate Approximation in IoT Applications

Title: Joint Selective State Space Model and Detrending for Robust Time Series Anomaly Detection

Title: Is In-Context Learning Sufficient for Instruction Following in LLMs?

Title: Learning from Random Demonstrations: Offline Reinforcement Learning with Importance-Sampled Diffusion Models

Title: From Words to Actions: Unveiling the Theoretical Underpinnings of LLM-Driven Autonomous Systems

Title: MCDS-VSS: Moving Camera Dynamic Scene Video Semantic Segmentation by Filtering with Self-Supervised Geometry and Motion

Title: Exploring Diffusion Models' Corruption Stage in Few-Shot Fine-tuning and Mitigating with Bayesian Neural Networks

Title: PLA4D: Pixel-Level Alignments for Text-to-4D Gaussian Splatting

Title: Collective Variable Free Transition Path Sampling with Generative Flow Network

Title: DiffPhysBA: Diffusion-based Physical Backdoor Attack against Person Re-Identification in Real-World

Title: DP-IQA: Utilizing Diffusion Prior for Blind Image Quality Assessment in the Wild

Title: Would I Lie To You? Inference Time Alignment of Language Models using Direct Preference Heads

Title: Rapid Wildfire Hotspot Detection Using Self-Supervised Learning on Temporal Remote Sensing Data

Title: FMARS: Annotating Remote Sensing Images for Disaster Management using Foundation Models

Title: RIGID: A Training-free and Model-Agnostic Framework for Robust AI-Generated Image Detection

Title: MotionDreamer: Zero-Shot 3D Mesh Animation from Video Diffusion Models

Title: Boost Your Own Human Image Generation Model via Direct Preference Optimization with AI Feedback

Title: MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model

Title: ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections

Title: CV-VAE: A Compatible Video VAE for Latent Generative Video Models

Title: SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow

Title: Can't make an Omelette without Breaking some Eggs: Plausible Action Anticipation using Large Video-Language Models

Title: Sequence-Augmented SE(3)-Flow Matching For Conditional Protein Backbone Generation

Title: ANAH: Analytical Annotation of Hallucinations in Large Language Models

Title: Improving the Training of Rectified Flows

Title: $\textit{S}^3$Gaussian: Self-Supervised Street Gaussians for Autonomous Driving

Title: Don't drop your samples! Coherence-aware training benefits Conditional diffusion

Title: MotionFollower: Editing Video Motion via Lightweight Score-Guided Diffusion

Title: GECO: Generative Image-to-3D within a SECOnd

Title: CoSy: Evaluating Textual Explanations of Neurons

Title: VividDream: Generating 3D Scene with Ambient Dynamics

Title: OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving

Title: From Zero to Hero: Cold-Start Anomaly Detection

Title: Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image