2025-04-17

Title: LANGTRAJ: Diffusion Model and Dataset for Language-Conditioned Trajectory Simulation

Title: Possibility for Proactive Anomaly Detection

Title: Improving Instruct Models for Free: A Study on Partial Adaptation

Title: Can GPT tell us why these images are synthesized? Empowering Multimodal Large Language Models for Forensics

Title: H$^3$GNNs: Harmonizing Heterophily and Homophily in GNNs via Joint Structural Node Encoding and Self-Supervised Learning

Title: Learning What NOT to Count

Title: Towards Safe Synthetic Image Generation On the Web: A Multimodal Robust NSFW Defense and Million Scale Dataset

Title: Adjoint Sampling: Highly Scalable Diffusion Samplers via Adjoint Matching

Title: EgoExo-Gen: Ego-centric Video Prediction by Watching Exo-centric Videos

Title: The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation

Title: GrabS: Generative Embodied Agent for 3D Object Segmentation without Scene Supervision

Title: PCDiff: Proactive Control for Ownership Protection in Diffusion Models with Watermark Compatibility

Title: ACMamba: Fast Unsupervised Anomaly Detection via An Asymmetrical Consensus State Space Model

Title: Real-World Depth Recovery via Structure Uncertainty Modeling and Inaccurate GT Depth Fitting

Title: Déjà Vu: Multilingual LLM Evaluation through the Lens of Machine Translation Evaluation

Title: Boosting Multi-View Stereo with Depth Foundation Model in the Absence of Real-World Labels

Title: ACE: Attentional Concept Erasure in Diffusion Models

Title: Search is All You Need for Few-shot Anomaly Detection

Title: AnomalyR1: A GRPO-based End-to-end MLLM for Industrial Anomaly Detection

Title: SemDiff: Generating Natural Unrestricted Adversarial Examples via Semantic Attributes Optimization in Diffusion Models

Title: Beyond Words: Augmenting Discriminative Richness via Diffusions in Unsupervised Prompt Learning

Title: VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning

Title: R-Meshfusion: Reinforcement Learning Powered Sparse-View Mesh Reconstruction with Diffusion Priors

Title: Securing the Skies: A Comprehensive Survey on Anti-UAV Methods, Benchmarking, and Future Directions

Title: Language Models as Quasi-Crystalline Thought: Structure, Constraint, and Emergence in Generative Systems

Title: A Complex-valued SAR Foundation Model Based on Physically Inspired Representation Learning

Title: Balancing Graph Embedding Smoothness in Self-Supervised Learning via Information-Theoretic Decomposition

Title: Understanding Attention Mechanism in Video Diffusion Models

Title: Modular-Cam: Modular Dynamic Camera-view Video Generation with LLM

Title: Generative Deep Learning Framework for Inverse Design of Fuels

Title: DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency

Title: Selective Demonstration Retrieval for Improved Implicit Hate Speech Detection

Title: Generalized Visual Relation Detection with Diffusion Models

Title: A Diffusion-Based Framework for Terrain-Aware Remote Sensing Image Reconstruction

Title: Anti-Aesthetics: Protecting Facial Privacy against Customized Text-to-Image Synthesis

Title: RADLER: Radar Object Detection Leveraging Semantic 3D City Models and Self-Supervised Radar-Image Learning

Title: Towards a General-Purpose Zero-Shot Synthetic Low-Light Image and Video Pipeline

Title: d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning

Title: Coding-Prior Guided Diffusion Network for Video Deblurring

Title: Cobra: Efficient Line Art COlorization with BRoAder References

Title: SIDME: Self-supervised Image Demoiréing via Masked Encoder-Decoder Reconstruction

Title: VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate

Title: How Do I Do That? Synthesizing 3D Hand Motion and Contacts for Everyday Interactions

Title: SHeaP: Self-Supervised Head Geometry Predictor Learned via 2D Gaussians