2025-03-13

Title: Versatile Multimodal Controls for Whole-Body Talking Human Animation

Title: A Recipe for Improving Remote Sensing VLM Zero Shot Generalization

Title: Training Plug-n-Play Knowledge Modules with Deep Context Distillation

Title: Preserving Product Fidelity in Large Scale Image Recontextualization with Diffusion Models

Title: Representing 3D Shapes With 64 Latent Vectors for 3D Diffusion Models

Title: Seal Your Backdoor with Variational Defense

Title: Contrastive Speaker-Aware Learning for Multi-party Dialogue Generation with LLMs

Title: Interpretable and Robust Dialogue State Tracking via Natural Language Summarization with LLMs

Title: Multilevel Generative Samplers for Investigating Critical Phenomena

Title: Near-Optimal Sample Complexity for Iterated CVaR Reinforcement Learning with a Generative Model

Title: I Predict Therefore I Am: Is Next Token Prediction Enough to Learn Human-Interpretable Concepts from Data?

Title: Image Encryption Using DNA Encoding, Snake Permutation and Chaotic Substitution Techniques

Title: Implicit Contrastive Representation Learning with Guided Stop-gradient

Title: Theoretical Guarantees for High Order Trajectory Refinement in Generative Flows

Title: Multi-Modal Foundation Models for Computational Pathology: A Survey

Title: Domain Adaptation for Japanese Sentence Embeddings with Contrastive Learning based on Synthetic Sentence Generation

Title: Sometimes Painful but Certainly Promising: Feasibility and Trade-offs of Language Model Inference at the Edge

Title: Training Data Provenance Verification: Did Your Model Use Synthetic Data from My Generative Model for Training?

Title: AdvAD: Exploring Non-Parametric Diffusion for Imperceptible Adversarial Attacks

Title: Generative Frame Sampler for Long Video Understanding

Title: Reangle-A-Video: 4D Video Generation as Video-to-Video Translation

Title: WonderVerse: Extendable 3D Scene Generation with Video Generative Models

Title: Incomplete Multi-view Clustering via Diffusion Contrastive Generation

Title: Time-EAPCR: A Deep Learning-Based Novel Approach for Anomaly Detection Applied to the Environmental Field

Title: Other Vehicle Trajectories Are Also Needed: A Driving World Model Unifies Ego-Other Vehicle Trajectories in Video Latant Space

Title: N2C2: Nearest Neighbor Enhanced Confidence Calibration for Cross-Lingual In-Context Learning

Title: Active Learning Inspired ControlNet Guidance for Augmenting Semantic Segmentation Datasets

Title: NAMI: Efficient Image Generation via Progressive Rectified Flow Transformers

Title: UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer

Title: Unmask It! AI-Generated Product Review Detection in Dravidian Languages

Title: Detecting and Preventing Data Poisoning Attacks on AI Models

Title: Revealing the Implicit Noise-based Imprint of Generative Models

Title: Unified Dense Prediction of Video Diffusion

Title: Deep Learning for Climate Action: Computer Vision Analysis of Visual Narratives on X

Title: Towards Graph Foundation Models: A Transferability Perspective

Title: PerCoV2: Improved Ultra-Low Bit-Rate Perceptual Image Compression with Implicit Hierarchical Masked Image Modeling

Title: Close-up-GS: Enhancing Close-Up View Synthesis in 3D Gaussian Splatting with Progressive Self-Training

Title: ForAug: Recombining Foregrounds and Backgrounds to Improve Vision Transformer Training with Bias Mitigation

Title: VLog: Video-Language Models by Generative Retrieval of Narration Vocabulary

Title: Diff-CL: A Novel Cross Pseudo-Supervision Method for Semi-supervised Medical Image Segmentation

Title: Monte Carlo Diffusion for Generalizable Learning-Based RANSAC

Title: Alias-Free Latent Diffusion Models:Improving Fractional Shift Equivariance of Diffusion Latent Space

Title: Multimodal Language Modeling for High-Accuracy Single Cell Transcriptomics Analysis and Generation

Title: SuperCarver: Texture-Consistent 3D Geometry Super-Resolution for High-Fidelity Surface Detail Generation

Title: Sparse Autoencoder as a Zero-Shot Classifier for Concept Erasing in Text-to-Image Diffusion Models

Title: How Well Does Your Tabular Generator Learn the Structure of Tabular Data?

Title: Project-Probe-Aggregate: Efficient Fine-Tuning for Group Robustness

Title: DAMM-Diffusion: Learning Divergence-Aware Multi-Modal Diffusion Model for Nanoparticles Distribution Prediction

Title: Parameter-Efficient Adaptation of Geospatial Foundation Models through Embedding Deflection

Title: Robust Multimodal Survival Prediction with the Latent Differentiation Conditional Variational AutoEncoder

Title: MindGYM: Enhancing Vision-Language Models via Synthetic Self-Challenging Questions

Title: CM-Diff: A Single Generative Network for Bidirectional Cross-Modality Translation Diffusion Model Between Infrared and Visible Images

Title: Evaluating Visual Explanations of Attention Maps for Transformer-based Medical Imaging

Title: GenHPE: Generative Counterfactuals for 3D Human Pose Estimation with Radio Frequency Signals

Title: TPDiff: Temporal Pyramid Video Diffusion Model

Title: Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Title: Minimax Optimality of the Probability Flow ODE for Diffusion Models

Title: PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop

Title: RewardSDS: Aligning Score Distillation via Reward-Weighted Sampling