2025-03-21

Title: Enforcing Cybersecurity Constraints for LLM-driven Robot Agents for Online Transactions

Title: Privacy-Aware RAG: Secure and Isolated Knowledge Retrieval

Title: GReaTER: Generate Realistic Tabular data after data Enhancement and Reduction

Title: Towards Unified Latent Space for 3D Molecular Latent Diffusion Modeling

Title: Understanding the Generalization of In-Context Learning in Transformers: An Empirical Study

Title: CAM-Seg: A Continuous-valued Embedding Approach for Semantic Image Generation

Title: DiffPortrait360: Consistent Portrait Diffusion for 360 View Synthesis

Title: CHROME: Clothed Human Reconstruction with Occlusion-Resilience and Multiview-Consistency from a Single Image

Title: GASP: Unifying Geometric and Semantic Self-Supervised Pre-training for Autonomous Driving

Title: The Change You Want To Detect: Semantic Change Detection In Earth Observation With Hybrid Data Generation

Title: Multi-focal Conditioned Latent Diffusion for Person Image Synthesis

Title: Uncertainty-Aware Diffusion Guided Refinement of 3D Scenes

Title: RL4Med-DDPO: Reinforcement Learning for Controlled Guidance Towards Diverse Medical Image Generation using Vision-Language Foundation Models

Title: Computation-Efficient and Recognition-Friendly 3D Point Cloud Privacy Protection

Title: EDEN: Enhanced Diffusion for High-quality Large-motion Video Frame Interpolation

Title: Fùxì: A Benchmark for Evaluating Language Models on Ancient Chinese Text Understanding and Generation

Title: Zero-1-to-A: Zero-Shot One Image to Animatable Head Avatars Using Video Diffusion

Title: VideoRFSplat: Direct Scene-Level Text-to-3D Gaussian Splatting Generation with Flexible Pose and Multi-View Joint Modeling

Title: UniCoRN: Latent Diffusion-based Unified Controllable Image Restoration Network across Multiple Degradations

Title: Repurposing 2D Diffusion Models with Gaussian Atlas for 3D Generation

Title: Jasmine: Harnessing Diffusion Prior for Self-supervised Depth Estimation

Title: Text-Driven Diffusion Model for Sign Language Production

Title: Learning to Efficiently Adapt Foundation Models for Self-Supervised Endoscopic 3D Scene Reconstruction from Any Cameras

Title: BlockDance: Reuse Structurally Similar Spatio-Temporal Features to Accelerate Diffusion Transformers

Title: Multivariate Time Series Anomaly Detection in Industry 5.0

Title: Acc3D: Accelerating Single Image to 3D Diffusion Models via Edge Consistency Guided Score Distillation

Title: A Survey on fMRI-based Brain Decoding for Reconstructing Multimodal Stimuli

Title: DIPLI: Deep Image Prior Lucky Imaging for Blind Astronomical Image Restoration

Title: SenseExpo: Efficient Autonomous Exploration with Prediction Information from Lightweight Neural Networks

Title: Corrective In-Context Learning: Evaluating Self-Correction in Large Language Models

Title: The Lighthouse of Language: Enhancing LLM Agents via Critique-Guided Improvement

Title: Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts

Title: Shining Yourself: High-Fidelity Ornaments Virtual Try-on with Diffusion Model

Title: PoseTraj: Pose-Aware Trajectory Control in Video Diffusion

Title: Cultural Alignment in Large Language Models Using Soft Prompt Tuning

Title: OSLoPrompt: Bridging Low-Supervision Challenges and Open-Set Domain Generalization in CLIP

Title: Improving Discriminator Guidance in Diffusion Models

Title: FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing

Title: Towards Lighter and Robust Evaluation for Retrieval Augmented Generation

Title: Guardians of Generation: Dynamic Inference-Time Copyright Shielding with Adaptive Guidance for AI Image Generation

Title: VP-NTK: Exploring the Benefits of Visual Prompting in Differentially Private Data Synthesis

Title: Temporal Score Analysis for Understanding and Correcting Diffusion Artifacts

Title: M2N2V2: Multi-Modal Unsupervised and Training-free Interactive Segmentation

Title: Uni-3DAR: Unified 3D Generation and Understanding via Autoregression on Compressed Spatial Tokens

Title: SceneMI: Motion In-betweening for Modeling Human-Scene Interactions

Title: Unleashing Vecset Diffusion Model for Fast Shape Generation

Title: Structured-Noise Masked Modeling for Video, Audio and Beyond

Title: Ultra-Resolution Adaptation with Ease

Title: Lyra: An Efficient and Expressive Subquadratic Architecture for Modeling Biological Sequences

Title: JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse

Title: NuiScene: Exploring Efficient Generation of Unbounded Outdoor Scenes

Title: LaPIG: Cross-Modal Generation of Paired Thermal and Visible Facial Images

Title: Do Visual Imaginations Improve Vision-and-Language Navigation Agents?

Title: SV4D 2.0: Enhancing Spatio-Temporal Consistency in Multi-View Video Diffusion for High-Quality 4D Generation

Title: Scale-wise Distillation of Diffusion Models

Title: ScalingNoise: Scaling Inference-Time Search for Generating Infinite Videos

Title: DreamTexture: Shape from Virtual Texture with Analysis by Augmentation

Title: M3: 3D-Spatial MultiModal Memory

Title: InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

Title: SynCity: Training-Free Generation of 3D Worlds

Title: Tokenize Image as a Set

Title: DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding

Title: Sonata: Self-Supervised Learning of Reliable Point Representations