2025-03-21

Title: GReaTER: Generate Realistic Tabular data after data Enhancement and Reduction

Title: Towards Unified Latent Space for 3D Molecular Latent Diffusion Modeling

Title: CAM-Seg: A Continuous-valued Embedding Approach for Semantic Image Generation

Title: LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning

Title: Transport-Related Surface Detection with Machine Learning: Analyzing Temporal Trends in Madrid and Vienna

Title: DiffPortrait360: Consistent Portrait Diffusion for 360 View Synthesis

Title: The Change You Want To Detect: Semantic Change Detection In Earth Observation With Hybrid Data Generation

Title: Multi-focal Conditioned Latent Diffusion for Person Image Synthesis

Title: Uncertainty-Aware Diffusion Guided Refinement of 3D Scenes

Title: ATTENTION2D: Communication Efficient Distributed Self-Attention Mechanism

Title: AutoDrive-QA- Automated Generation of Multiple-Choice Questions for Autonomous Driving Datasets Using Large Vision-Language Models

Title: RL4Med-DDPO: Reinforcement Learning for Controlled Guidance Towards Diverse Medical Image Generation using Vision-Language Foundation Models

Title: DNA Bench: When Silence is Smarter -- Benchmarking Over-Reasoning in Reasoning LLMs

Title: Computation-Efficient and Recognition-Friendly 3D Point Cloud Privacy Protection

Title: EDEN: Enhanced Diffusion for High-quality Large-motion Video Frame Interpolation

Title: What can Off-the-Shelves Large Multi-Modal Models do for Dynamic Scene Graph Generation?

Title: Zero-1-to-A: Zero-Shot One Image to Animatable Head Avatars Using Video Diffusion

Title: VideoRFSplat: Direct Scene-Level Text-to-3D Gaussian Splatting Generation with Flexible Pose and Multi-View Joint Modeling

Title: UniCoRN: Latent Diffusion-based Unified Controllable Image Restoration Network across Multiple Degradations

Title: MiLA: Multi-view Intensive-fidelity Long-term Video Generation World Model for Autonomous Driving

Title: Repurposing 2D Diffusion Models with Gaussian Atlas for 3D Generation

Title: UMIT: Unifying Medical Imaging Tasks via Vision-Language Models

Title: BlockDance: Reuse Structurally Similar Spatio-Temporal Features to Accelerate Diffusion Transformers

Title: UniCrossAdapter: Multimodal Adaptation of CLIP for Radiology Report Generation

Title: Acc3D: Accelerating Single Image to 3D Diffusion Models via Edge Consistency Guided Score Distillation

Title: A Survey on fMRI-based Brain Decoding for Reconstructing Multimodal Stimuli

Title: DIPLI: Deep Image Prior Lucky Imaging for Blind Astronomical Image Restoration

Title: Automating 3D Dataset Generation with Neural Radiance Fields

Title: SenseExpo: Efficient Autonomous Exploration with Prediction Information from Lightweight Neural Networks

Title: Single Image Iterative Subject-driven Generation and Editing

Title: Closer to Ground Truth: Realistic Shape and Appearance Labeled Data Generation for Unsupervised Underwater Image Segmentation

Title: Semantic-Guided Global-Local Collaborative Networks for Lightweight Image Super-Resolution

Title: Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts

Title: PoseTraj: Pose-Aware Trajectory Control in Video Diffusion

Title: MarkushGrapher: Joint Visual and Textual Recognition of Markush Structures

Title: FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing

Title: Guardians of Generation: Dynamic Inference-Time Copyright Shielding with Adaptive Guidance for AI Image Generation

Title: Improving Autoregressive Image Generation through Coarse-to-Fine Token Prediction

Title: VP-NTK: Exploring the Benefits of Visual Prompting in Differentially Private Data Synthesis

Title: Temporal Score Analysis for Understanding and Correcting Diffusion Artifacts

Title: Chain of Functions: A Programmatic Pipeline for Fine-Grained Chart Reasoning Data

Title: Uni-3DAR: Unified 3D Generation and Understanding via Autoregression on Compressed Spatial Tokens

Title: SceneMI: Motion In-betweening for Modeling Human-Scene Interactions

Title: Unleashing Vecset Diffusion Model for Fast Shape Generation

Title: Ultra-Resolution Adaptation with Ease

Title: UniSync: A Unified Framework for Audio-Visual Synchronization

Title: NuiScene: Exploring Efficient Generation of Unbounded Outdoor Scenes

Title: LaPIG: Cross-Modal Generation of Paired Thermal and Visible Facial Images

Title: SV4D 2.0: Enhancing Spatio-Temporal Consistency in Multi-View Video Diffusion for High-Quality 4D Generation

Title: Scale-wise Distillation of Diffusion Models

Title: ScalingNoise: Scaling Inference-Time Search for Generating Infinite Videos

Title: DreamTexture: Shape from Virtual Texture with Analysis by Augmentation

Title: InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

Title: SynCity: Training-Free Generation of 3D Worlds

Title: MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance

Title: Tokenize Image as a Set

Title: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation