2026-03-30

Title: A-SelecT: Automatic Timestep Selection for Diffusion Transformer Representation Learning

Title: Evaluating Synthetic Images as Effective Substitutes for Experimental Data in Surface Roughness Classification

Title: Pure and Physics-Guided Deep Learning Solutions for Spatio-Temporal Groundwater Level Prediction at Arbitrary Locations

Title: MAGNET: Autonomous Expert Model Generation via Decentralized Autoresearch and BitNet Training

Title: ViGoR-Bench: How Far Are Visual Generative Models From Zero-Shot Visual Reasoners?

Title: Seeing Through Smoke: Surgical Desmoking for Improved Visual Perception

Title: Speech-Synchronized Whiteboard Generation via VLM-Driven Structured Drawing Representations

Title: DRiffusion: Draft-and-Refine Process Parallelizes Diffusion Models with Ease

Title: Automated Quality Assessment of Blind Sweep Obstetric Ultrasound for Improved Diagnosis

Title: World Reasoning Arena

Title: DiReCT: Disentangled Regularization of Contrastive Trajectories for Physics-Refined Video Generation

Title: JRM: Joint Reconstruction Model for Multiple Objects without Alignment

Title: Neighbor-Aware Localized Concept Erasure in Text-to-Image Diffusion Models

Title: FairLLaVA: Fairness-Aware Parameter-Efficient Fine-Tuning for Large Vision-Language Assistants

Title: Constitutive parameterized deep energy method for solid mechanics problems with random material parameters

Title: Seeing Like Radiologists: Context- and Gaze-Guided Vision-Language Pretraining for Chest X-rays

Title: Pioneering Perceptual Video Fluency Assessment: A Novel Task with Benchmark Dataset and Baseline

Title: R-PGA: Robust Physical Adversarial Camouflage Generation via Relightable 3D Gaussian Splatting

Title: When Identities Collapse: A Stress-Test Benchmark for Multi-Subject Personalization

Title: TaxaAdapter: Vision Taxonomy Models are Key to Fine-grained Image Generation over the Tree of Life

Title: InstaVSR: Taming Diffusion for Efficient and Temporally Consistent Video Super-Resolution

Title: IP-Bench: Benchmark for Image Protection Methods in Image-to-Video Generation Scenarios

Title: Provably Contractive and High-Quality Denoisers for Convergent Restoration

Title: Consistency Beyond Contrast: Enhancing Open-Vocabulary Object Detection Robustness via Contextual Consistency Learning

Title: MemCam: Memory-Augmented Camera Control for Consistent Video Generation

Title: Towards GUI Agents: Vision-Language Diffusion Models for GUI Grounding

Title: DRUM: Diffusion-based Raydrop-aware Unpaired Mapping for Sim2Real LiDAR Segmentation

Title: PhysVid: Physics Aware Local Conditioning for Generative Video Models

Title: Label-Free Cross-Task LoRA Merging with Null-Space Compression

Title: Verify Claimed Text-to-Image Models via Boundary-Aware Prompt Optimization

Title: Reflect to Inform: Boosting Multimodal Reasoning via Information-Gain-Driven Verification

Title: MPDiT: Multi-Patch Global-to-Local Transformer Architecture For Efficient Flow Matching and Diffusion Model

Title: A Formal Framework for Uncertainty Analysis of Text Generation with Large Language Models

Title: Generative Modeling in Protein Design: Neural Representations, Conditional Generation, and Evaluation Standards

Title: Restore, Assess, Repeat: A Unified Framework for Iterative Image Restoration

Title: SparseCam4D: Spatio-Temporally Consistent 4D Reconstruction from Sparse Cameras

Title: Conditional Diffusion for 3D CT Volume Reconstruction from 2D X-rays

Title: AutoWeather4D: Autonomous Driving Video Weather Conversion via G-Buffer Dual-Pass Editing

Title: HolisticSemGes: Semantic Grounding of Holistic Co-Speech Gesture Generation with Contrastive Flow-Matching

Title: Generation Is Compression: Zero-Shot Video Coding via Stochastic Rectified Flow

Title: From Synthetic Data to Real Restorations: Diffusion Model for Patient-specific Dental Crown Completion

Title: Characterization and forecasting of national-scale solar power ramp events

Title: VGGRPO: Towards World-Consistent Video Generation with 4D Latent Reward

Title: Think over Trajectories: Leveraging Video Generation to Reconstruct GPS Trajectories from Cellular Signaling

Title: GaussianGPT: Towards Autoregressive 3D Gaussian Scene Generation