2025-05-29

Title: SIMCOPILOT: Evaluating Large Language Models for Copilot-Style Code Generation

Title: Do DeepFake Attribution Models Generalize?

Title: Learning Shared Representations from Unpaired Data

Title: Temporal Restoration and Spatial Rewiring for Source-Free Multivariate Time Series Domain Adaptation

Title: UniDB++: Fast Sampling of Unified Diffusion Bridge

Title: DiffDecompose: Layer-Wise Decomposition of Alpha-Composited Images via Diffusion Transformers

Title: Vision Meets Language: A RAG-Augmented YOLOv8 Framework for Coffee Disease Diagnosis and Farmer Assistance

Title: Corruption-Aware Training of Latent Video Diffusion Models for Robust Text-to-Video Generation

Title: Image Tokens Matter: Mitigating Hallucination in Discrete Tokenizer-based Large Vision-Language Models via Latent Editing

Title: Do We Need All the Synthetic Data? Towards Targeted Synthetic Image Augmentation via Diffusion Models

Title: BaryIR: Learning Multi-Source Unified Representation in Continuous Barycenter Space for Generalizable All-in-One Image Restoration

Title: Efficient Diffusion Models for Symmetric Manifolds

Title: Geometric Feature Prompting of Image Segmentation Models

Title: Think Before You Diffuse: LLMs-Guided Physics-Aware Video Generation

Title: PreGenie: An Agentic Framework for High-quality Visual Presentation Generation

Title: Efficient Controllable Diffusion via Optimal Classifier Guidance

Title: What happens when generative AI models train recursively on each others' generated outputs?

Title: OmniResponse: Online Multimodal Conversational Response Generation in Dyadic Interactions

Title: Simulating the Unseen: Crash Prediction Must Learn from What Did Not Happen

Title: Learning to See More: UAS-Guided Super-Resolution of Satellite Imagery for Precision Agriculture

Title: DualSchool: How Reliable are LLMs for Optimization Education?

Title: Memorization to Generalization: Emergence of Diffusion Models from Associative Memory

Title: Compositional Scene Understanding through Inverse Generative Modeling

Title: ALTER: All-in-One Layer Pruning and Temporal Expert Routing for Efficient Diffusion Generation

Title: HDRSDR-VQA: A Subjective Video Quality Dataset for HDR and SDR Comparative Evaluation

Title: UniMoGen: Universal Motion Generation

Title: SDPO: Importance-Sampled Direct Preference Optimization for Stable Diffusion Training

Title: Compressing Sine-Activated Low-Rank Adapters through Post-Training Quantization

Title: Concentrate on Weakness: Mining Hard Prototypes for Few-Shot Medical Image Segmentation

Title: Reference-Guided Identity Preserving Face Restoration

Title: AlignGen: Boosting Personalized Image Generation with Cross-Modality Prior Alignment

Title: Cross-modal RAG: Sub-dimensional Retrieval-Augmented Text-to-Image Generation

Title: One-Way Ticket:Time-Independent Unified Encoder for Distilling Text-to-Image Diffusion Models

Title: DvD: Unleashing a Generative Paradigm for Document Dewarping via Coordinates-based Diffusion Model

Title: Two-Stage Feature Generation with Transformer and Reinforcement Learning

Title: Learning World Models for Interactive Video Generation

Title: PanoWan: Lifting Diffusion Video Generation Models to 360° with Latitude/Longitude-aware Mechanisms

Title: GL-PGENet: A Parameterized Generation Framework for Robust Document Image Enhancement

Title: OmniAD: Detect and Understand Industrial Anomaly via Multimodal Reasoning

Title: Detecting Undesired Process Behavior by Means of Retrieval Augmented Generation

Title: LatentMove: Towards Complex Human Movement Video Generation

Title: Differentiable Generalized Sliced Wasserstein Plans

Title: AquaMonitor: A multimodal multi-view image sequence dataset for real-life aquatic invertebrate biodiversity monitoring

Title: From Failures to Fixes: LLM-Driven Scenario Repair for Self-Evolving Autonomous Driving

Title: SridBench: Benchmark of Scientific Research Illustration Drawing of Image Generation Model

Title: Real-Time Blind Defocus Deblurring for Earth Observation: The IMAGIN-e Mission Approach

Title: What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?

Title: FaceEditTalker: Interactive Talking Head Generation with Facial Attribute Editing

Title: Q-VDiT: Towards Accurate Quantization and Distillation of Video-Generation Diffusion Transformers

Title: A Survey on Training-free Open-Vocabulary Semantic Segmentation

Title: Look & Mark: Leveraging Radiologist Eye Fixations and Bounding boxes in Multimodal Large Language Models for Chest X-ray Report Generation

Title: Enjoying Information Dividend: Gaze Track-based Medical Weakly Supervised Segmentation

Title: YH-MINER: Multimodal Intelligent System for Natural Ecological Reef Metric Extraction

Title: From Controlled Scenarios to Real-World: Cross-Domain Degradation Pattern Matching for All-in-One Image Restoration

Title: Neural Restoration of Greening Defects in Historical Autochrome Photographs Based on Purely Synthetic Data

Title: Versatile Cardiovascular Signal Generation with a Unified Diffusion Transformer

Title: Task-Driven Implicit Representations for Automated Design of LiDAR Systems

Title: Look Within or Look Beyond? A Theoretical Comparison Between Parameter-Efficient and Full Fine-Tuning

Title: Identity-Preserving Text-to-Image Generation via Dual-Level Feature Decoupling and Expert-Guided Fusion

Title: Physics-Informed Distillation of Diffusion Models for PDE-Constrained Generation

Title: PacTure: Efficient PBR Texture Generation on Packed Views with Visual Autoregressive Models

Title: Self-Reflective Reinforcement Learning for Diffusion-based Image Reasoning Generation

Title: Frugal Incremental Generative Modeling using Variational Autoencoders

Title: Mitigating Overthinking in Large Reasoning Models via Manifold Steering

Title: Scaling Reasoning without Attention

Title: Data-Driven Antenna Miniaturization: A Knowledge-Based System Integrating Quantum PSO and Predictive Machine Learning Models

Title: Position: All Current Generative Fidelity and Diversity Metrics are Flawed

Title: Understanding Adversarial Training with Energy-based Models

Title: ProCrop: Learning Aesthetic Image Cropping from Professional Compositions

Title: ProSpero: Active Learning for Robust Protein Design Beyond Wild-Type Neighborhoods

Title: PrismLayers: Open Data for High-Quality Multi-Layer Transparent Image Generative Models

Title: Test-Time Alignment of Discrete Diffusion Models with Sequential Monte Carlo

Title: Thinking with Generated Images

Title: TabularQGAN: A Quantum Generative Model for Tabular Data

Title: Scaling-up Perceptual Video Quality Assessment

Title: Universal Visuo-Tactile Video Understanding for Embodied Interaction

Title: ImageReFL: Balancing Quality and Diversity in Human-Aligned Diffusion Models

Title: Tell me Habibi, is it Real or Fake?

Title: RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruction

Title: SPIRAL: Semantic-Aware Progressive LiDAR Scene Generation

Title: Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation

Title: Sherlock: Self-Correcting Reasoning in Vision-Language Models

Title: Training Free Stylized Abstraction