2025-05-29

Title: UniDB++: Fast Sampling of Unified Diffusion Bridge

Title: Self-Organizing Visual Prototypes for Non-Parametric Representation Learning

Title: Equivariant Flow Matching for Point Cloud Assembly

Title: DiffDecompose: Layer-Wise Decomposition of Alpha-Composited Images via Diffusion Transformers

Title: Corruption-Aware Training of Latent Video Diffusion Models for Robust Text-to-Video Generation

Title: Multi-instance Learning as Downstream Task of Self-Supervised Learning-based Pre-trained Model

Title: Diffusion Model-based Activity Completion for AI Motion Capture from Videos

Title: Do We Need All the Synthetic Data? Towards Targeted Synthetic Image Augmentation via Diffusion Models

Title: CellCLAT: Preserving Topology and Trimming Redundancy in Self-Supervised Cellular Contrastive Learning

Title: Pioneering 4-Bit FP Quantization for Diffusion Models: Mixup-Sign Quantization and Timestep-Aware Fine-Tuning

Title: Any-to-Bokeh: One-Step Video Bokeh via Multi-Plane Image Guided Diffusion

Title: VideoMarkBench: Benchmarking Robustness of Video Watermarking

Title: Object Concepts Emerge from Motion

Title: Efficient Diffusion Models for Symmetric Manifolds

Title: Geometric Feature Prompting of Image Segmentation Models

Title: Think Before You Diffuse: LLMs-Guided Physics-Aware Video Generation

Title: Efficient Controllable Diffusion via Optimal Classifier Guidance

Title: What happens when generative AI models train recursively on each others' generated outputs?

Title: MedBridge: Bridging Foundation Vision-Language Models to Medical Image Diagnosis

Title: A Joint Reconstruction-Triplet Loss Autoencoder Approach Towards Unseen Attack Detection in IoV Networks

Title: LaX: Boosting Low-Rank Training of Foundation Models via Latent Crossing

Title: What is Adversarial Training for Diffusion Models?

Title: Simulating the Unseen: Crash Prediction Must Learn from What Did Not Happen

Title: Hierarchical Reinforcement Learning with Uncertainty-Guided Diffusional Subgoals

Title: DualSchool: How Reliable are LLMs for Optimization Education?

Title: Memorization to Generalization: Emergence of Diffusion Models from Associative Memory

Title: Compositional Scene Understanding through Inverse Generative Modeling

Title: VeriTrail: Closed-Domain Hallucination Detection with Traceability

Title: SANSA: Unleashing the Hidden Semantics in SAM2 for Few-Shot Segmentation

Title: ALTER: All-in-One Layer Pruning and Temporal Expert Routing for Efficient Diffusion Generation

Title: Representative Language Generation

Title: TuneComp: Joint Fine-tuning and Compression for Large Foundation Models

Title: UniMoGen: Universal Motion Generation

Title: FPAN: Mitigating Replication in Diffusion Models through the Fine-Grained Probabilistic Addition of Noise to Token Embeddings

Title: Revisiting Bayesian Model Averaging in the Era of Foundation Models

Title: EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance

Title: Hyperspectral Gaussian Splatting

Title: SDPO: Importance-Sampled Direct Preference Optimization for Stable Diffusion Training

Title: CAST: Contrastive Adaptation and Distillation for Semi-Supervised Instance Segmentation

Title: Reference-Guided Identity Preserving Face Restoration

Title: AlignGen: Boosting Personalized Image Generation with Cross-Modality Prior Alignment

Title: Self-supervised Learning Method Using Transformer for Multi-dimensional Sensor Data Processing

Title: InfoSAM: Fine-Tuning the Segment Anything Model from An Information-Theoretic Perspective

Title: FALCON: An ML Framework for Fully Automated Layout-Constrained Analog Circuit Design

Title: Beyond Completion: A Foundation Model for General Knowledge Graph Reasoning

Title: One-Way Ticket:Time-Independent Unified Encoder for Distilling Text-to-Image Diffusion Models

Title: A2Seek: Towards Reasoning-Centric Benchmark for Aerial Anomaly Understanding

Title: DvD: Unleashing a Generative Paradigm for Document Dewarping via Coordinates-based Diffusion Model

Title: Learning World Models for Interactive Video Generation

Title: D-Fusion: Direct Preference Optimization for Aligning Diffusion Models with Visually Consistent Samples

Title: VulBinLLM: LLM-powered Vulnerability Detection for Stripped Binaries

Title: PanoWan: Lifting Diffusion Video Generation Models to 360° with Latitude/Longitude-aware Mechanisms

Title: Weakly-Supervised Contrastive Learning for Imprecise Class Labels

Title: OmniAD: Detect and Understand Industrial Anomaly via Multimodal Reasoning

Title: Bringing CLIP to the Clinic: Dynamic Soft Labels and Negation-Aware Learning for Medical Analysis

Title: Autoregression-free video prediction using diffusion model for mitigating error propagation

Title: Multimodal Forecasting of Sparse Intraoperative Hypotension Events Powered by Language Model

Title: SridBench: Benchmark of Scientific Research Illustration Drawing of Image Generation Model

Title: What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?

Title: FaceEditTalker: Interactive Talking Head Generation with Facial Attribute Editing

Title: InComeS: Integrating Compression and Selection Mechanisms into LLMs for Efficient Model Editing

Title: Unifying Continuous and Discrete Text Diffusion with Non-simultaneous Diffusion Processes

Title: Q-VDiT: Towards Accurate Quantization and Distillation of Video-Generation Diffusion Transformers

Title: An Augmentation-Aware Theory for Self-Supervised Contrastive Learning

Title: Investigating Mechanisms for In-Context Vision Language Binding

Title: LaMM: Semi-Supervised Pre-Training of Large-Scale Materials Models

Title: A Survey on Training-free Open-Vocabulary Semantic Segmentation

Title: Look & Mark: Leveraging Radiologist Eye Fixations and Bounding boxes in Multimodal Large Language Models for Chest X-ray Report Generation

Title: StateSpaceDiffuser: Bringing Long Context to Diffusion World Models

Title: Domain Adaptation of Attention Heads for Zero-shot Anomaly Detection

Title: Neural Restoration of Greening Defects in Historical Autochrome Photographs Based on Purely Synthetic Data

Title: Compensating for Data with Reasoning: Low-Resource Machine Translation with LLMs

Title: Versatile Cardiovascular Signal Generation with a Unified Diffusion Transformer

Title: A Closer Look on Memorization in Tabular Diffusion Model: A Data-Centric Perspective

Title: Task-Driven Implicit Representations for Automated Design of LiDAR Systems

Title: Physics-Informed Distillation of Diffusion Models for PDE-Constrained Generation

Title: PacTure: Efficient PBR Texture Generation on Packed Views with Visual Autoregressive Models

Title: Self-Reflective Reinforcement Learning for Diffusion-based Image Reasoning Generation

Title: Frugal Incremental Generative Modeling using Variational Autoencoders

Title: Position: All Current Generative Fidelity and Diversity Metrics are Flawed

Title: Fostering Video Reasoning via Next-Event Prediction

Title: Understanding Adversarial Training with Energy-based Models

Title: ProSpero: Active Learning for Robust Protein Design Beyond Wild-Type Neighborhoods

Title: PrismLayers: Open Data for High-Quality Multi-Layer Transparent Image Generative Models

Title: Test-Time Alignment of Discrete Diffusion Models with Sequential Monte Carlo

Title: TabularQGAN: A Quantum Generative Model for Tabular Data

Title: Scaling-up Perceptual Video Quality Assessment

Title: DES-LOC: Desynced Low Communication Adaptive Optimizers for Training Foundation Models

Title: ImageReFL: Balancing Quality and Diversity in Human-Aligned Diffusion Models

Title: Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding

Title: ObjectClear: Complete Object Removal via Object-Effect Attention

Title: SimProcess: High Fidelity Simulation of Noisy ICS Physical Processes

Title: SPIRAL: Semantic-Aware Progressive LiDAR Scene Generation