2025-07-04

Title: GeoAda: Efficiently Finetune Geometric Diffusion Models with Equivariant Adapters

Title: Sample Complexity Bounds for Linear Constrained MDPs with a Generative Model

Title: CROP: Circuit Retrieval and Optimization with Parameter Guidance using LLMs

Title: Generative Latent Diffusion for Efficient Spatiotemporal Data Reduction

Title: ESTR-CoT: Towards Explainable and Accurate Event Stream based Scene Text Recognition with Chain-of-Thought Reasoning

Title: SciGA: A Comprehensive Dataset for Designing Graphical Abstracts in Academic Papers

Title: Spotlighting Partially Visible Cinematic Language for Video-to-Audio Generation via Self-distillation

Title: DreamComposer++: Empowering Diffusion Models with Multi-View Conditions for 3D Content Generation

Title: MAGIC: Mask-Guided Diffusion Inpainting with Multi-Level Perturbations and Context-Aware Alignment for Few-Shot Anomaly Generation

Title: Improving Constrained Generation in Language Models via Self-Distilled Twisted Sequential Monte Carlo

Title: Are Synthetic Videos Useful? A Benchmark for Retrieval-Centric Evaluation of Synthetic Videos

Title: Transformer-based EEG Decoding: A Survey

Title: Heeding the Inner Voice: Aligning ControlNet Training via Intermediate Features Feedback

Title: Holistic Tokenizer for Autoregressive Image Generation

Title: PosDiffAE: Position-aware Diffusion Auto-encoder For High-Resolution Brain Tissue Classification Incorporating Artifact Restoration

Title: Mesh Silksong: Auto-Regressive Mesh Generation as Weaving Silk

Title: RetrySQL: text-to-SQL training with retry data for self-correcting query generation

Title: Lost in Latent Space: An Empirical Study of Latent Diffusion Models for Physics Emulation

Title: Medical Data Pecking: A Context-Aware Approach for Automated Quality Evaluation of Structured Medical Data

Title: High-Order Deep Meta-Learning with Category-Theoretic Interpretation

Title: OmniDraft: A Cross-vocabulary, Online Adaptive Drafter for On-device Speculative Decoding

Title: AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detection via Multimodal Large Language Models

Title: Guided Generation for Developable Antibodies

Title: Embedding-Based Federated Data Sharing via Differentially Private Conditional VAEs

Title: UniMC: Taming Diffusion Transformer for Unified Keypoint-Guided Multi-Class Image Generation

Title: FairHuman: Boosting Hand and Face Quality in Human Image Generation with Minimum Potential Delay Fairness in Diffusion Models

Title: Prompt learning with bounding box constraints for medical image segmentation

Title: RichControl: Structure- and Appearance-Rich Training-Free Spatial Control for Text-to-Image Generation

Title: No time to train! Training-Free Reference-Based Instance Segmentation

Title: LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion

Title: AnyI2V: Animating Any Conditional Image with Motion Control

Title: Less is Enough: Training-Free Video Diffusion Acceleration via Runtime-Adaptive Caching

Title: RefTok: Reference-Based Tokenization for Video Generation