2025-06-05

Title: Test-Time Scaling of Diffusion Models via Noise Trajectory Search

Title: PALADIN : Robust Neural Fingerprinting for Text-to-Image Diffusion Models

Title: FOLIAGE: Towards Physical Intelligence World Models Via Unbounded Surface Evolution

Title: Multimodal Generative AI with Autoregressive LLMs for Human Motion Understanding and Generation: A Way Forward

Title: FLEX: A Large-Scale Multi-Modal Multi-Action Dataset for Fitness Action Quality Assessment

Title: Channel-adaptive Cross-modal Generative Semantic Communication for Point Cloud Transmission

Title: DiaBlo: Diagonal Blocks Are Sufficient For Finetuning

Title: BadReward: Clean-Label Poisoning of Reward Models in Text-to-Image RLHF

Title: Chipmunk: Training-Free Acceleration of Diffusion Transformers with Dynamic Column-Sparse Deltas

Title: Robustness in Both Domains: CLIP Needs a Robust Text Encoder

Title: Adaptive Task Vectors for Large Language Models

Title: Exploiting LLMs for Automatic Hypothesis Assessment via a Logit-Based Calibrated Prior

Title: RefEdit: A Benchmark and Method for Improving Instruction-based Image Editing Model on Referring Expressions

Title: CHIME: Conditional Hallucination and Integrated Multi-scale Enhancement for Time Series Diffusion Model

Title: DenseDPO: Fine-Grained Temporal Preference Optimization for Video Diffusion Models

Title: Path Generation and Evaluation in Video Games: A Nonparametric Statistical Approach

Title: Learning Monotonic Probabilities with a Generative Cost Model

Title: VCDiag: Classifying Erroneous Waveforms for Failure Triage Acceleration

Title: Resolving Task Objective Conflicts in Unified Multimodal Understanding and Generation via Task-Aware Mixture-of-Experts

Title: ControlThinker: Unveiling Latent Semantics for Controllable Image Generation through Visual Reasoning

Title: Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision

Title: Negative-Guided Subject Fidelity Optimization for Zero-Shot Subject-Driven Generation

Title: EmoArt: A Multidimensional Dataset for Emotion-Aware Artistic Generation

Title: Out-of-Distribution Graph Models Merging

Title: PRJ: Perception-Retrieval-Judgement for Generated Images

Title: Advancements in Artificial Intelligence Applications for Cardiovascular Disease Research

Title: On the Closed-Form of Flow Matching: Generalization Does Not Arise from Target Stochasticity

Title: SAAT: Synergistic Alternating Aggregation Transformer for Image Super-Resolution

Title: Joint Video Enhancement with Deblurring, Super-Resolution, and Frame Interpolation Network

Title: Lower Ricci Curvature for Hypergraphs

Title: Solving Inverse Problems via Diffusion-Based Priors: An Approximation-Free Ensemble Sampling Approach

Title: Optimal Spiking Brain Compression: Improving One-Shot Post-Training Pruning and Quantization for Spiking Neural Networks

Title: Point Cloud Quality Assessment Using the Perceptual Clustering Weighted Graph (PCW-Graph) and Attention Fusion Network

Title: UniCUE: Unified Recognition and Generation Framework for Chinese Cued Speech Video-to-Speech Generation

Title: Image Editing As Programs with Diffusion Models

Title: Physics-Constrained Flow Matching: Sampling Generative Models with Hard Constraints

Title: Does Prompt Design Impact Quality of Data Imputation by LLMs?

Title: OpenThoughts: Data Recipes for Reasoning Models

Title: Diffusion Domain Teacher: Diffusion Guided Domain Adaptive Object Detector

Title: FullDiT2: Efficient In-Context Conditioning for Video Diffusion Transformers

Title: Sounding that Object: Interactive Object-Aware Image to Audio Generation

Title: UNIC: Unified In-Context Video Editing

Title: Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation

Title: LayerFlow: A Unified Model for Layer-aware Video Generation