2025-06-27

Title: Diffusion Tree Sampling: Scalable inference-time alignment of diffusion models

Title: On Convolutions, Intrinsic Dimension, and Diffusion Models

Title: Characterization and Mitigation of Training Instabilities in Microscaling Formats

Title: Stochastic and Non-local Closure Modeling for Nonlinear Dynamical Systems via Latent Score-based Generative Models

Title: Leveraging Vision-Language Models to Select Trustworthy Super-Resolution Samples Generated by Diffusion Models

Title: MultiHuman-Testbench: Benchmarking Image Generation for Multiple Humans

Title: LLM-guided Chemical Process Optimization with a Multi-Agent Approach

Title: PhysRig: Differentiable Physics-Based Skinning and Rigging Framework for Realistic Articulated Object Modeling

Title: DFVEdit: Conditional Delta Flow Vector for Zero-shot Video Editing

Title: Rethink Sparse Signals for Pose-guided Text-to-image Generation

Title: Step-by-Step Video-to-Audio Synthesis via Negative Audio Guidance

Title: Distilling Normalizing Flows

Title: Bridging Video Quality Scoring and Justification via Large Multimodal Models

Title: HybridQ: Hybrid Classical-Quantum Generative Adversarial Network for Skin Disease Image Generation

Title: Multimodal Prompt Alignment for Facial Expression Recognition

Title: LASFNet: A Lightweight Attention-Guided Self-Modulation Feature Fusion Network for Multimodal Object Detection

Title: Instella-T2I: Pushing the Limits of 1D Discrete Latent Space Image Generation

Title: Efficient Skill Discovery via Regret-Aware Optimization

Title: Boosting Generative Adversarial Transferability with Self-supervised Vision Transformer Features

Title: PoseMaster: Generating 3D Characters in Arbitrary Poses from a Single Image

Title: OracleFusion: Assisting the Decipherment of Oracle Bone Script with Structurally Constrained Semantic Typography

Title: Unlasting: Unpaired Single-Cell Multi-Perturbation Estimation by Dual Conditional Diffusion Implicit Bridges

Title: Learning to See in the Extremely Dark

Title: Generative Adversarial Evasion and Out-of-Distribution Detection for UAV Cyber-Attacks

Title: Geometry and Perception Guided Gaussians for Multiview-consistent 3D Generation from a Single Image

Title: Diverse Mini-Batch Selection in Reinforcement Learning for Efficient Chemical Exploration in de novo Drug Design

Title: BitMark for Infinity: Watermarking Bitwise Autoregressive Image Generative Models

Title: Video Virtual Try-on with Conditional Diffusion Transformer Inpainter

Title: HieraSurg: Hierarchy-Aware Diffusion Model for Surgical Video Generation

Title: DynamicBench: Evaluating Real-Time Report Generation in Large Language Models

Title: ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models

Title: CoPa-SG: Dense Scene Graphs with Parametric and Proto-Relations

Title: GenFlow: Interactive Modular System for Image Generation

Title: XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation

Title: Flow-Based Single-Step Completion for Efficient and Expressive Policy Learning

Title: Controllable 3D Placement of Objects with Scene-Aware Diffusion Models

Title: Mitigating Hallucination of Large Vision-Language Models via Dynamic Logits Calibration

Title: DeOcc-1-to-3: 3D De-Occlusion from a Single Image via Self-Supervised Multi-View Diffusion

Title: Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test