2025-12-23

Title: SuperFlow: Training Flow Matching Models with RL on the Fly

Title: FPBench: A Comprehensive Benchmark of Multimodal Large Language Models for Fingerprint Analysis

Title: SERA-H: Beyond Native Sentinel Spatial Limits for High-Resolution Canopy Height Mapping

Title: Grad: Guided Relation Diffusion Generation for Graph Augmentation in Graph Fraud Detection

Title: Local Patches Meet Global Context: Scalable 3D Diffusion Priors for Computed Tomography Reconstruction

Title: MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation

Title: Is There a Better Source Distribution than Gaussian? Exploring Source Distributions for Image Flow Matching

Title: Loom: Diffusion-Transformer for Interleaved Generation

Title: NOVA: Discovering Well-Conditioned Winograd Transforms through Numerical Optimization of Vandermonde Arithmetic

Title: Plasticine: A Traceable Diffusion Model for Medical Image Translation

Title: Self-organizing maps for water quality assessment in reservoirs and lakes: A systematic literature review

Title: Feature-Enhanced Graph Neural Networks for Classification of Synthetic Graph Generative Models: A Benchmarking Study

Title: Enhancing Medical Large Vision-Language Models via Alignment Distillation

Title: Comparing Dynamical Models Through Diffeomorphic Vector Field Alignment

Title: SD2AIL: Adversarial Imitation Learning from Synthetic Demonstrations via Diffusion Models

Title: Benchmarking neural surrogates on realistic spatiotemporal multiphysics flows

Title: SimpleCall: A Lightweight Image Restoration Agent in Label-Free Environments with MLLM Perceptual Feedback

Title: PTTA: A Pure Text-to-Animation Framework for High-Quality Creation

Title: Uni-Neur2Img: Unified Neural Signal-Guided Image Generation, Editing, and Stylization via Diffusion Transformers

Title: Rectification Reimagined: A Unified Mamba Model for Image Correction and Rectangling with Prompts

Title: Generating Risky Samples with Conformity Constraints via Diffusion Models

Title: $M^3-Verse$: A "Spot the Difference" Challenge for Large Multimodal Models

Title: Is Your Conditional Diffusion Model Actually Denoising?

Title: Memorize-and-Generate: Towards Long-Term Consistency in Real-Time Video Generation

Title: MaskFocus: Focusing Policy Optimization on Critical Steps for Masked Image Generation

Title: In-Context Audio Control of Video Diffusion Transformers

Title: Tempo as the Stable Cue: Hierarchical Mixture of Tempo and Beat Experts for Music to 3D Dance Generation

Title: Revealing Perception and Generation Dynamics in LVLMs: Mitigating Hallucinations via Validated Dominance Correction

Title: EchoMotion: Unified Human Video and Motion Generation via Dual-Modality Diffusion Transformer

Title: Generative Modeling through Spectral Analysis of Koopman Operator

Title: The Ensemble Schr{ö}dinger Bridge filter for Nonlinear Data Assimilation

Title: LouvreSAE: Sparse Autoencoders for Interpretable and Controllable Style Transfer

Title: When Less is More: 8-bit Quantization Improves Continual Learning in Large Language Models

Title: Symmetrization of 3D Generative Models

Title: Scaling Online Distributionally Robust Reinforcement Learning: Sample-Efficient Guarantees with General Function Approximation

Title: DVI: Disentangling Semantic and Visual Identity for Training-Free Personalized Generation

Title: CETCAM: Camera-Controllable Video Generation via Consistent and Extensible Tokenization

Title: Finer-Personalization Rank: Fine-Grained Retrieval Examines Identity Preservation for Personalized Generation

Title: WaTeRFlow: Watermark Temporal Robustness via Flow Consistency

Title: Decoupled Generative Modeling for Human-Object Interaction Synthesis

Title: Efficient Personalization of Generative Models via Optimal Experimental Design

Title: Watch Closely: Mitigating Object Hallucinations in Large Vision-Language Models with Disentangled Decoding

Title: Generative Giants, Retrieval Weaklings: Why do Multimodal Large Language Models Fail at Multimodal Retrieval?

Title: OmniMoGen: Unifying Human Motion Generation via Learning from Interleaved Text-Motion Instructions

Title: HippMetric: A skeletal-representation-based framework for cross-sectional and longitudinal hippocampal substructural morphometry

Title: Regression generation adversarial network based on dual data evaluation strategy for industrial application

Title: VisionDirector: Vision-Language Guided Closed-Loop Refinement for Generative Image Synthesis

Title: 3SGen: Unified Subject, Style, and Structure-Driven Image Generation with Adaptive Task-specific Memory

Title: Is Visual Realism Enough? Evaluating Gait Biometric Fidelity in Generative AI Human Animation

Title: RMLer: Synthesizing Novel Objects across Diverse Categories via Reinforcement Mixing Learning

Title: MixFlow Training: Alleviating Exposure Bias with Slowed Interpolation Mixture

Title: GANeXt: A Fully ConvNeXt-Enhanced Generative Adversarial Network for MRI- and CBCT-to-CT Synthesis

Title: Interpretable Hybrid Deep Q-Learning Framework for IoT-Based Food Spoilage Prediction with Synthetic Data Generation and Hardware Validation

Title: dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models

Title: Emotion-Director: Bridging Affective Shortcut in Emotion-Oriented Image Generation

Title: DK-STN: A Domain Knowledge Embedded Spatio-Temporal Network Model for MJO Forecast

Title: StoryMem: Multi-shot Long Video Storytelling with Memory

Title: ActAvatar: Temporally-Aware Precise Action Control for Talking Avatars

Title: BabyFlow: 3D modeling of realistic and expressive infant faces

Title: MapTrace: Scalable Data Generation for Route Tracing on Maps

Title: Generative diffusion models for agricultural AI: plant image generation, indoor-to-outdoor translation, and expert preference alignment

Title: Over++: Generative Video Compositing for Layer Interaction Effects

Title: Efficient Vision Mamba for MRI Super-Resolution via Hybrid Selective Scanning

Title: WorldWarp: Propagating 3D Geometry with Asynchronous Video Diffusion

Title: VA-$π$: Variational Policy Alignment for Pixel-Aware Autoregressive Generation

Title: From Indoor to Open World: Revealing the Spatial Reasoning Gap in MLLMs

Title: Visual-Aware CoT: Achieving High-Fidelity Visual Consistency in Unified Models

Title: Interact2Ar: Full-Body Human-Human Interaction Generation via Autoregressive Diffusion Models