2025-11-25

Title: Learning Straight Flows: Variational Flow Matching for Efficient Generation

Title: LLM-Powered Text-Attributed Graph Anomaly Detection via Retrieval-Augmented Reasoning

Title: Comparative Analysis of Large Language Model Inference Serving Systems: A Performance Study of vLLM and HuggingFace TGI

Title: Reconstruction-Driven Multimodal Representation Learning for Automated Media Understanding

Title: Energy-based Autoregressive Generation for Neural Population Dynamics

Title: Finding Pre-Injury Patterns in Triathletes from Lifestyle, Recovery and Load Dynamics Features

Title: AI-driven Generation of MALDI-TOF MS for Microbial Characterization

Title: Unified Low-Light Traffic Image Enhancement via Multi-Stage Illumination Recovery and Adaptive Noise Suppression

Title: Plug-and-Play Multi-Concept Adaptive Blending for High-Fidelity Text-to-Image Synthesis

Title: Tensor Gauge Flow Models

Title: Foundational Question Generation for Video Question Answering via an Embedding-Integrated Approach

Title: Efficient Large-Scale Learning of Minimax Risk Classifiers

Title: Rectifying Mean-Shift in Cascaded Precipitation Nowcasting

Title: Efficient Score Pre-computation for Diffusion Models via Cross-Matrix Krylov Projection

Title: Model-to-Model Knowledge Transmission (M2KT): A Data-Free Framework for Cross-Model Understanding Transfer

Title: MamTiff-CAD: Multi-Scale Latent Diffusion with Mamba+ for Complex Parametric Sequence

Title: SWITCH: Benchmarking Modeling and Handling of Tangible Interfaces in Long-horizon Embodied Scenarios

Title: GANGR: GAN-Assisted Scalable and Efficient Global Routing Parallelization

Title: VisReason: A Large-Scale Dataset for Visual Chain-of-Thought Reasoning

Title: Deepfake Geography: Detecting AI-Generated Satellite Images

Title: QAL: A Loss for Recall Precision Balance in 3D Reconstruction

Title: Show Me: Unifying Instructional Image and Video Generation with Diffusion Models

Title: Less is More: Data-Efficient Adaptation for Controllable Text-to-Video Generation

Title: Generative Adversarial Post-Training Mitigates Reward Hacking in Live Human-AI Music Interaction

Title: ArticFlow: Generative Simulation of Articulated Mechanisms

Title: Novel View Synthesis from A Few Glimpses via Test-Time Natural Video Completion

Title: Mitigating Catastrophic Forgetting in Streaming Generative and Predictive Learning via Stateful Replay

Title: VITAL: Vision-Encoder-centered Pre-training for LMMs in Visual Quality Assessment

Title: FeRA: Frequency-Energy Constrained Routing for Effective Diffusion Adaptation Fine-Tuning

Title: Plan-X: Instruct Video Generation via Semantic Planning

Title: SD-PSFNet: Sequential and Dynamic Point Spread Function Network for Image Deraining

Title: RAISECity: A Multimodal Agent Framework for Reality-Aligned 3D World Generation at City-Scale

Title: State and Scene Enhanced Prototypes for Weakly Supervised Open-Vocabulary Object Detection

Title: MambaX: Image Super-Resolution with State Predictive Control

Title: Curvature-Aware Safety Restoration In LLMs Fine-Tuning

Title: UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios

Title: IE-Critic-R1: Advancing the Explanatory Measurement of Text-Driven Image Editing for Human Perception Alignment

Title: Versatile Recompression-Aware Perceptual Image Super-Resolution

Title: Spotlight: Identifying and Localizing Video Generation Errors Using VLMs

Title: VCU-Bridge: Hierarchical Visual Connotation Understanding via Semantic Bridging

Title: Bias Is a Subspace, Not a Coordinate: A Geometric Rethinking of Post-hoc Debiasing in Vision-Language Models

Title: Video4Edit: Viewing Image Editing as a Degenerate Temporal Process

Title: UnfoldLDM: Deep Unfolding-based Blind Image Restoration with Latent Diffusion Priors

Title: Nested Unfolding Network for Real-World Concealed Object Segmentation

Title: EgoControl: Controllable Egocentric Video Generation via 3D Full-Body Poses

Title: Early Lung Cancer Diagnosis from Virtual Follow-up LDCT Generation via Correlational Autoencoder and Latent Flow Matching

Title: ARIAL: An Agentic Framework for Document VQA with Precise Answer Localization

Title: InfiniBench: Infinite Benchmarking for Visual Spatial Reasoning with Customizable Scene Complexity

Title: Generating Synthetic Human Blastocyst Images for In-Vitro Fertilization Blastocyst Grading

Title: MammothModa2: A Unified AR-Diffusion Framework for Multimodal Understanding and Generation

Title: Beyond Words and Pixels: A Benchmark for Implicit World Knowledge Reasoning in Generative Models

Title: Uni-DAD: Unified Distillation and Adaptation of Diffusion Models for Few-step Few-shot Image Generation

Title: TRIDENT: A Trimodal Cascade Generative Framework for Drug and RNA-Conditioned Cellular Morphology Synthesis

Title: MultiDiffNet: A Multi-Objective Diffusion Framework for Generalizable Brain Decoding

Title: Hierarchical Deep Research with Local-Web RAG: Toward Automated System-Level Materials Discovery

Title: DiVE-k: Differential Visual Reasoning for Fine-grained Image Recognition

Title: ScriptViT: Vision Transformer-Based Personalized Handwriting Generation

Title: DiM-TS: Bridge the Gap between Selective State Space Models and Time Series for Generative Modeling

Title: ConsistCompose: Unified Multimodal Layout Control for Image Composition

Title: FlowPortal: Residual-Corrected Flow for Training-Free Video Relighting and Background Replacement

Title: MagicWand: A Universal Agent for Generation and Evaluation Aligned with User Preference

Title: TRANSPORTER: Transferring Visual Semantics from VLM Manifolds

Title: MimiCAT: Mimic with Correspondence-Aware Cascade-Transformer for Category-Free 3D Pose Transfer

Title: Synthetic Curriculum Reinforces Compositional Text-to-Image Generation

Title: ViMix-14M: A Curated Multi-Source Video-Text Dataset with Long-Form, High-Quality Captions and Crawl-Free Access

Title: SegSplat: Feed-forward Gaussian Splatting and Open-Set Semantic Segmentation

Title: When Generative Replay Meets Evolving Deepfakes: Domain-Aware Relative Weighting for Incremental Face Forgery Detection

Title: NAF: Zero-Shot Feature Upsampling via Neighborhood Attention Filtering

Title: Robust Posterior Diffusion-based Sampling via Adaptive Guidance Scale

Title: Breaking Forgetting: Training-Free Few-Shot Class-Incremental Learning via Conditional Diffusion

Title: Hyperspectral Variational Autoencoders for Joint Data Compression and Component Extraction

Title: Zero-Shot Video Deraining with Video Diffusion Models

Title: TimePre: Bridging Accuracy, Efficiency, and Stability in Probabilistic Time-Series Forecasting

Title: Zero-Reference Joint Low-Light Enhancement and Deblurring via Visual Autoregressive Modeling with VLM-Derived Modulation

Title: Generative Myopia: Why Diffusion Models Fail at Structure

Title: Functional Localization Enforced Deep Anomaly Detection Using Fundus Images

Title: Health system learning achieves generalist neuroimaging models

Title: From Healthy Scans to Annotated Tumors: A Tumor Fabrication Framework for 3D Brain MRI Synthesis

Title: Data Augmentation Strategies for Robust Lane Marking Detection

Title: Sphinx: Efficiently Serving Novel View Synthesis using Regression-Guided Selective Refinement

Title: Edit2Perceive: Image Editing Diffusion Models Are Strong Dense Perceivers

Title: Neural Geometry Image-Based Representations with Optimal Transport (OT)

Title: Now You See It, Now You Don't - Instant Concept Erasure for Safe Text-to-Image and Video Generation

Title: CoD: A Diffusion Foundation Model for Image Compression

Title: Seeing What Matters: Visual Preference Policy Optimization for Visual Generation

Title: LogSyn: A Few-Shot LLM Framework for Structured Insight Extraction from Unstructured General Aviation Maintenance Logs

Title: GuideFlow: Constraint-Guided Flow Matching for Planning in End-to-End Autonomous Driving

Title: Yo'City: Personalized and Boundless 3D Realistic City Scene Generation via Self-Critic Expansion

Title: Thinking Ahead: Foresight Intelligence in MLLMs and World Models

Title: ProxT2I: Efficient Reward-Guided Text-to-Image Generation via Proximal Diffusion

Title: Any4D: Open-Prompt 4D Generation from Natural Language and Images

Title: NI-Tex: Non-isometric Image-based Garment Texture Generation

Title: ConceptGuard: Proactive Safety in Text-and-Image-to-Video Generation through Multimodal Risk Detection

Title: STCDiT: Spatio-Temporally Consistent Diffusion Transformer for High-Quality Video Super-Resolution

Title: Doubly Wild Refitting: Model-Free Evaluation of High Dimensional Black-Box Predictions under Convex Losses

Title: PartDiffuser: Part-wise 3D Mesh Generation via Discrete Diffusion

Title: Disc3D: Automatic Curation of High-Quality 3D Dialog Data via Discriminative Object Referring

Title: DiP: Taming Diffusion Models in Pixel Space

Title: Q-Save: Towards Scoring and Attribution for Generated Video Evaluation

Title: FlowSteer: Guiding Few-Step Image Synthesis with Authentic Trajectories

Title: FVAR: Visual Autoregressive Modeling via Next Focus Prediction

Title: Rethinking Long-tailed Dataset Distillation: A Uni-Level Framework with Unbiased Recovery and Relabeling

Title: KernelBand: Boosting LLM-based Kernel Optimization with a Hierarchical and Hardware-aware Multi-armed Bandit

Title: HunyuanVideo 1.5 Technical Report

Title: MagicWorld: Interactive Geometry-driven Video World Exploration

Title: MFmamba: A Multi-function Network for Panchromatic Image Resolution Restoration Based on State-Space Model

Title: Learning What to Trust: Bayesian Prior-Guided Optimization for Visual Generation

Title: One4D: Unified 4D Generation and Reconstruction via Decoupled LoRA Control

Title: FineXtrol: Controllable Motion Generation via Fine-Grained Text

Title: VeCoR - Velocity Contrastive Regularization for Flow Matching

Title: Leveraging Adversarial Learning for Pathological Fidelity in Virtual Staining

Title: Eevee: Towards Close-up High-resolution Video-based Virtual Try-on

Title: AVA-VLA: Improving Vision-Language-Action models with Active Visual Attention

Title: View-Consistent Diffusion Representations for 3D-Consistent Video Generation

Title: A Self-Conditioned Representation Guided Diffusion Model for Realistic Text-to-LiDAR Scene Generation

Title: Life-IQA: Boosting Blind Image Quality Assessment through GCN-enhanced Layer Interaction and MoE-based Feature Decoupling

Title: Beyond Reward Margin: Rethinking and Resolving Likelihood Displacement in Diffusion Models via Video Generation

Title: Understanding, Accelerating, and Improving MeanFlow Training

Title: EnfoPath: Energy-Informed Analysis of Generative Trajectories in Flow Matching

Title: HABIT: Human Action Benchmark for Interactive Traffic in CARLA

Title: 3M-TI: High-Quality Mobile Thermal Imaging via Calibration-free Multi-Camera Cross-Modal Diffusion

Title: FilmSceneDesigner: Chaining Set Design for Procedural Film Scene Generation

Title: ABM-LoRA: Activation Boundary Matching for Fast Convergence in Low-Rank Adaptation

Title: From Pixels to Posts: Retrieval-Augmented Fashion Captioning and Hashtag Generation

Title: Masked Diffusion Models are Secretly Learned-Order Autoregressive Models

Title: Test-Time Preference Optimization for Image Restoration

Title: Three-Dimensional Anatomical Data Generation Based on Artificial Neural Networks

Title: ReAlign: Text-to-Motion Generation via Step-Aware Reward-Guided Alignment

Title: Learning Plug-and-play Memory for Guiding Video Diffusion Models

Title: MAESTRO: Multi-Agent Environment Shaping through Task and Reward Optimization

Title: Solar-GECO: Perovskite Solar Cell Property Prediction with Geometric-Aware Co-Attention

Title: Interpreting GFlowNets for Drug Discovery: Extracting Actionable Insights for Medicinal Chemistry

Title: BideDPO: Conditional Image Generation with Simultaneous Text and Condition Alignment

Title: CDLM: Consistency Diffusion Language Models For Faster Sampling

Title: Tiny-TSM: Efficiently Training a Lightweight SOTA Time Series Foundation Model

Title: ReMatch: Boosting Representation through Matching for Multimodal Retrieval

Title: Open-weight genome language model safeguards: Assessing robustness via adversarial fine-tuning

Title: SyncMV4D: Synchronized Multi-view Joint Diffusion of Appearance and Motion for Hand-Object Interaction Synthesis

Title: Syn-GRPO: Self-Evolving Data Synthesis for MLLM Perception Reasoning

Title: Leveraging LLMs for reward function design in reinforcement learning control tasks

Title: Growing with the Generator: Self-paced GRPO for Video Generation

Title: DeCo: Frequency-Decoupled Pixel Diffusion for End-to-End Image Generation

Title: Efficiency vs. Fidelity: A Comparative Analysis of Diffusion Probabilistic Models and Flow Matching on Low-Resource Hardware

Title: In-Video Instructions: Visual Signals as Generative Control

Title: UniGame: Turning a Unified Multimodal Model Into Its Own Adversary

Title: SAM3-Adapter: Efficient Adaptation of Segment Anything 3 for Camouflage Object Segmentation, Shadow Detection, and Medical Image Segmentation

Title: Flow Map Distillation Without Data

Title: Breaking the Likelihood-Quality Trade-off in Diffusion Models by Merging Pretrained Experts

Title: Are Image-to-Video Models Good Zero-Shot Image Editors?

Title: VDC-Agent: When Video Detailed Captioners Evolve Themselves via Agentic Self-Reflection

Title: LumiTex: Towards High-Fidelity PBR Texture Generation with Illumination Context