2026-02-25

Title: Tensor Network Generator-Enhanced Optimization for Traveling Salesman Problem

Title: When Backdoors Go Beyond Triggers: Semantic Drift in Diffusion Models Under Encoder Attacks

Title: Multimodal Crystal Flow: Any-to-Any Modality Generation for Unified Crystal Modeling

Title: MultiModalPFN: Extending Prior-Data Fitted Networks for Multimodal Tabular Learning

Title: Discrete Diffusion with Sample-Efficient Estimators for Conditionals

Title: Shape-informed cardiac mechanics surrogates in data-scarce regimes via geometric encoding and generative augmentation

Title: In-context Pre-trained Time-Series Foundation Models adapt to Unseen Tasks

Title: QuantVLA: Scale-Calibrated Post-Training Quantization for Vision-Language-Action Models

Title: GSNR: Graph Smooth Null-Space Representation for Inverse Problems

Title: Hierarchical Molecular Representation Learning via Fragment-Based Self-Supervised Embedding Prediction

Title: BiRQA: Bidirectional Robust Quality Assessment for Images

Title: 3DSPA: A 3D Semantic Point Autoencoder for Evaluating Video Realism

Title: Momentum Guidance: Plug-and-Play Guidance for Flow Models

Title: SimLBR: Learning to Detect Fake Images by Learning to Detect Real Images

Title: gQIR: Generative Quanta Image Reconstruction

Title: A Long-Short Flow-Map Perspective for Drifting Models

Title: CGSTA: Cross-Scale Graph Contrast with Stability-Aware Alignment for Multivariate Time-Series Anomaly Detection

Title: SceMoS: Scene-Aware 3D Human Motion Synthesis by Planning with Geometry-Grounded Tokens

Title: VINA: Variational Invertible Neural Architectures

Title: LESA: Learnable Stage-Aware Predictors for Diffusion Model Acceleration

Title: Probing and Bridging Geometry-Interaction Cues for Affordance Reasoning in Vision Foundation Models

Title: How Do Inpainting Artifacts Propagate to Language?

Title: Stop-Think-AutoRegress: Language Modeling with Latent Diffusion Planning

Title: Actor-Curator: Co-adaptive Curriculum Learning via Policy-Improvement Bandits for RL Post-Training

Title: Sample-efficient evidence estimation of score based priors for model selection

Title: GENSR: Symbolic Regression Based in Equation Generative Space

Title: AIForge-Doc: A Benchmark for Detecting AI-Forged Tampering in Financial and Form Documents

Title: Efficient and Explainable End-to-End Autonomous Driving via Masked Vision-Language-Action Diffusion

Title: PropFly: Learning to Propagate via On-the-Fly Supervision from Pre-trained Video Diffusion Models

Title: TrajGPT-R: Generating Urban Mobility Trajectory with Reinforcement Learning-Enhanced Generative Pre-trained Transformer

Title: AnimeAgent: Is the Multi-Agent via Image-to-Video models a Good Disney Storytelling Artist?

Title: BoxSplitGen: A Generative Model for 3D Part Bounding Boxes in Varying Granularity

Title: CAMEL: Confidence-Gated Reflection for Reward Modeling

Title: GA-Drive: Geometry-Appearance Decoupled Modeling for Free-viewpoint Driving Scene Generatio

Title: UrbanFM: Scaling Urban Spatio-Temporal Foundation Models

Title: Vanishing Watermarks: Diffusion-Based Image Editing Undermines Robust Invisible Watermarking

Title: RAYNOVA: 3D-Geometry-Free Auto-Regressive Driving World Modeling with Unified Spatio-Temporal Representation

Title: CleanStyle: Plug-and-Play Style Conditioning Purification for Text-to-Image Stylization

Title: Bridging Physically Based Rendering and Diffusion Models with Stochastic Differential Equation

Title: OrthoDiffusion: A Generalizable Multi-Task Diffusion Foundation Model for Musculoskeletal MRI Interpretation

Title: Deep unfolding of MCMC kernels: scalable, modular & explainable GANs for high-dimensional posterior sampling

Title: VGGDrive: Empowering Vision-Language Models with Cross-View Geometric Grounding for Autonomous Driving

Title: Training-Free Multi-Concept Image Editing

Title: When Safety Collides: Resolving Multi-Category Harmful Conflicts in Text-to-Image Diffusion via Adaptive Safety Guidance

Title: SpatiaLQA: A Benchmark for Evaluating Spatial Logical Reasoning in Vision-Language Models

Title: TextPecker: Rewarding Structural Anomaly Quantification for Enhancing Visual Text Rendering

Title: Estimation of Confidence Bounds in Binary Classification using Wilson Score Kernel Density Estimation

Title: See and Fix the Flaws: Enabling VLMs and Diffusion Models to Comprehend Visual Artifacts via Agentic Data Synthesis

Title: Cycle-Consistent Tuning for Layered Image Decomposition

Title: From Perception to Action: An Interactive Benchmark for Vision Reasoning

Title: OmniOCR: Generalist OCR for Ethnic Minority Languages

Title: Skullptor: High Fidelity 3D Head Reconstruction in Seconds with Multi-View Normal Prediction

Title: SOM-VQ: Topology-Aware Tokenization for Interactive Generative Models

Title: Seeing Through Words: Controlling Visual Retrieval Quality with Language Models

Title: The Diffusion Duality, Chapter II: $Ψ$-Samplers and Efficient Curriculum

Title: Spa3R: Predictive Spatial Field Modeling for 3D Visual Reasoning

Title: Human Video Generation from a Single Image with 3D Pose and View Control