2026-03-25

Title: Founder effects shape the evolutionary dynamics of multimodality in open LLM families

Title: Efficient Embedding-based Synthetic Data Generation for Complex Reasoning Tasks

Title: Mitigating Premature Discretization with Progressive Quantization for Robust Vector Tokenization

Title: Full waveform inversion method based on diffusion model

Title: UniFluids: Unified Neural Operator Learning with Conditional Flow-matching

Title: ST-GDance++: A Scalable Spatial-Temporal Diffusion for Long-Duration Group Choreography

Title: Sparsely-Supervised Data Assimilation via Physics-Informed Schrödinger Bridge

Title: MCLR: Improving Conditional Modeling in Visual Generative Models via Inter-Class Likelihood-Ratio Maximization and Establishing the Equivalence between Classifier-Free Guidance and Alignment Objectives

Title: Three Creates All: You Only Sample 3 Steps

Title: OsteoFlow: Lyapunov-Guided Flow Distillation for Predicting Bone Remodeling after Mandibular Reconstruction

Title: MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding

Title: Color When It Counts: Grayscale-Guided Online Triggering for Always-On Streaming Video Sensing

Title: Tiny Inference-Time Scaling with Latent Verifiers

Title: Sketch2CT: Multimodal Diffusion for Structure-Aware 3D Medical Volume Generation

Title: Ego2Web: A Web Agent Benchmark Grounded in Egocentric Videos

Title: UrbanVGGT: Scalable Sidewalk Width Estimation from Street View Images

Title: Generalized multi-object classification and tracking with sparse feature resonator networks

Title: MIOFlow 2.0: A unified framework for inferring cellular stochastic dynamics from single cell and spatial transcriptomics data

Title: TrajLoom: Dense Future Trajectory Generation from Video

Title: Dress-ED: Instruction-Guided Editing for Virtual Try-On and Try-Off

Title: A Vision Language Model for Generating Procedural Plant Architecture Representations from Simulated Images

Title: Q-Tacit: Image Quality Assessment via Latent Visual Reasoning

Title: GeoTikzBridge: Advancing Multimodal Code Generation for Geometric Perception and Reasoning

Title: WiFi2Cap: Semantic Action Captioning from Wi-Fi CSI via Limb-Level Semantic Alignment

Title: TimeWeaver: Age-Consistent Reference-Based Face Restoration with Identity Preservation

Title: Behavioral Heterogeneity as Quantum-Inspired Representation

Title: From Pixels to Semantics: A Multi-Stage AI Framework for Structural Damage Detection in Satellite Imagery

Title: Know3D: Prompting 3D Generation with Knowledge from Vision-Language Models

Title: Caterpillar of Thoughts: The Optimal Test-Time Algorithm for Large Language Models

Title: It Takes Two: A Duet of Periodicity and Directionality for Burst Flicker Removal

Title: URA-Net: Uncertainty-Integrated Anomaly Perception and Restoration Attention Network for Unsupervised Anomaly Detection

Title: A Feature Shuffling and Restoration Strategy for Universal Unsupervised Anomaly Detection

Title: Designing to Forget: Deep Semi-parametric Models for Unlearning

Title: Caption Generation for Dongba Paintings via Prompt Learning and Semantic Fusion

Title: Few-Shot Generative Model Adaption via Identity Injection and Preservation

Title: WorldMesh: Generating Navigable Multi-Room 3D Scenes via Mesh-Conditioned Image Diffusion

Title: VQ-Jarvis: Retrieval-Augmented Video Restoration Agent with Sharp Vision and Fast Thought

Title: Zero-Shot Personalization of Objects via Textual Inversion

Title: A Sobering Look at Tabular Data Generation via Probabilistic Circuits

Title: Generative Event Pretraining with Foundation Model Alignment

Title: HUydra: Full-Range Lung CT Synthesis via Multiple HU Interval Generative Modelling

Title: MLLM-HWSI: A Multimodal Large Language Model for Hierarchical Whole Slide Image Understanding

Title: Policy-based Tuning of Autoregressive Image Models with Instance- and Distribution-Level Rewards

Title: InterDyad: Interactive Dyadic Speech-to-Video Generation by Querying Intermediate Visual Guidance

Title: DAK-UCB: Diversity-Aware Prompt Routing for LLMs and Generative Models

Title: VoDaSuRe: A Large-Scale Dataset Revealing Domain Shift in Volumetric Super-Resolution

Title: GSwap: Realistic Head Swapping with Dynamic Neural Gaussian Field

Title: Gimbal360: Differentiable Auto-Leveling for Canonicalized $360^\circ$ Panoramic Image Completion

Title: GO-Renderer: Generative Object Rendering with 3D-aware Controllable Video Diffusion Models

Title: A Learning Method with Gap-Aware Generation for Heterogeneous DAG Scheduling

Title: Permutation-Symmetrized Diffusion for Unconditional Molecular Generation

Title: Mamba-driven MRI-to-CT Synthesis for MRI-only Radiotherapy Planning

Title: Curriculum-Driven 3D CT Report Generation via Language-Free Visual Grafting and Zone-Constrained Compression

Title: Robustness Quantification for Discriminative Models: a New Robustness Metric and its Application to Dynamic Classifier Selection

Title: ViBe: Ultra-High-Resolution Video Synthesis Born from Pure Images

Title: An Explainable AI-Driven Framework for Automated Brain Tumor Segmentation Using an Attention-Enhanced U-Net

Title: ABot-PhysWorld: Interactive World Foundation Model for Robotic Manipulation with Physics Alignment

Title: SIMART: Decomposing Monolithic Meshes into Sim-ready Articulated Assets via MLLM

Title: Graph Energy Matching: Transport-Aligned Energy-Based Modeling for Graph Generation

Title: GeoSANE: Learning Geospatial Representations from Models, Not Data

Title: I3DM: Implicit 3D-aware Memory Retrieval and Injection for Consistent Video Scene Generation

Title: SortedRL: Accelerating RL Training for LLMs through Online Length-Aware Scheduling

Title: RealMaster: Lifting Rendered Scenes into Photorealistic Video

Title: InverFill: One-Step Inversion for Enhanced Few-Step Diffusion Inpainting

Title: One View Is Enough! Monocular Training for In-the-Wild Novel View Generation

Title: Foveated Diffusion: Efficient Spatially Adaptive Image and Video Generation

Title: WildWorld: A Large-Scale Dataset for Dynamic World Modeling with Actions and Explicit State toward Generative ARPG

Title: DA-Flow: Degradation-Aware Optical Flow Estimation with Diffusion Models

Title: UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation

Title: MedObvious: Exposing the Medical Moravec's Paradox in VLMs via Clinical Triage