2026-03-25

Title: Mitigating Premature Discretization with Progressive Quantization for Robust Vector Tokenization

Title: Full waveform inversion method based on diffusion model

Title: UniFluids: Unified Neural Operator Learning with Conditional Flow-matching

Title: ST-GDance++: A Scalable Spatial-Temporal Diffusion for Long-Duration Group Choreography

Title: Sparsely-Supervised Data Assimilation via Physics-Informed Schrödinger Bridge

Title: MCLR: Improving Conditional Modeling in Visual Generative Models via Inter-Class Likelihood-Ratio Maximization and Establishing the Equivalence between Classifier-Free Guidance and Alignment Objectives

Title: Three Creates All: You Only Sample 3 Steps

Title: Symbolic Graph Networks for Robust PDE Discovery from Noisy Sparse Data

Title: OsteoFlow: Lyapunov-Guided Flow Distillation for Predicting Bone Remodeling after Mandibular Reconstruction

Title: Model Predictive Control with Differentiable World Models for Offline Reinforcement Learning

Title: Static Scene Reconstruction from Dynamic Egocentric Videos

Title: MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding

Title: Model Context Protocol Threat Modeling and Analyzing Vulnerabilities to Prompt Injection with Tool Poisoning

Title: Tiny Inference-Time Scaling with Latent Verifiers

Title: Rashid: A Cipher-Based Framework for Exploring In-Context Language Learning

Title: Sketch2CT: Multimodal Diffusion for Structure-Aware 3D Medical Volume Generation

Title: Adversarial Vulnerabilities in Neural Operator Digital Twins: Gradient-Free Attacks on Nuclear Thermal-Hydraulic Surrogates

Title: Generalized multi-object classification and tracking with sparse feature resonator networks

Title: MIOFlow 2.0: A unified framework for inferring cellular stochastic dynamics from single cell and spatial transcriptomics data

Title: CanViT: Toward Active-Vision Foundation Models

Title: A Foundation Model for Instruction-Conditioned In-Context Time Series Tasks

Title: Dress-ED: Instruction-Guided Editing for Virtual Try-On and Try-Off

Title: PIVM: Diffusion-Based Prior-Integrated Variation Modeling for Anatomically Precise Abdominal CT Synthesis

Title: Pretext Matters: An Empirical Study of SSL Methods in Medical Imaging

Title: Bounding Box Anomaly Scoring for simple and efficient Out-of-Distribution detection

Title: How Far Can VLMs Go for Visual Bug Detection? Studying 19,738 Keyframes from 41 Hours of Gameplay Videos

Title: Multitask-Informed Prior for In-Context Learning on Tabular Data: Application to Steel Property Prediction

Title: Multimodal Industrial Anomaly Detection via Geometric Prior

Title: DALDALL: Data Augmentation for Lexical and Semantic Diverse in Legal Domain by leveraging LLM-Persona

Title: Know3D: Prompting 3D Generation with Knowledge from Vision-Language Models

Title: Predictive Photometric Uncertainty in Gaussian Splatting for Novel View Synthesis

Title: Cross-Slice Knowledge Transfer via Masked Multi-Modal Heterogeneous Graph Contrastive Learning for Spatial Gene Expression Inference

Title: URA-Net: Uncertainty-Integrated Anomaly Perception and Restoration Attention Network for Unsupervised Anomaly Detection

Title: A Feature Shuffling and Restoration Strategy for Universal Unsupervised Anomaly Detection

Title: Template-Based Feature Aggregation Network for Industrial Anomaly Detection

Title: Balancing Safety and Efficiency in Aircraft Health Diagnosis: A Task Decomposition Framework with Heterogeneous Long-Micro Scale Cascading and Knowledge Distillation-based Interpretability

Title: Asymptotic Learning Curves for Diffusion Models with Random Features Score and Manifold Data

Title: Few-Shot Generative Model Adaption via Identity Injection and Preservation

Title: WorldMesh: Generating Navigable Multi-Room 3D Scenes via Mesh-Conditioned Image Diffusion

Title: Can Graph Foundation Models Generalize Over Architecture?

Title: Zero-Shot Personalization of Objects via Textual Inversion

Title: A Sobering Look at Tabular Data Generation via Probabilistic Circuits

Title: Generative Event Pretraining with Foundation Model Alignment

Title: Traffic Sign Recognition in Autonomous Driving: Dataset, Benchmark, and Field Experiment

Title: YOLOv10 with Kolmogorov-Arnold networks and vision-language foundation models for interpretable object detection and trustworthy multimodal AI in computer vision perception

Title: HUydra: Full-Range Lung CT Synthesis via Multiple HU Interval Generative Modelling

Title: Assessing the Robustness of Climate Foundation Models under No-Analog Distribution Shifts

Title: Policy-based Tuning of Autoregressive Image Models with Instance- and Distribution-Level Rewards

Title: NeuroSeg Meets DINOv3: Transferring 2D Self-Supervised Visual Priors to 3D Neuron Segmentation via DINOv3 Initialization

Title: Automatic Segmentation of 3D CT scans with SAM2 using a zero-shot approach

Title: PiCo: Active Manifold Canonicalization for Robust Robotic Visual Anomaly Detection

Title: HGNet: Scalable Foundation Model for Automated Knowledge Graph Generation from Scientific Literature

Title: DAK-UCB: Diversity-Aware Prompt Routing for LLMs and Generative Models

Title: Conformal Cross-Modal Active Learning

Title: GSwap: Realistic Head Swapping with Dynamic Neural Gaussian Field

Title: Gimbal360: Differentiable Auto-Leveling for Canonicalized $360^\circ$ Panoramic Image Completion

Title: Sparser, Faster, Lighter Transformer Language Models

Title: FDIF: Formula-Driven supervised Learning with Implicit Functions for 3D Medical Image Segmentation

Title: Decoding AI Authorship: Can LLMs Truly Mimic Human Style Across Literature and Politics?

Title: I Came, I Saw, I Explained: Benchmarking Multimodal LLMs on Figurative Meaning in Memes

Title: GO-Renderer: Generative Object Rendering with 3D-aware Controllable Video Diffusion Models

Title: Is AI Catching Up to Human Expression? Exploring Emotion, Personality, Authorship, and Linguistic Style in English and Arabic with Six Large Language Models

Title: Permutation-Symmetrized Diffusion for Unconditional Molecular Generation

Title: SynForceNet: A Force-Driven Global-Local Latent Representation Framework for Lithium-Ion Battery Fault Diagnosis

Title: Curriculum-Driven 3D CT Report Generation via Language-Free Visual Grafting and Zone-Constrained Compression

Title: Robustness Quantification for Discriminative Models: a New Robustness Metric and its Application to Dynamic Classifier Selection

Title: ViBe: Ultra-High-Resolution Video Synthesis Born from Pure Images

Title: ABot-PhysWorld: Interactive World Foundation Model for Robotic Manipulation with Physics Alignment

Title: FG-Portrait: 3D Flow Guided Editable Portrait Animation

Title: From Feature Learning to Spectral Basis Learning: A Unifying and Flexible Framework for Efficient and Robust Shape Matching

Title: Graph Energy Matching: Transport-Aligned Energy-Based Modeling for Graph Generation

Title: GeoSANE: Learning Geospatial Representations from Models, Not Data

Title: DetPO: In-Context Learning with Multi-Modal LLMs for Few-Shot Object Detection

Title: RealMaster: Lifting Rendered Scenes into Photorealistic Video

Title: InverFill: One-Step Inversion for Enhanced Few-Step Diffusion Inpainting

Title: TETO: Tracking Events with Teacher Observation for Motion Estimation and Frame Interpolation

Title: Foveated Diffusion: Efficient Spatially Adaptive Image and Video Generation

Title: WildWorld: A Large-Scale Dataset for Dynamic World Modeling with Actions and Explicit State toward Generative ARPG

Title: DA-Flow: Degradation-Aware Optical Flow Estimation with Diffusion Models

Title: OccAny: Generalized Unconstrained Urban 3D Occupancy