2026-03-04

Title: Forecasting as Rendering: A 2D Gaussian Splatting Framework for Time Series Forecasting

Title: Generalized Discrete Diffusion with Self-Correction

Title: CUDABench: Benchmarking LLMs for Text-to-CUDA Generation

Title: CamDirector: Towards Long-Term Coherent Video Trajectory Editing

Title: PRISM: Exploring Heterogeneous Pretrained EEG Foundation Model Transfer to Clinical Differential Diagnosis

Title: Graph Attention Based Prioritization of Disease Responsible Genes from Multimodal Alzheimer's Network

Title: Characterizing Memorization in Diffusion Language Models: Generalized Extraction and Sampling Effects

Title: Preconditioned Score and Flow Matching

Title: Diffusion-MPC in Discrete Domains: Feasibility Constraints, Horizon Effects, and Critic Alignment: Case study with Tetris

Title: MERG3R: A Divide-and-Conquer Approach to Large-Scale Neural Visual Geometry

Title: Rigidity-Aware Geometric Pretraining for Protein Design and Conformational Ensembles

Title: DINOv3 Visual Representations for Blueberry Perception Toward Robotic Harvesting

Title: Spectral Regularization for Diffusion Models

Title: NeighborMAE: Exploiting Spatial Dependencies between Neighboring Earth Observation Images in Masked Autoencoders Pretraining

Title: Bridging Diffusion Guidance and Anderson Acceleration via Hopfield Dynamics

Title: On Discriminative vs. Generative classifiers: Rethinking MLLMs for Action Understanding

Title: CoDAR: Continuous Diffusion Language Models are More Powerful Than You Think

Title: Generalizable Knowledge Distillation from Vision Foundation Models for Semantic Segmentation

Title: Real-Time Generative Policy via Langevin-Guided Flow Matching for Autonomous Driving

Title: Direct Reward Fine-Tuning on Poses for Single Image to 3D Human in the Wild

Title: Towards an Incremental Unified Multimodal Anomaly Detection: Augmenting Multimodal Denoising From an Information Bottleneck Perspective

Title: Improving Diffusion Planners by Self-Supervised Action Gating with Energies

Title: Real-Time Generation of Game Video Commentary with Multimodal LLMs: Pause-Aware Decoding Approaches

Title: DREAM: Where Visual Understanding Meets Text-to-Image Generation

Title: ReCo-Diff: Residual-Conditioned Deterministic Sampling for Cold Diffusion in Sparse-View CT

Title: FiDeSR: High-Fidelity and Detail-Preserving One-Step Diffusion Super-Resolution

Title: Sensory-Aware Sequential Recommendation via Review-Distilled Representations

Title: MiM-DiT: MoE in MoE with Diffusion Transformers for All-in-One Image Restoration

Title: CoShadow: Multi-Object Shadow Generation for Image Compositing via Diffusion Model

Title: Efficient Self-Evaluation for Diffusion Language Models via Sequence Regeneration

Title: Scores Know Bobs Voice: Speaker Impersonation Attack

Title: Designing UNICORN: a Unified Benchmark for Imaging in Computational Pathology, Radiology, and Natural Language

Title: ScribeTokens: Fixed-Vocabulary Tokenization of Digital Ink

Title: Toward Early Quality Assessment of Text-to-Image Diffusion Models

Title: Adapting Time Series Foundation Models through Data Mixtures

Title: A Browser-based Open Source Assistant for Multimodal Content Verification

Title: DSBA: Dynamic Stealthy Backdoor Attack with Collaborative Optimization in Self-Supervised Learning

Title: SIGMark: Scalable In-Generation Watermark with Blind Extraction for Video Diffusion

Title: SemanticDialect: Semantic-Aware Mixed-Format Quantization for Video Diffusion Transformers

Title: Intrinsic Geometry-Appearance Consistency Optimization for Sparse-View Gaussian Splatting

Title: ProGIC: Progressive and Lightweight Generative Image Compression with Residual Vector Quantization

Title: Eliciting Numerical Predictive Distributions of LLMs Without Autoregression

Title: Interpretable Motion-Attentive Maps: Spatio-Temporally Localizing Concepts in Video Diffusion Transformers

Title: GloPath: An Entity-Centric Foundation Model for Glomerular Lesion Assessment and Clinicopathological Insights

Title: TRACE: Task-Adaptive Reasoning and Representation Learning for Universal Multimodal Retrieval

Title: Contextual Latent World Models for Offline Meta Reinforcement Learning

Title: TC-Padé: Trajectory-Consistent Padé Approximation for Diffusion Acceleration

Title: Improving Anomaly Detection with Foundation-Model Synthesis and Wavelet-Domain Attention

Title: Spatial Autoregressive Modeling of DINOv3 Embeddings for Unsupervised Anomaly Detection

Title: Breaking the Prototype Bias Loop: Confidence-Aware Federated Contrastive Learning for Highly Imbalanced Clients

Title: BRIGHT: A Collaborative Generalist-Specialist Foundation Model for Breast Pathology

Title: Compact Prompting in Instruction-tuned LLMs for Joint Argumentative Component Detection

Title: MoECLIP: Patch-Specialized Experts for Zero-shot Anomaly Detection

Title: AWDiff: An a trous wavelet diffusion model for lung ultrasound image synthesis

Title: Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing

Title: Information Routing in Atomistic Foundation Models: How Equivariance Creates Linearly Disentangled Representations

Title: Kling-MotionControl Technical Report

Title: MoD-DPO: Towards Mitigating Cross-modal Hallucinations in Omni LLMs using Modality Decoupled Preference Optimization

Title: Understanding and Mitigating Dataset Corruption in LLM Steering

Title: Inverse Reconstruction of Shock Time Series from Shock Response Spectrum Curves using Machine Learning

Title: On Geometry Regularization in Autoencoder Reduced-Order Models with Latent Neural ODE Dynamics

Title: COP-GEN: Latent Diffusion Transformer for Copernicus Earth Observation Data -- Generation Stochastic by Design

Title: UniG2U-Bench: Do Unified Models Advance Multimodal Understanding?

Title: Using Learning Progressions to Guide AI Feedback for Science Learning

Title: DuoMo: Dual Motion Diffusion for World-Space Human Reconstruction

Title: LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory

Title: Beyond Language Modeling: An Exploration of Multimodal Pretraining

Title: CFG-Ctrl: Control-Based Classifier-Free Diffusion Guidance

Title: MIBURI: Towards Expressive Interactive Gesture Synthesis

Title: Utonia: Toward One Encoder for All Point Clouds