2026-03-03

Title: StaTS: Spectral Trajectory Schedule Learning for Adaptive Time Series Forecasting with Frequency Guided Denoiser

Title: Attn-QAT: 4-Bit Attention With Quantization-Aware Training

Title: Breaking the Factorization Barrier in Diffusion Language Models

Title: BiJEPA: Bi-directional Joint Embedding Predictive Architecture for Symmetric Representation Learning

Title: Knowledge-guided generative surrogate modeling for high-dimensional design optimization under scarce data

Title: M3-AD: Reflection-aware Multi-modal, Multi-category, and Multi-dimensional Benchmark and Framework for Industrial Anomaly Detection

Title: VoxelDiffusionCut: Non-destructive Internal-part Extraction via Iterative Cutting and Structure Estimation

Title: NovaLAD: A Fast, CPU-Optimized Document Extraction Pipeline for Generative AI and Data Intelligence

Title: You Don't Need All That Attention: Surgical Memorization Mitigation in Text-to-Image Diffusion Models

Title: Steering Away from Memorization: Reachability-Constrained Reinforcement Learning for Text-to-Image Diffusion

Title: GrapHist: Graph Self-Supervised Learning for Histopathology

Title: Disentangled Hierarchical VAE for 3D Human-Human Interaction Generation

Title: Physics-Consistent Diffusion for Efficient Fluid Super-Resolution via Multiscale Residual Correction

Title: Attention to Neural Plagiarism: Diffusion Models Can Plagiarize Your Copyrighted Images!

Title: DINOv3 Meets YOLO26 for Weed Detection in Vegetable Crops

Title: Exploring the AI Obedience: Why is Generating a Pure Color Image Harder than CyberPunk?

Title: Summer-22B: A Systematic Approach to Dataset Engineering and Training at Scale for Video Foundation Model

Title: Infinite Self-Attention

Title: NNiT: Width-Agnostic Neural Network Generation with Structurally Aligned Weight Spaces

Title: Engineering FAIR Privacy-preserving Applications that Learn Histories of Disease

Title: Zero-Shot and Supervised Bird Image Segmentation Using Foundation Models: A Dual-Pipeline Approach with Grounding DINO~1.5, YOLOv11, and SAM~2.1

Title: ThreatFormer-IDS: Robust Transformer Intrusion Detection with Zero-Day Generalization and Explainable Attribution

Title: OSF: On Pre-training and Scaling of Sleep Foundation Models

Title: SKeDA: A Generative Watermarking Framework for Text-to-video Diffusion Models

Title: TACIT Benchmark: A Programmatic Visual Reasoning Benchmark for Generative and Discriminative Models

Title: Physical Evaluation of Naturalistic Adversarial Patches for Camera-Based Traffic-Sign Detection

Title: Diffusion-Based Low-Light Image Enhancement with Color and Luminance Priors

Title: Distribution-Aware Companding Quantization of Large Language Models

Title: DiffSOS: Acoustic Conditional Diffusion Model for Speed-of-Sound Reconstruction in Ultrasound Computed Tomography

Title: TENG-BC: Unified Time-Evolving Natural Gradient for Neural PDE Solvers with General Boundary Conditions

Title: Taxonomy-Aware Representation Alignment for Hierarchical Visual Recognition with Large Multimodal Models

Title: TAP-SLF: Parameter-Efficient Adaptation of Vision Foundation Models for Multi-Task Ultrasound Image Analysis

Title: Mamba-CAD: State Space Model For 3D Computer-Aided Design Generative Modeling

Title: SesaHand: Enhancing 3D Hand Reconstruction via Controllable Generation with Semantic and Structural Alignment

Title: Rooted Absorbed Prefix Trajectory Balance with Submodular Replay for GFlowNet Training

Title: Improved Adversarial Diffusion Compression for Real-World Video Super-Resolution

Title: DreamWorld: Unified World Modeling in Video Generation

Title: RAISE: Requirement-Adaptive Evolutionary Refinement for Training-Free Text-to-Image Alignment

Title: ArtiFixer: Enhancing and Extending 3D Reconstruction with Auto-Regressive Diffusion Models

Title: COG: Confidence-aware Optimal Geometric Correspondence for Unsupervised Single-reference Novel Object Pose Estimation

Title: Vision-TTT: Efficient and Expressive Visual Representation Learning with Test-Time Training

Title: Jano: Adaptive Diffusion Generation with Early-stage Convergence Awareness

Title: Phys-Diff: A Physics-Inspired Latent Diffusion Model for Tropical Cyclone Forecasting

Title: Bridge Matching Sampler: Scalable Sampling via Generalized Fixed-Point Diffusion Matching

Title: Spectral Condition for $μ$P under Width-Depth Scaling

Title: Weakly Supervised Video Anomaly Detection with Anomaly-Connected Components and Intention Reasoning

Title: AlignVAR: Towards Globally Consistent Visual Autoregression for Image Super-Resolution

Title: Learning to Explore: Policy-Guided Outlier Synthesis for Graph Out-of-Distribution Detection

Title: IdGlow: Dynamic Identity Modulation for Multi-Subject Generation

Title: Multi-Domain Riemannian Graph Gluing for Building Graph Foundation Models

Title: Adapt Data to Model: Adaptive Transformation Optimization for Domain-shared Time Series Foundation Models

Title: Position: Evaluation of Visual Processing Should Be Human-Centered, Not Metric-Centered

Title: Specializing Foundation Models via Mixture of Low-Rank Experts for Comprehensive Head CT Analysis

Title: Polynomial Mixing for Efficient Self-supervised Speech Encoders

Title: RAVEL: Reasoning Agents for Validating and Evaluating LLM Text Synthesis

Title: SCOUT: Fast Spectral CT Imaging in Ultra LOw-data Regimes via PseUdo-label GeneraTion

Title: Diversity over Uniformity: Rethinking Representation in Generated Image Detection

Title: General Proximal Flow Networks

Title: Stroke outcome and evolution prediction from CT brain using a spatiotemporal diffusion autoencoder

Title: Analyzing and Improving Fast Sampling of Text-to-Image Diffusion Models

Title: Interpretable Cross-Network Attention for Resting-State fMRI Representation Learning

Title: COMBAT: Conditional World Models for Behavioral Agent Training

Title: MultiPUFFIN: A Multimodal Domain-Constrained Foundation Model for Molecular Property Prediction of Small Molecules

Title: AMDS: Attack-Aware Multi-Stage Defense System for Network Intrusion Detection with Two-Stage Adaptive Weight Learning

Title: Active Flow Matching

Title: Knowledge without Wisdom: Measuring Misalignment between LLMs and Intended Impact

Title: Probabilistic Learning and Generation in Deep Sequence Models

Title: Clawdrain: Exploiting Tool-Calling Chains for Stealthy Token Exhaustion in OpenClaw Agents

Title: Improving Text-to-Image Generation with Intrinsic Self-Confidence Rewards

Title: \textsc{Mobile-VTON}: High-Fidelity On-Device Virtual Try-On

Title: Forgetting is Competition: Rethinking Unlearning as Representation Interference in Diffusion Models

Title: EraseAnything++: Enabling Concept Erasure in Rectified Flow Transformers Leveraging Multi-Object Optimization

Title: Fake It Right: Injecting Anatomical Logic into Synthetic Supervised Pre-training for Medical Segmentation

Title: Event-Anchored Frame Selection for Effective Long-Video Understanding

Title: Foundation Models in Remote Sensing: Evolving from Unimodality to Multimodality

Title: MLRecon: Robust Markerless Freehand 3D Ultrasound Reconstruction via Coarse-to-Fine Pose Estimation

Title: Compensation-free Machine Unlearning in Text-to-Image Diffusion Models by Eliminating the Mutual Information

Title: Let Your Image Move with Your Motion! -- Implicit Multi-Object Multi-Motion Transfer

Title: GeodesicNVS: Probability Density Geodesic Flow Matching for Novel View Synthesis

Title: BadRSSD: Backdoor Attacks on Regularized Self-Supervised Diffusion Models

Title: Vision-Language Feature Alignment for Road Anomaly Segmentation

Title: Evaluating GFlowNet from partial episodes for stable and flexible policy-based training

Title: LLaDA-o: An Effective and Length-Adaptive Omni Diffusion Model

Title: Flow Matching-enabled Test-Time Refinement for Unsupervised Cardiac MR Registration

Title: Unified Vision-Language Modeling via Concept Space Alignment

Title: Understanding LoRA as Knowledge Memory: An Empirical Analysis

Title: Data-Efficient Brushstroke Generation with Diffusion Models for Oil Painting

Title: GuiDINO: Rethinking Vision Foundation Model in Medical Image Segmentation

Title: Predictive Reasoning with Augmented Anomaly Contrastive Learning for Compositional Visual Relations

Title: A Deep Learning Framework for Heat Demand Forecasting using Time-Frequency Representations of Decomposed Features

Title: Teacher-Guided Causal Interventions for Image Denoising: Orthogonal Content-Noise Disentanglement in Vision Transformers

Title: ArtLLM: Generating Articulated Assets via 3D LLM

Title: Reasoning or Rationalization? The Role of Justifications in Masked Diffusion Models for Fact Verification

Title: Generative AI & Fictionality: How Novels Power Large Language Models

Title: Linking Knowledge to Care: Knowledge Graph-Augmented Medical Follow-Up Question Generation

Title: Cross-Modal Guidance for Fast Diffusion-Based Computed Tomography

Title: Theoretical Perspectives on Data Quality and Synergistic Effects in Pre- and Post-Training Reasoning Models

Title: AG-VAS: Anchor-Guided Zero-Shot Visual Anomaly Segmentation with Large Multimodal Models

Title: You Only Need One Stage: Novel-View Synthesis From A Single Blind Face Image

Title: MetaState: Persistent Working Memory for Discrete Diffusion Language Models

Title: Perspective-Equivariant Fine-tuning for Multispectral Demosaicing without Ground Truth

Title: Provable and Practical In-Context Policy Optimization for Self-Improvement

Title: UTICA: Multi-Objective Self-Distllation Foundation Model Pretraining for Time Series Classification

Title: DUEL: Exact Likelihood for Masked Diffusion via Deterministic Unmasking

Title: Toward Graph-Tokenizing Large Language Models with Reconstructive Graph Instruction Tuning

Title: One Operator to Rule Them All? On Boundary-Indexed Operator Families in Neural PDE Solvers

Title: UniTalking: A Unified Audio-Video Framework for Talking Portrait Generation

Title: LaSER: Internalizing Explicit Reasoning into Latent Space for Dense Retrieval

Title: DOCFORGE-BENCH: A Comprehensive Benchmark for Document Forgery Detection and Analysis

Title: Autoregressive Synthesis of Sparse and Semi-Structured Mixed-Type Data

Title: Deepfake Forensics Adapter: A Dual-Stream Network for Generalizable Deepfake Detection

Title: Tri-path DINO: Feature Complementary Learning for Remote Sensing Multi-Class Change Detection

Title: Retrieval, Refinement, and Ranking for Text-to-Video Generation via Prompt Optimization and Test-Time Scaling

Title: FACE: A Face-based Autoregressive Representation for High-Fidelity and Efficient Mesh Generation

Title: Benchmarking Semantic Segmentation Models via Appearance and Geometry Attribute Editing

Title: RA-Det: Towards Universal Detection of AI-Generated Images via Robustness Asymmetry

Title: PathMoE: Interpretable Multimodal Interaction Experts for Pediatric Brain Tumor Classification

Title: Align-cDAE: Alzheimer's Disease Progression Modeling with Attention-Aligned Conditional Diffusion Auto-Encoder

Title: LFPO: Likelihood-Free Policy Optimization for Masked Diffusion Models

Title: Cryo-Bench: Benchmarking Foundation Models for Cryosphere Applications

Title: SkeleGuide: Explicit Skeleton Reasoning for Context-Aware Human-in-Place Image Synthesis

Title: Markovian ODE-guided scoring can assess the quality of offline reasoning traces in language models

Title: FAST-DIPS: Adjoint-Free Analytic Steps and Hard-Constrained Likelihood Correction for Diffusion-Prior Inverse Problems

Title: Preference Score Distillation: Leveraging 2D Rewards to Align Text-to-3D Generation with Human Preference

Title: Sparse View Distractor-Free Gaussian Splatting

Title: Information-Theoretic Digital Twins for Stealthy Attack Detection in Industrial Control Systems: A Closed-Form KL Divergence Approach

Title: Adaptive Spectral Feature Forecasting for Diffusion Sampling Acceleration

Title: PromptStereo: Zero-Shot Stereo Matching via Structure and Motion Prompts

Title: Transform-Invariant Generative Ray Path Sampling for Efficient Radio Propagation Modeling

Title: A Diffusion-Driven Fine-Grained Nodule Synthesis Framework for Enhanced Lung Nodule Detection from Chest Radiographs

Title: FastLightGen: Fast and Light Video Generation with Fewer Steps and Parameters

Title: DiffusionXRay: A Diffusion and GAN-Based Approach for Enhancing Digitally Reconstructed Chest Radiographs

Title: CoopDiff: A Diffusion-Guided Approach for Cooperation under Corruptions

Title: Building a Strong Instruction Language Model for a Less-Resourced Language

Title: Dual Distillation for Few-Shot Anomaly Detection

Title: Bootstrapping Embeddings for Low Resource Languages

Title: Causal Circuit Tracing Reveals Distinct Computational Architectures in Single-Cell Foundation Models: Inhibitory Dominance, Biological Coherence, and Cross-Model Convergence

Title: Meta-Learning Hyperparameters for Parameter Efficient Fine-Tuning

Title: Modular Memory is the Key to Continual Learning Agents

Title: Efficient Test-Time Optimization for Depth Completion via Low-Rank Decoder Adaptation

Title: CHLU: The Causal Hamiltonian Learning Unit as a Symplectic Primitive for Deep Learning

Title: FreeAct: Freeing Activations for LLM Quantization

Title: LLM-as-an-Annotator: Training Lightweight Models with LLM-Annotated Examples for Aspect Sentiment Tuple Prediction

Title: D3LM: A Discrete DNA Diffusion Language Model for Bidirectional DNA Understanding and Generation

Title: Learning Shortest Paths with Generative Flow Networks

Title: Phase-Type Variational Autoencoders for Heavy-Tailed Data

Title: Non-verbal Real-time Human-AI Interaction in Constrained Robotic Environments

Title: Constrained Particle Seeking: Solving Diffusion Inverse Problems with Just Forward Passes

Title: Phishing the Phishers with SpecularNet: Hierarchical Graph Autoencoding for Reference-Free Web Phishing Detection

Title: CTForensics: A Comprehensive Dataset and Method for AI-Generated CT Image Detection

Title: Resolving Blind Inverse Problems under Dynamic Range Compression via Structured Forward Operator Modeling

Title: Generative Visual Chain-of-Thought for Image Editing

Title: Zero-shot Low-Field MRI Enhancement via Diffusion-Based Adaptive Contrast Transport

Title: AdaPonderLM: Gated Pondering Language Models with Token-Wise Adaptive Depth

Title: LaST-VLA: Thinking in Latent Spatio-Temporal Space for Vision-Language-Action in Autonomous Driving

Title: Dream2Learn: Structured Generative Dreaming for Continual Learning

Title: Probabilistic Retrofitting of Learned Simulators

Title: Semantic Similarity is a Spurious Measure of Comic Understanding: Lessons Learned from Hallucinations in a Benchmarking Experiment

Title: CoVAE: correlated multimodal generative modeling

Title: Process Over Outcome: Cultivating Forensic Reasoning for Generalizable Multimodal Manipulation Detection

Title: Mitigating topology biases in Graph Diffusion via Counterfactual Intervention

Title: MAP-Diff: Multi-Anchor Guided Diffusion for Progressive 3D Whole-Body Low-Dose PET Denoising

Title: CausalWrap: Model-Agnostic Causal Constraint Wrappers for Tabular Synthetic Data

Title: PonderLM-3: Adaptive Token-Wise Pondering with Differentiable Masking

Title: WorldStereo: Bridging Camera-Guided Video Generation and Scene Reconstruction via 3D Geometric Memories

Title: ORGAN: Object-Centric Representation Learning using Cycle Consistent Generative Adversarial Networks

Title: From Pixels to Patches: Pooling Strategies for Earth Embeddings

Title: LiftAvatar: Kinematic-Space Completion for Expression-Controlled 3D Gaussian Avatar Animation

Title: GeoDiT: Point-Conditioned Diffusion Transformer for Satellite Image Synthesis

Title: Kiwi-Edit: Versatile Video Editing via Instruction and Reference Guidance

Title: Sketch2Colab: Sketch-Conditioned Multi-Human Animation via Controllable Flow Distillation

Title: Frontier Models Can Take Actions at Low Probabilities