2026-03-03

Title: StaTS: Spectral Trajectory Schedule Learning for Adaptive Time Series Forecasting with Frequency Guided Denoiser

Title: Breaking the Factorization Barrier in Diffusion Language Models

Title: BiJEPA: Bi-directional Joint Embedding Predictive Architecture for Symmetric Representation Learning

Title: Knowledge-guided generative surrogate modeling for high-dimensional design optimization under scarce data

Title: VoxelDiffusionCut: Non-destructive Internal-part Extraction via Iterative Cutting and Structure Estimation

Title: Efficient Image Super-Resolution with Multi-Scale Spatial Adaptive Attention Networks

Title: NovaLAD: A Fast, CPU-Optimized Document Extraction Pipeline for Generative AI and Data Intelligence

Title: CT-Flow: Orchestrating CT Interpretation Workflow with Model Context Protocol Servers

Title: You Don't Need All That Attention: Surgical Memorization Mitigation in Text-to-Image Diffusion Models

Title: Steering Away from Memorization: Reachability-Constrained Reinforcement Learning for Text-to-Image Diffusion

Title: From Scale to Speed: Adaptive Test-Time Scaling for Image Editing

Title: Disentangled Hierarchical VAE for 3D Human-Human Interaction Generation

Title: Physics-Consistent Diffusion for Efficient Fluid Super-Resolution via Multiscale Residual Correction

Title: EfficientPosterGen: Semantic-aware Efficient Poster Generation via Token Compression and Accurate Violation Detection

Title: FlowPortrait: Reinforcement Learning for Audio-Driven Portrait Video Generation

Title: SKINOPATHY AI: Smartphone-Based Ophthalmic Screening and Longitudinal Tracking Using Lightweight Computer Vision

Title: Exploring the AI Obedience: Why is Generating a Pure Color Image Harder than CyberPunk?

Title: NNiT: Width-Agnostic Neural Network Generation with Structurally Aligned Weight Spaces

Title: Engineering FAIR Privacy-preserving Applications that Learn Histories of Disease

Title: SKeDA: A Generative Watermarking Framework for Text-to-video Diffusion Models

Title: TACIT Benchmark: A Programmatic Visual Reasoning Benchmark for Generative and Discriminative Models

Title: VisRef: Visual Refocusing while Thinking Improves Test-Time Scaling in Multi-Modal Large Reasoning Models

Title: Physical Evaluation of Naturalistic Adversarial Patches for Camera-Based Traffic-Sign Detection

Title: Adversarial Patch Generation for Visual-Infrared Dense Prediction Tasks via Joint Position-Color Optimization

Title: Percept-Aware Surgical Planning for Visual Cortical Prostheses with Vascular Avoidance

Title: DiffSOS: Acoustic Conditional Diffusion Model for Speed-of-Sound Reconstruction in Ultrasound Computed Tomography

Title: SSR: Pushing the Limit of Spatial Intelligence with Structured Scene Reasoning

Title: Station2Radar: query conditioned gaussian splatting for precipitation field

Title: An Interpretable Local Editing Model for Counterfactual Medical Image Generation

Title: Self-Correction Inside the Model: Leveraging Layer Attention to Mitigate Hallucinations in Large Vision Language Models

Title: Mamba-CAD: State Space Model For 3D Computer-Aided Design Generative Modeling

Title: SesaHand: Enhancing 3D Hand Reconstruction via Controllable Generation with Semantic and Structural Alignment

Title: Rooted Absorbed Prefix Trajectory Balance with Submodular Replay for GFlowNet Training

Title: Improved Adversarial Diffusion Compression for Real-World Video Super-Resolution

Title: ReMoT: Reinforcement Learning with Motion Contrast Triplets

Title: DreamWorld: Unified World Modeling in Video Generation

Title: U-VLM: Hierarchical Vision Language Modeling for Report Generation

Title: RAISE: Requirement-Adaptive Evolutionary Refinement for Training-Free Text-to-Image Alignment

Title: ArtiFixer: Enhancing and Extending 3D Reconstruction with Auto-Regressive Diffusion Models

Title: Multimodal Adaptive Retrieval Augmented Generation through Internal Representation Learning

Title: Vision-TTT: Efficient and Expressive Visual Representation Learning with Test-Time Training

Title: Jano: Adaptive Diffusion Generation with Early-stage Convergence Awareness

Title: Mesh-Pro: Asynchronous Advantage-guided Ranking Preference Optimization for Artist-style Quadrilateral Mesh Generation

Title: RAFM: Retrieval-Augmented Flow Matching for Unpaired CBCT-to-CT Translation

Title: Spectral Condition for $μ$P under Width-Depth Scaling

Title: WildActor: Unconstrained Identity-Preserving Video Generation

Title: AlignVAR: Towards Globally Consistent Visual Autoregression for Image Super-Resolution

Title: IdGlow: Dynamic Identity Modulation for Multi-Subject Generation

Title: Position: Evaluation of Visual Processing Should Be Human-Centered, Not Metric-Centered

Title: Direct low-field MRI super-resolution using undersampled k-space

Title: SCOUT: Fast Spectral CT Imaging in Ultra LOw-data Regimes via PseUdo-label GeneraTion

Title: Diversity over Uniformity: Rethinking Representation in Generated Image Detection

Title: General Proximal Flow Networks

Title: Interpretable Cross-Network Attention for Resting-State fMRI Representation Learning

Title: COMBAT: Conditional World Models for Behavioral Agent Training

Title: Neural Discrimination-Prompted Transformers for Efficient UHD Image Restoration and Enhancement

Title: Active Flow Matching

Title: Knowledge without Wisdom: Measuring Misalignment between LLMs and Intended Impact

Title: Probabilistic Learning and Generation in Deep Sequence Models

Title: pySpatial: Generating 3D Visual Programs for Zero-Shot Spatial Reasoning

Title: ShiftLUT: Spatial Shift Enhanced Look-Up Tables for Efficient Image Restoration

Title: VGGT-Det: Mining VGGT Internal Priors for Sensor-Geometry-Free Multi-View Indoor 3D Object Detection

Title: Improving Text-to-Image Generation with Intrinsic Self-Confidence Rewards

Title: DriveCode: Domain Specific Numerical Encoding for LLM-Based Autonomous Driving

Title: Seeing Beyond 8bits: Subjective and Objective Quality Assessment of HDR-UGC Videos

Title: \textsc{Mobile-VTON}: High-Fidelity On-Device Virtual Try-On

Title: Forgetting is Competition: Rethinking Unlearning as Representation Interference in Diffusion Models

Title: PreciseCache: Precise Feature Caching for Efficient and High-fidelity Video Generation

Title: EraseAnything++: Enabling Concept Erasure in Rectified Flow Transformers Leveraging Multi-Object Optimization

Title: The Texture-Shape Dilemma: Boundary-Safe Synthetic Generation for 3D Medical Transformers

Title: Compensation-free Machine Unlearning in Text-to-Image Diffusion Models by Eliminating the Mutual Information

Title: Let Your Image Move with Your Motion! -- Implicit Multi-Object Multi-Motion Transfer

Title: GeodesicNVS: Probability Density Geodesic Flow Matching for Novel View Synthesis

Title: One-Token Verification for Reasoning Correctness Estimation

Title: Reparameterized Tensor Ring Functional Decomposition for Multi-Dimensional Data Recovery

Title: From Intuition to Investigation: A Tool-Augmented Reasoning MLLM Framework for Generalizable Face Anti-Spoofing

Title: Evaluating GFlowNet from partial episodes for stable and flexible policy-based training

Title: MM-DeepResearch: A Simple and Effective Multimodal Agentic Search Baseline

Title: LLaDA-o: An Effective and Length-Adaptive Omni Diffusion Model

Title: Can Vision Language Models Assess Graphic Design Aesthetics? A Benchmark, Evaluation, and Dataset Perspective

Title: Understanding LoRA as Knowledge Memory: An Empirical Analysis

Title: Data-Efficient Brushstroke Generation with Diffusion Models for Oil Painting

Title: ClinCoT: Clinical-Aware Visual Chain-of-Thought for Medical Vision Language Models

Title: Teacher-Guided Causal Interventions for Image Denoising: Orthogonal Content-Noise Disentanglement in Vision Transformers

Title: ArtLLM: Generating Articulated Assets via 3D LLM

Title: Operator Learning Using Weak Supervision from Walk-on-Spheres

Title: RnG: A Unified Transformer for Complete 3D Modeling from Partial Observations

Title: JailNewsBench: Multi-Lingual and Regional Benchmark for Fake News Generation under Jailbreak Attacks

Title: You Only Need One Stage: Novel-View Synthesis From A Single Blind Face Image

Title: DUEL: Exact Likelihood for Masked Diffusion via Deterministic Unmasking

Title: TIMI: Training-Free Image-to-3D Multi-Instance Generation with Spatial Fidelity

Title: Continuous Exposure-Time Modeling for Realistic Atmospheric Turbulence Synthesis

Title: UniTalking: A Unified Audio-Video Framework for Talking Portrait Generation

Title: DOCFORGE-BENCH: A Comprehensive Benchmark for Document Forgery Detection and Analysis

Title: Unifying Language-Action Understanding and Generation for Autonomous Driving

Title: Autoregressive Synthesis of Sparse and Semi-Structured Mixed-Type Data

Title: Deepfake Forensics Adapter: A Dual-Stream Network for Generalizable Deepfake Detection

Title: Retrieval, Refinement, and Ranking for Text-to-Video Generation via Prompt Optimization and Test-Time Scaling

Title: FACE: A Face-based Autoregressive Representation for High-Fidelity and Efficient Mesh Generation

Title: Benchmarking Semantic Segmentation Models via Appearance and Geometry Attribute Editing

Title: RA-Det: Towards Universal Detection of AI-Generated Images via Robustness Asymmetry

Title: Align-cDAE: Alzheimer's Disease Progression Modeling with Attention-Aligned Conditional Diffusion Auto-Encoder

Title: LFPO: Likelihood-Free Policy Optimization for Masked Diffusion Models

Title: SkeleGuide: Explicit Skeleton Reasoning for Context-Aware Human-in-Place Image Synthesis

Title: Preference Score Distillation: Leveraging 2D Rewards to Align Text-to-3D Generation with Human Preference

Title: Dehallu3D: Hallucination-Mitigated 3D Generation from Single Image via Cyclic View Consistency Refinement

Title: Adaptive Spectral Feature Forecasting for Diffusion Sampling Acceleration

Title: DriveCombo: Benchmarking Compositional Traffic Rule Reasoning in Autonomous Driving

Title: QCAgent: An agentic framework for quality-controllable pathology report generation from whole slide image

Title: Transform-Invariant Generative Ray Path Sampling for Efficient Radio Propagation Modeling

Title: FreeGNN: Continual Source-Free Graph Neural Network Adaptation for Renewable Energy Forecasting

Title: A Diffusion-Driven Fine-Grained Nodule Synthesis Framework for Enhanced Lung Nodule Detection from Chest Radiographs

Title: FastLightGen: Fast and Light Video Generation with Fewer Steps and Parameters

Title: DiffusionXRay: A Diffusion and GAN-Based Approach for Enhancing Digitally Reconstructed Chest Radiographs

Title: Learning Domain-Aware Task Prompt Representations for Multi-Domain All-in-One Image Restoration

Title: NeuroSymb-MRG: Differentiable Abductive Reasoning with Active Uncertainty Minimization for Radiology Report Generation

Title: StepVAR: Structure-Texture Guided Pruning for Visual Autoregressive Models

Title: CHLU: The Causal Hamiltonian Learning Unit as a Symplectic Primitive for Deep Learning

Title: D3LM: A Discrete DNA Diffusion Language Model for Bidirectional DNA Understanding and Generation

Title: Learning Shortest Paths with Generative Flow Networks

Title: Phase-Type Variational Autoencoders for Heavy-Tailed Data

Title: Non-verbal Real-time Human-AI Interaction in Constrained Robotic Environments

Title: Constrained Particle Seeking: Solving Diffusion Inverse Problems with Just Forward Passes

Title: FireRed-OCR Technical Report

Title: Tide: A Customisable Dataset Generator for Anti-Money Laundering Research

Title: CTForensics: A Comprehensive Dataset and Method for AI-Generated CT Image Detection

Title: Generative Visual Chain-of-Thought for Image Editing

Title: LaST-VLA: Thinking in Latent Spatio-Temporal Space for Vision-Language-Action in Autonomous Driving

Title: Dream2Learn: Structured Generative Dreaming for Continual Learning

Title: Semantic Similarity is a Spurious Measure of Comic Understanding: Lessons Learned from Hallucinations in a Benchmarking Experiment

Title: CoVAE: correlated multimodal generative modeling

Title: Process Over Outcome: Cultivating Forensic Reasoning for Generalizable Multimodal Manipulation Detection

Title: Mitigating topology biases in Graph Diffusion via Counterfactual Intervention

Title: Noise-Calibrated Inference from Differentially Private Sufficient Statistics in Exponential Families

Title: MAP-Diff: Multi-Anchor Guided Diffusion for Progressive 3D Whole-Body Low-Dose PET Denoising

Title: Latent attention on masked patches for flow reconstruction

Title: Expanding LLM Agent Boundaries with Strategy-Guided Exploration

Title: NICO-RAG: Multimodal Hypergraph Retrieval-Augmented Generation for Understanding the Nicotine Public Health Crisis

Title: WorldStereo: Bridging Camera-Guided Video Generation and Scene Reconstruction via 3D Geometric Memories

Title: ORGAN: Object-Centric Representation Learning using Cycle Consistent Generative Adversarial Networks

Title: LiftAvatar: Kinematic-Space Completion for Expression-Controlled 3D Gaussian Avatar Animation

Title: SimRecon: SimReady Compositional Scene Reconstruction from Real Videos

Title: OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens

Title: GeoDiT: Point-Conditioned Diffusion Transformer for Satellite Image Synthesis

Title: Kiwi-Edit: Versatile Video Editing via Instruction and Reference Guidance

Title: Multi-Head Low-Rank Attention