2026-01-01

Title: Enriching Historical Records: An OCR and AI-Driven Approach for Database Integration

Title: A Review of Diffusion-based Simulation-Based Inference: Foundations and Applications in Non-Ideal Data Scenarios

Title: Geometric Scaling of Bayesian Inference in LLMs

Title: HINTS: Extraction of Human Insights from Time-Series Without External Sources

Title: A Survey on Graph Neural Networks for Fraud Detection in Ride Hailing Platforms

Title: Leveraging Synthetic Priors for Monocular Depth Estimation in Specular Surgical Environments

Title: Exploiting the Prior of Generative Time Series Imputation

Title: Flow Matching Neural Processes

Title: Lifelong Domain Adaptive 3D Human Pose Estimation

Title: Scaling Remote Sensing Foundation Models: Data Domain Tradeoffs at the Peta-Scale

Title: T2VAttack: Adversarial Attack on Text-to-Video Diffusion Models

Title: Efficient Context Scaling with LongCat ZigZag Attention

Title: Assured Autonomy: How Operations Research Powers and Orchestrates Generative AI Systems

Title: DriveExplorer: Images-Only Decoupled 4D Reconstruction with Progressive Restoration for Driving View Extrapolation

Title: Anomaly detection in satellite imagery through temporal inpainting

Title: Bridging Structure and Appearance: Topological Features for Robust Self-Supervised Segmentation

Title: Tracing the Heart's Pathways: ECG Representation Learning from a Cardiac Conduction Perspective

Title: On Exact Editing of Flow-Based Diffusion Models

Title: PipeFlow: Pipelined Processing and Motion-Aware Frame Selection for Long-Form Video Editing

Title: Reinforced Diffusion: Learning to Push the Limits of Anisotropic Diffusion for Image Denoising

Title: RainFusion2.0: Temporal-Spatial Awareness and Hardware-Efficient Block-wise Sparse Attention

Title: Guided Diffusion-based Generation of Adversarial Objects for Real-World Monocular Depth Estimation Attacks

Title: GARDO: Reinforcing Diffusion Models without Reward Hacking

Title: Activation Steering for Masked Diffusion Language Models

Title: Taming Preference Mode Collapse via Directional Decoupling Alignment in Diffusion Reinforcement Learning

Title: Towards Open-Vocabulary Industrial Defect Understanding with a Large-Scale Multimodal Dataset

Title: DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models

Title: Deep Global Clustering for Hyperspectral Image Segmentation: Concepts, Applications, and Open Challenges

Title: Guiding a Diffusion Transformer with the Internal Dynamics of Itself

Title: CorGi: Contribution-Guided Block-Wise Interval Caching for Training-Free Acceleration of Diffusion Transformers

Title: Medical Image Classification on Imbalanced Data Using ProGAN and SMA-Optimized ResNet: Application to COVID-19

Title: ARM: A Learnable, Plug-and-Play Module for CLIP-based Open-vocabulary Semantic Segmentation

Title: Mirage: One-Step Video Diffusion for Photorealistic and Coherent Asset Editing in Driving Scenes

Title: MotivNet: Evolving Meta-Sapiens into an Emotionally Intelligent Foundation Model

Title: Physically-Grounded Manifold Projection with Foundation Priors for Metal Artifact Reduction in Dental CBCT

Title: Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation

Title: One-shot synthesis of rare gastrointestinal lesions improves diagnostic accuracy and clinical training

Title: Virtual-Eyes: Quantitative Validation of a Lung CT Quality-Control Pipeline for Foundation-Model Cancer Risk Prediction

Title: Skim-Aware Contrastive Learning for Efficient Document Representation

Title: Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems

Title: DyStream: Streaming Dyadic Talking Heads Generation via Flow Matching-based Autoregressive Model

Title: Generative forecasting with joint probability models

Title: F2IDiff: Real-world Image Super-resolution using Feature to Image Diffusion Foundation Model

Title: Training-Free Color-Aware Adversarial Diffusion Sanitization for Diffusion Stegomalware Defense at Security Gateways

Title: Using Large Language Models To Translate Machine Results To Human Results

Title: Do Large Language Models Know What They Are Capable Of?

Title: HeteroHBA: A Generative Structure-Manipulating Backdoor Attack on Heterogeneous Graphs

Title: Nested Learning: The Illusion of Deep Learning Architectures

Title: BandiK: Efficient Multi-Task Decomposition Using a Multi-Bandit Framework

Title: Self-Supervised Neural Architecture Search for Multimodal Deep Neural Networks

Title: AODDiff: Probabilistic Reconstruction of Aerosol Optical Depth via Diffusion-based Bayesian Inference

Title: Characterization of Transfer Using Multi-task Learning Curves

Title: Towards Provably Secure Generative AI: Reliable Consensus Sampling

Title: HaineiFRDM: Explore Diffusion to Restore Defects in Fast-Movement Films

Title: ProDM: Synthetic Reality-driven Property-aware Progressive Diffusion Model for Coronary Calcium Motion Correction in Non-gated Chest CT

Title: VIPER: Process-aware Evaluation for Generative Video Reasoning

Title: ShowUI-$π$: Flow-based Generative Models as GUI Dexterous Hands

Title: FoundationSLAM: Unleashing the Power of Depth Foundation Models for End-to-End Dense Visual SLAM

Title: Diffusion Language Models are Provably Optimal Parallel Samplers

Title: Generative Classifiers Avoid Shortcut Solutions

Title: From Inpainting to Editing: A Self-Bootstrapping Framework for Context-Rich Visual Dubbing

Title: GaMO: Geometry-aware Multi-view Diffusion Outpainting for Sparse-View 3D Reconstruction

Title: SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time