2026-03-17

Title: Translational Gaps in Graph Transformers for Longitudinal EHR Prediction: A Critical Appraisal of GT-BEHRT

Title: Your Code Agent Can Grow Alongside You with Structured Memory

Title: Knowledge, Rules and Their Embeddings: Two Paths towards Neuro-Symbolic JEPA

Title: CAMEL-CLIP: Channel-aware Multimodal Electroencephalography-text Alignment for Generalizable Brain Foundation Models

Title: A Stability-Aware Frozen Euler Autoencoder for Physics-Informed Tracking in Continuum Mechanics (SAFE-PIT-CM)

Title: Do Diffusion Models Dream of Electric Planes? Discrete and Continuous Simulation-Based Inference for Aircraft Design

Title: TAS-GNN: A Status-Aware Signed Graph Neural Network for Anomaly Detection in Bitcoin Trust Systems

Title: ICPRL: Acquiring Physical Intuition from Interactive Control

Title: DreamReader: An Interpretability Toolkit for Text-to-Image Models

Title: Safety-Guided Flow (SGF): A Unified Framework for Negative Guidance in Safe Generation

Title: Benchmarking Compact VLMs for Clip-Level Surveillance Anomaly Detection Under Weak Supervision

Title: LightningRL: Breaking the Accuracy-Parallelism Trade-off of Block-wise dLLMs via Reinforcement Learning

Title: RBF-Solver: A Multistep Sampler for Diffusion Probabilistic Models via Radial Basis Functions

Title: Local Precise Refinement: A Dual-Gated Mixture-of-Experts for Enhancing Foundation Model Generalization against Spectral Shifts

Title: AgriPath: A Systematic Exploration of Architectural Trade-offs for Crop Disease Classification

Title: Bi-CamoDiffusion: A Boundary-informed Diffusion Approach for Camouflaged Object Detection

Title: Graph2Video: Leveraging Video Models to Model Dynamic Graph Evolution

Title: Thinking in Uncertainty: Mitigating Hallucinations in MLRMs with Latent Entropy-Aware Decoding

Title: Real-Time Monocular Scene Analysis for UAV in Outdoor Environments

Title: Geometry-Aware Semantic Reasoning for Training Free Video Anomaly Detection

Title: InfiniteDance: Scalable 3D Dance Generation Towards in-the-wild Generalization

Title: DINOv3 with Test-Time Calibration for Automated Carotid Intima-Media Thickness Measurement on CUBS v1

Title: Layout-Guided Controllable Pathology Image Generation with In-Context Diffusion Transformers

Title: High-Fidelity Text-to-Image Generation from Pre-Trained Vision-Language Models via Distribution-Conditioned Diffusion Decoding

Title: Colony Grounded SAM2: Zero-shot detection and segmentation of bacterial colonies using foundation models

Title: Language-Guided Token Compression with Reinforcement Learning in Large Vision-Language Models

Title: SERUM: Simple, Efficient, Robust, and Unifying Marking for Diffusion-based Image Generation

Title: MAD: Microenvironment-Aware Distillation -- A Pretraining Strategy for Virtual Spatial Omics from Microscopy

Title: Anchor Forcing: Anchor Memory and Tri-Region RoPE for Interactive Streaming Video Diffusion

Title: Diffusion Models Generalize but Not in the Way You Might Think

Title: Generalization and Memorization in Rectified Flow

Title: Self-Flow-Matching assisted Full Waveform Inversion

Title: CHIMERA-Bench: A Benchmark Dataset for Epitope-Specific Antibody Design

Title: Modality-free Graph In-context Alignment

Title: CtrlAttack: A Unified Attack on World-Model Control in Diffusion Models

Title: Vision-Language Based Expert Reporting for Painting Authentication and Defect Detection

Title: Draft-and-Target Sampling for Video Generation Policy

Title: Improving Channel Estimation via Multimodal Diffusion Models with Flow Matching

Title: LADR: Locality-Aware Dynamic Rescue for Efficient Text-to-Image Generation with Diffusion Large Language Models

Title: Reconciling In-Context and In-Weight Learning via Dual Representation Space Encoding

Title: Purifying Generative LLMs from Backdoors without Prior Knowledge or Clean Reference

Title: Synthetic Melanoma Image Generation and Evaluation Using Generative Adversarial Networks

Title: ActionPlan: Future-Aware Streaming Motion Synthesis via Frame-Level Action Planning

Title: LibraGen: Playing a Balance Game in Subject-Driven Video Generation

Title: MIRAGE: Model-agnostic Industrial Realistic Anomaly Generation and Evaluation for Visual Anomaly Detection

Title: A Systematic Benchmark of GAN Architectures for MRI-to-CT Synthesis

Title: Probabilistic Gaussian Homotopy: A Probability-Space Continuation Framework for Nonconvex Optimization

Title: NumColor: Precise Numeric Color Control in Text-to-Image Generation

Title: Scalable Classification of Course Information Sheets Using Large Language Models: A Reusable Institutional Method for Academic Quality Assurance

Title: Privacy-Preserving Machine Learning for IoT: A Cross-Paradigm Survey and Future Roadmap

Title: DiveUp: Learning Feature Upsampling from Diverse Vision Foundation Models

Title: Privacy-Preserving Federated Fraud Detection in Payment Transactions with NVIDIA FLARE

Title: SemRep: Generative Code Representation Learning with Code Transformations

Title: PLUME: Building a Network-Native Foundation Model for Wireless Traces via Protocol-Aware Tokenization

Title: FMS$^2$: Unified Flow Matching for Segmentation and Synthesis of Thin Structures

Title: Learning Generalizable 3D Medical Image Representations from Mask-Guided Self-Supervision

Title: PDE-SSM: A Spectral State Space Approach to Spatial Mixing in Diffusion Transformers

Title: SHAMISA: SHAped Modeling of Implicit Structural Associations for Self-supervised No-Reference Image Quality Assessment

Title: RSEdit: Text-Guided Image Editing for Remote Sensing

Title: Can We Trust LLMs on Memristors? Diving into Reasoning Ability under Non-Ideality

Title: Ransomware and Artificial Intelligence: A Comprehensive Systematic Review of Reviews

Title: UniVid: Pyramid Diffusion Model for High Quality Video Generation

Title: Multi-Object Advertisement Creative Generation

Title: Manifold-Orthogonal Dual-spectrum Extrapolation for Parameterized Physics-Informed Neural Networks

Title: PhysAlign: Physics-Coherent Image-to-Video Generation through Feature and 3D Representation Alignment

Title: AD-Copilot: A Vision-Language Assistant for Industrial Anomaly Detection via Visual In-context Comparison

Title: Learning through Creation: A Hash-Free Framework for On-the-Fly Category Discovery

Title: On Interpolation Formulas Describing Neural Network Generalization

Title: GradMem: Learning to Write Context into Memory with Test-Time Gradient Descent

Title: CT-Conditioned Diffusion Prior with Physics-Constrained Sampling for PET Super-Resolution

Title: Pixel-level Scene Understanding in One Token: Visual States Need What-is-Where Composition

Title: Scene Generation at Absolute Scale: Utilizing Semantic and Geometric Guidance From Text for Accurate and Interpretable 3D Indoor Scene Generation

Title: Towards Stable Self-Supervised Object Representations in Unconstrained Egocentric Video

Title: Discriminative Flow Matching Via Local Generative Predictors

Title: Sat-JEPA-Diff: Bridging Self-Supervised Learning and Generative Diffusion for Remote Sensing

Title: DCP-CLIP:A Coarse-to-Fine Framework for Open-Vocabulary Semantic Segmentation with Dual Interaction

Title: IMS3: Breaking Distributional Aggregation in Diffusion-Based Dataset Distillation

Title: VID-AD: A Dataset for Image-Level Logical Anomaly Detection under Vision-Induced Distraction

Title: VAD4Space: Visual Anomaly Detection for Planetary Surface Imagery

Title: Human-like Object Grouping in Self-supervised Vision Transformers

Title: Benchmarking Open-Source PPG Foundation Models for Biological Age Prediction

Title: EyeWorld: A Generative World Model of Ocular State and Dynamics

Title: TMPDiff: Temporal Mixed-Precision for Diffusion Models

Title: Self-Supervised Uncertainty Estimation For Super-Resolution of Satellite Images

Title: Effective Feature Learning for 3D Medical Registration via Domain-Specialized DINO Pretraining

Title: Soft Mean Expected Calibration Error (SMECE): A Calibration Metric for Probabilistic Labels

Title: Not All Latent Spaces Are Flat: Hyperbolic Concept Control

Title: Revisiting the Perception-Distortion Trade-off with Spatial-Semantic Guided Super-Resolution

Title: Diffusion Reinforcement Learning via Centered Reward Distillation

Title: Seeing Through the PRISM: Compound & Controllable Restoration of Scientific Images

Title: SK-Adapter: Skeleton-Based Structural Control for Native 3D Generation

Title: TACTIC for Navigating the Unknown: Tabular Anomaly deteCTion via In-Context inference

Title: Artificial intelligence-enabled single-lead ECG for non-invasive hyperkalemia detection: development, multicenter validation, and proof-of-concept deployment

Title: Fair Benchmarking of Emerging One-Step Generative Models Against Multistep Diffusion and Flow Models

Title: Joint Segmentation and Grading with Iterative Optimization for Multimodal Glaucoma Diagnosis

Title: DualTSR: Unified Dual-Diffusion Transformer for Scene Text Image Super-Resolution

Title: ChArtist: Generating Pictorial Charts with Unified Spatial and Subject Control

Title: FIND: A Simple yet Effective Baseline for Diffusion-Generated Image Detection

Title: Membership Inference for Contrastive Pre-training Models with Text-only PII Queries

Title: FOCUS: Bridging Fine-Grained Recognition and Open-World Discovery across Domains

Title: CamLit: Unified Video Diffusion with Explicit Camera and Lighting Control

Title: GoldenStart: Q-Guided Priors and Entropy Control for Distilling Flow Policies

Title: DiFlowDubber: Discrete Flow Matching for Automated Video Dubbing via Cross-Modal Alignment and Synchronization

Title: Toward Clinically Ready Foundation Models in Medical Image Analysis: Adaptation Mechanisms and Deployment Trade-offs

Title: Seeking Physics in Diffusion Noise

Title: Early Failure Detection and Intervention in Video Diffusion Models

Title: AvatarForcing: One-Step Streaming Talking Avatars via Local-Future Sliding-Window Denoising

Title: Representation Alignment for Just Image Transformers is not Easier than You Think

Title: The Pulse of Motion: Measuring Physical Frame Rate from Visual Dynamics

Title: ES-Merging: Biological MLLM Merging via Embedding Space Signals

Title: Graph-Based Deep Learning for Intelligent Detection of Energy Losses, Theft, and Operational Inefficiencies in Oil & Gas Production Networks

Title: Towards One-for-All Anomaly Detection for Tabular Data

Title: PGcGAN: Pathological Gait-Conditioned GAN for Human Gait Synthesis

Title: On the (Generative) Linear Sketching Problem

Title: V-JEPA 2.1: Unlocking Dense Features in Video Self-Supervised Learning

Title: WorldVLM: Combining World Model Forecasting and Vision-Language Reasoning

Title: Mapping Dark-Matter Clusters via Physics-Guided Diffusion Models

Title: Trust-Region Noise Search for Black-Box Alignment of Diffusion and Flow Models

Title: Unlocking the Latent Canvas: Eliciting and Benchmarking Symbolic Visual Expression in LLMs

Title: LatSearch: Latent Reward-Guided Search for Faster Inference-Time Scaling in Video Diffusion

Title: Interp3R: Continuous-time 3D Geometry Estimation with Frames and Events

Title: Distilling Latent Manifolds: Resolution Extrapolation by Variational Autoencoders

Title: Learning to Order: Task Sequencing as In-Context Optimization

Title: A Multi-Scale Graph Learning Framework with Temporal Consistency Constraints for Financial Fraud Detection in Transaction Networks under Non-Stationary Conditions

Title: $PA^3$: $\textbf{P}$olicy-$\textbf{A}$ware $\textbf{A}$gent $\textbf{A}$lignment through Chain-of-Thought

Title: Make it SING: Analyzing Semantic Invariants in Classifiers

Title: A Heterogeneous Ensemble for Multi-Center COVID-19 Classification from Chest CT Scans

Title: Continual Few-shot Adaptation for Synthetic Fingerprint Detection

Title: Spectrum Matching: a Unified Perspective for Superior Diffusability in Latent Diffusion

Title: Comparative Analysis of 3D Convolutional and 2.5D Slice-Conditioned U-Net Architectures for MRI Super-Resolution via Elucidated Diffusion Models

Title: MVHOI: Bridge Multi-view Condition to Complex Human-Object Interaction Video Reenactment via 3D Foundation Model

Title: AURORA-KITTI: Any-Weather Depth Completion and Denoising in the Wild

Title: Fractal Autoregressive Depth Estimation with Continuous Token Diffusion

Title: Chain-of-Trajectories: Unlocking the Intrinsic Generative Optimality of Diffusion Models via Graph-Theoretic Planning

Title: Cross-RAG: Zero-Shot Retrieval-Augmented Time Series Forecasting via Cross-Attention

Title: Training-Free Generation of Protein Sequences from Small Family Alignments via Stochastic Attention

Title: Automated Diabetic Screening via Anterior Segment Ocular Imaging: A Deep Learning and Explainable AI Approach

Title: DeFRiS: Silo-Cooperative IoT Applications Scheduling via Decentralized Federated Reinforcement Learning

Title: PHAC: Promptable Human Amodal Completion

Title: POLCA: Stochastic Generative Optimization with LLM

Title: AnyPhoto: Multi-Person Identity Preserving Image Generation with ID Adaptive Modulation on Location Canvas

Title: Face-to-Face: A Video Dataset for Multi-Person Interaction Modeling

Title: RAZOR: Ratio-Aware Layer Editing for Targeted Unlearning in Vision Transformers and Diffusion Models

Title: IntegratingWeather Foundation Model and Satellite to Enable Fine-Grained Solar Irradiance Forecasting

Title: From Artefact to Insight: Efficient Low-Rank Adaptation of BrushNet for Scanning Probe Microscopy Image Restoration

Title: Architecture-Agnostic Feature Synergy for Universal Defense Against Heterogeneous Generative Threats

Title: IgPose: A Generative Data-Augmented Pipeline for Robust Immunoglobulin-Antigen Binding Prediction

Title: Seismic full-waveform inversion based on a physics-driven generative adversarial network

Title: SpiralDiff: Spiral Diffusion with LoRA for RGB-to-RAW Conversion Across Cameras

Title: Workflow-Aware Structured Layer Decomposition for Illustration Production

Title: Relevance Feedback in Text-to-Image Diffusion: A Training-Free And Model-Agnostic Interactive Framework

Title: LLM as Graph Kernel: Rethinking Message Passing on Text-Rich Graphs

Title: FAR-Drive: Frame-AutoRegressive Video Generation in Closed-Loop Autonomous Driving

Title: CyCLeGen: Cycle-Consistent Layout Prediction and Image Generation in Vision Foundation Models

Title: GeoNVS: Geometry Grounded Video Diffusion for Novel View Synthesis

Title: Edit2Interp: Adapting Image Foundation Models from Spatial Editing to Video Frame Interpolation with Few-Shot Learning

Title: TrajFlow: Nation-wide Pseudo GPS Trajectory Generation with Flow Matching Models

Title: Training-free Detection of Generated Videos via Spatial-Temporal Likelihoods

Title: Interpretable Predictability-Based AI Text Detection: A Replication Study

Title: Writer-R1: Enhancing Generative Writing in LLMs via Memory-augmented Replay Policy Optimization

Title: ReactMotion: Generating Reactive Listener Motions from Speaker Utterance

Title: Learning from Limited and Incomplete Data: A Multimodal Framework for Predicting Pathological Response in NSCLC

Title: VAREX: A Benchmark for Multi-Modal Structured Extraction from Documents

Title: A Tutorial on ALOS2 SAR Utilization: Dataset Preparation, Self-Supervised Pretraining, and Semantic Segmentation

Title: Next-Frame Decoding for Ultra-Low-Bitrate Image Compression with Video Diffusion Priors

Title: WiT: Waypoint Diffusion Transformers via Trajectory Conflict Navigation

Title: Safe Flow Q-Learning: Offline Safe Reinforcement Learning with Reachability-Based Flow Policies

Title: SNCE: Geometry-Aware Supervision for Scalable Discrete Image Generation

Title: PiGRAND: Physics-informed Graph Neural Diffusion for Intelligent Additive Manufacturing

Title: Towards Foundation Models for Consensus Rank Aggregation

Title: Multi-turn Physics-informed Vision-language Model for Physics-grounded Anomaly Detection

Title: In-Context Symbolic Regression for Robustness-Improved Kolmogorov-Arnold Networks

Title: IConE: Batch Independent Collapse Prevention for Self-Supervised Representation Learning

Title: Exemplar Diffusion: Improving Medical Object Detection with Opportunistic Labels

Title: Self-Supervised ImageNet Representations for In Vivo Confocal Microscopy: Tortuosity Grading without Segmentation Maps

Title: Flash-Unified: A Training-Free and Task-Aware Acceleration Framework for Native Unified Models

Title: Faster Inference of Flow-Based Generative Models via Improved Data-Noise Coupling

Title: GATE-AD: Graph Attention Network Encoding For Few-Shot Industrial Visual Anomaly Detection

Title: Generative Video Compression with One-Dimensional Latent Representation

Title: DOS: Dependency-Oriented Sampler for Masked Diffusion Language Models

Title: Unsupervised Cross-Protocol Anomaly Analysis in Mobile Core Networks via Multi-Embedding Models Consensus

Title: Conditional Rectified Flow-based End-to-End Rapid Seismic Inversion Method

Title: A PPO-Based Bitrate Allocation Conditional Diffusion Model for Remote Sensing Image Compression

Title: Spectral Rectification for Parameter-Efficient Adaptation of Foundation Models in Colonoscopy Depth Estimation

Title: AI Evasion and Impersonation Attacks on Facial Re-Identification with Activation Map Explanations

Title: AnyCrowd: Instance-Isolated Identity-Pose Binding for Arbitrary Multi-Character Animation

Title: Physics-informed fine-tuning of foundation models for partial differential equations

Title: MV2UV: Generating High-quality UV Texture Maps with Multiview Prompts

Title: ViFeEdit: A Video-Free Tuner of Your Video Diffusion Transformer

Title: RSGen: Enhancing Layout-Driven Remote Sensing Image Generation with Diverse Edge Guidance

Title: Kimodo: Scaling Controllable Human Motion Generation

Title: Self-Distillation of Hidden Layers for Self-Supervised Representation Learning

Title: Learning Latent Proxies for Controllable Single-Image Relighting

Title: Anatomy of a Lie: A Multi-Stage Diagnostic Framework for Tracing Hallucinations in Vision-Language Models

Title: Grounding World Simulation Models in a Real-World Metropolis

Title: Tri-Prompting: Video Diffusion with Unified Control over Scene, Subject, and Motion

Title: Look Before Acting: Enhancing Vision Foundation Representations for Vision-Language-Action Models