2026-03-11

Title: Are Expressive Encoders Necessary for Discrete Graph Generation?

Title: HECTOR: Hybrid Editable Compositional Object References for Video Generation

Title: NetDiffuser: Deceiving DNN-Based Network Attack Detection Systems with Diffusion-Generated Adversarial Traffic

Title: TIDE: Text-Informed Dynamic Extrapolation with Step-Aware Temperature Control for Diffusion Transformers

Title: Using Vision Language Foundation Models to Generate Plant Simulation Configurations via In-Context Learning

Title: Semantic Level of Detail: Multi-Scale Knowledge Representation via Heat Kernel Diffusion on Hyperbolic Manifolds

Title: SVG-EAR: Parameter-Free Linear Compensation for Sparse Video Generation via Error-aware Routing

Title: Diffusion-Based Authentication of Copy Detection Patterns: A Multimodal Framework with Printer Signature Conditioning

Title: Security Considerations for Multi-agent Systems

Title: Spectral-Structured Diffusion for Single-Image Rain Removal

Title: Chain of Event-Centric Causal Thought for Physically Plausible Video Generation

Title: Training-free Motion Factorization for Compositional Video Generation

Title: QUSR: Quality-Aware and Uncertainty-Guided Image Super-Resolution Diffusion Model

Title: Latent-DARM: Bridging Discrete Diffusion And Autoregressive Models For Reasoning

Title: TubeMLLM: A Foundation Model for Topology Knowledge Exploration in Vessel-like Anatomy

Title: UniField: A Unified Field-Aware MRI Enhancement Framework

Title: BridgeDiff: Bridging Human Observations and Flat-Garment Synthesis for Virtual Try-Off

Title: RAE-NWM: Navigation World Model in Dense Visual Representation Space

Title: When Detectors Forget Forensics: Blocking Semantic Shortcuts for Generalizable AI-Generated Image Detection

Title: From Ideal to Real: Stable Video Object Removal under Imperfect Conditions

Title: Learning Convex Decomposition via Feature Fields

Title: CogBlender: Towards Continuous Cognitive Intervention in Text-to-Image Generation

Title: TA-GGAD: Testing-time Adaptive Graph Model for Generalist Graph Anomaly Detection

Title: Interactive 3D visualization of surface roughness predictions in additive manufacturing: A data-driven framework

Title: ProvAgent: Threat Detection Based on Identity-Behavior Binding and Multi-Agent Collaborative Attack Investigation

Title: M3GCLR: Multi-View Mini-Max Infinite Skeleton-Data Game Contrastive Learning For Skeleton-Based Action Recognition

Title: MIL-PF: Multiple Instance Learning on Precomputed Features for Mammography Classification

Title: EventVGGT: Exploring Cross-Modal Distillation for Consistent Event-based Depth Estimation

Title: Training-Free Coverless Multi-Image Steganography with Access Control

Title: Reviving ConvNeXt for Efficient Convolutional Diffusion Models

Title: Variational Routing: A Scalable Bayesian Framework for Calibrated Mixture-of-Experts Transformers

Title: ShapeMark: Robust and Diversity-Preserving Watermarking for Diffusion Models

Title: Component-Aware Sketch-to-Image Generation Using Self-Attention Encoding and Coordinate-Preserving Fusion

Title: Streaming Autoregressive Video Generation via Diagonal Distillation

Title: Temporal-Conditioned Normalizing Flows for Multivariate Time Series Anomaly Detection

Title: Probing the Reliability of Driving VLMs: From Inconsistent Responses to Grounded Temporal Reasoning

Title: BinaryAttention: One-Bit QK-Attention for Vision and Diffusion Transformers

Title: A saccade-inspired approach to image classification using visiontransformer attention maps

Title: Grounding Synthetic Data Generation With Vision and Language Models

Title: X-GS: An Extensible Open Framework Unifying 3DGS Architectures with Downstream Multimodal Models

Title: Well Log-Guided Synthesis of Subsurface Images from Sparse Petrography Data Using cGANs

Title: When to Lock Attention: Training-Free KV Control in Video Diffusion

Title: GNNs for Time Series Anomaly Detection: An Open-Source Framework and a Critical Evaluation

Title: Automatic Cardiac Risk Management Classification using large-context Electronic Patients Health Records

Title: TriFusion-SR: Joint Tri-Modal Medical Image Fusion and SR

Title: FrameDiT: Diffusion Transformer with Frame-Level Matrix Attention for Efficient Video Generation

Title: FetalAgents: A Multi-Agent System for Fetal Ultrasound Image and Video Analysis

Title: LAP: A Language-Aware Planning Model For Procedure Planning In Instructional Videos

Title: LogoDiffuser: Training-Free Multilingual Logo Generation and Stylization via Letter-Aware Attention Control

Title: Good Reasoning Makes Good Demonstrations: Implicit Reasoning Quality Supervision via In-Context Reinforcement Learning

Title: ConfCtrl: Enabling Precise Camera Control in Video Diffusion via Confidence-Aware Interpolation

Title: Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs

Title: WikiCLIP: An Efficient Contrastive Baseline for Open-domain Visual Entity Recognition

Title: Adaptive Clinical-Aware Latent Diffusion for Multimodal Brain Image Generation and Missing Modality Imputation

Title: Generative Drifting is Secretly Score Matching: a Spectral and Variational Perspective

Title: SignalMC-MED: A Multimodal Benchmark for Evaluating Biosignal Foundation Models on Single-Lead ECG and PPG

Title: From Semantics to Pixels: Coarse-to-Fine Masked Autoencoders for Hierarchical Visual Understanding