2026-03-10

Title: vLLM Hook v0: A Plug-in for Programming Model Internals on vLLM

Title: Switchable Activation Networks

Title: Know When You're Wrong: Aligning Confidence with Correctness for LLM Error Detection

Title: Correlation Analysis of Generative Models

Title: Annealed Co-Generation: Disentangling Variables via Progressive Pairwise Modeling

Title: Evo: Autoregressive-Diffusion Large Language Models with Evolving Balance

Title: Advances in GRPO for Generation Models: A Survey

Title: SmartBench: Evaluating LLMs in Smart Homes with Anomalous Device States and Behavioral Contexts

Title: HEARTS: Benchmarking LLM Reasoning on Health Time Series

Title: HURRI-GAN: A Novel Approach for Hurricane Bias-Correction Beyond Gauge Stations using Generative Adversarial Networks

Title: EnsAug: Augmentation-Driven Ensembles for Human Motion Sequence Analysis

Title: HyperTokens: Controlling Token Dynamics for Continual Video-Language Understanding

Title: Accelerating Video Generation Inference with Sequential-Parallel 3D Positional Encoding Using a Global Time Index

Title: SJD-PV: Speculative Jacobi Decoding with Phrase Verification for Autoregressive Image Generation

Title: Does Semantic Noise Initialization Transfer from Images to Videos? A Paired Diagnostic Study

Title: Chart Deep Research in LVLMs via Parallel Relative Policy Optimization

Title: VB: Visibility Benchmark for Visibility and Perspective Reasoning in Images

Title: ECHO: Event-Centric Hypergraph Operations via Multi-Agent Collaboration for Multimedia Event Extraction

Title: One step further with Monte-Carlo sampler to guide diffusion better

Title: Narrative Weaver: Towards Controllable Long-Range Visual Consistency with Multi-Modal Conditioning

Title: SIQA: Toward Reliable Scientific Image Quality Assessment

Title: From Statistical Fidelity to Clinical Consistency: Scalable Generation and Auditing of Synthetic Patient Trajectories

Title: Safe Transformer: An Explicit Safety Bit For Interpretable And Controllable Alignment

Title: Vessel-Aware Deep Learning for OCTA-Based Detection of AMD

Title: Rank-Factorized Implicit Neural Bias: Scaling Super-Resolution Transformer with FlashAttention

Title: Heterogeneous Decentralized Diffusion Models

Title: Improved Constrained Generation by Bridging Pretrained Generative Models

Title: Enhancing Instruction Following of LLMs via Activation Steering with Dynamic Rejection

Title: Implementation of Quantum Implicit Neural Representation in Deterministic and Probabilistic Autoencoders for Image Reconstruction/Generation Tasks

Title: Failure Detection in Chemical Processes using Symbolic Machine Learning: A Case Study on Ethylene Oxidation

Title: xaitimesynth: A Python Package for Evaluating Attribution Methods for Time Series with Synthetic Ground Truth

Title: Physics-Informed Diffusion Model for Generating Synthetic Extreme Rare Weather Events Data

Title: NEST: Network- and Memory-Aware Device Placement For Distributed Deep Learning

Title: Stochastic Attention via Langevin Dynamics on the Modern Hopfield Energy

Title: Learning From Design Procedure To Generate CAD Programs for Data Augmentation

Title: XGenBoost: Synthesizing Small and Large Tabular Datasets with XGBoost

Title: HIERAMP: Coarse-to-Fine Autoregressive Amplification for Generative Dataset Distillation

Title: Joint MDPs and Reinforcement Learning in Coupled-Dynamics Environments

Title: SurgCUT3R: Surgical Scene-Aware Continuous Understanding of Temporal 3D Representation

Title: Conditional Unbalanced Optimal Transport Maps: An Outlier-Robust Framework for Conditional Generative Modeling

Title: Diffusion Controller: Framework, Algorithms and Parameterization

Title: AdaGen: Learning Adaptive Policy for Image Synthesis

Title: Resource-Adaptive Federated Text Generation with Differential Privacy

Title: SODA: Sensitivity-Oriented Dynamic Acceleration for Diffusion Transformer

Title: MedSteer: Counterfactual Endoscopic Synthesis via Training-Free Activation Steering

Title: Physics-Guided VLM Priors for All-Cloud Removal

Title: Entropy-Aware On-Policy Distillation of Language Models

Title: Facial Expression Generation Aligned with Human Preference for Natural Dyadic Interaction

Title: TIQA: Human-Aligned Text Quality Assessment in Generated Images

Title: CanoVerse: 3D Object Scalable Canonicalization and Dataset for Generation and Pose

Title: LiveWorld: Simulating Out-of-Sight Dynamics in Generative Video World Models

Title: Agentic Planning with Reasoning for Image Styling via Offline RL

Title: Class Visualizations and Activation Atlases for Enhancing Interpretability in Deep Learning-Based Computational Pathology

Title: FastSTAR: Spatiotemporal Token Pruning for Efficient Autoregressive Video Synthesis

Title: Retrieval-Augmented Generation for Predicting Cellular Responses to Gene Perturbation

Title: Single Image Super-Resolution via Bivariate `A Trous Wavelet Diffusion

Title: FabricGen: Microstructure-Aware Woven Fabric Generation

Title: PresentBench: A Fine-Grained Rubric-Based Benchmark for Slide Generation

Title: Variational Flow Maps: Make Some Noise for One-Step Conditional Generation

Title: MAviS: A Multimodal Conversational Assistant For Avian Species

Title: A Lightweight Digital-Twin-Based Framework for Edge-Assisted Vehicle Tracking and Collision Prediction

Title: Latent Generative Models with Tunable Complexity for Compressed Sensing and other Inverse Problems

Title: ConfHit: Conformal Generative Design with Oracle Free Guarantees

Title: AQuA: Toward Strategic Response Generation for Ambiguous Visual Questions

Title: VIVECaption: A Split Approach to Caption Quality Improvement

Title: Prompt-Based Caption Generation for Single-Tooth Dental Images Using Vision-Language Models

Title: UnSCAR: Universal, Scalable, Controllable, and Adaptable Image Restoration

Title: Context Channel Capacity: An Information-Theoretic Framework for Understanding Catastrophic Forgetting

Title: Disentangled Textual Priors for Diffusion-based Image Super-Resolution

Title: Image Generation Models: A Technical History

Title: Contact-Guided 3D Genome Structure Generation of E. coli via Diffusion Transformers

Title: EVLF: Early Vision-Language Fusion for Generative Dataset Distillation

Title: RobustSCI: Beyond Reconstruction to Restoration for Snapshot Compressive Imaging under Real-World Degradations

Title: High-Fidelity Medical Shape Generation via Skeletal Latent Diffusion

Title: Reinforcement learning-based dynamic cleaning scheduling framework for solar energy system

Title: One-for-All Model Initialization with Frequency-Domain Knowledge

Title: Generative prediction of laser-induced rocket ignition with dynamic latent space representations

Title: How Long Can Unified Multimodal Models Generate Images Reliably? Taming Long-Horizon Interleaved Image Generation via Context Curation

Title: CONSTANT: Towards High-Quality One-Shot Handwriting Generation with Patch Contrastive Enhancement and Style-Aware Quantization

Title: DreamSAC: Learning Hamiltonian World Models via Symmetry Exploration

Title: ReconDrive: Fast Feed-Forward 4D Gaussian Splatting for Autonomous Driving Scene Reconstruction

Title: Brain-WM: Brain Glioblastoma World Model

Title: GRD-Net: Generative-Reconstructive-Discriminative Anomaly Detection with Region of Interest Attention Module

Title: Constraints Matrix Diffusion based Generative Neural Solver for Vehicle Routing Problems

Title: Integration of deep generative Anomaly Detection algorithm in high-speed industrial line

Title: EmbedTalk: Triplane-Free Talking Head Synthesis using Embedding-Driven Gaussian Deformation

Title: Looking Into the Water by Unsupervised Learning of the Surface Shape

Title: Compression as Adaptation: Implicit Visual Representation with Diffusion Foundation Models

Title: Evaluating Synthetic Data for Baggage Trolley Detection in Airport Logistics

Title: Compressed-Domain-Aware Online Video Super-Resolution

Title: Learning Context-Adaptive Motion Priors for Masked Motion Diffusion Models with Efficient Kinematic Attention Aggregation

Title: TDM-R1: Reinforcing Few-Step Diffusion Models with Non-Differentiable Reward

Title: PARSE: Part-Aware Relational Spatial Modeling

Title: Uncertainty-Gated Generative Modeling

Title: Geometric Knowledge-Assisted Federated Dual Knowledge Distillation Approach Towards Remote Sensing Satellite Imagery

Title: Breaking Training Bottlenecks: Effective and Stable Reinforcement Learning for Coding Models

Title: HybridStitch: Pixel and Timestep Level Model Stitching for Diffusion Acceleration

Title: Guess & Guide: Gradient-Free Zero-Shot Diffusion Guidance

Title: LeJOT-AutoML: LLM-Driven Feature Engineering for Job Execution Time Prediction in Databricks Cost Optimization

Title: Enhancing Unregistered Hyperspectral Image Super-Resolution via Unmixing-based Abundance Fusion Learning

Title: Text to Automata Diagrams: Comparing TikZ Code Generation with Direct Image Synthesis

Title: ELLMob: Event-Driven Human Mobility Generation with Self-Aligned LLM Framework

Title: SGG-R$^{\rm 3}$: From Next-Token Prediction to End-to-End Unbiased Scene Graph Generation

Title: On the Feasibility and Opportunity of Autoregressive 3D Object Detection

Title: AutoTraces: Autoregressive Trajectory Forecasting via Multimodal Large Language Models

Title: Missing No More: Dictionary-Guided Cross-Modal Image Fusion under Missing Infrared

Title: VSDiffusion: Taming Ill-Posed Shadow Generation via Visibility-Constrained Diffusion

Title: Not Like Transformers: Drop the Beat Representation for Dance Generation with Mamba-Based Diffusion Model

Title: Controllable Complex Human Motion Video Generation via Text-to-Skeleton Cascades

Title: QualiTeacher: Quality-Conditioned Pseudo-Labeling for Real-World Image Restoration

Title: GCGNet: Graph-Consistent Generative Network for Time Series Forecasting with Exogenous Variables

Title: ImageEdit-R1: Boosting Multi-Agent Image Editing via Reinforcement Learning

Title: Evaluating Generative Models via One-Dimensional Code Distributions

Title: Synthetic Defect Image Generation for Power Line Insulator Inspection Using Multimodal Large Language Models

Title: DSH-Bench: A Difficulty- and Scenario-Aware Benchmark with Hierarchical Subject Taxonomy for Subject-Driven Text-to-Image Generation

Title: Foley-Flow: Coordinated Video-to-Audio Generation with Masked Audio-Visual Alignment and Dynamic Conditional Flows

Title: Fast Low-light Enhancement and Deblurring for 3D Dark Scenes

Title: C$^2$FG: Control Classifier-Free Guidance via Score Discrepancy Analysis

Title: Learning Hierarchical Knowledge in Text-Rich Networks with Taxonomy-Informed Representation Learning

Title: Video2LoRA: Unified Semantic-Controlled Video Generation via Per-Reference-Video LoRA

Title: SRNeRV: A Scale-wise Recursive Framework for Neural Video Representation

Title: GarmentPainter: Efficient 3D Garment Texture Synthesis with Character-Guided Diffusion Model

Title: Exploring Deep Learning and Ultra-Widefield Imaging for Diabetic Retinopathy and Macular Edema

Title: WaDi: Weight Direction-aware Distillation for One-step Image Synthesis

Title: Prototype-Guided Concept Erasure in Diffusion Models

Title: Retrieval-Augmented Anatomical Guidance for Text-to-CT Generation

Title: HDR-NSFF: High Dynamic Range Neural Scene Flow Fields

Title: $Δ$VLA: Prior-Guided Vision-Language-Action Models via World Knowledge Variation

Title: Diffusion-Based Data Augmentation for Image Recognition: A Systematic Analysis and Evaluation

Title: SPIRAL: A Closed-Loop Framework for Self-Improving Action World Models via Reflective Planning Agents

Title: LycheeCluster: Efficient Long-Context Inference with Structure-Aware Chunking and Hierarchical KV Indexing

Title: Reasoning as Compression: Unifying Budget Forcing via the Conditional Information Bottleneck

Title: X-AVDT: Audio-Visual Cross-Attention for Robust Deepfake Detection

Title: SWIFT: Sliding Window Reconstruction for Few-Shot Training-Free Generated Video Attribution

Title: BioGait-VLM: A Tri-Modal Vision-Language-Biomechanics Framework for Interpretable Clinical Gait Assessment

Title: PRISM: Streaming Human Motion Generation with Per-Joint Latent Decomposition

Title: CAST: Modeling Visual State Transitions for Consistent Video Retrieval

Title: HiAR: Efficient Autoregressive Long Video Generation via Hierarchical Denoising