2025-11-11

Title: Token Is All You Need: Cognitive Planning through Sparse Intent Alignment

Title: AGRAG: Advanced Graph-based Retrieval-Augmented Generation for LLMs

Title: In-Context-Learning-Assisted Quality Assessment Vision-Language Models for Metal Additive Manufacturing

Title: EVLP:Learning Unified Embodied Vision-Language Planner with Reinforced Supervised Fine-Tuning

Title: Effective Test-Time Scaling of Discrete Diffusion through Iterative Refinement

Title: Automatic Extraction of Road Networks by using Teacher-Student Adaptive Structural Deep Belief Network and Its Application to Landslide Disaster

Title: C3-Diff: Super-resolving Spatial Transcriptomics via Cross-modal Cross-content Contrastive Diffusion Modelling

Title: Video Text Preservation with Synthetic Text-Rich Videos

Title: DiffSwap++: 3D Latent-Controlled Diffusion for Identity-Preserving Face Swapping

Title: Fine-Tuning Vision-Language Models for Multimodal Polymer Property Prediction

Title: Depth-induced NTK: Bridging Over-parameterized Neural Networks and Deep Neural Kernels

Title: GRAVER: Generative Graph Vocabularies for Robust Graph Foundation Models Fine-tuning

Title: AutoHood3D: A Multi-Modal Benchmark for Automotive Hood Design and Fluid-Structure Interaction

Title: Walking the Schrödinger Bridge: A Direct Trajectory for Text-to-3D Generation

Title: Pose-Aware Multi-Level Motion Parsing for Action Quality Assessment

Title: KLASS: KL-Guided Fast Inference in Masked Diffusion Models

Title: Long Grounded Thoughts: Distilling Compositional Visual Reasoning Chains at Scale

Title: Position-Prior-Guided Network for System Matrix Super-Resolution in Magnetic Particle Imaging

Title: Catching Contamination Before Generation: Spectral Kill Switches for Agents

Title: AiEDA: An Open-Source AI-Aided Design Library for Design-to-Vector

Title: Understanding Cross Task Generalization in Handwriting-Based Alzheimer's Screening via Vision Language Adaptation

Title: Enhancing Diffusion Model Guidance through Calibration and Regularization

Title: Point Cloud Segmentation of Integrated Circuits Package Substrates Surface Defects Using Causal Inference: Dataset Construction and Methodology

Title: Predicting the Future by Retrieving the Past

Title: CGCE: Classifier-Guided Concept Erasure in Generative Models

Title: Open-World 3D Scene Graph Generation for Retrieval-Augmented Reasoning

Title: AD-DAE: Unsupervised Modeling of Longitudinal Alzheimer's Disease Progression with Diffusion Auto-Encoder

Title: Interaction-Centric Knowledge Infusion and Transfer for Open-Vocabulary Scene Graph Generation

Title: A Dual-Mode ViT-Conditioned Diffusion Framework with an Adaptive Conditioning Bridge for Breast Cancer Segmentation

Title: MALeR: Improving Compositional Fidelity in Layout-Guided Generation

Title: MiVID: Multi-Strategic Self-Supervision for Video Frame Interpolation using Diffusion Model

Title: Lethe: Layer- and Time-Adaptive KV Cache Pruning for Reasoning-Intensive LLM Serving

Title: Advancing Ocean State Estimation with efficient and scalable AI

Title: Neodragon: Mobile Video Generation using Diffusion Transformer

Title: Hybrid CNN-ViT Framework for Motion-Blurred Scene Text Restoration

Title: Adapting Web Agents with Synthetic Supervision

Title: Latent Refinement via Flow Matching for Training-free Linear Inverse Problem Solving

Title: MambaOVSR: Multiscale Fusion with Global Motion Modeling for Chinese Opera Video Super-Resolution

Title: NURBGen: High-Fidelity Text-to-CAD Generation through LLM-Driven NURBS Modeling

Title: Scene-Aware Urban Design: A Human-AI Recommendation Framework Using Co-Occurrence Embeddings and Vision-Language Models

Title: Physics-Informed Image Restoration via Progressive PDE Integration

Title: Gait Recognition via Collaborating Discriminative and Generative Diffusion Models

Title: Test-Time Iterative Error Correction for Efficient Diffusion Models

Title: Breaking the Modality Barrier: Generative Modeling for Accurate Molecule Retrieval from Mass Spectra

Title: RelightMaster: Precise Video Relighting with Multi-plane Light Images

Title: LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation

Title: Enhancing Multimodal Misinformation Detection by Replaying the Whole Story from Image Modality Perspective

Title: DRIVE: Data Curation Best Practices for Reinforcement Learning with Verifiable Reward in Competitive Code Generation

Title: Adaptive 3D Reconstruction via Diffusion Priors and Forward Curvature-Matching Likelihood Updates

Title: BuildingWorld: A Structured 3D Building Dataset for Urban Foundation Models

Title: AesTest: Measuring Aesthetic Intelligence from Perception to Production

Title: Route Experts by Sequence, not by Token

Title: TriShGAN: Enhancing Sparsity and Robustness in Multivariate Time Series Counterfactuals Explanation

Title: Practical Policy Distillation for Reinforcement Learning in Radio Access Networks

Title: Sim4Seg: Boosting Multimodal Multi-disease Medical Diagnosis Segmentation with Region-Aware Vision-Language Similarity Masks

Title: AnoStyler: Text-Driven Localized Anomaly Generation via Lightweight Style Transfer

Title: K-Stain: Keypoint-Driven Correspondence for H&E-to-IHC Virtual Staining

Title: SinSEMI: A One-Shot Image Generation Model and Data-Efficient Evaluation Framework for Semiconductor Inspection Equipment

Title: Image Restoration via Primal Dual Hybrid Gradient and Flow Generative Model

Title: TiS-TSL: Image-Label Supervised Surgical Video Stereo Matching via Time-Switchable Teacher-Student Learning

Title: Integrating Reweighted Least Squares with Plug-and-Play Diffusion Priors for Noisy Image Restoration

Title: MUGSQA: Novel Multi-Uncertainty-Based Gaussian Splatting Quality Assessment Method, Dataset, and Benchmarks

Title: ConsistTalk: Intensity Controllable Temporally Consistent Talking Head Generation with Diffusion Noise Search

Title: Contact Wasserstein Geodesics for Non-Conservative Schrodinger Bridges

Title: VAEVQ: Enhancing Discrete Visual Tokenization through Variational Modeling

Title: Generating an Image From 1,000 Words: Enhancing Text-to-Image With Structured Captions

Title: A Two-Stage System for Layout-Controlled Image Generation using Large Language Models and Diffusion Models

Title: FoCLIP: A Feature-Space Misalignment Framework for CLIP-Based Image Manipulation and Detection

Title: PADM: A Physics-aware Diffusion Model for Attenuation Correction

Title: Oh That Looks Familiar: A Novel Similarity Measure for Spreadsheet Template Discovery

Title: CoLM: Collaborative Large Models via A Client-Server Paradigm

Title: Performance Decay in Deepfake Detection: The Limitations of Training on Outdated Data

Title: Improving Deepfake Detection with Reinforcement Learning-Based Adaptive Data Augmentation

Title: RaLD: Generating High-Resolution 3D Radar Point Clouds with Latent Diffusion

Title: How Bias Binds: Measuring Hidden Associations for Bias Control in Text-to-Image Compositions

Title: GEWDiff: Geometric Enhanced Wavelet-based Diffusion Model for Hyperspectral Image Super-resolution

Title: On the Joint Minimization of Regularization Loss Functions in Deep Variational Bayesian Methods for Attribute-Controlled Symbolic Music Generation

Title: ProcGen3D: Learning Neural Procedural Graph Representations for Image-to-3D Reconstruction

Title: Conditional Diffusion as Latent Constraints for Controllable Symbolic Music Generation

Title: Guiding Generative Models to Uncover Diverse and Novel Crystals via Reinforcement Learning

Title: LiteUpdate: A Lightweight Framework for Updating AI-Generated Image Detectors

Title: Breaking the Stealth-Potency Trade-off in Clean-Image Backdoors with Generative Trigger Optimization

Title: Omni-View: Unlocking How Generation Facilitates Understanding in Unified 3D Model based on Multiview images

Title: 4DSTR: Advancing Generative 4D Gaussians with Spatial-Temporal Rectification for High-Quality and Consistent 4D Generation

Title: Enabling Off-Policy Imitation Learning with Deep Actor Critic Stabilization

Title: LMM-IQA: Image Quality Assessment for Low-Dose CT Imaging

Title: Q-RAG: Long Context Multi-step Retrieval via Value-based Embedder Training

Title: Inference-Time Scaling of Diffusion Models for Infrared Data Generation

Title: Provable Benefit of Curriculum in Transformer Tree-Reasoning Post-Training

Title: Real-Time LiDAR Super-Resolution via Frequency-Aware Multi-Scale Fusion

Title: A Diffusion Model to Shrink Proteins While Maintaining Their Function

Title: StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video Generation

Title: DIMO: Diverse 3D Motion Generation for Arbitrary Objects

Title: Routing Manifold Alignment Improves Generalization of Mixture-of-Experts LLMs