2025-08-06

Title: ECGTwin: Personalized ECG Generation Using Controllable Diffusion Model

Title: DreamVVT: Mastering Realistic Video Virtual Try-On in the Wild via a Stage-Wise Diffusion Transformer Framework

Title: Clinically Grounded Agent-based Report Evaluation: An Interpretable Metric for Radiology Report Generation

Title: Elucidating the Role of Feature Normalization in IJEPA

Title: Learning from B Cell Evolution: Adaptive Multi-Expert Diffusion for Antibody Design via Online Optimization

Title: Highlight & Summarize: RAG without the jailbreaks

Title: CauKer: classification time series foundation models can be pretrained on synthetic data only

Title: RDDPM: Robust Denoising Diffusion Probabilistic Model for Unsupervised Anomaly Segmentation

Title: How Diffusion Prior Landscapes Shape the Posterior in Blind Deconvolution

Title: GrandJury: A Collaborative Machine Learning Model Evaluation Protocol for Dynamic Quality Rubrics

Title: X-Actor: Emotional and Expressive Long-Range Portrait Acting from Audio

Title: Injecting Measurement Information Yields a Fast and Noise-Robust Diffusion-Based Inverse Problem Solver

Title: Diffusion Models with Adaptive Negative Sampling Without External Resources

Title: Seeing It Before It Happens: In-Generation NSFW Detection for Diffusion-Based Text-to-Image Models

Title: Multi-Granularity Feature Calibration via VFM for Domain Generalized Semantic Segmentation

Title: MoCA: Identity-Preserving Text-to-Video Generation via Mixture of Cross Attention

Title: When Algorithms Meet Artists: Topic Modeling the AI-Art Debate, 2013-2025

Title: Urban In-Context Learning: Bridging Pretraining and Inference through Masked Diffusion for Urban Profiling

Title: A Novel Multimodal Framework for Early Detection of Alzheimers Disease Using Deep Learning

Title: Untraceable DeepFakes via Traceable Fingerprint Elimination

Title: HiTeC: Hierarchical Contrastive Learning on Text-Attributed Hypergraph with Semantic-Aware Augmentation

Title: H3R: Hybrid Multi-view Correspondence for Generalizable 3D Reconstruction

Title: UniEdit-I: Training-free Image Editing for Unified VLM via Iterative Understanding, Editing and Verifying

Title: SARD: Segmentation-Aware Anomaly Synthesis via Region-Constrained Diffusion with Discriminative Mask Guidance

Title: Quantum Spectral Reasoning: A Non-Neural Architecture for Interpretable Machine Learning

Title: SAVER: Mitigating Hallucinations in Large Vision-Language Models via Style-Aware Visual Early Revision

Title: Convergence of Deterministic and Stochastic Diffusion-Model Samplers: A Simple Analysis in Wasserstein Distance

Title: ActionSink: Toward Precise Robot Manipulation with Dynamic Integration of Action Flow

Title: BadBlocks: Low-Cost and Stealthy Backdoor Attacks Tailored for Text-to-Image Diffusion Models

Title: Zero-shot Shape Classification of Nanoparticles in SEM Images using Vision Foundation Models

Title: Robust Single-Stage Fully Sparse 3D Object Detection via Detachable Latent Diffusion

Title: V.I.P. : Iterative Online Preference Distillation for Efficient Video Diffusion Models

Title: Beyond Isolated Words: Diffusion Brush for Handwritten Text-Line Generation

Title: Investigating Gender Bias in LLM-Generated Stories via Psychological Stereotypes

Title: Zero Shot Domain Adaptive Semantic Segmentation by Synthetic Data Generation and Progressive Adaptation

Title: Macro-from-Micro Planning for High-Quality and Parallelized Autoregressive Long Video Generation

Title: FedPromo: Federated Lightweight Proxy Models at the Edge Bring New Domains to Foundation Models

Title: Thinking with Nothinking Calibration: A New In-Context Learning Paradigm in Reasoning Large Language Models

Title: Diffusion Once and Done: Degradation-Aware LoRA for Efficient All-in-One Image Restoration

Title: SCFlow: Implicitly Learning Style and Content Disentanglement with Flow Models

Title: Learning Latent Representations for Image Translation using Frequency Distributed CycleGAN

Title: R2GenKG: Hierarchical Multi-modal Knowledge Graph for LLM-based Radiology Report Generation

Title: AI on the Pulse: Real-Time Health Anomaly Detection with Wearable and Ambient Intelligence

Title: Spatial Imputation Drives Cross-Domain Alignment for EEG Classification

Title: MedCAL-Bench: A Comprehensive Benchmark on Cold-Start Active Learning with Foundation Models for Medical Image Analysis

Title: RAAG: Ratio Aware Adaptive Guidance

Title: CoPS: Conditional Prompt Synthesis for Zero-Shot Anomaly Detection

Title: Cropping outperforms dropout as an augmentation strategy for training self-supervised text embeddings

Title: READ: Real-time and Efficient Asynchronous Diffusion for Audio-driven Talking Head Generation

Title: VideoGuard: Protecting Video Content from Unauthorized Editing

Title: Draw Your Mind: Personalized Generation via Condition-Level Modeling in Text-to-Image Diffusion Models

Title: When Cars Have Stereotypes: Auditing Demographic Bias in Objects from Text-to-Image Models

Title: LRQ-DiT: Log-Rotation Post-Training Quantization of Diffusion Transformers for Text-to-Image Generation

Title: ParticleSAM: Small Particle Segmentation for Material Quality Monitoring in Recycling Processes

Title: MAUP: Training-free Multi-center Adaptive Uncertainty-aware Prompting for Cross-domain Few-shot Medical Image Segmentation

Title: Semantic Mosaicing of Histo-Pathology Image Fragments using Visual Foundation Models

Title: MoKA: Mixture of Kronecker Adapters

Title: EmbedGrad: Gradient-Based Prompt Optimization in Embedding Space for Large Language Models

Title: CoEmoGen: Towards Semantically-Coherent and Scalable Emotional Image Content Generation

Title: Quality-Aware Language-Conditioned Local Auto-Regressive Anomaly Synthesis and Detection

Title: SAM2-UNeXT: An Improved High-Resolution Baseline for Adapting Foundation Models to Downstream Segmentation Tasks

Title: Zero-Variance Gradients for Variational Autoencoders

Title: VITA: Variational Pretraining of Transformers for Climate-Robust Crop Yield Forecasting

Title: evTransFER: A Transfer Learning Framework for Event-based Facial Expression Recognition

Title: A DbC Inspired Neurosymbolic Layer for Trustworthy Agent Design

Title: OmniShape: Zero-Shot Multi-Hypothesis Shape and Pose Estimation in the Real World

Title: Veila: Panoramic LiDAR Generation from a Monocular RGB Image

Title: La La LiDAR: Large-Scale Layout Generation from LiDAR Data

Title: LiDARCrafter: Dynamic 4D World Modeling from LiDAR Sequences