2025-08-04

Title: ECG Latent Feature Extraction with Autoencoders for Downstream Prediction Tasks

Title: World Consistency Score: A Unified Metric for Video Generation Quality

Title: Watch the Weights: Unsupervised monitoring and control of fine-tuned LLMs

Title: DiSC-Med: Diffusion-based Semantic Communications for Robust Medical Image Transmission

Title: EMA Without the Lag: Bias-Corrected Iterate Averaging Schemes

Title: Learning Personalised Human Internal Cognition from External Expressive Behaviours for Real Personality Recognition

Title: Guided Depth Map Super-Resolution via Multi-Scale Fusion U-shaped Mamba Network

Title: Instruction-Grounded Visual Projectors for Continual Learning of Generative Vision-Language Models

Title: AniMer+: Unified Pose and Shape Estimation Across Mammalia and Aves via Family-Aware Transformer

Title: Controllable Pedestrian Video Editing for Multi-View Driving Scenarios via Motion Sequence

Title: Exploring Fourier Prior and Event Collaboration for Low-Light Image Enhancement

Title: GV-VAD : Exploring Video Generation for Weakly-Supervised Video Anomaly Detection

Title: Steering Guidance for Personalized Text-to-Image Diffusion Models

Title: PnP-DA: Towards Principled Plug-and-Play Integration of Variational Data Assimilation and Generative Models

Title: BOOD: Boundary-based Out-Of-Distribution Data Generation

Title: Analyze-Prompt-Reason: A Collaborative Agent-Based Framework for Multi-Image Vision-Language Reasoning

Title: $MV_{Hybrid}$: Improving Spatial Transcriptomics Prediction with Hybrid State Space-Vision Transformer Backbone in Pathology Vision Foundation Models

Title: Video Forgery Detection with Optical Flow Residuals and Spatial-Temporal Consistency

Title: PMR: Physical Model-Driven Multi-Stage Restoration of Turbulent Dynamic Videos

Title: Sortblock: Similarity-Aware Feature Reuse for Diffusion Model

Title: DC-AE 1.5: Accelerating Diffusion Model Convergence with Structured Latent Space

Title: TopoTTA: Topology-Enhanced Test-Time Adaptation for Tubular Structure Segmentation

Title: PIF-Net: Ill-Posed Prior Guided Multispectral and Hyperspectral Image Fusion via Invertible Mamba and Fusion-Aware LoRA

Title: Semantic and Temporal Integration in Latent Diffusion Space for High-Fidelity Video Super-Resolution

Title: A Conditional GAN for Tabular Data Generation with Probabilistic Sampling of Latent Subspaces

Title: LAMIC: Layout-Aware Multi-Image Composition via Scalability of Multimodal Diffusion Transformer

Title: Court of LLMs: Evidence-Augmented Generation via Multi-LLM Collaboration for Text-Attributed Graph Anomaly Detection

Title: Video Color Grading via Look-Up Table Generation

Title: Guiding Diffusion-Based Articulated Object Generation by Partial Point Cloud Alignment and Physical Plausibility Constraints

Title: Learning Potential Energy Surfaces of Hydrogen Atom Transfer Reactions in Peptides

Title: Wukong Framework for Not Safe For Work Detection in Text-to-Image systems

Title: Backdoor Attacks on Deep Learning Face Detection

Title: Minimum Data, Maximum Impact: 20 annotated samples for explainable lung nodule classification

Title: Wind Power Scenario Generation based on the Generalized Dynamic Factor Model and Generative Adversarial Network

Title: D3: Training-Free AI-Generated Video Detection Using Second-Order Features

Title: Democratizing Tabular Data Access with an Open$\unicode{x2013}$Source Synthetic$\unicode{x2013}$Data SDK

Title: YOLO-Count: Differentiable Object Counting for Text-to-Image Generation

Title: Is It Really You? Exploring Biometric Verification Scenarios in Photorealistic Talking-Head Avatar Videos

Title: SU-ESRGAN: Semantic and Uncertainty-Aware ESRGAN for Super-Resolution of Satellite and Drone Imagery with Fine-Tuning for Cross Domain Evaluation