2025-10-16

Title: An Investigation of Memorization Risk in Healthcare Foundation Models

Title: Epistemic-aware Vision-Language Foundation Model for Fetal Ultrasound Interpretation

Title: Reference-Specific Unlearning Metrics Can Hide the Truth: A Reality Check

Title: SVAG-Bench: A Large-Scale Benchmark for Multi-Instance Spatio-temporal Video Action Grounding

Title: SeqBench: Benchmarking Sequential Narrative Generation in Text-to-Video Models

Title: SceneAdapt: Scene-aware Adaptation of Human Motion Diffusion

Title: NeuroRVQ: Multi-Scale EEG Tokenization for Generative Large Brainwave Models

Title: Counting Hallucinations in Diffusion Models

Title: VPREG: An Optimal Control Formulation for Diffeomorphic Image Registration Based on the Variational Principle Grid Generation Method

Title: On the Reasoning Abilities of Masked Diffusion Language Models

Title: MimicParts: Part-aware Style Injection for Speech-Driven 3D Motion Generation

Title: Prompt-based Adaptation in Large-scale Vision Models: A Survey

Title: CymbaDiff: Structured Spatial Diffusion for Sketch-based 3D Semantic Urban Scene Generation

Title: End-to-End Multi-Modal Diffusion Mamba

Title: Universal Image Restoration Pre-training via Masked Degradation Classification

Title: Federated Conditional Conformal Prediction via Generative Models

Title: Km-scale dynamical downscaling through conformalized latent diffusion models

Title: Self-Augmented Visual Contrastive Decoding

Title: No-Reference Rendered Video Quality Assessment: Dataset and Metrics

Title: Assessing the robustness of heterogeneous treatment effects in survival analysis under informative censoring

Title: Reinforcement Learning Meets Masked Generative Models: Mask-GRPO for Text-to-Image Generation

Title: Neural Sum-of-Squares: Certifying the Nonnegativity of Polynomials with Transformers

Title: Near-Infrared Hyperspectral Imaging Applications in Food Analysis -- Improving Algorithms and Methodologies

Title: VIST3A: Text-to-3D by Stitching a Multi-view Reconstruction Network to a Video Generator

Title: Tahakom LLM guidelines and receipts: from pre-training data to an Arabic LLM

Title: ProtoTopic: Prototypical Network for Few-Shot Medical Topic Modeling

Title: Manifold Decoders: A Framework for Generative Modeling from Nonlinear Embeddings

Title: Challenges, Advances, and Evaluation Metrics in Medical Image Enhancement: A Systematic Literature Review

Title: Local-Global Context-Aware and Structure-Preserving Image Super-Resolution

Title: EditCast3D: Single-Frame-Guided 3D Editing with Video Propagation and View Selection

Title: CanvasMAR: Improving Masked Autoregressive Video Generation With Canvas

Title: FlashWorld: High-quality 3D Scene Generation within Seconds

Title: Generating healthy counterfactuals with denoising diffusion bridge models

Title: MVCustom: Multi-View Customized Diffusion via Geometric Latent Rendering and Completion

Title: Assessing the Geographic Generalization and Physical Consistency of Generative Models for Climate Downscaling

Title: Cyclic Self-Supervised Diffusion for Ultra Low-field to High-field MRI Synthesis

Title: UniCalli: A Unified Diffusion Framework for Column-Level Generation and Recognition of Chinese Calligraphy

Title: InteractiveOmni: A Unified Omni-modal Model for Audio-Visual Multi-turn Dialogue

Title: RECODE: Reasoning Through Code Generation for Visual Question Answering

Title: Uni-MMMU: A Massive Multi-discipline Multimodal Unified Benchmark

Title: NoisePrints: Distortion-Free Watermarks for Authorship in Private Diffusion Models

Title: Generative Universal Verifier as Multimodal Meta-Reasoner

Title: PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning