2025-10-16

Title: A\textsuperscript{2}FM: An Adaptive Agent Foundation Model for Tool-Aware Hybrid Reasoning

Title: An Investigation of Memorization Risk in Healthcare Foundation Models

Title: Epistemic-aware Vision-Language Foundation Model for Fetal Ultrasound Interpretation

Title: CADE 2.5 - ZeResFDG: Frequency-Decoupled, Rescaled and Zero-Projected Guidance for SD/SDXL Latent Diffusion Models

Title: A Connection Between Score Matching and Local Intrinsic Dimension

Title: Reference-Specific Unlearning Metrics Can Hide the Truth: A Reality Check

Title: Machine Learning-Based Ultrasonic Weld Characterization Using Hierarchical Wave Modeling and Diffusion-Driven Distribution Alignment

Title: SceneAdapt: Scene-aware Adaptation of Human Motion Diffusion

Title: True Self-Supervised Novel View Synthesis is Transferable

Title: NeuroRVQ: Multi-Scale EEG Tokenization for Generative Large Brainwave Models

Title: Counting Hallucinations in Diffusion Models

Title: Edit-Your-Interest: Efficient Video Editing via Feature Most-Similar Propagation

Title: On the Reasoning Abilities of Masked Diffusion Language Models

Title: DP-TTA: Test-time Adaptation for Transient Electromagnetic Signal Denoising via Dictionary-driven Prior Regularization

Title: Text Anomaly Detection with Simplified Isolation Kernel

Title: CleverCatch: A Knowledge-Guided Weak Supervision Model for Fraud Detection

Title: Prompt-based Adaptation in Large-scale Vision Models: A Survey

Title: CymbaDiff: Structured Spatial Diffusion for Sketch-based 3D Semantic Urban Scene Generation

Title: End-to-End Multi-Modal Diffusion Mamba

Title: Higher Satisfaction, Lower Cost: A Technical Report on How LLMs Revolutionize Meituan's Intelligent Interaction Systems

Title: Federated Conditional Conformal Prediction via Generative Models

Title: Km-scale dynamical downscaling through conformalized latent diffusion models

Title: LLM one-shot style transfer for Authorship Attribution and Verification

Title: Isolation-based Spherical Ensemble Representations for Anomaly Detection

Title: Group-Wise Optimization for Self-Extensible Codebooks in Vector Quantized Models

Title: Document Intelligence in the Era of Large Language Models: A Survey

Title: Contrastive Learning-Based Dependency Modeling for Anomaly Detection in Cloud Services

Title: Generalizing WiFi Gesture Recognition via Large-Model-Aware Semantic Distillation and Alignment

Title: Doing Things with Words: Rethinking Theory of Mind Simulation in Large Language Models

Title: Reinforcement Learning Meets Masked Generative Models: Mask-GRPO for Text-to-Image Generation

Title: Ultra High-Resolution Image Inpainting with Patch-Based Content Consistency Adapter

Title: Manifold Decoders: A Framework for Generative Modeling from Nonlinear Embeddings

Title: AVAR-Net: A Lightweight Audio-Visual Anomaly Recognition Framework with a Benchmark Dataset

Title: Towards Adversarial Robustness and Uncertainty Quantification in DINOv2-based Few-Shot Anomaly Detection

Title: Local-Global Context-Aware and Structure-Preserving Image Super-Resolution

Title: EditCast3D: Single-Frame-Guided 3D Editing with Video Propagation and View Selection

Title: Time Series Foundation Models: Benchmarking Challenges and Requirements

Title: Axial Neural Networks for Dimension-Free Foundation Models

Title: CanvasMAR: Improving Masked Autoregressive Video Generation With Canvas

Title: FlashWorld: High-quality 3D Scene Generation within Seconds

Title: Generating healthy counterfactuals with denoising diffusion bridge models

Title: MVCustom: Multi-View Customized Diffusion via Geometric Latent Rendering and Completion

Title: NExT-OMNI: Towards Any-to-Any Omnimodal Foundation Models with Discrete Flow Matching

Title: Assessing the Geographic Generalization and Physical Consistency of Generative Models for Climate Downscaling

Title: Cyclic Self-Supervised Diffusion for Ultra Low-field to High-field MRI Synthesis

Title: UniCalli: A Unified Diffusion Framework for Column-Level Generation and Recognition of Chinese Calligraphy

Title: Scaling Vision Transformers for Functional MRI with Flat Maps

Title: UrbanFusion: Stochastic Multimodal Fusion for Contrastive Learning of Robust Spatial Representations

Title: Adaptive Visual Conditioning for Semantic Consistency in Diffusion-Based Story Continuation

Title: NoisePrints: Distortion-Free Watermarks for Authorship in Private Diffusion Models

Title: BRIEF-Pro: Universal Context Compression with Short-to-Long Synthesis for Fast and Accurate Multi-Hop Reasoning

Title: Generative Universal Verifier as Multimodal Meta-Reasoner