2025-09-10

Title: VLMs-in-the-Wild: Bridging the Gap Between Academic Benchmarks and Enterprise Reality

Title: Visible Yet Unreadable: A Systematic Blind Spot of Vision Language Models Across Writing Systems

Title: K-Syn: K-space Data Synthesis in Ultra Low-data Regimes

Title: Human-in-the-Loop: Quantitative Evaluation of 3D Models Generation by Large Language Models

Title: Moment- and Power-Spectrum-Based Gaussianity Regularization for Text-to-Image Models

Title: Automated Evaluation of Gender Bias Across 13 Large Multimodal Models

Title: PLaID++: A Preference Aligned Language Model for Targeted Inorganic Materials Design

Title: Breast Cancer Detection in Thermographic Images via Diffusion-Based Augmentation and Nonlinear Feature Fusion

Title: Reconstruction Alignment Improves Unified Multimodal Models

Title: CancerGUIDE: Cancer Guideline Understanding via Internal Disagreement Estimation

Title: The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward

Title: DreamLifting: A Plug-in Module Lifting MV Diffusion Models for 3D Asset Generation

Title: ANYPORTAL: Zero-Shot Consistent Video Background Replacement

Title: EHWGesture -- A dataset for multimodal understanding of clinical gestures

Title: PanoLAM: Large Avatar Model for Gaussian Full-Head Synthesis from One-shot Unposed Image

Title: $ΔL$ Normalization: Rethink Loss Aggregation in RLVR

Title: uGMM-NN: Univariate Gaussian Mixture Model Neural Network

Title: Bias in Gender Bias Benchmarks: How Spurious Features Distort Evaluation

Title: Beyond Rebalancing: Benchmarking Binary Classifiers Under Class Imbalance Without Rebalancing Techniques

Title: Data-Efficient Fine-Tuning of Vision-Language Models for Diagnosis of Alzheimer's Disease

Title: Self-Supervised Cross-Encoder for Neurodegenerative Disease Diagnosis

Title: Semantic Watermarking Reinvented: Enhancing Robustness and Generation Quality with Fourier Integrity

Title: Faster, Self-Supervised Super-Resolution for Anisotropic Multi-View MRI Using a Sparse Coordinate Loss

Title: Feature Space Analysis by Guided Diffusion Model

Title: Bringing Multi-Modal Multi-Task Federated Foundation Models to Education Domain: Prospects and Challenges

Title: Visual-TableQA: Open-Domain Benchmark for Reasoning over Table Images

Title: One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation