2025-11-05

Title: Challenging DINOv3 Foundation Model under Low Inter-Class Variability: A Case Study on Fetal Brain Ultrasound

Title: Dynamic Population Distribution Aware Human Trajectory Generation with Diffusion Model

Title: Locally-Supervised Global Image Restoration

Title: Quantum-Enhanced Generative Models for Rare Event Prediction

Title: Text-VQA Aug: Pipelined Harnessing of Large Multimodal Models for Automated Synthesis

Title: Beyond Static Cutoffs: One-Shot Dynamic Thresholding for Diffusion Language Models

Title: Watermarking Discrete Diffusion Language Models

Title: Energy Loss Functions for Physical Systems

Title: Natural Building Blocks for Structured World Models: Theory, Evidence, and Scaling

Title: Language-Enhanced Generative Modeling for PET Synthesis from MRI and Blood Biomarkers

Title: Can Foundation Models Revolutionize Mobile AR Sparse Sensing?

Title: Federated Quantum Kernel Learning for Anomaly Detection in Multivariate IoT Time-Series

Title: Let Multimodal Embedders Learn When to Augment Query via Adaptive Query Augmentation

Title: CoCoVa: Chain of Continuous Vision-Language Thought for Latent Space Reasoning

Title: Self-Supervised Moving Object Segmentation of Sparse and Noisy Radar Point Clouds

Title: Purrturbed but Stable: Human-Cat Invariant Representations Across CNNs, ViTs and Self-Supervised ViTs

Title: KAO: Kernel-Adaptive Optimization in Diffusion for Satellite Image

Title: Object Detection as an Optional Basis: A Graph Matching Network for Cross-View UAV Localization

Title: DetectiumFire: A Comprehensive Multi-modal Dataset Bridging Vision and Language for Fire Understanding

Title: Adapting General-Purpose Foundation Models for X-ray Ptychography in Low-Data Regimes

Title: Unsupervised Learning for Industrial Defect Detection: A Case Study on Shearographic Data

Title: Forecasting Future Anatomies: Longitudianl Brain Mri-to-Mri Prediction

Title: TAUE: Training-free Noise Transplant and Cultivation Diffusion Model

Title: Zero-Shot Multi-Animal Tracking in the Wild

Title: UniChange: Unifying Change Detection with Multimodal Large Language Model

Title: A Non-Adversarial Approach to Idempotent Generative Modelling

Title: VidEmo: Affective-Tree Reasoning for Emotion-Centric Video Foundation Models

Title: AI Diffusion in Low Resource Language Countries

Title: STAR-VAE: Latent Variable Transformers for Scalable and Controllable Molecular Generation

Title: AI-Generated Image Detection: An Empirical Study and Future Research Directions

Title: TabTune: A Unified Library for Inference and Fine-Tuning Tabular Foundation Models

Title: Oolong: Evaluating Long Context Reasoning and Aggregation Capabilities

Title: PLUTO-4: Frontier Pathology Foundation Models

Title: GeoCrossBench: Cross-Band Generalization for Remote Sensing