2025-09-12

Title: Recurrence Meets Transformers for Universal Multimodal Retrieval

Title: PromptGuard: An Orchestrated Prompting Framework for Principled Synthetic Text Generation for Vulnerable Populations using LLMs with Enhanced Safety, Fairness, and Controllability

Title: Discovering Divergent Representations between Text-to-Image Models

Title: Integrating Anatomical Priors into a Causal Diffusion Model

Title: ALL-PET: A Low-resource and Low-shot PET Foundation Model in the Projection Domain

Title: Objectness Similarity: Capturing Object-Level Fidelity in 3D Scene Evaluation

Title: HISPASpoof: A New Dataset For Spanish Speech Forensics

Title: Bridging the Gap Between Ideal and Real-world Evaluation: Benchmarking AI-Generated Image Detection in Challenging Scenarios

Title: VQualA 2025 Challenge on Visual Quality Comparison for Large Multimodal Models: Methods and Results

Title: Towards Better Dental AI: A Multimodal Benchmark and Instruction Dataset for Panoramic X-ray Analysis

Title: Can Multimodal LLMs See Materials Clearly? A Multimodal Benchmark on Materials Characterization

Title: Fine-Grained Customized Fashion Design with Image-into-Prompt benchmark and dataset from LMM

Title: FS-Diff: Semantic guidance and clarity-aware simultaneous multimodal image fusion and super-resolution

Title: Composable Score-based Graph Diffusion Model for Multi-Conditional Molecular Generation

Title: OpenFake: An Open Dataset and Platform Toward Large-Scale Deepfake Detection

Title: Region-Wise Correspondence Prediction between Manga Line Art Images

Title: Generative Diffusion Contrastive Network for Multi-View Clustering

Title: Improving Video Diffusion Transformer Training by Multi-Feature Fusion and Alignment from Self-Supervised Vision Encoders

Title: InterAct: Advancing Large-Scale Versatile 3D Human-Object Interaction Generation

Title: Kling-Avatar: Grounding Multimodal Instructions for Cascaded Long-Duration Avatar Animation Synthesis

Title: Mechanistic Learning with Guided Diffusion Models to Predict Spatio-Temporal Brain Tumor Growth

Title: ReBaNO: Reduced Basis Neural Operator Mitigating Generalization Gaps and Achieving Discretization Invariance

Title: Can Understanding and Generation Truly Benefit Together -- or Just Coexist?

Title: Geometric Neural Distance Fields for Learning Human Motion Priors

Title: Locality in Image Diffusion Models Emerges from Data Statistics

Title: FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehensive Benchmark