2025-09-12

Title: Diffusion-Based Action Recognition Generalizes to Untrained Domains

Title: Live(r) Die: Predicting Survival in Colorectal Liver Metastasis

Title: Discovering Divergent Representations between Text-to-Image Models

Title: Deep Context-Conditioned Anomaly Detection for Tabular Data

Title: Integrating Anatomical Priors into a Causal Diffusion Model

Title: Enhancing 3D Medical Image Understanding with Pretraining Aided by 2D Multimodal Large Language Models

Title: MR-UIE: Multi-Perspective Reasoning with Reinforcement Learning for Universal Information Extraction

Title: S-BEVLoc: BEV-based Self-supervised Framework for Large-scale LiDAR Global Localization

Title: Gradient-Attention Guided Dual-Masking Synergetic Framework for Robust Text-based Person Retrieval

Title: Automated Classification of Tutors' Dialogue Acts Using Generative AI: A Case Study Using the CIMA Corpus

Title: ALL-PET: A Low-resource and Low-shot PET Foundation Model in the Projection Domain

Title: Video Understanding by Design: How Datasets Shape Architectures and Insights

Title: Bridging the Gap Between Ideal and Real-world Evaluation: Benchmarking AI-Generated Image Detection in Challenging Scenarios

Title: GmSLM : Generative Marmoset Spoken Language Modeling

Title: Medverse: A Universal Model for Full-Resolution 3D Medical Image Segmentation, Transformation and Enhancement

Title: Data Driven Discovery of Emergent Dynamics in Reaction Diffusion Systems from Sparse and Noisy Observations

Title: Can Multimodal LLMs See Materials Clearly? A Multimodal Benchmark on Materials Characterization

Title: Fine-Grained Customized Fashion Design with Image-into-Prompt benchmark and dataset from LMM

Title: Exploring Pre-training Across Domains for Few-Shot Surgical Skill Assessment

Title: Plug-and-play Diffusion Models for Image Compressive Sensing with Data Consistency Projection

Title: Unsupervised Integrated-Circuit Defect Segmentation via Image-Intrinsic Normality

Title: Semantic Concentration for Self-Supervised Dense Representations Learning

Title: GrACE: A Generative Approach to Better Confidence Elicitation in Large Language Models

Title: Composable Score-based Graph Diffusion Model for Multi-Conditional Molecular Generation

Title: FlexiD-Fuse: Flexible number of inputs multi-modal medical image fusion based on diffusion model

Title: Prompt Pirates Need a Map: Stealing Seeds helps Stealing Prompts

Title: OpenFake: An Open Dataset and Platform Toward Large-Scale Deepfake Detection

Title: Towards Explainable Job Title Matching: Leveraging Semantic Textual Relatedness and Knowledge Graphs

Title: DeMeVa at LeWiDi-2025: Modeling Perspectives with In-Context Learning and Label Distribution Learning

Title: Generative Diffusion Contrastive Network for Multi-View Clustering

Title: DualTrack: Sensorless 3D Ultrasound needs Local and Global Context

Title: Improving Video Diffusion Transformer Training by Multi-Feature Fusion and Alignment from Self-Supervised Vision Encoders

Title: InterAct: Advancing Large-Scale Versatile 3D Human-Object Interaction Generation

Title: PeftCD: Leveraging Vision Foundation Models with Parameter-Efficient Fine-Tuning for Remote Sensing Change Detection

Title: Mechanistic Learning with Guided Diffusion Models to Predict Spatio-Temporal Brain Tumor Growth

Title: ReBaNO: Reduced Basis Neural Operator Mitigating Generalization Gaps and Achieving Discretization Invariance

Title: Geometric Neural Distance Fields for Learning Human Motion Priors

Title: Locality in Image Diffusion Models Emerges from Data Statistics