2025-08-26

Title: CrystalDiT: A Diffusion Transformer for Crystal Generation

Title: A Novel Unified Extended Matrix for Graph Signal Processing: Theory and Application

Title: Enhancing Transformer-Based Foundation Models for Time Series Forecasting via Bagging, Boosting and Statistical Ensembles

Title: From Classical Probabilistic Latent Variable Models to Modern Generative AI: A Unified Perspective

Title: CountLoop: Training-Free High-Instance Image Generation via Iterative Agent Guidance

Title: A Laplace diffusion-based transformer model for heart rate forecasting within daily activity context

Title: OASIS: Open-world Adaptive Self-supervised and Imbalanced-aware System

Title: Trust but Verify! A Survey on Verification Design for Test-time Scaling

Title: FAIRWELL: Fair Multimodal Self-Supervised Learning for Wellbeing Prediction

Title: A Framework for Benchmarking Fairness-Utility Trade-offs in Text-to-Image Models via Pareto Frontiers

Title: GAICo: A Deployed and Extensible Framework for Evaluating Diverse and Multimodal Generative AI Outputs

Title: Toward Socially Aware Vision-Language Models: Evaluating Cultural Competence Through Multimodal Story Generation

Title: Latent Graph Learning in Generative Models of Neural Signals

Title: Improving Performance, Robustness, and Fairness of Radiographic AI Models with Finely-Controllable Synthetic Data

Title: Assess and Prompt: A Generative RL Framework for Improving Engagement in Online Mental Health Communities

Title: Towards Open-Vocabulary Multimodal 3D Object Detection with Attributes

Title: NinA: Normalizing Flows in Action. Training VLA Models with Normalizing Flows

Title: Delta-SVD: Efficient Compression for Personalized Text-to-Image Models

Title: Structural Energy-Guided Sampling for View-Consistent Text-to-3D

Title: Sig-DEG for Distillation: Making Diffusion Models Faster and Lighter

Title: RPD-Diff: Region-Adaptive Physics-Guided Diffusion Model for Visibility Enhancement under Dense and Non-Uniform Haze

Title: HiCache: Training-free Acceleration of Diffusion Models via Hermite Polynomial-based Feature Caching

Title: Dual Orthogonal Guidance for Robust Diffusion-based Handwritten Text Generation

Title: A Novel Local Focusing Mechanism for Deepfake Detection Generalization

Title: Styleclone: Face Stylization with Diffusion Based Data Augmentation

Title: PVNet: Point-Voxel Interaction LiDAR Scene Upsampling Via Diffusion Models

Title: REGEN: Real-Time Photorealism Enhancement in Games via a Dual-Stage Generative Network Framework

Title: SSG-Dit: A Spatial Signal Guided Framework for Controllable Video Generation

Title: Linguistic Neuron Overlap Patterns to Facilitate Cross-lingual Transfer on Low-resource Languages

Title: Two Birds with One Stone: Enhancing Uncertainty Quantification and Interpretability with Graph Functional Neural Process

Title: GRASP: Geospatial pixel Reasoning viA Structured Policy learning

Title: Geolocation-Aware Robust Spoken Language Identification

Title: SSFO: Self-Supervised Faithfulness Optimization for Retrieval-Augmented Generation

Title: 4D Visual Pre-training for Robot Learning

Title: Deep Learning-Assisted Detection of Sarcopenia in Cross-Sectional Computed Tomography Imaging

Title: Quickly Tuning Foundation Models for Image Segmentation

Title: FoundDiff: Foundational Diffusion Model for Generalizable Low-Dose CT Denoising

Title: PosBridge: Multi-View Positional Embedding Transplant for Identity-Aware Image Editing

Title: ShortListing Model: A Streamlined SimplexDiffusion for Discrete Variable Generation

Title: No Pixel Left Behind: A Detail-Preserving Architecture for Robust High-Resolution AI-Generated Image Detection

Title: DiCache: Let Diffusion Model Determine Its Own Cache

Title: ShaLa: Multimodal Shared Latent Space Modelling

Title: Retrieval Capabilities of Large Language Models Scale with Pretraining FLOPs

Title: DS@GT at CheckThat! 2025: A Simple Retrieval-First, LLM-Backed Framework for Claim Normalization

Title: Mutual Information Surprise: Rethinking Unexpectedness in Autonomous Systems

Title: E-BayesSAM: Efficient Bayesian Adaptation of SAM with Self-Optimizing KAN-Based Interpretation for Uncertainty-Aware Ultrasonic Segmentation

Title: Modular MeanFlow: Towards Stable and Scalable One-Step Generative Modeling

Title: TinySR: Pruning Diffusion for Real-World Image Super-Resolution

Title: A Synthetic Dataset for Manometry Recognition in Robotic Applications

Title: Social-MAE: A Transformer-Based Multimodal Autoencoder for Face and Voice

Title: DinoTwins: Combining DINO and Barlow Twins for Robust, Label-Efficient Vision Transformers

Title: Modeling Irregular Astronomical Time Series with Neural Stochastic Delay Differential Equations

Title: OmniMRI: A Unified Vision--Language Foundation Model for Generalist MRI Interpretation

Title: In-Context Algorithm Emulation in Fixed-Weight Transformers

Title: IDU: Incremental Dynamic Update of Existing 3D Virtual Environments with New Imagery Data

Title: HERO: Hierarchical Extrapolation and Refresh for Efficient World Models

Title: JCo-MVTON: Jointly Controllable Multi-Modal Diffusion Transformer for Mask-Free Virtual Try-on

Title: ControlEchoSynth: Boosting Ejection Fraction Estimation Models via Controlled Video Diffusion

Title: Finding Outliers in a Haystack: Anomaly Detection for Large Pointcloud Scenes

Title: Longitudinal Progression Prediction of Alzheimer's Disease with Tabular Foundation Model

Title: Towards Synthesizing Normative Data for Cognitive Assessments Using Generative Multimodal Large Language Models

Title: Robustness Feature Adapter for Efficient Adversarial Training

Title: Unlearning as Ablation: Toward a Falsifiable Benchmark for Generative Scientific Discovery

Title: On the Edge of Memorization in Diffusion Models

Title: Copyright Protection for 3D Molecular Structures with Watermarking

Title: CATformer: Contrastive Adversarial Transformer for Image Super-Resolution

Title: F2RVLM: Boosting Fine-grained Fragment Retrieval for Multi-Modal Long-form Dialogue with Vision Language Model

Title: Instant Preference Alignment for Text-to-Image Diffusion Models

Title: Few-shot Human Action Anomaly Detection via a Unified Contrastive Learning Framework

Title: SMITE: Enhancing Fairness in LLMs through Optimal In-Context Example Selection via Dynamic Validation

Title: Randomly Removing 50% of Dimensions in Text Embeddings has Minimal Impact on Retrieval and Classification Tasks

Title: SuperGen: An Efficient Ultra-high-resolution Video Generation System with Sketching and Tiling

Title: CEIDM: A Controlled Entity and Interaction Diffusion Model for Enhanced Text-to-Image Generation

Title: Proximal Supervised Fine-Tuning

Title: Robust Anomaly Detection in Industrial Environments via Meta-Learning

Title: Multi-domain Distribution Learning for De Novo Drug Design

Title: UniSino: Physics-Driven Foundational Model for Universal CT Sinogram Standardization

Title: A Contrastive Learning-Guided Confident Meta-learning for Zero Shot Anomaly Detection

Title: Diffusion-Based Data Augmentation for Medical Image Segmentation

Title: Speech Discrete Tokens or Continuous Features? A Comparative Analysis for Spoken Language Understanding in SpeechLLMs

Title: Edge-Enhanced Vision Transformer Framework for Accurate AI-Generated Image Detection

Title: Evaluating the Representation of Vowels in Wav2Vec Feature Extractor: A Layer-Wise Analysis Using MFCCs

Title: EndoUFM: Utilizing Foundation Models for Monocular depth estimation of endoscopic images

Title: Generative Feature Imputing - A Technique for Error-resilient Semantic Communication

Title: A Novel Framework for Uncertainty Quantification via Proper Scores for Classification and Beyond

Title: Fence off Anomaly Interference: Cross-Domain Distillation for Fully Unsupervised Anomaly Detection

Title: Towards Continual Visual Anomaly Detection in the Medical Domain

Title: FCR: Investigating Generative AI models for Forensic Craniofacial Reconstruction

Title: Annotation-Free Open-Vocabulary Segmentation for Remote-Sensing Images

Title: How Quantization Shapes Bias in Large Language Models

Title: Speech-Based Depressive Mood Detection in the Presence of Multiple Sclerosis: A Cross-Corpus and Cross-Lingual Study

Title: Incorporating Pre-trained Diffusion Models in Solving the Schrödinger Bridge Problem

Title: Provable Mixed-Noise Learning with Flow-Matching

Title: Learning from Few Samples: A Novel Approach for High-Quality Malcode Generation

Title: $AutoGuardX$: A Comprehensive Cybersecurity Framework for Connected Vehicles

Title: SpotEdit: Evaluating Visually-Guided Image Editing Methods

Title: Amortized Sampling with Transferable Normalizing Flows

Title: Leveraging Large Language Models for Accurate Sign Language Translation in Low-Resource Scenarios

Title: Better Language Model-Based Judging Reward Modeling through Scaling Comprehension Boundaries

Title: Follow My Hold: Hand-Object Interaction Reconstruction through Geometric Guidance

Title: Sealing The Backdoor: Unlearning Adversarial Text Triggers In Diffusion Models Using Knowledge Distillation

Title: Interpretable Evaluation of AI-Generated Content with Language-Grounded Sparse Encoders

Title: ObjFiller-3D: Consistent Multi-view 3D Inpainting via Video Diffusion Models