2025-08-26

Title: Towards High-Precision Depth Sensing via Monocular-Aided iToF and RGB Integration

Title: CrystalDiT: A Diffusion Transformer for Crystal Generation

Title: A Retrieval Augmented Spatio-Temporal Framework for Traffic Prediction

Title: From Classical Probabilistic Latent Variable Models to Modern Generative AI: A Unified Perspective

Title: CountLoop: Training-Free High-Instance Image Generation via Iterative Agent Guidance

Title: QA-VLM: Providing human-interpretable quality assessment for wire-feed laser additive manufacturing parts with Vision Language Models

Title: Multidimensional Distributional Neural Network Output Demonstrated in Super-Resolution of Surface Wind Speed

Title: A Framework for Benchmarking Fairness-Utility Trade-offs in Text-to-Image Models via Pareto Frontiers

Title: WebMMU: A Benchmark for Multimodal Multilingual Website Understanding and Code Generation

Title: Latent Graph Learning in Generative Models of Neural Signals

Title: Improving Performance, Robustness, and Fairness of Radiographic AI Models with Finely-Controllable Synthetic Data

Title: Delta-SVD: Efficient Compression for Personalized Text-to-Image Models

Title: AWM-Fuse: Multi-Modality Image Fusion for Adverse Weather via Global and Local Text Perception

Title: MDIQA: Unified Image Quality Assessment for Multi-dimensional Evaluation and Restoration

Title: Structural Energy-Guided Sampling for View-Consistent Text-to-3D

Title: NAT: Learning to Attack Neurons for Enhanced Adversarial Transferability

Title: Sig-DEG for Distillation: Making Diffusion Models Faster and Lighter

Title: Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning

Title: RPD-Diff: Region-Adaptive Physics-Guided Diffusion Model for Visibility Enhancement under Dense and Non-Uniform Haze

Title: HiCache: Training-free Acceleration of Diffusion Models via Hermite Polynomial-based Feature Caching

Title: Dual Orthogonal Guidance for Robust Diffusion-based Handwritten Text Generation

Title: A Novel Local Focusing Mechanism for Deepfake Detection Generalization

Title: Styleclone: Face Stylization with Diffusion Based Data Augmentation

Title: PVNet: Point-Voxel Interaction LiDAR Scene Upsampling Via Diffusion Models

Title: REGEN: Real-Time Photorealism Enhancement in Games via a Dual-Stage Generative Network Framework

Title: SSG-Dit: A Spatial Signal Guided Framework for Controllable Video Generation

Title: Two Birds with One Stone: Enhancing Uncertainty Quantification and Interpretability with Graph Functional Neural Process

Title: PlantVillageVQA: A Visual Question Answering Dataset for Benchmarking Vision-Language Models in Plant Science

Title: Structural Damage Detection Using AI Super Resolution and Visual Language Model

Title: MMCIG: Multimodal Cover Image Generation for Text-only Documents and Its Dataset Construction via Pseudo-labeling

Title: How to make Medical AI Systems safer? Simulating Vulnerabilities, and Threats in Multimodal Medical RAG System

Title: Explain Before You Answer: A Survey on Compositional Visual Reasoning

Title: PosBridge: Multi-View Positional Embedding Transplant for Identity-Aware Image Editing

Title: Defending Deepfake via Texture Feature Perturbation

Title: SpecGen: Neural Spectral BRDF Generation via Spectral-Spatial Tri-plane Aggregation

Title: ShortListing Model: A Streamlined SimplexDiffusion for Discrete Variable Generation

Title: No Pixel Left Behind: A Detail-Preserving Architecture for Robust High-Resolution AI-Generated Image Detection

Title: DiCache: Let Diffusion Model Determine Its Own Cache

Title: Condition Weaving Meets Expert Modulation: Towards Universal and Controllable Image Generation

Title: ShaLa: Multimodal Shared Latent Space Modelling

Title: Enhancing Underwater Images via Deep Learning: A Comparative Study of VGG19 and ResNet50-Based Approaches

Title: MoCo: Motion-Consistent Human Video Generation via Structure-Appearance Decoupling

Title: Constrained Prompt Enhancement for Improving Zero-Shot Generalization of Vision-Language Models

Title: Modular MeanFlow: Towards Stable and Scalable One-Step Generative Modeling

Title: TinySR: Pruning Diffusion for Real-World Image Super-Resolution

Title: An LLM-LVLM Driven Agent for Iterative and Fine-Grained Image Editing

Title: TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

Title: A Synthetic Dataset for Manometry Recognition in Robotic Applications

Title: T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation

Title: OmniMRI: A Unified Vision--Language Foundation Model for Generalist MRI Interpretation

Title: MetaGen: A DSL, Database, and Benchmark for VLM-Assisted Metamaterial Generation

Title: IDU: Incremental Dynamic Update of Existing 3D Virtual Environments with New Imagery Data

Title: HERO: Hierarchical Extrapolation and Refresh for Efficient World Models

Title: ChartMaster: Advancing Chart-to-Code Generation with Real-World Charts and Chart Similarity Reinforcement Learning

Title: JCo-MVTON: Jointly Controllable Multi-Modal Diffusion Transformer for Mask-Free Virtual Try-on

Title: ControlEchoSynth: Boosting Ejection Fraction Estimation Models via Controlled Video Diffusion

Title: Longitudinal Progression Prediction of Alzheimer's Disease with Tabular Foundation Model

Title: Hierarchical Vision-Language Learning for Medical Out-of-Distribution Detection

Title: Towards Synthesizing Normative Data for Cognitive Assessments Using Generative Multimodal Large Language Models

Title: Characterizing the Behavior of Training Mamba-based State Space Models on GPUs

Title: Unlearning as Ablation: Toward a Falsifiable Benchmark for Generative Scientific Discovery

Title: Copyright Protection for 3D Molecular Structures with Watermarking

Title: CATformer: Contrastive Adversarial Transformer for Image Super-Resolution

Title: F2RVLM: Boosting Fine-grained Fragment Retrieval for Multi-Modal Long-form Dialogue with Vision Language Model

Title: Instant Preference Alignment for Text-to-Image Diffusion Models

Title: Few-shot Human Action Anomaly Detection via a Unified Contrastive Learning Framework

Title: Randomly Removing 50% of Dimensions in Text Embeddings has Minimal Impact on Retrieval and Classification Tasks

Title: Multi-layer Abstraction for Nested Generation of Options (MANGO) in Hierarchical Reinforcement Learning

Title: SuperGen: An Efficient Ultra-high-resolution Video Generation System with Sketching and Tiling

Title: CEIDM: A Controlled Entity and Interaction Diffusion Model for Enhanced Text-to-Image Generation

Title: Multi-domain Distribution Learning for De Novo Drug Design

Title: HLG: Comprehensive 3D Room Construction via Hierarchical Layout Generation

Title: Diffusion-Based Data Augmentation for Medical Image Segmentation

Title: Edge-Enhanced Vision Transformer Framework for Accurate AI-Generated Image Detection

Title: UniAPO: Unified Multimodal Automated Prompt Optimization

Title: Generative Feature Imputing - A Technique for Error-resilient Semantic Communication

Title: A Novel Framework for Uncertainty Quantification via Proper Scores for Classification and Beyond

Title: AQ-PCDSys: An Adaptive Quantized Planetary Crater Detection System for Autonomous Space Exploration

Title: FCR: Investigating Generative AI models for Forensic Craniofacial Reconstruction

Title: Visual-CoG: Stage-Aware Reinforcement Learning with Chain of Guidance for Text-to-Image Generation

Title: Incorporating Pre-trained Diffusion Models in Solving the Schrödinger Bridge Problem

Title: Provable Mixed-Noise Learning with Flow-Matching

Title: SpotEdit: Evaluating Visually-Guided Image Editing Methods

Title: Amortized Sampling with Transferable Normalizing Flows

Title: Sealing The Backdoor: Unlearning Adversarial Text Triggers In Diffusion Models Using Knowledge Distillation

Title: Interpretable Evaluation of AI-Generated Content with Language-Grounded Sparse Encoders