2025-08-05

Title: PCS Workflow for Veridical Data Science in the Age of AI

Title: A Residual Guided strategy with Generative Adversarial Networks in training Physics-Informed Transformer Networks

Title: Phase-fraction guided denoising diffusion model for augmenting multiphase steel microstructure segmentation via micrograph image-mask pair synthesis

Title: Beyond Benchmarks: Dynamic, Automatic And Systematic Red-Teaming Agents For Trustworthy Medical Language Models

Title: From Generator to Embedder: Harnessing Innate Abilities of Multimodal LLMs via Building Zero-Shot Discriminative Embedding Model

Title: VAULT: Vigilant Adversarial Updates via LLM-Driven Retrieval-Augmented Generation for NLI

Title: Masked Omics Modeling for Multimodal Representation Learning across Histopathology and Molecular Profiles

Title: ROVI: A VLM-LLM Re-Captioned Dataset for Open-Vocabulary Instance-Grounded Text-to-Image Generation

Title: v-PuNNs: van der Put Neural Networks for Transparent Ultrametric Representation Learning

Title: Structured Spectral Graph Learning for Anomaly Classification in 3D Chest CT Scans

Title: Flow Matching for Probabilistic Learning of Dynamical Systems from Missing or Noisy Data

Title: A hierarchy tree data structure for behavior-based user segment representation

Title: UniEgoMotion: A Unified Model for Egocentric Motion Reconstruction, Forecasting, and Generation

Title: Transformers in Pseudo-Random Number Generation: A Dual Perspective on Theory and Practice

Title: Personalized Safety Alignment for Text-to-Image Diffusion Models

Title: LawDIS: Language-Window-based Controllable Dichotomous Image Segmentation

Title: RSPO: Risk-Seeking Policy Optimization for Pass@k and Max@k Metrics in Large Language Models

Title: BSL: A Unified and Generalizable Multitask Learning Platform for Virtual Drug Discovery from Design to Synthesis

Title: StyDeco: Unsupervised Style Transfer with Distilling Priors and Semantic Decoupling

Title: NS-Net: Decoupling CLIP Semantic Information through NULL-Space for Generalizable AI-Generated Image Detection

Title: SpatioTemporal Difference Network for Video Depth Super-Resolution

Title: Enhancing Diffusion-based Dataset Distillation via Adversary-Guided Curriculum Sampling

Title: PromptSafe: Gated Prompt Tuning for Safe Text-to-Image Generation

Title: GMAT: Grounded Multi-Agent Clinical Description Generation for Text Encoder in Vision-Language MIL for Whole Slide Image Classification

Title: Zero-shot Segmentation of Skin Conditions: Erythema with Edit-Friendly Inversion

Title: Effective Damage Data Generation by Fusing Imagery with Human Knowledge Using Vision-Language Models

Title: A Full-Stage Refined Proposal Algorithm for Suppressing False Positives in Two-Stage CNN-Based Detection Methods

Title: ForenX: Towards Explainable AI-Generated Image Detection with Multimodal Large Language Models

Title: Uncertainty-Aware Segmentation Quality Prediction via Deep Learning Bayesian Modeling: Comprehensive Evaluation and Interpretation on Skin Cancer and Liver Segmentation

Title: Can3Tok: Canonical 3D Tokenization and Latent Modeling of Scene-Level 3D Gaussians

Title: ESM: A Framework for Building Effective Surrogate Models for Hardware-Aware Neural Architecture Search

Title: A Reward-Directed Diffusion Framework for Generative Design Optimization

Title: Canoe Paddling Quality Assessment Using Smart Devices: Preliminary Machine Learning Study

Title: MiraGe: Multimodal Discriminative Representation Learning for Generalizable AI-Generated Image Detection

Title: E-VRAG: Enhancing Long Video Understanding with Resource-Efficient Retrieval Augmented Generation

Title: A Glimpse to Compress: Dynamic Visual Token Pruning for Large Vision-Language Models

Title: EvoVLMA: Evolutionary Vision-Language Model Adaptation

Title: A Spatio-temporal Continuous Network for Stochastic 3D Human Motion Prediction

Title: Diffusion Models for Future Networks and Communications: A Comprehensive Survey

Title: Censored Sampling for Topology Design: Guiding Diffusion with Human Preferences

Title: Enhancing Zero-Shot Brain Tumor Subtype Classification via Fine-Grained Patch-Text Alignment

Title: TCDiff: Triplex Cascaded Diffusion for High-fidelity Multimodal EHRs Generation with Incomplete Clinical Data

Title: Privacy-Preserving Inference for Quantized BERT Models

Title: StrandDesigner: Towards Practical Strand Generation with Sketch Guidance

Title: DisCo3D: Distilling Multi-View Consistency for 3D Scene Editing

Title: Versatile Transition Generation with Image-to-Video Diffusion

Title: TimeExpert: An Expert-Guided Video LLM for Video Temporal Grounding

Title: Imbalance-Robust and Sampling-Efficient Continuous Conditional GANs via Adaptive Vicinity and Auxiliary Regularization

Title: Improving Noise Efficiency in Privacy-preserving Dataset Distillation

Title: DiffSemanticFusion: Semantic Raster BEV Fusion for Autonomous Driving via Online HD Map Diffusion

Title: Beyond Vulnerabilities: A Survey of Adversarial Attacks as Both Threats and Defenses in Computer Vision Systems

Title: DiffusionFF: Face Forgery Detection via Diffusion-based Artifact Localization

Title: Optimizing Day-Ahead Energy Trading with Proximal Policy Optimization and Blockchain

Title: How Does Controllability Emerge In Language Models During Pretraining?

Title: Proactive Disentangled Modeling of Trigger-Object Pairings for Backdoor Defense

Title: Accelerating LLM Reasoning via Early Rejection with Partial Reward Modeling

Title: Generative Large-Scale Pre-trained Models for Automated Ad Bidding Optimization

Title: Devil is in the Detail: Towards Injecting Fine Details of Image Prompt in Image Generation via Conflict-free Guidance and Stratified Attention

Title: An Evolving Scenario Generation Method based on Dual-modal Driver Model Trained by Multi-Agent Reinforcement Learning

Title: Bench2ADVLM: A Closed-Loop Benchmark for Vision-language Models in Autonomous Driving

Title: Conditional Diffusion Model with Anatomical-Dose Dual Constraints for End-to-End Multi-Tumor Dose Prediction

Title: StarPose: 3D Human Pose Estimation via Spatial-Temporal Autoregressive Diffusion

Title: S-RRG-Bench: Structured Radiology Report Generation with Fine-Grained Evaluation Framework

Title: CRINN: Contrastive Reinforcement Learning for Approximate Nearest Neighbor Search

Title: Towards Immersive Human-X Interaction: A Real-Time Framework for Physically Plausible Motion Synthesis

Title: AutoLoRA: Automatic LoRA Retrieval and Fine-Grained Gated Fusion for Text-to-Image Generation

Title: Amber Pruner: Leveraging N:M Activation Sparsity for Efficient Prefill in Large Language Models

Title: A Neural Quality Metric for BRDF Models

Title: AttriCtrl: Fine-Grained Control of Aesthetic Attribute Intensity in Diffusion Models

Title: DreamPainter: Image Background Inpainting for E-commerce Scenarios

Title: Subject or Style: Adaptive and Training-Free Mixture of LoRAs

Title: After the Party: Navigating the Mapping From Color to Ambient Lighting

Title: CAAD: Context-Aware Adaptive Decoding for Truthful Text Generation

Title: Balancing Information Accuracy and Response Timeliness in Networked LLMs

Title: Forecasting When to Forecast: Accelerating Diffusion Models with Confidence-Gated Taylor

Title: Patho-AgenticRAG: Towards Multimodal Agentic Retrieval-Augmented Generation for Pathology VLMs via Reinforcement Learning

Title: CellForge: Agentic Design of Virtual Cell Models

Title: CAPO: Towards Enhancing LLM Reasoning through Verifiable Generative Credit Assignment

Title: Qwen-Image Technical Report

Title: MicroMix: Efficient Mixed-Precision Quantization with Microscaling Formats for Large Language Models

Title: Text2Lip: Progressive Lip-Synced Talking Face Generation from Text via Viseme-Guided Rendering

Title: Uni-Layout: Integrating Human Feedback in Unified Layout Generation and Evaluation

Title: MindShot: Multi-Shot Video Reconstruction from fMRI with LLM Decoding

Title: Toward Using Machine Learning as a Shape Quality Metric for Liver Point Cloud Generation

Title: Federated Graph Unlearning

Title: AnalogCoder-Pro: Unifying Analog Circuit Generation and Optimization via Multi-modal LLMs

Title: StructSynth: Leveraging LLMs for Structure-Aware Tabular Data Synthesis in Low-Data Regimes

Title: ReMoMask: Retrieval-Augmented Masked Motion Generation

Title: DeepKoopFormer: A Koopman Enhanced Transformer Based Architecture for Time Series Forecasting