2025-07-04

Title: Continuous Wavelet Transform and Siamese Network-Based Anomaly Detection in Multi-variate Semiconductor Process Time Series

Title: Large Language Models for Crash Detection in Video: A Survey of Methods, Datasets, and Challenges

Title: GeoAda: Efficiently Finetune Geometric Diffusion Models with Equivariant Adapters

Title: Sample Complexity Bounds for Linear Constrained MDPs with a Generative Model

Title: Energy-Based Transformers are Scalable Learners and Thinkers

Title: Can Artificial Intelligence solve the blockchain oracle problem? Unpacking the Challenges and Possibilities

Title: Generative Latent Diffusion for Efficient Spatiotemporal Data Reduction

Title: Non-exchangeable Conformal Prediction for Temporal Graph Neural Networks

Title: Understanding Trade offs When Conditioning Synthetic Data

Title: SurgVisAgent: Multimodal Agentic Model for Versatile Surgical Visual Enhancement

Title: Prompt Disentanglement via Language Guidance and Representation Alignment for Domain Generalization

Title: DreamComposer++: Empowering Diffusion Models with Multi-View Conditions for 3D Content Generation

Title: MAGIC: Mask-Guided Diffusion Inpainting with Multi-Level Perturbations and Context-Aware Alignment for Few-Shot Anomaly Generation

Title: Transformer-based EEG Decoding: A Survey

Title: Heeding the Inner Voice: Aligning ControlNet Training via Intermediate Features Feedback

Title: Offline Reinforcement Learning with Penalized Action Noise Injection

Title: Evaluating Language Models For Threat Detection in IoT Security Logs

Title: Wildlife Target Re-Identification Using Self-supervised Learning in Non-Urban Settings

Title: PosDiffAE: Position-aware Diffusion Auto-encoder For High-Resolution Brain Tissue Classification Incorporating Artifact Restoration

Title: AvatarMakeup: Realistic Makeup Transfer for 3D Animatable Head Avatars

Title: CrowdTrack: A Benchmark for Difficult Multiple Pedestrian Tracking in Real Scenarios

Title: Temporally-Aware Supervised Contrastive Learning for Polyp Counting in Colonoscopy

Title: RetrySQL: text-to-SQL training with retry data for self-correcting query generation

Title: Reconstructing Close Human Interaction with Appearance and Proxemics Reasoning

Title: Structure-aware Semantic Discrepancy and Consistency for 3D Medical Image Self-supervised Learning

Title: Lost in Latent Space: An Empirical Study of Latent Diffusion Models for Physics Emulation

Title: High-Order Deep Meta-Learning with Category-Theoretic Interpretation

Title: Guided Generation for Developable Antibodies

Title: Embedding-Based Federated Data Sharing via Differentially Private Conditional VAEs

Title: Learning few-step posterior samplers by unfolding and distillation of diffusion models

Title: APT: Adaptive Personalized Training for Diffusion Models with Limited Data

Title: UniMC: Taming Diffusion Transformer for Unified Keypoint-Guided Multi-Class Image Generation

Title: FairHuman: Boosting Hand and Face Quality in Human Image Generation with Minimum Potential Delay Fairness in Diffusion Models

Title: Prompt learning with bounding box constraints for medical image segmentation

Title: RichControl: Structure- and Appearance-Rich Training-Free Spatial Control for Text-to-Image Generation

Title: No time to train! Training-Free Reference-Based Instance Segmentation

Title: Multimodal Mathematical Reasoning with Diverse Solving Perspective

Title: LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion

Title: USAD: An Unsupervised Data Augmentation Spatio-Temporal Attention Diffusion Network

Title: Answer Matching Outperforms Multiple Choice for Language Model Evaluation

Title: AnyI2V: Animating Any Conditional Image with Motion Control

Title: Less is Enough: Training-Free Video Diffusion Acceleration via Runtime-Adaptive Caching