2025-05-22

Title: CrypticBio: A Large Multimodal Dataset for Visually Confusing Biodiversity

Title: DraftAttention: Fast Video Diffusion via Low-Resolution Attention Guidance

Title: Time Series Similarity Score Functions to Monitor and Interact with the Training and Denoising Process of a Time Series Diffusion Model applied to a Human Activity Recognition Dataset based on IMUs

Title: Communication-Efficient Diffusion Denoising Parallelization via Reuse-then-Predict Mechanism

Title: Large Language Models for Data Synthesis

Title: This Time is Different: An Observability Perspective on Time Series Foundation Models

Title: KO: Kinetics-inspired Neural Optimizer with PDE Simulation Approaches

Title: Sample and Computationally Efficient Continuous-Time Reinforcement Learning with General Function Approximation

Title: Leveraging Generative AI Models to Explore Human Identity

Title: A self-regulated convolutional neural network for classifying variable stars

Title: In-Context Learning Boosts Speech Recognition via Human-like Adaptation to Speakers and Language Varieties

Title: Foundations of Unknown-aware Machine Learning

Title: Anomaly Detection Based on Critical Paths for Deep Neural Networks

Title: Flattening Hierarchies with Policy Bootstrapping

Title: Meta-Design Matters: A Self-Design Multi-Agent System

Title: One-Layer Transformers are Provably Optimal for In-context Reasoning and Distributional Association Learning in Next-Token Prediction Tasks

Title: RL Tango: Reinforcing Generator and Verifier Together for Language Reasoning

Title: Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective

Title: Improving the fact-checking performance of language models by relying on their entailment ability

Title: Generalization Through Growth: Hidden Dynamics Controls Depth Dependence

Title: Agentic Feature Augmentation: Unifying Selection and Generation with Teaming, Planning, and Memories

Title: Data Augmentation and Resolution Enhancement using GANs and Diffusion Models for Tree Segmentation

Title: Mechanistic evaluation of Transformers and state space models

Title: Graph Foundation Models: A Comprehensive Survey

Title: Filtering Learning Histories Enhances In-Context Reinforcement Learning

Title: From Pixels to Images: Deep Learning Advances in Remote Sensing Image Semantic Segmentation

Title: Time Tracker: Mixture-of-Experts-Enhanced Foundation Time Series Forecasting Model with Decoupled Training Pipelines

Title: Sculpting Features from Noise: Reward-Guided Hierarchical Diffusion for Task-Optimal Feature Transformation

Title: MonoSplat: Generalizable 3D Gaussian Splatting from Monocular Depth Foundation Models

Title: Leveraging Foundation Models for Multimodal Graph-Based Action Recognition

Title: Flashback: Memory-Driven Zero-shot, Real-time Video Anomaly Detection

Title: Multimodal Conditional Information Bottleneck for Generalizable AI-Generated Image Detection

Title: Towards Explainable Temporal Reasoning in Large Language Models: A Structure-Aware Generative Framework

Title: VET-DINO: Learning Anatomical Understanding Through Multi-View Distillation in Veterinary Imaging

Title: Loss-Guided Auxiliary Agents for Overcoming Mode Collapse in GFlowNets

Title: An Efficient Private GPT Never Autoregressively Decodes

Title: gen2seg: Generative Models Enable Generalizable Instance Segmentation

Title: Scaling Diffusion Transformers Efficiently via $μ$P

Title: FaceCrafter: Identity-Conditional Diffusion with Disentangled Control over Facial Pose, Expression, and Emotion

Title: Parameter-Efficient Fine-Tuning of Multispectral Foundation Models for Hyperspectral Image Classification

Title: My Face Is Mine, Not Yours: Facial Protection Against Diffusion Model Face Swapping

Title: Revealing Language Model Trajectories via Kullback-Leibler Divergence

Title: Federated Learning-Enhanced Blockchain Framework for Privacy-Preserving Intrusion Detection in Industrial IoT

Title: Responsible Diffusion Models via Constraining Text Embeddings within Safe Regions

Title: Bridging Sign and Spoken Languages: Pseudo Gloss Generation for Sign Language Translation

Title: Stronger ViTs With Octic Equivariance

Title: Comprehensive Evaluation and Analysis for NSFW Concept Erasure in Text-to-Image Diffusion Models

Title: Prompt Tuning Vision Language Models with Margin Regularizer for Few-Shot Learning under Distribution Shifts

Title: NOMAD Projection

Title: Detection of Underwater Multi-Targets Based on Self-Supervised Learning and Deformable Path Aggregation Feature Pyramid Network

Title: PlantDreamer: Achieving Realistic 3D Plant Models with Diffusion-Guided Gaussian Splatting

Title: Bridging the Domain Gap in Equation Distillation with Reinforcement Feedback

Title: Beyond Classification: Evaluating Diffusion Denoised Smoothing for Security-Utility Trade off

Title: FragFake: A Dataset for Fine-Grained Detection of Edited Images with Vision Language Models

Title: The Devil is in Fine-tuning and Long-tailed Problems:A New Benchmark for Scene Text Detection

Title: Graph Conditional Flow Matching for Relational Data Generation

Title: UniErase: Unlearning Token as a Universal Erasure Primitive for Language Models

Title: Constructing a 3D Town from a Single Image

Title: dKV-Cache: The Cache for Diffusion Language Models

Title: Large Language Models as Computable Approximations to Solomonoff Induction

Title: VARD: Efficient and Dense Fine-Tuning for Diffusion Models with Value-based RL

Title: Interspatial Attention for Efficient 4D Human Video Generation

Title: The Atlas of In-Context Learning: How Attention Heads Shape In-Context Retrieval Augmentation

Title: Neural Conditional Transport Maps

Title: MMaDA: Multimodal Large Diffusion Language Models

Title: On the creation of narrow AI: hierarchy and nonlocality of neural network skills

Title: Leveraging the Powerful Attention of a Pre-trained Diffusion Model for Exemplar-based Image Colorization

Title: Meta-Learning an In-Context Transformer Model of Human Higher Visual Cortex