2025-08-14

Title: To Theoretically Understand Transformer-Based In-Context Learning for Optimizing CSMA

Title: Motif 2.6B Technical Report

Title: A Rolling Stone Gathers No Moss: Adaptive Policy Optimization for Stable Self-Evaluation in Large Multimodal Models

Title: Physics-Constrained Fine-Tuning of Flow-Matching Models for Generation and Inverse Problems

Title: EvaDrive: Evolutionary Adversarial Policy Optimization for End-to-End Autonomous Driving

Title: An Unsupervised Deep XAI Framework for Localization of Concurrent Replay Attacks in Nuclear Reactor Signals

Title: Generating Feasible and Diverse Synthetic Populations Using Diffusion Models

Title: IAD-R1: Reinforcing Consistent Reasoning in Industrial Anomaly Detection

Title: From Values to Tokens: An LLM-Driven Framework for Context-aware Time Series Forecasting via Symbolic Discretization

Title: Diffusion LLMs Can Do Faster-Than-AR Inference via Discrete Diffusion Forcing

Title: Multi-Objective Instruction-Aware Representation Learning in Procedural Content Generation RL

Title: Learning to Detect Unknown Jailbreak Attacks in Large Vision-Language Models: A Unified and Accurate Approach

Title: Towards Scalable Training for Handwritten Mathematical Expression Recognition

Title: FineState-Bench: A Comprehensive Benchmark for Fine-Grained State Control in GUI Agents

Title: Leveraging Large Language Models for Rare Disease Named Entity Recognition

Title: Lung-DDPM+: Efficient Thoracic CT Image Synthesis using Diffusion Probabilistic Model

Title: The Human-AI Hybrid Delphi Model: A Structured Framework for Context-Rich, Expert Consensus in Complex Domains

Title: Flow-SLM: Joint Learning of Linguistic and Acoustic Information for Spoken Language Modeling

Title: X-UniMotion: Animating Human Images with Expressive, Unified and Identity-Agnostic Motion Latents

Title: Understanding Dementia Speech Alignment with Diffusion-Based Image Generation

Title: Graph Neural Network and Transformer Integration for Unsupervised System Anomaly Discovery

Title: Distilling LLM Prior to Flow Model for Generalizable Agent's Imagination in Object Goal Navigation

Title: RASR: Retrieval-Augmented Super Resolution for Practical Reference-based Image Restoration

Title: A Unified Contrastive-Generative Framework for Time Series Classification

Title: HyperKD: Distilling Cross-Spectral Knowledge in Masked Autoencoders via Inverse Domain Shift with Spatial-Aware Masking and Specialized Loss

Title: Gen-AFFECT: Generation of Avatar Fine-grained Facial Expressions with Consistent identiTy

Title: CitySeg: A 3D Open Vocabulary Semantic Segmentation Foundation Model in City-scale Scenarios

Title: EGGS-PTP: An Expander-Graph Guided Structured Post-training Pruning Method for Large Language Models

Title: Leveraging Failed Samples: A Few-Shot and Training-Free Framework for Generalized Deepfake Detection

Title: CLIP-Flow: A Universal Discriminator for AI-Generated Images Inspired by Anomaly Detection

Title: SkySplat: Generalizable 3D Gaussian Splatting from Multi-Temporal Sparse Satellite Images

Title: SARE: Semantic-Aware Reconstruction Error for Generalizable Diffusion-Generated Image Detection

Title: Large-Small Model Collaborative Framework for Federated Continual Learning

Title: Causal Graph Profiling via Structural Divergence for Robust Anomaly Detection in Cyber-Physical Systems

Title: LACA: Improving Cross-lingual Aspect-Based Sentiment Analysis with LLM Data Augmentation

Title: Generation of Indian Sign Language Letters, Numbers, and Words

Title: Decentralized Rank Scheduling for Energy-Constrained Multi-Task Federated Fine-Tuning in Edge-Assisted IoV Networks

Title: Exploring the Equivalence of Closed-Set Generative and Real Data Augmentation in Image Classification

Title: Edge General Intelligence Through World Models and Agentic AI: Fundamentals, Solutions, and Challenges

Title: Dual Recursive Feedback on Generation and Appearance Latents for Pose-Robust Text-to-Image Diffusion

Title: Images Speak Louder Than Scores: Failure Mode Escape for Enhancing Generative Quality

Title: MInDI-3D: Iterative Deep Learning in 3D for Sparse-view Cone Beam Computed Tomography

Title: Semantic-aware DropSplat: Adaptive Pruning of Redundant Gaussians for 3D Aerial-View Segmentation

Title: NegFaceDiff: The Power of Negative Context in Identity-Conditioned Diffusion for Synthetic Face Generation

Title: GSFixer: Improving 3D Gaussian Splatting with Reference-Guided Video Diffusion Priors

Title: MangaDiT: Reference-Guided Line Art Colorization with Hierarchical Attention in Diffusion Transformers

Title: GraphTreeGen: Subtree-Centric Approach to Efficient and Supervised Graph Generation

Title: NEURAL: Attention-Guided Pruning for Unified Multimodal Resource-Constrained Clinical Evaluation

Title: Generative Modeling with Multi-Instance Reward Learning for E-commerce Creative Optimization

Title: Region-to-Region: Enhancing Generative Image Harmonization with Adaptive Regional Injection

Title: Provable In-Context Vector Arithmetic via Retrieving Task Concepts

Title: KonfAI: A Modular and Fully Configurable Framework for Deep Learning in Medical Imaging

Title: Speed Always Wins: A Survey on Efficient Architectures for Large Language Models

Title: Enhancing Diffusion Face Generation with Contrastive Embeddings and SegFormer Guidance

Title: PRELUDE: A Benchmark Designed to Require Global Comprehension and Reasoning over Long Contexts

Title: HumanGenesis: Agent-Based Geometric and Generative Modeling for Synthetic Human Dynamics

Title: Modern Neural Networks for Small Tabular Datasets: The New Default for Field-Scale Digital Soil Mapping?

Title: Rare anomalies require large datasets: About proving the existence of anomalies

Title: SpeechForensics: Audio-Visual Speech Representation Learning for Face Forgery Detection

Title: Prototype-Guided Diffusion: Visual Conditioning without External Memory

Title: Quo Vadis Handwritten Text Generation for Handwritten Text Recognition?

Title: AST-n: A Fast Sampling Approach for Low-Dose CT Reconstruction using Diffusion Models

Title: Stable Diffusion Models are Secretly Good at Visual In-Context Learning

Title: MOC: Meta-Optimized Classifier for Few-Shot Whole Slide Image Classification

Title: Noise Hypernetworks: Amortizing Test-Time Compute in Diffusion Models

Title: PERSONA: Personalized Whole-Body 3D Avatar with Pose-Driven Deformations from a Single Image

Title: A Survey on 3D Gaussian Splatting Applications: Segmentation, Editing, and Generation

Title: Story2Board: A Training-Free Approach for Expressive Storyboard Generation

Title: Echo-4o: Harnessing the Power of GPT-4o Synthetic Images for Improved Image Generation