2025-05-22

Title: DraftAttention: Fast Video Diffusion via Low-Resolution Attention Guidance

Title: FastCar: Cache Attentive Replay for Fast Auto-Regressive Video Generation on the Edge

Title: The Evolution of Alpha in Finance Harnessing Human Insight and LLM Agents

Title: Time Series Similarity Score Functions to Monitor and Interact with the Training and Denoising Process of a Time Series Diffusion Model applied to a Human Activity Recognition Dataset based on IMUs

Title: Communication-Efficient Diffusion Denoising Parallelization via Reuse-then-Predict Mechanism

Title: Large Language Models for Data Synthesis

Title: Leveraging Generative AI Models to Explore Human Identity

Title: A self-regulated convolutional neural network for classifying variable stars

Title: Programmatic Video Prediction Using Large Language Models

Title: STree: Speculative Tree Decoding for Hybrid State-Space Models

Title: Flattening Hierarchies with Policy Bootstrapping

Title: RL Tango: Reinforcing Generator and Verifier Together for Language Reasoning

Title: Agentic Feature Augmentation: Unifying Selection and Generation with Teaming, Planning, and Memories

Title: Khan-GCL: Kolmogorov-Arnold Network Based Graph Contrastive Learning with Hard Negatives

Title: BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms

Title: CineTechBench: A Benchmark for Cinematographic Technique Understanding and Generation

Title: Sculpting Features from Noise: Reward-Guided Hierarchical Diffusion for Task-Optimal Feature Transformation

Title: Harnessing Caption Detailness for Data-Efficient Text-to-Image Generation

Title: AvatarShield: Visual Reinforcement Learning for Human-Centric Video Forgery Detection

Title: MonoSplat: Generalizable 3D Gaussian Splatting from Monocular Depth Foundation Models

Title: Intentional Gesture: Deliver Your Intentions with Gestures for Speech

Title: KernelOracle: Predicting the Linux Scheduler's Next Move with Deep Learning

Title: Multimodal Conditional Information Bottleneck for Generalizable AI-Generated Image Detection

Title: Continuous Representation Methods, Theories, and Applications: An Overview and Perspectives

Title: Loss-Guided Auxiliary Agents for Overcoming Mode Collapse in GFlowNets

Title: gen2seg: Generative Models Enable Generalizable Instance Segmentation

Title: Scaling Diffusion Transformers Efficiently via $μ$P

Title: GS2E: Gaussian Splatting is an Effective Data Generator for Event Stream Generation

Title: BadSR: Stealthy Label Backdoor Attacks on Image Super-Resolution

Title: FaceCrafter: Identity-Conditional Diffusion with Disentangled Control over Facial Pose, Expression, and Emotion

Title: My Face Is Mine, Not Yours: Facial Protection Against Diffusion Model Face Swapping

Title: Bridging Sign and Spoken Languages: Pseudo Gloss Generation for Sign Language Translation

Title: FRN: Fractal-Based Recursive Spectral Reconstruction Network

Title: Comprehensive Evaluation and Analysis for NSFW Concept Erasure in Text-to-Image Diffusion Models

Title: NOMAD Projection

Title: PlantDreamer: Achieving Realistic 3D Plant Models with Diffusion-Guided Gaussian Splatting

Title: seg_3D_by_PC2D: Multi-View Projection for Domain Generalization and Adaptation in 3D Semantic Segmentation

Title: Impact of Data Sparsity on Machine Learning for Fault Detection in Power System Protection

Title: Bridging the Domain Gap in Equation Distillation with Reinforcement Feedback

Title: Guidelines for the Quality Assessment of Energy-Aware NAS Benchmarks

Title: FragFake: A Dataset for Fine-Grained Detection of Edited Images with Vision Language Models

Title: Graph Conditional Flow Matching for Relational Data Generation

Title: RUSplatting: Robust 3D Gaussian Splatting for Sparse-View Underwater Scene Reconstruction

Title: Constructing a 3D Town from a Single Image

Title: IA-T2I: Internet-Augmented Text-to-Image Generation

Title: VARD: Efficient and Dense Fine-Tuning for Diffusion Models with Value-based RL

Title: Interspatial Attention for Efficient 4D Human Video Generation

Title: Neural Conditional Transport Maps

Title: MMaDA: Multimodal Large Diffusion Language Models

Title: InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object Recognition