2025-07-29

Title: Beyond 9-to-5: A Generative Model for Augmenting Mobility Data of Underrepresented Shift Workers

Title: Enhancing Spatiotemporal Networks with xLSTM: A Scalar LSTM Approach for Cellular Traffic Forecasting

Title: Language Models for Controllable DNA Sequence Design

Title: Kolmogorov Arnold Network Autoencoder in Medicine

Title: Efficient and Scalable Agentic AI with Heterogeneous Systems

Title: SynPAIN: A Synthetic Dataset of Pain and Non-Pain Facial Expressions

Title: Salsa as a Nonverbal Embodied Language -- The CoMPAS3D Dataset and Benchmarks

Title: Disjoint Generative Models

Title: Bias Analysis for Synthetic Face Detection: A Case Study of the Impact of Facial Attribute

Title: Beyond Nearest Neighbors: Semantic Compression and Graph-Augmented Retrieval for Enhanced Vector Search

Title: MoFRR: Mixture of Diffusion Models for Face Retouching Restoration

Title: Large Language Model Agent for Structural Drawing Generation Using ReAct Prompt Engineering and Retrieval Augmented Generation

Title: JDATT: A Joint Distillation Framework for Atmospheric Turbulence Mitigation and Target Detection

Title: DepthFlow: Exploiting Depth-Flow Structural Correlations for Unsupervised Video Object Segmentation

Title: ForCenNet: Foreground-Centric Network for Document Image Rectification

Title: SeeDiff: Off-the-Shelf Seeded Mask Generation from Diffusion Models

Title: VAE-GAN Based Price Manipulation in Coordinated Local Energy Markets

Title: FineMotion: A Dataset and Benchmark with both Spatial and Temporal Annotation for Fine-grained Motion Generation and Editing

Title: OW-CLIP: Data-Efficient Visual Supervision for Open-World Object Detection via Human-AI Collaboration

Title: All-in-One Medical Image Restoration with Latent Diffusion-Enhanced Vector-Quantized Codebook Prior

Title: A Survey on Generative Model Unlearning: Fundamentals, Taxonomy, Evaluation, and Future Direction

Title: HumanSAM: Classifying Human-centric Forgery Videos in Human Spatial, Appearance, and Motion Anomaly

Title: MambaVesselNet++: A Hybrid CNN-Mamba Architecture for Medical Image Segmentation

Title: LLMControl: Grounded Control of Text-to-Image Diffusion-based Synthesis with Multimodal LLMs

Title: SCALAR: Scale-wise Controllable Visual Autoregressive Learning

Title: FROSS: Faster-than-Real-Time Online 3D Semantic Scene Graph Generation from RGB-D Images

Title: PERRY: Policy Evaluation with Confidence Intervals using Auxiliary Data

Title: The Devil is in the EOS: Sequence Training for Detailed Image Captioning

Title: KB-DMGen: Knowledge-Based Global Guidance and Dynamic Pose Masking for Human Image Generation

Title: Local Prompt Adaptation for Style-Consistent Multi-Object Generation in Diffusion Models

Title: Generative molecule evolution using 3D pharmacophore for efficient Structure-Based Drug Design

Title: AnimeColor: Reference-based Animation Colorization with Diffusion Transformers

Title: Player-Centric Multimodal Prompt Generation for Large Language Model Based Identity-Aware Basketball Video Captioning

Title: PUMPS: Skeleton-Agnostic Point-based Universal Motion Pre-Training for Synthesis in Human Motion Tasks

Title: LRR-Bench: Left, Right or Rotate? Vision-Language models Still Struggle With Spatial Understanding Tasks

Title: Motion-example-controlled Co-speech Gesture Generation Leveraging Large Language Models

Title: Protein-SE(3): Benchmarking SE(3)-based Generative Models for Protein Structure Design

Title: Fine-structure Preserved Real-world Image Super-resolution via Transfer VAE Training

Title: Generative Pre-training for Subjective Tasks: A Diffusion Transformer-Based Framework for Facial Beauty Prediction

Title: MagicAnime: A Hierarchically Annotated, Multimodal and Multitasking Dataset with Benchmarks for Cartoon Animation Generation

Title: WBHT: A Generative Attention Architecture for Detecting Black Hole Anomalies in Backbone Networks

Title: VESPA: Towards un(Human)supervised Open-World Pointcloud Labeling for Autonomous Driving

Title: BioNeuralNet: A Graph Neural Network based Multi-Omics Network Data Analysis Tool

Title: Frequency-Aware Autoregressive Modeling for Efficient High-Resolution Image Synthesis

Title: GaRe: Relightable 3D Gaussian Splatting for Outdoor Scenes from Unconstrained Photo Collections

Title: Kernel Learning for Sample Constrained Black-Box Optimization

Title: T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation

Title: Annotation-Free Human Sketch Quality Assessment

Title: Reminiscence Attack on Residuals: Exploiting Approximate Machine Unlearning for Privacy

Title: AV-Deepfake1M++: A Large-Scale Audio-Visual Deepfake Benchmark with Real-World Perturbations

Title: Harnessing Diffusion-Yielded Score Priors for Image Restoration

Title: PhaseNAS: Language-Model Driven Architecture Search with Dynamic Phase Adaptation

Title: Deep Generative Models of Evolution: SNP-level Population Adaptation by Genomic Linkage Incorporation

Title: Learning Only with Images: Visual Reinforcement Learning with Reasoning, Rendering, and Visual Feedback

Title: FantasyID: A dataset for detecting digital manipulations of ID-documents

Title: First Hallucination Tokens Are Different from Conditional Ones

Title: Towards Explainable Deep Clustering for Time Series Data

Title: Compositional Video Synthesis by Temporal Object-Centric Learning

Title: Exploring text-to-image generation for historical document image retrieval

Title: Mask-Free Audio-driven Talking Face Generation for Enhanced Visual Quality and Identity Preservation

Title: PROVCREATOR: Synthesizing Complex Heterogenous Graphs with Node and Edge Attributes

Title: Model-Agnostic Gender Bias Control for Text-to-Image Generation via Sparse Autoencoder

Title: Adapting Vehicle Detectors for Aerial Imagery to Unseen Domains with Weak Supervision

Title: JWB-DH-V1: Benchmark for Joint Whole-Body Talking Avatar and Speech Generation Version 1

Title: LoRA-PAR: A Flexible Dual-System LoRA Partitioning Approach to Efficient LLM Fine-Tuning

Title: Flow Matching Policy Gradients