2025-07-09

Title: Structured Captions Improve Prompt Adherence in Text-to-Image Models (Re-LAION-Caption 19M)

Title: CorrDetail: Visual Detail Enhanced Self-Correction for Face Forgery Detection

Title: Enhancing Underwater Images Using Deep Learning with Subjective Image Quality Integration

Title: Neural-Driven Image Editing

Title: Motion Generation: A Survey of Generative Approaches and Benchmarks

Title: Navigating Sparse Molecular Data with Stein Diffusion Guidance

Title: Cloud Diffusion Part 1: Theory and Motivation

Title: LoomNet: Enhancing Multi-View Image Generation via Latent Space Weaving

Title: Simulating Refractive Distortions and Weather-Induced Artifacts for Resource-Constrained Autonomous Perception

Title: ReLayout: Integrating Relation Reasoning for Content-aware Layout Generation with Multi-modal Large Language Models

Title: Model-free Optical Processors using In Situ Reinforcement Learning with Proximal Policy Optimization

Title: Semi-Supervised Defect Detection via Conditional Diffusion and CLIP-Guided Noise Filtering

Title: Rethinking Layered Graphic Design Generation with a Top-Down Approach

Title: Kernel Density Steering: Inference-Time Scaling via Mode Seeking for Image Restoration

Title: Generative Head-Mounted Camera Captures for Photorealistic Avatars

Title: AdaptaGen: Domain-Specific Image Generation through Hierarchical Semantic Optimization Framework

Title: Graph Learning

Title: MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos

Title: LiON-LoRA: Rethinking LoRA Fusion to Unify Controllable Spatial and Temporal Generation for Video Diffusion

Title: DreamArt: Generating Interactable Articulated Objects from a Single Image

Title: SPADE: Spatial-Aware Denoising Network for Open-vocabulary Panoptic Scene Graph Generation with Long- and Local-range Context Reasoning

Title: DREAM: Document Reconstruction via End-to-end Autoregressive Model

Title: Towards Solar Altitude Guided Scene Illumination

Title: 2D Instance Editing in 3D Space

Title: USIGAN: Unbalanced Self-Information Feature Transport for Weakly Paired Image IHC Virtual Staining

Title: Diffusion Dataset Condensation: Training Your Diffusion Model Faster with Less Data

Title: Tora2: Motion and Appearance Customized Diffusion Transformer for Multi-Entity Video Generation

Title: Automatic Synthesis of High-Quality Triplet Data for Composed Image Retrieval

Title: MEDTalk: Multimodal Controlled 3D Facial Animation with Dynamic Emotions by Disentangled Embedding

Title: ScoreAdv: Score-based Targeted Generation of Natural Adversarial Examples via Diffusion Models

Title: Omni-Video: Democratizing Unified Video Understanding and Generation

Title: OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion