2025-04-22

Title: Generative System Dynamics in Recurrent Neural Networks

Title: Multiscale Tensor Summation Factorization as a New Neural Network Layer (MTS Layer) for Multidimensional Data Processing

Title: Entropy Rectifying Guidance for Diffusion and Flow Models

Title: Deep Learning on Graphs for Mobile Network Topology Generation

Title: Fashion-RAG: Multimodal Fashion Image Editing via Retrieval-Augmented Generation

Title: A synthetic dataset of French electric load curves with temperature conditioning

Title: Personalizing Exposure Therapy via Reinforcement Learning

Title: Point-Driven Interactive Text and Image Layer Editing Using Diffusion Models

Title: BMRL: Bi-Modal Guided Multi-Perspective Representation Learning for Zero-Shot Deepfake Attribution

Title: Transforming hyperspectral images into chemical maps: A new deep learning based approach to hyperspectral image processing

Title: Rethinking Target Label Conditioning in Adversarial Attacks: A 2D Tensor-Guided Generative Approach

Title: Learning Joint ID-Textual Representation for ID-Preserving Image Synthesis

Title: Towards Explainable Fake Image Detection with Multi-Modal Large Language Models

Title: Any Image Restoration via Efficient Spatial-Frequency Degradation Adaptation

Title: ColorVein: Colorful Cancelable Vein Biometrics

Title: Cross-attention for State-based model RWKV-7

Title: Generative emulation of chaotic dynamics with coherent prior

Title: Text-Audio-Visual-conditioned Diffusion Model for Video Saliency Prediction

Title: Towards NSFW-Free Text-to-Image Generation via Safety-Constraint Direct Preference Optimization

Title: Learning and Generating Diverse Residential Load Patterns Using GAN with Weakly-Supervised Training and Weight Selection

Title: Manipulating Multimodal Agents via Cross-Modal Prompt Injection

Title: Improving RL Exploration for LLM Reasoning through Retrospective Replay

Title: Do You Really Need Public Data? Surrogate Public Data for Differential Privacy on Tabular Data

Title: SphereDiff: Tuning-free Omnidirectional Panoramic Image and Video Generation via Spherical Latent Representation

Title: ResNetVLLM-2: Addressing ResNetVLLM's Multi-Modal Hallucinations

Title: Causal Disentanglement for Robust Long-tail Medical Image Generation

Title: LGD: Leveraging Generative Descriptions for Zero-Shot Referring Image Segmentation

Title: Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis

Title: STARS: Sparse Learning Correlation Filter with Spatio-temporal Regularization and Super-resolution Reconstruction for Thermal Infrared Target Tracking

Title: Less is More: Adaptive Coverage for Synthetic Training Data

Title: FlowLoss: Dynamic Flow-Conditioned Loss Strategy for Video Diffusion Models

Title: VGNC: Reducing the Overfitting of Sparse-view 3DGS via Validation-guided Gaussian Number Control

Title: NTIRE 2025 Challenge on Image Super-Resolution ($\times$4): Methods and Results

Title: Using street view imagery and deep generative modeling for estimating the health of urban forests

Title: Generative Auto-Bidding with Value-Guided Explorations

Title: NTIRE 2025 Challenge on Real-World Face Restoration: Methods and Results

Title: AlphaZero-Edu: Making AlphaZero Accessible to Everyone

Title: Relation-R1: Cognitive Chain-of-Thought Guided Reinforcement Learning for Unified Relational Comprehension

Title: LeetCodeDataset: A Temporal Dataset for Robust Evaluation and Efficient Training of Code LLMs

Title: Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens

Title: SuperCL: Superpixel Guided Contrastive Learning for Medical Image Segmentation Pre-training

Title: Novel Concept-Oriented Synthetic Data approach for Training Generative AI-Driven Crystal Grain Analysis Using Diffusion Model

Title: When Cloud Removal Meets Diffusion Model in Remote Sensing

Title: Enhanced Data-driven Topology Design Methodology with Multi-level Mesh and Correlation-based Mutation for Stress-related Multi-objective Optimization

Title: Edge-boosted graph learning for functional brain connectivity analysis

Title: Verifying Robust Unlearning: Probing Residual Knowledge in Unlearned Models

Title: What Lurks Within? Concept Auditing for Shared Diffusion Models at Scale

Title: Distribution-aware Dataset Distillation for Efficient Image Restoration

Title: Twin Co-Adaptive Dialogue for Progressive Image Generation

Title: Memory-Augmented Dual-Decoder Networks for Multi-Class Unsupervised Anomaly Detection

Title: Latent Bayesian Optimization via Autoregressive Normalizing Flows

Title: Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation

Title: POLYRAG: Integrating Polyviews into Retrieval-Augmented Generation for Medical Applications

Title: TWIG: Two-Step Image Generation using Segmentation Masks in Diffusion Models

Title: Efficient Document Retrieval with G-Retriever

Title: Cyc3D: Fine-grained Controllable 3D Generation via Cycle Consistency Regularization

Title: NTIRE 2025 Challenge on Short-form UGC Video Quality Assessment and Enhancement: KwaiSR Dataset and Study

Title: Insert Anything: Image Insertion via In-Context Editing in DiT

Title: Gaussian Shading++: Rethinking the Realistic Deployment Challenge of Performance-Lossless Image Watermark for Diffusion Models

Title: DyST-XL: Dynamic Layout Planning and Content Control for Compositional Text-to-Video Generation

Title: VistaDepth: Frequency Modulation With Bias Reweighting For Enhanced Long-Range Depth Estimation

Title: Fast-Slow Co-advancing Optimizer: Toward Harmonious Adversarial Training of GAN

Title: Acquire and then Adapt: Squeezing out Text-to-Image Model for Image Restoration

Title: DSPO: Direct Semantic Preference Optimization for Real-World Image Super-Resolution

Title: FaceCraft4D: Animated 3D Facial Avatar Generation from a Single Image

Title: Tiger200K: Manually Curated High Visual Quality Video Dataset from UGC Platform

Title: Bringing Diversity from Diffusion Models to Semantic-Guided Face Asset Generation

Title: StyleMe3D: Stylization with Disentangled Priors by Multiple Encoders on 3D Gaussians