2025-04-16

Title: GPT Meets Graphs and KAN Splines: Testing Novel Frameworks on Multitask Fine-Tuned GPT-2 with LoRA

Title: Beyond the Generative Learning Trilemma: Generative Model Assessment in Data Scarcity Domains

Title: VAE-based Feature Disentanglement for Data Augmentation and Compression in Generalized GNSS Interference Classification

Title: Enhancing Image Restoration through Learning Context-Rich and Detail-Accurate Features

Title: H3AE: High Compression, High Speed, and High Quality AutoEncoder for Video Diffusion Models

Title: Energy Matching: Unifying Flow Matching and Energy-Based Models for Generative Modeling

Title: Relation-Rich Visual Document Generator for Visual Information Extraction

Title: H-MoRe: Learning Human-centric Motion Representation for Action Analysis

Title: The Tenth NTIRE 2025 Efficient Super-Resolution Challenge Report

Title: SpinMeRound: Consistent Multi-View Identity Generation Using Diffusion Models

Title: ReasonDrive: Efficient Visual Question Answering for Autonomous Vehicles with Reasoning-Enhanced Small Vision-Language Models

Title: Power-scaled Bayesian Inference with Score-based Generative mModels

Title: IlluSign: Illustrating Sign Language Videos by Leveraging the Attention Mechanism

Title: OmniVDiff: Omni Controllable Video Diffusion for Generation and Understanding

Title: LayoutCoT: Unleashing the Deep Reasoning Potential of Large Language Models for Layout Generation

Title: Safe-Construct: Redefining Construction Safety Violation Recognition as 3D Multi-View Engagement Task

Title: Bringing together invertible UNets with invertible attention modules for memory-efficient diffusion models

Title: PuzzleBench: A Fully Dynamic Evaluation Framework for Large Multimodal Models on Puzzle Solving

Title: InterAnimate: Taming Region-aware Diffusion Model for Realistic Human Interaction Animation

Title: An Efficient and Mixed Heterogeneous Model for Image Restoration

Title: AFiRe: Anatomy-Driven Self-Supervised Learning for Fine-Grained Representation in Radiographic Images

Title: ProtFlow: Fast Protein Sequence Design via Flow Matching on Compressed Protein Language Model Embeddings

Title: Leveraging Vertical Public-Private Split for Improved Synthetic Data Generation

Title: AnimeDL-2M: Million-Scale AI-Generated Anime Image Detection and Localization in Diffusion Era

Title: Easy3D: A Simple Yet Effective Method for 3D Interactive Segmentation

Title: Defending Against Frequency-Based Attacks with Diffusion Models

Title: UKDM: Underwater keypoint detection and matching using underwater image enhancement techniques

Title: Vivid4D: Improving 4D Reconstruction from Monocular Video by Video Inpainting

Title: Token-Level Constraint Boundary Search for Jailbreaking Text-to-Image Models

Title: Taming Consistency Distillation for Accelerated Human Image Animation

Title: TerraMind: Large-Scale Generative Multimodality for Earth Observation

Title: Efficient Distributed Retrieval-Augmented Generation for Enhancing Language Model Performance

Title: Distillation-Supervised Convolutional Low-Rank Adaptation for Efficient Image Super-Resolution

Title: UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer

Title: Autoregressive Distillation of Diffusion Transformers

Title: Looking beyond the next token

Title: Seedream 3.0 Technical Report

Title: DeepWheel: Generating a 3D Synthetic Wheel Dataset for Design and Performance Evaluation

Title: Omni$^2$: Unifying Omnidirectional Image Generation and Editing in an Omni Model

Title: ADT: Tuning Diffusion Models with Adversarial Supervision

Title: Elucidating the Design Space of Multimodal Protein Language Models

Title: SimpleAR: Pushing the Frontier of Autoregressive Visual Generation through Pretraining, SFT, and RL

Title: Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception