2024-12-20

Title: Distilled Pooling Transformer Encoder for Efficient Realistic Image Dehazing

Title: PixelMan: Consistent Object Editing with Diffusion Models via Pixel Manipulation and Generation

Title: Personalized Generative Low-light Image Denoising and Enhancement

Title: Joint Co-Speech Gesture and Expressive Talking Face Generation using Diffusion with Adapters

Title: A Unifying Information-theoretic Perspective on Evaluating Generative Models

Title: Surrealistic-like Image Generation with Vision-Language Models

Title: Enhancing Diffusion Models for High-Quality Image Generation

Title: IntroStyle: Training-Free Introspective Style Attribution using Diffusion Features

Title: GenHMR: Generative Human Mesh Recovery

Title: Multimodal Latent Diffusion Model for Complex Sewing Pattern Generation

Title: LEDiff: Latent Exposure Diffusion for HDR Generation

Title: LiftRefine: Progressively Refined View Synthesis from 3D Lifting with Volume-Triplane Representations

Title: DiffusionTrend: A Minimalist Approach to Virtual Fashion Try-On

Title: DirectorLLM for Human-Centric Video Generation

Title: Content-style disentangled representation for controllable artistic image stylization and generation

Title: Consistent Human Image and Video Generation with Spatially Conditioned Diffusion

Title: DAMPER: A Dual-Stage Medical Report Generation Framework with Coarse-Grained MeSH Alignment and Fine-Grained Hypergraph Matching

Title: Bright-NeRF:Brightening Neural Radiance Field with Color Restoration from Low-light Raw Images

Title: ScaMo: Exploring the Scaling Law in Autoregressive Motion Generation Model

Title: DiffSim: Taming Diffusion Models for Evaluating Visual Similarity

Title: Successive optimization of optics and post-processing with differentiable coherent PSF operator and field information

Title: Qua$^2$SeDiMo: Quantifiable Quantization Sensitivity of Diffusion Models

Title: Unified Image Restoration and Enhancement: Degradation Calibrated Cycle Reconstruction Diffusion Model

Title: LoLaFL: Low-Latency Federated Learning via Forward-only Propagation

Title: MUSTER: Longitudinal Deformable Registration by Composition of Consecutive Deformations

Title: FiVL: A Framework for Improved Vision-Language Alignment

Title: EnergyMoGen: Compositional Human Motion Generation with Energy-Based Diffusion Model in Latent Space

Title: Generative AI for Banks: Benchmarks and Algorithms for Synthetic Financial Transaction Data

Title: Extending TWIG: Zero-Shot Predictive Hyperparameter Selection for KGEs based on Graph Structure

Title: Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations

Title: MagicNaming: Consistent Identity Generation by Finding a "Name Space" in T2I Diffusion Models

Title: Movie2Story: A framework for understanding videos and telling stories in the form of novel text

Title: Arti-PG: A Toolbox for Procedurally Synthesizing Large-Scale and Diverse Articulated Objects with Rich Annotations

Title: DCTdiff: Intriguing Properties of Image Generative Modeling in the DCT Space

Title: Uni-Renderer: Unifying Rendering and Inverse Rendering Via Dual Stream Diffusion

Title: Learning Disentangled Equivariant Representation for Explicitly Controllable 3D Molecule Generation

Title: Parallelized Autoregressive Visual Generation

Title: Jet: A Modern Transformer-Based Normalizing Flow

Title: Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM

Title: OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization

Title: Rethinking Uncertainty Estimation in Natural Language Generation

Title: Tiled Diffusion

Title: AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation

Title: DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation for High-quality 3D Asset Creation

Title: FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching

Title: Generative Multiview Relighting for 3D Reconstruction under Extreme Illumination Variation

Title: Flowing from Words to Pixels: A Framework for Cross-Modality Evolution