2024-12-12

Title: Mogo: RQ Hierarchical Causal Transformer for High-Quality 3D Human Motion Generation

Title: Learning to Correction: Explainable Feedback Generation for Visual Commonsense Reasoning Distractor

Title: Boosting Alignment for Post-Unlearning Text-to-Image Generative Models

Title: Pix2Poly: A Sequence Prediction Method for End-to-end Polygonal Building Footprint Extraction from Remote Sensing Imagery

Title: Explaining and Mitigating the Modality Gap in Contrastive Multimodal Learning

Title: Non-Normal Diffusion Models

Title: Phase-aware Training Schedule Simplifies Learning in Flow-Based Generative Models

Title: MAGIC: Mastering Physical Adversarial Generation in Context through Collaborative LLM Agents

Title: NeRF-NQA: No-Reference Quality Assessment for Scenes Generated by NeRF and Neural View Synthesis Methods

Title: DynamicPAE: Generating Scene-Aware Physical Adversarial Examples in Real-Time

Title: Federated In-Context LLM Agent Learning

Title: Statistical Downscaling via High-Dimensional Distribution Matching with Generative Models

Title: Generative Zoo

Title: Seeing Syntax: Uncovering Syntactic Learning Limitations in Vision-Language Models

Title: Progressive Multi-granular Alignments for Grounded Reasoning in Large Vision-Language Models

Title: DiffRaman: A Conditional Latent Denoising Diffusion Probabilistic Model for Bacterial Raman Spectroscopy Identification Under Limited Data Conditions

Title: AsyncDSB: Schedule-Asynchronous Diffusion Schr\"odinger Bridge for Image Inpainting

Title: Analyzing and Improving Model Collapse in Rectified Flow Models

Title: GN-FR:Generalizable Neural Radiance Fields for Flare Removal

Title: Generate Any Scene: Evaluating and Improving Text-to-Vision Generation with Scene Graph Programming

Title: Self-Refining Diffusion Samplers: Enabling Parallelization via Parareal Iterations

Title: Video Summarization using Denoising Diffusion Probabilistic Model

Title: Adversarial Purification by Consistency-aware Latent Space Optimization on Data Manifolds

Title: Pysical Informed Driving World Model

Title: Pragmatist: Multiview Conditional Diffusion Models for High-Fidelity 3D Reconstruction from Unposed Sparse Views

Title: Federated Learning for Traffic Flow Prediction with Synthetic Data Augmentation

Title: CC-Diff: Enhancing Contextual Coherence in Remote Sensing Image Synthesis

Title: Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel

Title: InvDiff: Invariant Guidance for Bias Mitigation in Diffusion Models

Title: Learning Flow Fields in Attention for Controllable Person Image Generation

Title: StyleStudio: Text-Driven Style Transfer with Selective Control of Style Elements

Title: Watermarking Training Data of Music Generation Models

Title: Can We Generate Visual Programs Without Prompting LLMs?

Title: GenPlan: Generative sequence models as adaptive planners

Title: TryOffAnyone: Tiled Cloth Generation from a Dressed Person

Title: LAION-SG: An Enhanced Large-Scale Dataset for Training Complex Image-Text Models with Structural Annotations

Title: Fair Primal Dual Splitting Method for Image Inverse Problems

Title: GPD-1: Generative Pre-training for Driving

Title: ObjectMate: A Recurrence Prior for Object Insertion and Subject-Driven Generation