2024-12-05

Title: DYffCast: Regional Precipitation Nowcasting Using IMERG Satellite Data. A case study over South America

Title: Prithvi-EO-2.0: A Versatile Multi-Temporal Foundation Model for Earth Observation Applications

Title: Mixture of Physical Priors Adapter for Parameter-Efficient Fine-Tuning

Title: Grayscale to Hyperspectral at Any Resolution Using a Phase-Only Lens

Title: Minimization of Boolean Complexity in In-Context Concept Learning

Title: Effortless Efficiency: Low-Cost Pruning of Diffusion Models

Title: MAGMA: Manifold Regularization for MAEs

Title: GUESS: Generative Uncertainty Ensemble for Self Supervision

Title: Panoptic Diffusion Models: co-generation of images and segmentation maps

Title: Semantic Segmentation Prior for Diffusion-Based Real-World Super-Resolution

Title: Partially Conditioned Patch Parallelism for Accelerated Diffusion Model Inference

Title: CLAS: A Machine Learning Enhanced Framework for Exploring Large 3D Design Datasets

Title: AdvDreamer Unveils: Are Vision-Language Models Truly Ready for Real-World 3D Variations?

Title: Human Multi-View Synthesis from a Single-View Model:Transferred Body and Face Representations

Title: Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach

Title: Frequency-Guided Diffusion Model with Perturbation Training for Skeleton-Based Video Anomaly Detection

Title: UTSD: Unified Time Series Diffusion Model

Title: Analytic Study of Text-Free Speech Synthesis for Raw Audio using a Self-Supervised Learning Model

Title: Align3R: Aligned Monocular Depth Estimation for Dynamic Videos

Title: Mimir: Improving Video Diffusion Models for Precise Text Understanding

Title: MultiGO: Towards Multi-level Geometry Learning for Monocular 3D Textured Human Reconstruction

Title: Few-Shot Learning with Adaptive Weight Masking in Conditional GANs

Title: Appearance Matching Adapter for Exemplar-based Semantic Image Synthesis

Title: PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation

Title: Beyond [cls]: Exploring the true potential of Masked Image Modeling representations

Title: MaterialPicker: Multi-Modal Material Generation with Diffusion Transformers

Title: DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation

Title: RFSR: Improving ISR Diffusion Models via Reward Feedback Learning

Title: Intent-driven In-context Learning for Few-shot Dialogue State Tracking

Title: AntLM: Bridging Causal and Masked Language Models

Title: Black-Box Forgery Attacks on Semantic Watermarks for Diffusion Models

Title: Equivariant Representation Learning for Augmentation-based Self-Supervised Learning via Image Reconstruction

Title: Geometry-guided Cross-view Diffusion for One-to-many Cross-view Image Synthesis

Title: UniVAD: A Training-free Unified Model for Few-shot Visual Anomaly Detection

Title: DIVE: Taming DINO for Subject-Driven Video Editing

Title: Fairer Analysis and Demographically Balanced Face Generation for Fairer Face Verification

Title: TASR: Timestep-Aware Diffusion Model for Image Super-Resolution

Title: Implicit Priors Editing in Stable Diffusion via Targeted Token Adjustment

Title: Skel3D: Skeleton Guided Novel View Synthesis

Title: Assessing Foundation Models' Transferability to Physiological Signals in Precision Medicine

Title: SINGER: Vivid Audio-driven Singing Video Generation with Multi-scale Spectral Diffusion Model

Title: CleanDIFT: Diffusion Features without Noise

Title: State Frequency Estimation for Anomaly Detection

Title: Pre-trained Multiple Latent Variable Generative Models are good defenders against Adversarial Attacks

Title: Flow Matching with General Discrete Paths: A Kinetic-Optimal Perspective

Title: Distillation of Diffusion Features for Semantic Correspondence

Title: Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion

Title: NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images

Title: Seeing Beyond Views: Multi-View Driving Scene Video Generation with Holistic Attention

Title: NODE-AdvGAN: Improving the transferability and perceptual similarity of adversarial examples by dynamic-system-driven adversarial generative model

Title: MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation

Title: FreeSim: Toward Free-viewpoint Camera Simulation in Driving Scenes

Title: Sparse-view Pose Estimation and Reconstruction via Analysis by Generative Synthesis

Title: Navigation World Models