2024-12-05

Title: DYffCast: Regional Precipitation Nowcasting Using IMERG Satellite Data. A case study over South America

Title: WxC-Bench: A Novel Dataset for Weather and Climate Downstream Tasks

Title: Temporally Consistent Dynamic Scene Graphs: An End-to-End Approach for Action Tracklet Generation

Title: FLAME 3 Dataset: Unleashing the Power of Radiometric Thermal UAV Imagery for Wildfire Management

Title: GUESS: Generative Uncertainty Ensemble for Self Supervision

Title: Panoptic Diffusion Models: co-generation of images and segmentation maps

Title: Semantic Segmentation Prior for Diffusion-Based Real-World Super-Resolution

Title: Partially Conditioned Patch Parallelism for Accelerated Diffusion Model Inference

Title: MedAutoCorrect: Image-Conditioned Autocorrection in Medical Reporting

Title: Surveying the Effects of Quality, Diversity, and Complexity in Synthetic Data From Large Language Models

Title: EchoONE: Segmenting Multiple echocardiography Planes in One Model

Title: CLAS: A Machine Learning Enhanced Framework for Exploring Large 3D Design Datasets

Title: AdvDreamer Unveils: Are Vision-Language Models Truly Ready for Real-World 3D Variations?

Title: Human Multi-View Synthesis from a Single-View Model:Transferred Body and Face Representations

Title: Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach

Title: UTSD: Unified Time Series Diffusion Model

Title: TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation

Title: Mimir: Improving Video Diffusion Models for Precise Text Understanding

Title: MultiGO: Towards Multi-level Geometry Learning for Monocular 3D Textured Human Reconstruction

Title: Few-Shot Learning with Adaptive Weight Masking in Conditional GANs

Title: Splats in Splats: Embedding Invisible 3D Watermark within Gaussian Splatting

Title: Appearance Matching Adapter for Exemplar-based Semantic Image Synthesis

Title: PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation

Title: Parametric Enhancement of PerceptNet: A Human-Inspired Approach for Image Quality Assessment

Title: Semi-Supervised Transfer Boosting (SS-TrBoosting)

Title: Survey of different Large Language Model Architectures: Trends, Benchmarks, and Challenges

Title: MaterialPicker: Multi-Modal Material Generation with Diffusion Transformers

Title: Task-driven Image Fusion with Learnable Fusion Loss

Title: DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation

Title: GERD: Geometric event response data generation

Title: RFSR: Improving ISR Diffusion Models via Reward Feedback Learning

Title: Geometry-guided Cross-view Diffusion for One-to-many Cross-view Image Synthesis

Title: DIVE: Taming DINO for Subject-Driven Video Editing

Title: Fairer Analysis and Demographically Balanced Face Generation for Fairer Face Verification

Title: TASR: Timestep-Aware Diffusion Model for Image Super-Resolution

Title: Mapping using Transformers for Volumes -- Network for Super-Resolution with Long-Range Interactions

Title: Implicit Priors Editing in Stable Diffusion via Targeted Token Adjustment

Title: Skel3D: Skeleton Guided Novel View Synthesis

Title: PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation

Title: SINGER: Vivid Audio-driven Singing Video Generation with Multi-scale Spectral Diffusion Model

Title: Pre-trained Multiple Latent Variable Generative Models are good defenders against Adversarial Attacks

Title: Urban4D: Semantic-Guided 4D Gaussian Splatting for Urban Scene Reconstruction

Title: Flow Matching with General Discrete Paths: A Kinetic-Optimal Perspective

Title: Distillation of Diffusion Features for Semantic Correspondence

Title: NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images

Title: Seeing Beyond Views: Multi-View Driving Scene Video Generation with Holistic Attention

Title: NODE-AdvGAN: Improving the transferability and perceptual similarity of adversarial examples by dynamic-system-driven adversarial generative model

Title: Imagine360: Immersive 360 Video Generation from Perspective Anchor

Title: PaliGemma 2: A Family of Versatile VLMs for Transfer

Title: MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation

Title: FreeSim: Toward Free-viewpoint Camera Simulation in Driving Scenes

Title: Sparse-view Pose Estimation and Reconstruction via Analysis by Generative Synthesis

Title: Style3D: Attention-guided Multi-view Style Transfer for 3D Object Generation

Title: Navigation World Models