2024-12-20

Title: Distilled Pooling Transformer Encoder for Efficient Realistic Image Dehazing

Title: Fake News Detection: Comparative Evaluation of BERT-like Models and Large Language Models with Generative AI-Annotated Data

Title: PixelMan: Consistent Object Editing with Diffusion Models via Pixel Manipulation and Generation

Title: TRecViT: A Recurrent Video Transformer

Title: Personalized Generative Low-light Image Denoising and Enhancement

Title: Joint Co-Speech Gesture and Expressive Talking Face Generation using Diffusion with Adapters

Title: A Unifying Information-theoretic Perspective on Evaluating Generative Models

Title: Surrealistic-like Image Generation with Vision-Language Models

Title: ECG-Byte: A Tokenizer for End-to-End Generative Electrocardiogram Language Modeling

Title: Enhancing Diffusion Models for High-Quality Image Generation

Title: FedPIA -- Permuting and Integrating Adapters leveraging Wasserstein Barycenters for Finetuning Foundation Models in Multi-Modal Federated Learning

Title: IntroStyle: Training-Free Introspective Style Attribution using Diffusion Features

Title: GenHMR: Generative Human Mesh Recovery

Title: CLDG: Contrastive Learning on Dynamic Graphs

Title: Multimodal Latent Diffusion Model for Complex Sewing Pattern Generation

Title: LEDiff: Latent Exposure Diffusion for HDR Generation

Title: From Human Annotation to LLMs: SILICON Annotation Workflow for Management Research

Title: Affordance-Aware Object Insertion via Mask-Aware Dual Diffusion

Title: LiftRefine: Progressively Refined View Synthesis from 3D Lifting with Volume-Triplane Representations

Title: DiffusionTrend: A Minimalist Approach to Virtual Fashion Try-On

Title: Drive-1-to-3: Enriching Diffusion Priors for Novel View Synthesis of Real Vehicles

Title: Content-style disentangled representation for controllable artistic image stylization and generation

Title: Guided Diffusion Model for Sensor Data Obfuscation

Title: Efficient Self-Supervised Video Hashing with Selective State Spaces

Title: Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models

Title: Consistent Human Image and Video Generation with Spatially Conditioned Diffusion

Title: ST-ReP: Learning Predictive Representations Efficiently for Spatial-Temporal Forecasting

Title: Downscaling Precipitation with Bias-informed Conditional Diffusion Model

Title: Global Spatio-Temporal Fusion-based Traffic Prediction Algorithm with Anomaly Aware

Title: DiffSim: Taming Diffusion Models for Evaluating Visual Similarity

Title: Multi-Sensor Object Anomaly Detection: Unifying Appearance, Geometry, and Internal Properties

Title: LDP: Generalizing to Multilingual Visual Information Extraction by Language Decoupled Pretraining

Title: Qua$^2$SeDiMo: Quantifiable Quantization Sensitivity of Diffusion Models

Title: Robust PCA Based on Adaptive Weighted Least Squares and Low-Rank Matrix Factorization

Title: Unified Image Restoration and Enhancement: Degradation Calibrated Cycle Reconstruction Diffusion Model

Title: Event-assisted 12-stop HDR Imaging of Dynamic Scene

Title: EnergyMoGen: Compositional Human Motion Generation with Energy-Based Diffusion Model in Latent Space

Title: Generative AI for Banks: Benchmarks and Algorithms for Synthetic Financial Transaction Data

Title: Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations

Title: Explainable Tampered Text Detection via Multimodal Large Models

Title: DS$^2$-ABSA: Dual-Stream Data Synthesis with Label Refinement for Few-Shot Aspect-Based Sentiment Analysis

Title: Zero-Shot Artifact2Artifact: Self-incentive artifact removal for photoacoustic imaging without any data

Title: Diffusion priors for Bayesian 3D reconstruction from incomplete measurements

Title: MagicNaming: Consistent Identity Generation by Finding a "Name Space" in T2I Diffusion Models

Title: Dehallucinating Parallel Context Extension for Retrieval-Augmented Generation

Title: DCTdiff: Intriguing Properties of Image Generative Modeling in the DCT Space

Title: Uni-Renderer: Unifying Rendering and Inverse Rendering Via Dual Stream Diffusion

Title: MultiverSeg: Scalable Interactive Segmentation of Biomedical Imaging Datasets with In-Context Guidance

Title: Learning Disentangled Equivariant Representation for Explicitly Controllable 3D Molecule Generation

Title: Jet: A Modern Transformer-Based Normalizing Flow

Title: Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM

Title: OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization

Title: Tiled Diffusion

Title: LlamaFusion: Adapting Pretrained Language Models for Multimodal Generation

Title: AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation

Title: DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation for High-quality 3D Asset Creation

Title: LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks

Title: Generative Multiview Relighting for 3D Reconstruction under Extreme Illumination Variation

Title: Scaling 4D Representations

Title: Flowing from Words to Pixels: A Framework for Cross-Modality Evolution

Title: LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis