2024-12-06

Title: HunyuanVideo: A Systematic Framework For Large Video Generative Models

Title: MV-Adapter: Multi-view Consistent Image Generation Made Easy

Title: A Water Efficiency Dataset for African Data Centers

Title: HIIF: Hierarchical Encoding based Implicit Image Function for Continuous Super-resolution

Title: Multi-view Image Diffusion via Coordinate Noise and Fourier Attention

Title: Advancing Auto-Regressive Continuation for Video Frames

Title: Coordinate In and Value Out: Training Flow Transformers in Ambient Space

Title: Exploring Real&Synthetic Dataset and Linear Attention in Image Restoration

Title: A large language model-type architecture for high-dimensional molecular potential energy surfaces

Title: LL-ICM: Image Compression for Low-level Machine Vision via Large Vision-Language Model

Title: Automated LaTeX Code Generation from Handwritten Math Expressions Using Vision Transformer

Title: CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation

Title: Safeguarding Text-to-Image Generation via Inference-Time Prompt-Noise Optimization

Title: DiffSign: AI-Assisted Generation of Customizable Sign Language Videos With Enhanced Realism

Title: A Noise is Worth Diffusion Guidance

Title: Multi-View Pose-Agnostic Change Localization with Zero Labels

Title: Privacy-Preserving in Medical Image Analysis: A Review of Methods and Applications

Title: InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models

Title: AIpparel: A Large Multimodal Generative Model for Digital Garments

Title: A Framework For Image Synthesis Using Supervised Contrastive Learning

Title: Local Curvature Smoothing with Stein's Identity for Efficient Score Matching

Title: Blind Underwater Image Restoration using Co-Operational Regressor Networks

Title: IF-MDM: Implicit Face Motion Diffusion Model for High-Fidelity Realtime Talking Head Generation

Title: PriorMotion: Generative Class-Agnostic Motion Prediction with Raster-Vector Motion Field Priors

Title: INFP: Audio-Driven Interactive Head Generation in Dyadic Conversations

Title: ZipAR: Accelerating Autoregressive Image Generation through Spatial Locality

Title: Towards Generalizable Autonomous Penetration Testing via Domain Randomization and Meta-Reinforcement Learning

Title: BodyMetric: Evaluating the Realism of HumanBodies in Text-to-Image Generation

Title: LossAgent: Towards Any Optimization Objectives for Image Processing with LLM Agents

Title: MRGen: Diffusion-based Controllable Data Engine for MRI Segmentation towards Unannotated Modalities

Title: Deep priors for satellite image restoration with accurate uncertainties

Title: Compositional Generative Multiphysics and Multi-component Simulation

Title: Understanding Memorization in Generative Models via Sharpness in Probability Landscapes

Title: AnyDressing: Customizable Multi-Garment Virtual Dressing via Latent Diffusion Models

Title: An In-Depth Examination of Risk Assessment in Multi-Class Classification Algorithms

Title: Instructional Video Generation

Title: Hipandas: Hyperspectral Image Joint Denoising and Super-Resolution by Image Fusion with the Panchromatic Image

Title: VASCAR: Content-Aware Layout Generation via Visual-Aware Self-Correction

Title: LMDM:Latent Molecular Diffusion Model For 3D Molecule Generation

Title: SynFinTabs: A Dataset of Synthetic Financial Tables for Information and Table Extraction

Title: SIDA: Social Media Image Deepfake Detection, Localization and Explanation with Large Multimodal Model

Title: T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts

Title: LocalSR: Image Super-Resolution in Local Region

Title: Liquid: Language Models are Scalable Multi-modal Generators

Title: RMD: A Simple Baseline for More General Human Motion Generation via Training-free Retrieval-Augmented Motion Diffuse

Title: Discriminative Fine-tuning of LVLMs

Title: Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion

Title: Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Title: Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation

Title: GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration

Title: DiCoDe: Diffusion-Compressed Deep Tokens for Autoregressive Video Generation with Language Models

Title: MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation

Title: Four-Plane Factorized Video Autoencoders

Title: HeatFormer: A Neural Optimizer for Multiview Human Mesh Recovery

Title: LayerFusion: Harmonized Multi-Layer Text-to-Image Generation with Generative Priors

Title: Turbo3D: Ultra-fast Text-to-3D Generation

Title: PaintScene4D: Consistent 4D Scene Generation from Text Prompts