2024-11-28

Title: UVCG: Leveraging Temporal Consistency for Universal Video Protection

Title: MUSE-VL: Modeling Unified VLM through Semantic Discrete Encoding

Title: Symmetry Strikes Back: From Single-Image Symmetry Detection to 3D Generation

Title: DiagramQG: A Dataset for Generating Concept-Focused Questions from Diagrams

Title: MVBoost: Boost 3D Reconstruction with Multi-View Refinement

Title: Diffusion Autoencoders for Few-shot Image Generation in Hyperbolic Space

Title: DreamCache: Finetuning-Free Lightweight Personalized Image Generation via Feature Caching

Title: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient

Title: Self-supervised Monocular Depth and Pose Estimation for Endoscopy with Generative Latent Priors

Title: Signs as Tokens: An Autoregressive Multilingual Sign Language Generator

Title: From memorization to generalization: a theoretical framework for diffusion-based generative models

Title: Low-rank Adaptation-based All-Weather Removal for Autonomous Navigation

Title: SVGDreamer++: Advancing Editability and Diversity in Text-Guided SVG Generation

Title: Generative Image Layer Decomposition with Visual Effects

Title: Passive Deepfake Detection Across Multi-modalities: A Comprehensive Survey

Title: ROICtrl: Boosting Instance Control for Visual Generation

Title: Improved implicit diffusion model with knowledge distillation to estimate the spatial distribution density of carbon stock in remote sensing imagery

Title: Vision Mamba Distillation for Low-resolution Fine-grained Image Classification

Title: Exploring Visual Vulnerabilities via Multi-Loss Adversarial Search for Jailbreaking Vision-Language Models

Title: HyperGLM: HyperGraph for Video Scene Graph Generation and Anticipation

Title: PersonaCraft: Personalized Full-Body Image Synthesis for Multiple Identities from Single References Using 3D-Model-Conditioned Diffusion

Title: Training Data Synthesis with Difficulty Controlled Diffusion Model

Title: When Large Vision-Language Models Meet Person Re-Identification

Title: ModeDreamer: Mode Guiding Score Distillation for Text-to-3D Generation using Reference Image Prompts

Title: Type-R: Automatically Retouching Typos for Text-to-Image Generation

Title: DistinctAD: Distinctive Audio Description Generation in Contexts

Title: Semantic Edge Computing and Semantic Communications in 6G Networks: A Unifying Survey and Research Challenges

Title: SharpDepth: Sharpening Metric Depth Predictions Using Diffusion Distillation

Title: Dynamic Retail Pricing via Q-Learning -- A Reinforcement Learning Framework for Enhanced Revenue Management

Title: TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution

Title: MotionCharacter: Identity-Preserving and Motion Controllable Human Video Generation

Title: HiFiVFS: High Fidelity Video Face Swapping

Title: Enhancing MMDiT-Based Text-to-Image Models for Similar Subject Generation

Title: InfiniDreamer: Arbitrarily Long Human Motion Generation via Segment Score Distillation

Title: MvKeTR: Chest CT Report Generation with Multi-View Perception and Knowledge Enhancement

Title: TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models

Title: Individual Content and Motion Dynamics Preserved Pruning for Video Diffusion Models

Title: Adaptive Blind All-in-One Image Restoration

Title: Continuous Autoregressive Models with Noise Augmentation Avoid Error Accumulation

Title: Advancements in Myocardial Infarction Detection and Classification Using Wearable Devices: A Comprehensive Review

Title: Synthetic ECG Generation for Data Augmentation and Transfer Learning in Arrhythmia Classification

Title: Complexity Experts are Task-Discriminative Learners for Any Image Restoration

Title: GATE OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation

Title: Enhancing weed detection performance by means of GenAI-based image augmentation

Title: PhyCAGE: Physically Plausible Compositional 3D Asset Generation from a Single Image

Title: FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion

Title: Hierarchical Information Flow for Generalized Efficient Image Restoration

Title: CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models

Title: Diffusion Self-Distillation for Zero-Shot Customized Image Generation