2024-12-02

Title: OOD-HOI: Text-Driven 3D Whole-Body Human-Object Interactions Generation Beyond Training Domains

Title: HoliSDiP: Image Super-Resolution via Holistic Semantics and Diffusion Prior

Title: Spatiotemporal Skip Guidance for Enhanced Video Diffusion Sampling

Title: SpotLight: Shadow-Guided Object Relighting via Diffusion

Title: Point Cloud Unsupervised Pre-training via 3D Gaussian Splatting

Title: Towards Chunk-Wise Generation for Long Videos

Title: SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modality

Title: AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers

Title: Active Data Curation Effectively Distills Large-Scale Multimodal Models

Title: MatchDiffusion: Training-free Generation of Match-cuts

Title: Random Walks with Tweedie: A Unified Framework for Diffusion Models

Title: Generative Visual Communication in the Era of Vision-Language Models

Title: Foundation Models in Radiology: What, How, When, Why and Why Not

Title: DiffMVR: Diffusion-based Automated Multi-Guidance Video Restoration

Title: Lifting Motion to the 3D World via 2D Diffusion

Title: Enhancing Compositional Text-to-Image Generation with Reliable Random Seeds

Title: FaithDiff: Unleashing Diffusion Priors for Faithful Image Super-resolution

Title: EzSQL: An SQL intermediate representation for improving SQL-to-text Generation

Title: Data Augmentation with Diffusion Models for Colon Polyp Localization on the Low Data Regime: How much real data is enough?

Title: VIPaint: Image Inpainting with Pre-Trained Diffusion Models via Variational Inference

Title: Self-Cross Diffusion Guidance for Text-to-Image Synthesis of Similar Subjects

Title: ICLERB: In-Context Learning Embedding and Reranker Benchmark

Title: Random Sampling for Diffusion-based Adversarial Purification

Title: Perception of Visual Content: Differences Between Humans and Foundation Models

Title: SPAgent: Adaptive Task Decomposition and Model Selection for General Video Generation and Editing

Title: Locally-Focused Face Representation for Sketch-to-Image Generation Using Noise-Induced Refinement

Title: PCDreamer: Point Cloud Completion Through Multi-view Diffusion Priors

Title: 3D-WAG: Hierarchical Wavelet-Guided Autoregressive Generation for High-Fidelity 3D Shapes

Title: I Dream My Painting: Connecting MLLMs and Diffusion Models via Prompt Generation for Text-Guided Multi-Mask Inpainting

Title: LADDER: Multi-objective Backdoor Attack via Evolutionary Algorithm

Title: ObjectRelator: Enabling Cross-View Object Relation Understanding in Ego-Centric and Exo-Centric Videos

Title: Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

Title: Understanding and Improving Training-Free AI-Generated Image Detections with Vision Foundation Models

Title: MSG score: A Comprehensive Evaluation for Multi-Scene Video Generation

Title: SOWing Information: Cultivating Contextual Coherence with MLLMs in Image Generation

Title: Video Depth without Video Models

Title: Automatic Prompt Generation and Grounding Object Detection for Zero-Shot Image Anomaly Detection

Title: Pre-Training Graph Contrastive Masked Autoencoders are Strong Distillers for EEG

Title: Z-STAR+: A Zero-shot Style Transfer Method via Adjusting Style Distribution

Title: Gaussians-to-Life: Text-Driven Animation of 3D Gaussian Splatting Scenes

Title: SmartLLMSentry: A Comprehensive LLM Based Smart Contract Vulnerability Detection Framework

Title: Face2QR: A Unified Framework for Aesthetic, Face-Preserving, and Scannable QR Code Generation

Title: Improving Multi-Subject Consistency in Open-Domain Image Generation with Isolation and Reposition Attention

Title: GMS-VINS:Multi-category Dynamic Objects Semantic Segmentation for Enhanced Visual-Inertial Odometry Using a Promptable Foundation Model

Title: Enhancing Parameter-Efficient Fine-Tuning of Vision Transformers through Frequency-Based Adaptation

Title: Trajectory Attention for Fine-grained Video Motion Control

Title: Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation

Title: Towards a Mechanistic Explanation of Diffusion Model Generalization

Title: CLIP meets DINO for Tuning Zero-Shot Classifier using Unlabeled Image Collections

Title: DENIAHL: In-Context Features Influence LLM Needle-In-A-Haystack Abilities

Title: Enhancing Sketch Animation: Text-to-Video Diffusion Models with Temporal Consistency and Rigidity Constraints

Title: DreamBlend: Advancing Personalized Fine-tuning of Text-to-Image Diffusion Models

Title: AMO Sampler: Enhancing Text Rendering with Overshooting

Title: Any-Resolution AI-Generated Image Detection by Spectral Learning

Title: Multiview Equivariance Improves 3D Correspondence Understanding with Minimal Feature Finetuning

Title: Effective Fine-Tuning of Vision-Language Models for Accurate Galaxy Morphology Analysis

Title: Graph-Enhanced EEG Foundation Model

Title: Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis

Title: DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding

Title: RAGDiffusion: Faithful Cloth Generation via External Knowledge Assimilation

Title: QUOTA: Quantifying Objects with Text-to-Image Models for Any Domain

Title: Deepfake Media Generation and Detection in the Generative AI Era: A Survey and Outlook

Title: KV Shifting Attention Enhances Language Modeling

Title: In-Context Learning with Noisy Labels

Title: LLM Teacher-Student Framework for Text Classification With No Manually Annotated Data: A Case Study in IPTC News Topic Classification

Title: Uniform Attention Maps: Boosting Image Fidelity in Reconstruction and Editing

Title: TexGaussian: Generating High-quality PBR Material via Octree-based 3D Gaussian Splatting

Title: MonoPP: Metric-Scaled Self-Supervised Monocular Depth Estimation by Planar-Parallax Geometry in Automotive Applications

Title: JetFormer: An Autoregressive Generative Model of Raw Images and Text

Title: Real-Time Anomaly Detection in Video Streams

Title: A Note on Small Percolating Sets on Hypercubes via Generative AI

Title: HVAC-DPT: A Decision Pretrained Transformer for HVAC Control

Title: Dual Risk Minimization: Towards Next-Level Robustness in Fine-tuning Zero-Shot Models

Title: Riemannian Denoising Score Matching for Molecular Structure Optimization with Accurate Energy

Title: MoTe: Learning Motion-Text Diffusion Model for Multiple Generation Tasks

Title: INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge

Title: DeMo: Decoupled Momentum Optimization

Title: Open source Differentiable ODE Solving Infrastructure

Title: FlowCLAS: Enhancing Normalizing Flow Via Contrastive Learning For Anomaly Segmentation

Title: $C^{3}$-NeRF: Modeling Multiple Scenes via Conditional-cum-Continual Neural Radiance Fields