2024-12-13

Title: In-Context Learning with Topological Information for Knowledge Graph Completion

Title: Generative Modeling with Explicit Memory

Title: Large Concept Models: Language Modeling in a Sentence Representation Space

Title: Inference-Time Diffusion Model Distillation

Title: Federated Foundation Models on Heterogeneous Time Series

Title: Reversing the Damage: A QP-Aware Transformer-Diffusion Approach for 8K Video Restoration under Codec Compression

Title: Mojito: Motion Trajectory and Intensity Control for Video Generation

Title: Multimodal Industrial Anomaly Detection by Crossmodal Reverse Distillation

Title: Align, Generate, Learn: A Novel Closed-Loop Framework for Cross-Lingual In-Context Learning

Title: Deep Learning Model Security: Threats and Defenses

Title: Reasoning-Aware Query-Focused Summarization over Multi-Table Data

Title: Elevating Flow-Guided Video Inpainting with Reference Generation

Title: MS2Mesh-XR: Multi-modal Sketch-to-Mesh Generation in XR Environments

Title: Arbitrary-steps Image Super-resolution via Diffusion Inversion

Title: Video Anomaly Detection with Motion and Appearance Guided Patch Diffusion Model

Title: Dialogue Language Model with Large-Scale Persona Data Engineering

Title: Multi-Task Learning with LLMs for Implicit Sentiment Analysis: Data-level and Task-level Automatic Weight Learning

Title: Go With the Flow: Fast Diffusion for Gaussian Mixture Models

Title: An Efficient Framework for Enhancing Discriminative Models via Diffusion Techniques

Title: Cross-View Completion Models are Zero-shot Correspondence Estimators

Title: DomCLP: Domain-wise Contrastive Learning with Prototype Mixup for Unsupervised Domain Generalization

Title: ResFlow: Fine-tuning Residual Optical Flow for Event-based High Temporal Resolution Motion Estimation

Title: LVMark: Robust Watermark for latent video diffusion models

Title: Pinpoint Counterfactuals: Reducing social bias in foundation models via localized counterfactual generation

Title: When Text Embedding Meets Large Language Model: A Comprehensive Survey

Title: DECOR:Decomposition and Projection of Text Embeddings for Text-to-Image Customization

Title: RAD: Region-Aware Diffusion Models for Image Inpainting

Title: ExpRDiff: Short-exposure Guided Diffusion Model for Realistic Local Motion Deblurring

Title: CleanComedy: Creating Friendly Humor through Generative Techniques

Title: Foundation Models and Adaptive Feature Selection: A Synergistic Approach to Video Question Answering

Title: Make Satire Boring Again: Reducing Stylistic Bias of Satirical Corpus by Utilizing Generative LLMs

Title: LatentSync: Audio Conditioned Latent Diffusion Models for Lip Sync

Title: Transfer Learning of RSSI to Improve Indoor Localisation Performance

Title: GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic Expression

Title: Are Conditional Latent Diffusion Models Effective for Image Restoration?

Title: Auto-Regressive Moving Diffusion Models for Time Series Forecasting

Title: MaskTerial: A Foundation Model for Automated 2D Material Flake Detection

Title: DisPose: Disentangling Pose Guidance for Controllable Human Image Animation

Title: Causal Graphical Models for Vision-Language Compositional Understanding

Title: Diffusion Model with Representation Alignment for Protein Inverse Folding

Title: UFO: Enhancing Diffusion-Based Video Generation with a Uniform Frame Organizer

Title: Towards Robust and Fair Vision Learning in Open-World Environments

Title: The Impact of Copyrighted Material on Large Language Models: A Norwegian Perspective

Title: OFTSR: One-Step Flow for Image Super-Resolution with Tunable Fidelity-Realism Trade-offs

Title: Capturing the Temporal Dependence of Training Data Influence

Title: SimAvatar: Simulation-Ready Avatars with Layered Hair and Clothing

Title: Video Creation by Demonstration

Title: JuStRank: Benchmarking LLM Judges for System Ranking

Title: Neural LightRig: Unlocking Accurate Object Normal and Material Estimation with Multi-Light Diffusion

Title: InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Title: LiftImage3D: Lifting Any Single Image to 3D Gaussians with Video Generation Priors

Title: Feat2GS: Probing Visual Foundation Models with Gaussian Splatting

Title: Olympus: A Universal Task Router for Computer Vision Tasks

Title: Context Canvas: Enhancing Text-to-Image Diffusion Models with Knowledge Graph-Based RAG

Title: EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM

Title: SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training

Title: LoRACLR: Contrastive Adaptation for Customization of Diffusion Models

Title: OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation

Title: GenEx: Generating an Explorable World

Title: Illusion3D: 3D Multiview Illusion with 2D Diffusion Priors

Title: FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion