2024-11-26

Title: Adaptively Controllable Diffusion Model for Efficient Conditional Image Generation

Title: Self-Supervised Conditional Distribution Learning on Graphs

Title: Quantized symbolic time series approximation

Title: S$^2$ALM: Sequence-Structure Pre-trained Large Language Model for Comprehensive Antibody Representation Learning

Title: Sampling with Adaptive Variance for Multimodal Distributions

Title: IterIS: Iterative Inference-Solving Alignment for LoRA Merging

Title: BiomedCoOp: Learning to Prompt for Biomedical Vision-Language Models

Title: Text Embedding is Not All You Need: Attention Control for Text-to-Image Semantic Alignment with Text Self-Attention Maps

Title: Faithful Label-free Knowledge Distillation

Title: Is Attention All You Need For Actigraphy? Foundation Models of Wearable Accelerometer Data for Mental Health Research

Title: Reward Fine-Tuning Two-Step Diffusion Models via Learning Differentiable Latent-Space Surrogate Reward

Title: TPLogAD: Unsupervised Log Anomaly Detection Based on Event Templates and Key Parameters

Title: LocRef-Diffusion:Tuning-Free Layout and Appearance-Guided Generation

Title: VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing

Title: MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation

Title: Derivative-Free Diffusion Manifold-Constrained Gradient for Unified XAI

Title: EADReg: Probabilistic Correspondence Generation with Efficient Autoregressive Diffusion Model for Outdoor Point Cloud Registration

Title: Foundation Cures Personalization: Recovering Facial Personalized Models' Prompt Consistency

Title: There is no SAMantics! Exploring SAM as a Backbone for Visual Understanding Tasks

Title: PPLqa: An Unsupervised Information-Theoretic Quality Metric for Comparing Generative Large Language Models

Title: Zero-Shot Coreset Selection: Efficient Pruning for Unlabeled Data

Title: Exploiting Watermark-Based Defense Mechanisms in Text-to-Image Diffusion Models for Unauthorized Data Usage

Title: Gradient dynamics for low-rank fine-tuning beyond kernels

Title: From Jack of All Trades to Master of One: Specializing LLM-based Autoraters to a Test Set

Title: Gradient-Free Classifier Guidance for Diffusion Model Sampling

Title: LDM-Morph: Latent diffusion model guided deformable image registration

Title: ConsistentAvatar: Learning to Diffuse Fully Consistent Talking Head Avatar with Temporal Guidance

Title: Twin Trigger Generative Networks for Backdoor Attacks against Object Detection

Title: Mamba-CL: Optimizing Selective State Space Model in Null Space for Continual Learning

Title: SplatFlow: Self-Supervised Dynamic Gaussian Splatting in Neural Motion Flow Field for Autonomous Driving

Title: Automatic Evaluation for Text-to-image Generation: Task-decomposed Framework, Distilled Training, and Meta-evaluation Benchmark

Title: Improving Factuality of 3D Brain MRI Report Generation with Paired Image-domain Retrieval and Text-domain Augmentation

Title: AeroGen: Enhancing Remote Sensing Object Detection with Diffusion-Driven Data Generation

Title: Interactive Visual Assessment for Text-to-Image Generation Models

Title: MUNBa: Machine Unlearning via Nash Bargaining

Title: Optical-Flow Guided Prompt Optimization for Coherent Video Generation

Title: NeRF Inpainting with Geometric Diffusion Prior and Balanced Score Distillation

Title: From MTEB to MTOB: Retrieval-Augmented Classification for Descriptive Grammars

Title: TKG-DM: Training-free Chroma Key Content Generation Diffusion Model

Title: EMD: Explicit Motion Modeling for High-Quality Street Gaussian Splatting

Title: FLD+: Data-efficient Evaluation Metric for Generative Models

Title: An adversarial feature learning based semantic communication method for Human 3D Reconstruction

Title: Fine-Grained Open-Vocabulary Object Recognition via User-Guided Segmentation

Title: Multi-label Sequential Sentence Classification via Large Language Model

Title: Effort: Efficient Orthogonal Modeling for Generalizable AI-Generated Image Detection

Title: Semantic Shield: Defending Vision-Language Models Against Backdooring and Poisoning via Fine-grained Knowledge Alignment

Title: Can a Large Language Model Learn Matrix Functions In Context?

Title: Fixing the Perspective: A Critical Examination of Zero-1-to-3

Title: ROOT: VLM based System for Indoor Scene Understanding and Beyond

Title: AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea

Title: Beyond Data Scarcity: A Frequency-Driven Framework for Zero-Shot Forecasting

Title: ZeroGS: Training 3D Gaussian Splatting from Unposed Images

Title: Multi-Token Enhancing for Vision Representation Learning

Title: Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing

Title: Generalizable Single-view Object Pose Estimation by Two-side Generating and Matching

Title: PanoLlama: Generating Endless and Coherent Panoramas with Next-Token-Prediction LLMs

Title: Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation

Title: A Tunable Despeckling Neural Network Stabilized via Diffusion Equation

Title: Making Images from Images: Interleaving Denoising and Transformation

Title: Generative Context Distillation

Title: Improving Pre-Trained Self-Supervised Embeddings Through Effective Entropy Maximization

Title: Gaussian Scenes: Pose-Free Sparse-View Scene Reconstruction using Depth-Enhanced Diffusion Priors

Title: ROADS: Robust Prompt-driven Multi-Class Anomaly Detection under Domain Shift

Title: VICON: Vision In-Context Operator Networks for Multi-Physics Fluid Dynamics Prediction

Title: Geometry Distributions

Title: Debiasing Classifiers by Amplifying Bias with Latent Diffusion and Large Language Models

Title: Boosting 3D Object Generation through PBR Materials

Title: AI-Generated Image Quality Assessment Based on Task-Specific Prompt and Multi-Granularity Similarity

Title: FUN-AD: Fully Unsupervised Learning for Anomaly Detection with Noisy Training Data

Title: Med-PerSAM: One-Shot Visual Prompt Tuning for Personalized Segment Anything Model in Medical Domain

Title: CIA: Controllable Image Augmentation Framework Based on Stable Diffusion

Title: DeDe: Detecting Backdoor Samples for SSL Encoders via Decoders

Title: Graph Adapter of EEG Foundation Models for Parameter Efficient Fine Tuning

Title: MVGenMaster: Scaling Multi-View Generation from Any Image via 3D Priors Enhanced Diffusion Model

Title: Text-to-Image Synthesis: A Decade Survey

Title: BadSFL: Backdoor Attack against Scaffold Federated Learning

Title: Image Generation Diversity Issues and How to Tame Them

Title: U2NeRF: Unsupervised Underwater Image Restoration and Neural Radiance Fields

Title: Any3DIS: Class-Agnostic 3D Instance Segmentation by 2D Mask Tracking

Title: Fancy123: One Image to High-Quality 3D Mesh Generation via Plug-and-Play Deformation

Title: Learn from Foundation Model: Fruit Detection Model without Manual Annotation

Title: Interpreting Object-level Foundation Models via Visual Precision Search

Title: VIRES: Video Instance Repainting with Sketch and Text Guidance

Title: SMGDiff: Soccer Motion Generation using diffusion probabilistic models

Title: Weakly supervised image segmentation for defect-based grading of fresh produce

Title: DoubleCCA: Improving Foundation Model Group Robustness with Random Sentence Embeddings

Title: Transparent Neighborhood Approximation for Text Classifier Explanation

Title: BayLing 2: A Multilingual Large Language Model with Efficient Language Alignment

Title: DiffDesign: Controllable Diffusion with Meta Prior for Efficient Interior Design Generation

Title: An End-to-End Robust Point Cloud Semantic Segmentation Network with Single-Step Conditional Diffusion Models

Title: One Diffusion to Generate Them All

Title: Cluster-based human-in-the-loop strategy for improving machine learning-based circulating tumor cell detection in liquid biopsy

Title: Can AI grade your essays? A comparative analysis of large language models and teacher ratings in multidimensional essay scoring

Title: Towards Foundation Models for Critical Care Time Series

Title: Multi-modal Retrieval Augmented Multi-modal Generation: A Benchmark, Evaluate Metrics and Strong Baselines

Title: Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing

Title: Human-Calibrated Automated Testing and Validation of Generative Language Models

Title: Synthesising Handwritten Music with GANs: A Comprehensive Evaluation of CycleWGAN, ProGAN, and DCGAN

Title: Machine Learning for the Digital Typhoon Dataset: Extensions to Multiple Basins and New Developments in Representations and Tasks

Title: Unsupervised Event Outlier Detection in Continuous Time

Title: Privacy Protection in Personalized Diffusion Models via Targeted Cross-Attention Adversarial Attack

Title: Multi-Resolution Generative Modeling of Human Motion from Limited Data

Title: Noise Diffusion for Enhancing Semantic Faithfulness in Text-to-Image Synthesis

Title: LaB-RAG: Label Boosted Retrieval Augmented Generation for Radiology Report Generation

Title: Fundamental Limits of Prompt Tuning Transformers: Universality, Capacity and Efficiency

Title: Continual Deep Reinforcement Learning with Task-Agnostic Policy Distillation

Title: Transformers are Deep Optimizers: Provable In-Context Learning for Deep Model Training

Title: Representation Collapsing Problems in Vector Quantization

Title: Enhancing Few-Shot Learning with Integrated Data and GAN Model Approaches

Title: Rethinking Diffusion for Text-Driven Human Motion Generation

Title: Unlocking The Potential of Adaptive Attacks on Diffusion-Based Purification

Title: Chat2SVG: Vector Graphics Generation with Large Language Models and Image Diffusion Models

Title: Exploring Discrete Flow Matching for 3D De Novo Molecule Generation

Title: Diffusion Features for Zero-Shot 6DoF Object Pose Estimation

Title: Generative Omnimatte: Learning to Decompose Video into Layers