2024-09-17

Title: Neural Message Passing Induced by Energy-Constrained Diffusion

Title: PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion Preimage

Title: Adaptive Multi-Modal Control of Digital Human Hand Synthesis Using a Region-Aware Cycle Loss

Title: Informative Subgraphs Aware Masked Auto-Encoder in Dynamic Graphs

Title: Language Models "Grok" to Copy

Title: Turbo your multi-modal classification with contrastive learning

Title: Matrix Profile for Anomaly Detection on Multidimensional Time Series

Title: ManiDext: Hand-Object Manipulation Synthesis via Continuous Correspondence Embeddings and Residual-Guided Diffusion

Title: Schr\"odinger Bridge Flow for Unpaired Data Translation

Title: Beta-Sigma VAE: Separating beta and decoder variance in Gaussian variational autoencoder

Title: Interpretable Vision-Language Survival Analysis with Ordinal Inductive Bias for Computational Pathology

Title: BM$^2$: Coupled Schr\"{o}dinger Bridge Matching

Title: Towards Diverse and Efficient Audio Captioning via Diffusion Models

Title: Real-world Adversarial Defense against Patch Attacks based on Diffusion Model

Title: Evaluating Pre-trained Convolutional Neural Networks and Foundation Models as Feature Extractors for Content-based Medical Image Retrieval

Title: Detecting Looted Archaeological Sites from Satellite Image Time Series

Title: On the Generalizability of Foundation Models for Crop Type Mapping

Title: Learning Keypoints for Multi-Agent Behavior Analysis using Self-Supervision

Title: Keeping Humans in the Loop: Human-Centered Automated Annotation with Generative AI

Title: Enhancing Skin Disease Diagnosis: Interpretable Visual Concept Discovery with SAM Empowerment

Title: COMFORT: A Continual Fine-Tuning Framework for Foundation Models Targeted at Consumer Healthcare

Title: Bias Begets Bias: The Impact of Biased Embeddings on Diffusion Models

Title: DreamMover: Leveraging the Prior of Diffusion Models for Image Interpolation with Large Motion

Title: TextureDiffusion: Target Prompt Disentangled Editing for Various Texture Transfer

Title: HJ-sampler: A Bayesian sampler for inverse problems of a stochastic process by leveraging Hamilton-Jacobi PDEs and score-based generative models

Title: A Simple HMM with Self-Supervised Representations for Phone Segmentation

Title: Leveraging Open-Source Large Language Models for Native Language Identification

Title: EditBoard: Towards A Comprehensive Evaluation Benchmark for Text-based Video Editing Models

Title: E-Commerce Inpainting with Mask Guidance in Controlnet for Reducing Overcompletion

Title: GFlowNet Pretraining with Inexpensive Rewards

Title: AlpaPICO: Extraction of PICO Frames from Clinical Trial Documents Using LLMs

Title: MFCLIP: Multi-modal Fine-grained CLIP for Generalizable Diffusion Face Forgery Detection

Title: OML-AD: Online Machine Learning for Anomaly Detection in Time Series Data

Title: Towards Multi-view Graph Anomaly Detection with Similarity-Guided Contrastive Clustering

Title: DiFSD: Ego-Centric Fully Sparse Paradigm with Uncertainty Denoising and Iterative Refinement for Efficient End-to-End Autonomous Driving

Title: Large Language Model Based Generative Error Correction: A Challenge and Baselines forSpeech Recognition, Speaker Tagging, and Emotion Recognition

Title: BEnDEM:A Boltzmann Sampler Based on Bootstrapped Denoising Energy Matching

Title: Abnormal Event Detection In Videos Using Deep Embedding

Title: PROSE-FD: A Multimodal PDE Foundation Model for Learning Multiple Operators for Forecasting Fluid Dynamics

Title: Latent Diffusion Models for Controllable RNA Sequence Generation

Title: Towards Kinetic Manipulation of the Latent Space

Title: Flexible Diffusion Scopes with Parameterized Laplacian for Heterophilic Graph Learning

Title: Estimating Wage Disparities Using Foundation Models

Title: GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion

Title: Rapid Adaptation of Earth Observation Foundation Models for Segmentation

Title: SFR-RAG: Towards Contextually Faithful LLMs

Title: Deep Graph Anomaly Detection: A Survey and New Perspectives

Title: 2S-ODIS: Two-Stage Omni-Directional Image Synthesis by Geometric Distortion Correction

Title: SelECT-SQL: Self-correcting ensemble Chain-of-Thought for Text-to-SQL

Title: AttnMod: Attention-Based New Art Styles

Title: Enhancing Anomaly Detection via Generating Diversified and Hard-to-distinguish Synthetic Anomalies

Title: MotionCom: Automatic and Motion-Aware Image Composition with LLM and Video Diffusion Prior

Title: DDoS: Diffusion Distribution Similarity for Out-of-Distribution Detection

Title: Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT

Title: PSHuman: Photorealistic Single-view Human Reconstruction using Cross-Scale Diffusion

Title: RealDiff: Real-world 3D Shape Completion using Self-Supervised Diffusion Models

Title: From Text to Emoji: How PEFT-Driven Personality Manipulation Unleashes the Emoji Potential in LLMs

Title: Anatomical Positional Embeddings

Title: On Synthetic Texture Datasets: Challenges, Creation, and Curation

Title: Taming Diffusion Models for Image Restoration: A Review

Title: 2D or not 2D: How Does the Dimensionality of Gesture Representation Affect 3D Co-Speech Gesture Generation?

Title: Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning

Title: Mamba-ST: State Space Model for Efficient Style Transfer

Title: Signed Graph Autoencoder for Explainable and Polarization-Aware Network Embeddings

Title: MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion

Title: SimInversion: A Simple Framework for Inversion-Based Text-to-Image Editing

Title: Do Pre-trained Vision-Language Models Encode Object States?