2024-08-13

Title: Large Model Strategic Thinking, Small Model Efficiency: Transferring Theory of Mind in Large Language Models

Title: The Role and Applications of Airport Digital Twin in Cyberattack Protection during the Generative AI Era

Title: The impact of internal variability on benchmarking deep learning climate emulators

Title: Hybrid Efficient Unsupervised Anomaly Detection for Early Pandemic Case Identification

Title: PersonViT: Large-scale Self-supervised Vision Transformer for Person Re-Identificat

Title: LaiDA: Linguistics-aware In-context Learning with Data Augmentation for Metaphor Components Identification

Title: Style-Preserving Lip Sync via Audio-Aware Style Reference

Title: High-fidelity and Lip-synced Talking Face Synthesis via Landmark-based Diffusion Model

Title: Multimodal generative semantic communication based on latent diffusion model

Title: Path-LLM: A Shortest-Path-based LLM Learning for Unified Graph Representation

Title: ReToMe-VA: Recursive Token Merging for Video Diffusion-based Unrestricted Adversarial Attack

Title: ZePo: Zero-Shot Portrait Stylization with Faster Sampling

Title: SWIFT:A Scalable lightWeight Infrastructure for Fine-Tuning

Title: Large Language Model-based Role-Playing for Personalized Medical Jargon Extraction

Title: What Matters in Autonomous Driving Anomaly Detection: A Weakly Supervised Horizon

Title: Sequential Representation Learning via Static-Dynamic Conditional Disentanglement

Title: UrFound: Towards Universal Retinal Foundation Models via Knowledge-Guided Masked Modeling

Title: Speculative Diffusion Decoding: Accelerating Language Generation through Diffusion

Title: StealthDiffusion: Towards Evading Diffusion Forensic Detection through Diffusion Model

Title: SMILES-Mamba: Chemical Mamba Foundation Models for Drug ADMET Prediction

Title: Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators

Title: Contrastive masked auto-encoders based self-supervised hashing for 2D image and 3D point cloud cross-modal retrieval

Title: SSL: A Self-similarity Loss for Improving Generative Image Super-resolution

Title: MTSCI: A Conditional Diffusion Model for Multivariate Time Series Consistent Imputation

Title: An analysis of HOI: using a training-free method with multimodal visual foundation models when only the test set is available, without the training set

Title: Efficient Test-Time Prompt Tuning for Vision-Language Models

Title: Egocentric Vision Language Planning

Title: HySparK: Hybrid Sparse Masking for Large Scale Medical Image Pre-Training

Title: LaWa: Using Latent Space for In-Generation Image Watermarking

Title: LLM-Based Robust Product Classification in Commerce and Compliance

Title: GFlowNet Training by Policy Gradients

Title: Classifier Guidance Enhances Diffusion-based Adversarial Purification by Preserving Predictive Information

Title: HcNet: Image Modeling with Heat Conduction Equation

Title: Weakly Supervised Video Anomaly Detection and Localization with Spatio-Temporal Prompts

Title: A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models

Title: Deep Geometric Moments Promote Shape Consistency in Text-to-3D Generation

Title: UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalization

Title: Freehand Sketch Generation from Mechanical Components

Title: Unseen No More: Unlocking the Potential of CLIP for Generative Zero-shot HOI Detection

Title: Diffuse-UDA: Addressing Unsupervised Domain Adaptation in Medical Image Segmentation with Appearance and Structure Aligned Diffusion Models

Title: An Analysis for Image-to-Image Translation and Style Transfer

Title: BooW-VTON: Boosting In-the-Wild Virtual Try-On via Mask-Free Pseudo Data Training

Title: What Ails Generative Structure-based Drug Design: Too Little or Too Much Expressivity?

Title: ControlNeXt: Powerful and Efficient Control for Image and Video Generation

Title: CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer

Title: Building Decision Making Models Through Language Model Regime

Title: A Methodological Report on Anomaly Detection on Dynamic Knowledge Graphs

Title: Efficient and Scalable Point Cloud Generation with Sparse Point-Voxel Diffusion Models

Title: Novel View Synthesis from a Single Image with Pretrained Diffusion Guidance

Title: FruitNeRF: A Unified Neural Radiance Field based Fruit Counting Framework

Title: Correlation Weighted Prototype-based Self-Supervised One-Shot Segmentation of Medical Images

Title: 3D Reconstruction of Protein Structures from Multi-view AFM Images using Neural Radiance Fields (NeRFs)

Title: Context-aware Visual Storytelling with Visual Prefix Tuning and Contrastive Learning

Title: Open-Source Molecular Processing Pipeline for Generating Molecules

Title: DUNE: A Machine Learning Deep UNet++ based Ensemble Approach to Monthly, Seasonal and Annual Climate Forecasting