
Title: Enhancing Question Answering for Enterprise Knowledge Bases using Large Language Models

Title: Can Contrastive Learning Refine Embeddings

Title: FastLogAD: Log Anomaly Detection with Mask-Guided Pseudo Anomaly Generation and Discrimination

Title: Towards Sim-to-Real Industrial Parts Classification with Synthetic Dataset

Title: Detecting AI-Generated Images via CLIP

Title: Differentiable and Stable Long-Range Tracking of Multiple Posterior Modes

Title: Semantic Approach to Quantifying the Consistency of Diffusion Model Image Generation

Title: E3: Ensemble of Expert Embedders for Adapting Synthetic Image Detectors to New Generators Using Limited Data

Title: Single-image driven 3d viewpoint training data augmentation for effective wine label recognition

Title: LLM In-Context Recall is Prompt Dependent

Title: EIVEN: Efficient Implicit Attribute Value Extraction using Multimodal LLM

Title: ChangeAnywhere: Sample Generation for Remote Sensing Change Detection via Semantic Latent Diffusion Model

Title: PM2: A New Prompting Multi-modal Model Paradigm for Few-shot Medical Image Classification

Title: Diffusion Models Meet Remote Sensing: Principles, Methods, and Perspectives

Title: Label-free Anomaly Detection in Aerial Agricultural Images with Masked Image Modeling

Title: Enforcing Paraphrase Generation via Controllable Latent Diffusion

Title: Multimodal Cross-Document Event Coreference Resolution Using Linear Semantic Transfer and Mixed-Modality Ensembles

Title: Beyond Known Clusters: Probe New Prototypes for Efficient Generalized Class Discovery

Title: MMA-DFER: MultiModal Adaptation of unimodal models for Dynamic Facial Expression Recognition in-the-wild

Title: Theoretical research on generative diffusion models: an overview

Title: Adapting Mental Health Prediction Tasks for Cross-lingual Learning via Meta-Training and In-context Learning with Large Language Model

Title: Rethinking Iterative Stereo Matching from Diffusion Bridge Model Perspective

Title: Probabilistic Directed Distance Fields for Ray-Based Shape Representations

Title: Exploring Generative AI for Sim2Real in Driving Data Synthesis

Title: GCC: Generative Calibration Clustering

Title: Confidence Calibration and Rationalization for LLMs via Multi-Agent Deliberation

Title: From Bytes to Borsch: Fine-Tuning Gemma and Mistral for the Ukrainian Language Representation

Title: RF-Diffusion: Radio Signal Generation via Time-Frequency Diffusion

Title: ToNER: Type-oriented Named Entity Recognition with Generative Language Model

Title: GeMQuAD : Generating Multilingual Question Answering Datasets from Large Language Models using Few Shot Learning

Title: LoopAnimate: Loopable Salient Object Animation

Title: FaceCat: Enhancing Face Recognition Security with a Unified Generative Model Framework

Title: DKE-Research at SemEval-2024 Task 2: Incorporating Data Augmentation with Generative Models and Biomedical Knowledge to Enhance Inference Robustness

Title: DEGNN: Dual Experts Graph Neural Network Handling Both Edge and Node Feature Noise

Title: DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection

Title: DreamScape: 3D Scene Creation via Gaussian Splatting joint Correlation Modeling

Title: Fault Detection in Mobile Networks Using Diffusion Models

Title: RoofDiffusion: Constructing Roofs from Severely Corrupted Point Data via Diffusion

Title: Reap the Wild Wind: Detecting Media Storms in Large-Scale News Corpora

Title: Weight Copy and Low-Rank Adaptation for Few-Shot Distillation of Vision Transformers

Title: Counteracting Concept Drift by Learning with Future Malware Predictions

Title: RankCLIP: Ranking-Consistent Language-Image Pretraining

Title: Masked and Shuffled Blind Spot Denoising for Real-World Images

Title: Watermark-embedded Adversarial Examples for Copyright Protection against Diffusion Models

Title: Neural McKean-Vlasov Processes: Distributional Dependence in Diffusion Processes

Title: Human-in-the-Loop Segmentation of Multi-species Coral Imagery

Title: VFMM3D: Releasing the Potential of Image by Vision Foundation Model for Monocular 3D Object Detection

Title: Exploring Text-to-Motion Generation with Human Preference

Title: PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI

Title: Large Language Models Can Automatically Engineer Features for Few-Shot Tabular Learning

Title: Magic Clothing: Controllable Garment-Driven Image Synthesis

Title: Deep image learning of quantitative structure-property relationships of cooper alloys via feature augmentation on Geodesic curve in shape space

Title: TMPQ-DM: Joint Timestep Reduction and Quantization Precision Selection for Efficient Diffusion Models

Title: Text-Driven Diverse Facial Texture Generation via Progressive Latent-Space Refinement

Title: AesExpert: Towards Multi-modality Foundation Model for Image Aesthetics Perception

Title: In-Context Translation: Towards Unifying Image Recognition, Processing, and Generation

Title: All-in-one simulation-based inference

Title: Do LLMs Understand Visual Anomalies? Uncovering LLM Capabilities in Zero-shot Anomaly Detection

Title: VFLGAN: Vertical Federated Learning-based Generative Adversarial Network for Vertically Partitioned Data Publication

Title: Convergence Analysis of Probability Flow ODE for Score-based Generative Models

Title: Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language Models

Title: Equipping Diffusion Models with Differentiable Spatial Entropy for Low-Light Image Enhancement

Title: FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance, Head-pose, and Facial Expression Features

Title: Can We Break Free from Strong Data Augmentations in Self-Supervised Learning?

Title: Personalized Collaborative Fine-Tuning for On-Device Large Language Models

Title: The Devil is in the Few Shots: Iterative Visual Knowledge Completion for Few-shot Learning

Title: Impact of Preference Noise on the Alignment Performance of Generative Language Models

Title: Negation Triplet Extraction with Syntactic Dependency and Semantic Consistency

Title: Digging into contrastive learning for robust depth estimation with diffusion models

Title: A Diffusion-based Data Generator for Training Object Recognition Models in Ultra-Range Distance

Title: Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL

Title: Explainable Online Unsupervised Anomaly Detection for Cyber-Physical Systems via Causal Discovery from Time Series

Title: EdgeRelight360: Text-Conditioned 360-Degree HDR Image Generation for Real-Time On-Device Video Portrait Relighting

Title: Evolving Interpretable Visual Classifiers with Large Language Models

Title: How to build the best medical image segmentation algorithm using foundation models: a comprehensive empirical study with Segment Anything Model

Title: Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model

Title: Diffscaler: Enhancing the Generative Prowess of Diffusion Transformers

Title: MaxFusion: Plug&Play Multi-Modal Generation in Text-to-Image Diffusion Models

Title: Memory Sharing for Large Language Model based Agents

Title: in2IN: Leveraging individual Information to Generate Human INteractions

Title: HQ-Edit: A High-Quality Dataset for Instruction-based Image Editing

Title: Taming Latent Diffusion Model for Neural Radiance Field Inpainting