2024-06-11

Title: The Unmet Promise of Synthetic Training Images: Using Retrieved Real Images Performs Better

Title: DiffusionPID: Interpreting Diffusion via Partial Information Decomposition

Title: Retrieval & Fine-Tuning for In-Context Tabular Models

Title: TabPFGen -- Tabular Data Generation with TabPFN

Title: Generative Explore-Exploit: Training-free Optimization of Generative Recommender Systems using LLM Optimizers

Title: Efficient Differentially Private Fine-Tuning of Diffusion Models

Title: USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation

Title: VISTA3D: Versatile Imaging SegmenTation and Annotation model for 3D Computed Tomography

Title: Beyond Efficiency: Scaling AI Sustainably

Title: Weakly Supervised Set-Consistency Learning Improves Morphological Profiling of Single-Cell Images

Title: MotionClone: Training-Free Motion Cloning for Controllable Video Generation

Title: RAPID: Robust APT Detection and Investigation Using Context-Aware Deep Learning

Title: Mean-field Chaos Diffusion Models

Title: Regularized Training with Generated Datasets for Name-Only Transfer of Vision-Language Models

Title: Novel Approach to Intrusion Detection: Introducing GAN-MSCNN-BILSTM with LIME Predictions

Title: A Novel Generative AI-Based Framework for Anomaly Detection in Multicast Messages in Smart Grid Communications

Title: Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis

Title: Generalist Multimodal AI: A Review of Architectures, Challenges and Opportunities

Title: Perturbation Towards Easy Samples Improves Targeted Adversarial Transferability

Title: VP-LLM: Text-Driven 3D Volume Completion with Large Language Models through Patchification

Title: ThatiAR: Subjectivity Detection in Arabic News Sentences

Title: Medical Vision Generalist: Unifying Medical Imaging Tasks in Context

Title: Can Prompt Modifiers Control Bias? A Comparative Analysis of Text-to-Image Generative Models

Title: Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language

Title: PaRa: Personalizing Text-to-Image Diffusion via Parameter Rank Reduction

Title: Anomaly Multi-classification in Industrial Scenarios: Transferring Few-shot Learning to a New Task

Title: MS-HuBERT: Mitigating Pre-training and Inference Mismatch in Masked Language Modelling methods for learning Speech Representations

Title: Provable Optimization for Adversarial Fair Self-supervised Contrastive Learning

Title: Hierarchical Features Matter: A Deep Exploration of GAN Priors for Improved Dataset Distillation

Title: ALGO: Object-Grounded Visual Commonsense Reasoning for Open-World Egocentric Action Recognition

Title: Binarized Diffusion Model for Image Super-Resolution

Title: Region of Interest Loss for Anonymizing Learned Image Compression

Title: Gentle-CLIP: Exploring Aligned Semantic In Low-Quality Multimodal Data With Soft Alignment

Title: MLCM: Multistep Consistency Distillation of Latent Diffusion Model

Title: Utilizing Grounded SAM for self-supervised frugal camouflaged human detection

Title: ProFeAT: Projected Feature Adversarial Training for Self-Supervised Learning of Robust Representations

Title: SAM-PM: Enhancing Video Camouflaged Object Detection using Spatio-Temporal Attention

Title: Unified Text-to-Image Generation and Retrieval

Title: Self-supervised Adversarial Training of Monocular Depth Estimation against Physical-World Attacks

Title: STARLING: Self-supervised Training of Text-based Reinforcement Learning Agent with Large Language Models

Title: LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning

Title: Safety Alignment Should Be Made More Than Just a Few Tokens Deep

Title: FRAG: Frequency Adapting Group for Diffusion Video Editing

Title: Synthesizing Efficient Data with Diffusion Models for Person Re-Identification Pre-Training

Title: Robust Latent Representation Tuning for Image-text Classification

Title: Generalizable Human Gaussians from Single-View Image

Title: ProcessPainter: Learn Painting Process from Sequence Data

Title: Latent Representation Matters: Human-like Sketches in One-shot Drawing Tasks

Title: ExtraNeRF: Visibility-Aware View Extrapolation of Neural Radiance Fields with Diffusion Models

Title: DiffInject: Revisiting Debias via Synthetic Data Generation using Diffusion-based Style Injection

Title: Generalized Nested Latent Variable Models for Lossy Coding applied to Wind Turbine Scenarios

Title: LINGOLY: A Benchmark of Olympiad-Level Linguistic Reasoning Puzzles in Low-Resource and Extinct Languages

Title: Data Augmentation in Earth Observation: A Diffusion Model Approach

Title: Compute Better Spent: Replacing Dense Layers with Structured Matrices

Title: Tuning-Free Visual Customization via View Iterative Self-Attention Control

Title: Unveiling the Safety of GPT-4o: An Empirical Study using Jailbreak Attacks

Title: NeuroMoCo: A Neuromorphic Momentum Contrast Learning Method for Spiking Neural Networks

Title: Self-Tuning: Instructing LLMs to Effectively Acquire New Knowledge through Self-Teaching

Title: Latent Directions: A Simple Pathway to Bias Mitigation in Generative AI

Title: MVGamba: Unify 3D Content Generation as State Space Sequence Modeling

Title: UMAD: Unsupervised Mask-Level Anomaly Detection for Autonomous Driving

Title: Improving Deep Learning-based Automatic Cranial Defect Reconstruction by Heavy Data Augmentation: From Image Registration to Latent Diffusion Models

Title: Diffusion-RPO: Aligning Diffusion Models through Relative Preference Optimization

Title: Should We Fine-Tune or RAG? Evaluating Different Techniques to Adapt LLMs for Dialogue

Title: Controlling Emotion in Text-to-Speech with Natural Language Prompts

Title: Hybrid Video Anomaly Detection for Anomalous Scenarios in Autonomous Driving

Title: Margin-aware Preference Optimization for Aligning Diffusion Models without Reference

Title: Cometh: A continuous-time discrete-state graph diffusion model

Title: AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction

Title: Graph-Based Bidirectional Transformer Decision Threshold Adjustment Algorithm for Class-Imbalanced Molecular Data

Title: Parallelizing Linear Transformers with the Delta Rule over Sequence Length

Title: Scaling Continuous Latent Variable Models as Probabilistic Integral Circuits

Title: Direct Preference Optimization for Suppressing Hallucinated Prior Exams in Radiology Report Generation

Title: Monkey See, Monkey Do: Harnessing Self-attention in Motion Diffusion for Zero-shot Motion Transfer

Title: Merlin: A Vision Language Foundation Model for 3D Computed Tomography

Title: NaRCan: Natural Refined Canonical Image with Integration of Diffusion Prior for Video Editing

Title: Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

Title: GaussianCity: Generative Gaussian Splatting for Unbounded 3D City Generation

Title: IllumiNeRF: 3D Relighting without Inverse Rendering