2024-03-26

Title: Distilling Named Entity Recognition Models for Endangered Species from Large Language Models

Title: Loops On Retrieval Augmented Generation (LoRAG)

Title: Unveiling the Anomalies in an Ever-Changing World: A Benchmark for Pixel-Level Anomaly Detection in Continual Learning

Title: Learning to Infer Generative Template Programs for Visual Concepts

Title: Integrating Supervised Extractive and Generative Language Models for Suicide Risk Evidence Summarization

Title: RakutenAI-7B: Extending Large Language Models for Japanese

Title: Sequence-to-Sequence Language Models for Character and Emotion Detection in Dream Narratives

Title: GTC: GNN-Transformer Co-contrastive Learning for Self-supervised Heterogeneous Graph Representation

Title: An Optimization Framework to Enforce Multi-View Consistency for Texturing 3D Meshes Using Pre-Trained Text-to-Image Models

Title: MedPromptX: Grounded Multimodal Prompting for Chest X-ray Diagnosis

Title: EAGLE: A Domain Generalization Framework for AI-generated Text Detection

Title: Technical Report: Masked Skeleton Sequence Modeling for Learning Larval Zebrafish Behavior Latent Embeddings

Title: SceneX:Procedural Controllable Large-scale Scene Generation via Large-language Models

Title: Convection-Diffusion Equation: A Theoretically Certified Framework for Neural Networks

Title: BEND: Bagging Deep Learning Training Based on Efficient Neural Network Diffusion

Title: In-Context Matting

Title: Boarding for ISS: Imbalanced Self-Supervised: Discovery of a Scaled Autoencoder for Mixed Tabular Datasets

Title: Diffusion-based Aesthetic QR Code Generation via Scanning-Robust Perceptual Guidance

Title: X-Portrait: Expressive Portrait Animation with Hierarchical Motion Attention

Title: Feature Manipulation for DDPM based Change Detection

Title: IllusionVQA: A Challenging Optical Illusion Dataset for Vision Language Models

Title: Fill in the ____ (a Diffusion-based Image Inpainting Pipeline)

Title: A Unified Module for Accelerating STABLE-DIFFUSION: LCM-LORA

Title: Robust Diffusion Models for Adversarial Purification

Title: EVA: Zero-shot Accurate Attributes and Multi-Object Video Editing

Title: Self-Supervised Multi-Frame Neural Scene Flow

Title: A Survey on Self-Supervised Pre-Training of Graph Foundation Models: A Knowledge-Based Perspective

Title: One Masked Model is All You Need for Sensor Fault Detection, Isolation and Accommodation

Title: Gaze-guided Hand-Object Interaction Synthesis: Benchmark and Method

Title: Pose-Guided Self-Training with Two-Stage Clustering for Unsupervised Landmark Discovery

Title: Diffusion Model is a Good Pose Estimator from 3D RF-Vision

Title: SQL-Encoder: Improving NL2SQL In-Context Learning Through a Context-Aware Encoder

Title: Skull-to-Face: Anatomy-Guided 3D Facial Reconstruction and Editing

Title: Frankenstein: Generating Semantic-Compositional 3D Scenes in One Tri-Plane

Title: Adversarially Masked Video Consistency for Unsupervised Domain Adaptation

Title: Connecting the Dots: Inferring Patent Phrase Similarity with Retrieved Phrase Graphs

Title: Constricting Normal Latent Space for Anomaly Detection with Normal-only Training Data

Title: Object Detectors in the Open Environment:Challenges, Solutions, and Outlook

Title: L-MAE: Longitudinal masked auto-encoder with time and severity-aware encoding for diabetic retinopathy progression prediction

Title: latentSplat: Autoencoding Variational Gaussians for Fast Generalizable 3D Reconstruction

Title: AutoInst: Automatic Instance-Based Segmentation of LiDAR 3D Scans

Title: Generating Potent Poisons and Backdoors from Scratch with Guided Diffusion

Title: FlashEval: Towards Fast and Accurate Evaluation of Text-to-image Diffusion Generative Models

Title: Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation

Title: PathoTune: Adapting Visual Foundation Model to Pathological Specialists

Title: Self-Supervised Learning for Medical Image Data with Anatomy-Oriented Imaging Planes

Title: LARA: Linguistic-Adaptive Retrieval-Augmented LLMs for Multi-Turn Intent Classification

Title: Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework

Title: LLMs Are Few-Shot In-Context Low-Resource Language Learners

Title: Let Real Images be as a Judger, Spotting Fake Images Synthesized with Generative Models

Title: Visually Guided Generative Text-Layout Pre-training for Document Intelligence

Title: CMViM: Contrastive Masked Vim Autoencoder for 3D Multi-modal Representation Learning for AD classification

Title: An Intermediate Fusion ViT Enables Efficient Text-Image Alignment in Diffusion Models

Title: SegICL: A Universal In-context Learning Framework for Enhanced Segmentation in Medical Imaging

Title: SatSynth: Augmenting Image-Mask Pairs through Diffusion Models for Aerial Semantic Segmentation

Title: SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions

Title: AI-Generated Video Detection via Spatio-Temporal Anomaly Learning

Title: Graph Augmentation for Recommendation

Title: Iso-Diffusion: Improving Diffusion Probabilistic Models Using the Isotropy of the Additive Gaussian Noise

Title: Multiple-Source Localization from a Single-Snapshot Observation Using Graph Bayesian Optimization

Title: Convergence of a model-free entropy-regularized inverse reinforcement learning algorithm

Title: Multiple Object Tracking as ID Prediction

Title: Encoding of lexical tone in self-supervised models of spoken language

Title: Discrete Latent Graph Generative Modeling with Diffusion Bridges

Title: FLIGAN: Enhancing Federated Learning with Incomplete Data using GAN

Title: SPACE-IDEAS: A Dataset for Salient Information Detection in Space Innovation

Title: Isolated Diffusion: Optimizing Multi-Concept Text-to-Image Generation Training-Freely with Isolated Diffusion Guidance

Title: Be Yourself: Bounded Attention for Multi-Subject Text-to-Image Generation

Title: Comp4D: LLM-Guided Compositional 4D Scene Generation

Title: Language Rectified Flow: Advancing Diffusion Language Generation with Probabilistic Flows

Title: Learning Spatial Adaptation and Temporal Coherence in Diffusion Models for Video Super-Resolution

Title: VP3D: Unleashing 2D Visual Prompt for Text-to-3D Generation

Title: SD-DiT: Unleashing the Power of Self-supervised Discrimination in Diffusion Transformer

Title: TRIP: Temporal Residual Learning with Image Noise Prior for Image-to-Video Diffusion Models

Title: Invertible Diffusion Models for Compressed Sensing

Title: DreamLIP: Language-Image Pre-training with Long Captions