2024-06-18

Title: Towards Signal Processing In Large Language Models

Title: Unused information in token probability distribution of generative LLM: improving LLM reading comprehension through calculation of expected values

Title: ExHuBERT: Enhancing HuBERT Through Block Extension and Fine-Tuning on 37 Emotion Datasets

Title: Hardware-based stack buffer overflow attack detection on RISC-V architectures

Title: Attentive Merging of Hidden Embeddings from Pre-trained Speech Model for Anti-spoofing Detection

Title: VeraCT Scan: Retrieval-Augmented Fake News Detection with Justifiable Reasoning

Title: CLST: Cold-Start Mitigation in Knowledge Tracing by Aligning a Generative Language Model as a Students' Knowledge Tracer

Title: What is the best model? Application-driven Evaluation for Large Language Models

Title: Creating a Lens of Chinese Culture: A Multimodal Dataset for Chinese Pun Rebus Art Understanding

Title: VANE-Bench: Video Anomaly Evaluation Benchmark for Conversational LMMs

Title: SigDiffusions: Score-Based Diffusion Models for Long Time Series via Log-Signature Embeddings

Title: EWEK-QA: Enhanced Web and Efficient Knowledge Graph Retrieval for Citation-based Question Answering Systems

Title: Towards Neural Scaling Laws for Foundation Models on Temporal Graphs

Title: Consistency-diversity-realism Pareto fronts of conditional image generative models

Title: Enhancing In-Context Learning with Semantic Representations for Relation Extraction

Title: Domain-Specific Shorthand for Generation Based on Context-Free Grammar

Title: The BabyView dataset: High-resolution egocentric videos of infants' and young children's everyday experiences

Title: CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation

Title: From Words to Worlds: Transforming One-line Prompt into Immersive Multi-modal Digital Stories with Communicative LLM Agent

Title: Large Language Models as Event Forecasters

Title: Self-Supervised Representation Learning with Spatial-Temporal Consistency for Sign Language Recognition

Title: Learning to Adapt Foundation Model DINOv2 for Capsule Endoscopy Diagnosis

Title: Lift Your Molecules: Molecular Graph Generation in Latent Euclidean Space

Title: MALLM-GAN: Multi-Agent Large Language Model as Generative Adversarial Network for Synthesizing Tabular Data

Title: Self-Supervised Vision Transformer for Enhanced Virtual Clothes Try-On

Title: NeRFDeformer: NeRF Transformation from a Single View via 3D Scene Flows

Title: A Comprehensive Taxonomy and Analysis of Talking Head Synthesis: Techniques for Portrait Generation, Driving Mechanisms, and Editing

Title: Enhancing Anomaly Detection Generalization through Knowledge Exposure: The Dual Effects of Augmentation

Title: On the Hardness of Faithful Chain-of-Thought Reasoning in Large Language Models

Title: Applications of Generative AI in Healthcare: algorithmic, ethical, legal and societal considerations

Title: UniZero: Generalized and Efficient Planning with Scalable Latent World Models

Title: A Late-Stage Bitemporal Feature Fusion Network for Semantic Change Detection

Title: GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR

Title: Text-space Graph Foundation Models: Comprehensive Benchmarks and New Insights

Title: A Comprehensive Survey of Foundation Models in Medicine

Title: FreeMotion: MoCap-Free Human Motion Synthesis with Multimodal Large Language Models

Title: Quantifying Generative Media Bias with a Corpus of Real-world and Generated News Articles

Title: STAR: Scale-wise Text-to-image generation via Auto-Regressive representations

Title: Diffusion Model With Optimal Covariance Matching

Title: Post-hoc Utterance Refining Method by Entity Mining for Faithful Knowledge Grounded Conversations

Title: On the Effectiveness of Supervision in Asymmetric Non-Contrastive Learning

Title: Reminding Multimodal Large Language Models of Object-aware Knowledge with Retrieved Tags

Title: CBGBench: Fill in the Blank of Protein-Molecule Complex Binding Graph

Title: Geometric-informed GFlowNets for Structure-Based Drug Design

Title: Graph Neural Reaction Diffusion Models

Title: Imperceptible Face Forgery Attack via Adversarial Semantic Mask

Title: Benchmarking Label Noise in Instance Segmentation: Spatial Noise Matters

Title: Breaking the Attention Bottleneck

Title: MICL: Improving In-Context Learning through Multiple-Label Words in Demonstration

Title: Make Your Home Safe: Time-aware Unsupervised User Behavior Anomaly Detection in Smart Homes via Loss-guided Mask

Title: E-Bench: Towards Evaluating the Ease-of-Use of Large Language Models

Title: ExPLoRA: Parameter-Efficient Extended Pre-Training to Adapt Vision Transformers under Domain Shifts

Title: ViD-GPT: Introducing GPT-style Autoregressive Generation in Video Diffusion Models

Title: Data Shapley in One Training Run

Title: Self-supervised Pretraining and Finetuning for Monocular Depth and Visual Odometry

Title: Boosting Medical Image Classification with Segmentation Foundation Model

Title: garak: A Framework for Security Probing Large Language Models

Title: Leveraging Foundation Models for Multi-modal Federated Learning with Incomplete Modality

Title: RAEmoLLM: Retrieval Augmented LLMs for Cross-Domain Misinformation Detection Using In-Context Learning based on Emotional Information

Title: An Analysis on Quantizing Diffusion Transformers

Title: Exploiting Diffusion Prior for Out-of-Distribution Detection

Title: Dynamic Order Template Prediction for Generative Aspect-Based Sentiment Analysis

Title: Diffusion Models in Low-Level Vision: A Survey

Title: Learning Iterative Reasoning through Energy Diffusion

Title: A Survey on Human Preference Learning for Large Language Models

Title: In-Context Editing: Learning Knowledge from Self-Induced Distributions

Title: Vid3D: Synthesis of Dynamic 3D Scenes using 2D Video Diffusion

Title: Consistency^2: Consistent and Fast 3D Painting with Latent Consistency Models

Title: Probing the Decision Boundaries of In-context Learning in Large Language Models

Title: Accurate and Fast Pixel Retrieval with Spatial and Uncertainty Aware Hypergraph Diffusion

Title: FamiCom: Further Demystifying Prompts for Language Models with Task-Agnostic Performance Estimation

Title: Generative Visual Instruction Tuning

Title: Do Not Design, Learn: A Trainable Scoring Function for Uncertainty Estimation in Generative LLMs

Title: Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding

Title: VideoVista: A Versatile Benchmark for Video Understanding and Reasoning

Title: Hallucination Mitigation Prompts Long-term Video Understanding

Title: Fine-grained Controllable Text Generation through In-context Learning with Feedback

Title: A Systematic Analysis of Large Language Models as Soft Reasoners: The Case of Syllogistic Inferences

Title: P-TA: Using Proximal Policy Optimization to Enhance Tabular Data Augmentation via Large Language Models

Title: Multimodal Structured Generation: CVPR's 2nd MMFM Challenge Technical Report

Title: Cross-domain Open-world Discovery

Title: AnyTrans: Translate AnyText in the Image with Large Scale Models

Title: MedThink: Inducing Medical Large-scale Visual Language Models to Hallucinate Less by Thinking More

Title: Automating Easy Read Text Segmentation

Title: Promises, Outlooks and Challenges of Diffusion Language Modeling

Title: How Far Can In-Context Alignment Go? Exploring the State of In-Context Alignment

Title: Prior Normality Prompt Transformer for Multi-class Industrial Image Anomaly Detection

Title: HyperSIGMA: Hyperspectral Intelligence Comprehension Foundation Model

Title: Simple Yet Efficient: Towards Self-Supervised FG-SBIR with Unified Sample Feature Alignment

Title: Quaternion Generative Adversarial Neural Networks and Applications to Color Image Inpainting

Title: ChildDiffusion: Unlocking the Potential of Generative AI and Controllable Augmentations for Child Facial Data using Stable Diffusion and Large Language Models

Title: Standardizing Structural Causal Models

Title: Can Many-Shot In-Context Learning Help Long-Context LLM Judges? See More, Judge Better!

Title: AnyMaker: Zero-shot General Object Customization via Decoupled Dual-Level ID Injection

Title: HoLLMwood: Unleashing the Creativity of Large Language Models in Screenwriting via Role Playing

Title: Lightweight Model Pre-training via Language Guided Knowledge Distillation

Title: Meta Reasoning for Large Language Models

Title: Latent Denoising Diffusion GAN: Faster sampling, Higher image quality

Title: Transcendence: Generative Models Can Outperform The Experts That Train Them

Title: CELL your Model: Contrastive Explanation Methods for Large Language Models

Title: MegaScenes: Scene-Level View Synthesis at Scale

Title: Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models

Title: OoDIS: Anomaly Instance Segmentation Benchmark

Title: Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99%

Title: Autoregressive Image Generation without Vector Quantization