2024-09-05

Title: MMLU-Pro+: Evaluating Higher-Order Reasoning and Shortcut Learning in LLMs

Title: Arctic-SnowCoder: Demystifying High-Quality Data in Code Pretraining

Title: Diversify-verify-adapt: Efficient and Robust Retrieval-Augmented Ambiguous Question Answering

Title: Do Large Language Models Possess Sensitive to Sentiment?

Title: How Privacy-Savvy Are Large Language Models? A Case Study on Compliance and Privacy Technical Review

Title: STAB: Speech Tokenizer Assessment Benchmark

Title: Abstractive Text Summarization: State of the Art, Challenges, and Improvements

Title: DetectiveQA: Evaluating Long-Context Reasoning on Detective Novels

Title: Language is Scary when Over-Analyzed: Unpacking Implied Misogynistic Reasoning with Argumentation Theory-Driven Prompts

Title: More is More: Addition Bias in Large Language Models

Title: PUB: Plot Understanding Benchmark and Dataset for Evaluating Large Language Models on Synthetic Visual Data Interpretation

Title: Creating Domain-Specific Translation Memories for Machine Translation Fine-tuning: The TRENCARD Bilingual Cardiology Corpus

Title: Deconfounded Causality-aware Parameter-Efficient Fine-Tuning for Problem-Solving Improvement of LLMs

Title: Pre-training data selection for biomedical domain adaptation using journal impact metrics

Title: Pooling And Attention: What Are Effective Designs For LLm-Based Embedding Models?

Title: Towards a Unified View of Preference Learning for Large Language Models: A Survey

Title: MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark

Title: CMM-Math: A Chinese Multimodal Math Dataset To Evaluate and Enhance the Mathematics Reasoning of Large Multimodal Models

Title: Exploring Sentiment Dynamics and Predictive Behaviors in Cryptocurrency Discussions by Few-Shot Learning with Large Language Models

Title: Historical German Text Normalization Using Type- and Token-Based Language Modeling

Title: Visually Grounded Speech Models for Low-resource Languages and Cognitive Modelling

Title: LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture

Title: LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA