2024-07-11

Title: Nash CoT: Multi-Path Inference with Preference Equilibrium

Title: Identification of emotions on Twitter during the 2022 electoral process in Colombia

Title: Reuse, Don't Retrain: A Recipe for Continued Pretraining of Language Models

Title: ESM+: Modern Insights into Perspective on Text-to-SQL Evaluation in the Age of Large Language Models

Title: RAG vs. Long Context: Examining Frontier Large Language Models for Environmental Review Document Comprehension

Title: Probability of Differentiation Reveals Brittleness of Homogeneity Bias in Large Language Models

Title: Interpretable Differential Diagnosis with Dual-Inference Large Language Models

Title: MixSumm: Topic-based Data Augmentation using LLMs for Low-resource Extractive Text Summarization

Title: Multilingual Blending: LLM Safety Alignment Evaluation with Language Mixture

Title: LokiLM: Technical Report

Title: KpopMT: Translation Dataset with Terminology for Kpop Fandom

Title: Review-LLM: Harnessing Large Language Models for Personalized Review Generation

Title: Bucket Pre-training is All You Need

Title: Beyond Benchmarking: A New Paradigm for Evaluation and Assessment of Large Language Models

Title: Arabic Automatic Story Generation with Large Language Models

Title: On Leakage of Code Generation Evaluation Datasets

Title: A Review of the Challenges with Massive Web-mined Corpora Used in Large Language Models Pre-Training

Title: A Proposed S.C.O.R.E. Evaluation Framework for Large Language Models : Safety, Consensus, Objectivity, Reproducibility and Explainability

Title: Multi-task Prompt Words Learning for Social Media Content Generation

Title: WorldAPIs: The World Is Worth How Many APIs? A Thought Experiment

Title: Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities

Title: Attribute or Abstain: Large Language Models as Long Document Assistants

Title: Training on the Test Task Confounds Evaluation and Emergence