2024-03-07

Title: Mad Libs Are All You Need: Augmenting Cross-Domain Document-Level Event Argument Data

Title: Book2Dial: Generating Teacher-Student Interactions from Textbooks for Cost-Effective Development of Educational Chatbots

Title: Guardrail Baselines for Unlearning in LLMs

Title: DIVERSE: Deciphering Internet Views on the U.S. Military Through Video Comment Stance Analysis, A Novel Benchmark Dataset for Stance Classification

Title: Scope of Large Language Models for Mining Emerging Opinions in Online Health Discourse

Title: Learning to Maximize Mutual Information for Chain-of-Thought Distillation

Title: Japanese-English Sentence Translation Exercises Dataset for Automatic Grading

Title: Negating Negatives: Alignment without Human Positive Samples via Distributional Dispreference Optimization

Title: Mixture-of-LoRAs: An Efficient Multitask Tuning for Large Language Models

Title: Magic Markup: Maintaining Document-External Markup with an LLM

Title: A Knowledge Plug-and-Play Test Bed for Open-domain Dialogue Generation

Title: CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models

Title: Unsupervised Multilingual Dense Retrieval via Generative Pseudo Labeling

Title: Benchmarking Hallucination in Large Language Models based on Unanswerable Math Word Problem

Title: Multimodal Large Language Models to Support Real-World Fact-Checking

Title: GPTopic: Dynamic and Interactive Topic Representations

Title: Apollo: Lightweight Multilingual Medical LLMs towards Democratizing Medical AI to 6B People

Title: General2Specialized LLMs Translation for E-commerce

Title: Rapidly Developing High-quality Instruction Data and Evaluation Benchmark for Large Language Models with Minimal Human Effort: A Case Study on Japanese

Title: German also Hallucinates! Inconsistency Detection in News Summaries with the Absinth Dataset

Title: PPTC-R benchmark: Towards Evaluating the Robustness of Large Language Models for PowerPoint Task Completion

Title: Evaluating the Elementary Multilingual Capabilities of Large Language Models with MultiQ

Title: ShortGPT: Layers in Large Language Models are More Redundant Than You Expect

Title: Emojinize : Enriching Any Text with Emoji Translations

Title: Designing Informative Metrics for Few-Shot Example Selection

Title: X-Shot: A Unified System to Handle Frequent, Few-shot and Zero-shot Learning Simultaneously in Classification

Title: KIWI: A Dataset of Knowledge-Intensive Writing Instructions for Answering Research Questions

Title: On the Origins of Linear Representations in Large Language Models

Title: Learning to Decode Collaboratively with Multiple Language Models

Title: SaulLM-7B: A pioneering Large Language Model for Law

Title: FaaF: Facts as a Function for the evaluation of RAG systems

Title: From One to Many: Expanding the Scope of Toxicity Mitigation in Language Models

Title: Did Translation Models Get More Robust Without Anyone Even Noticing?

Title: The Heuristic Core: Understanding Subnetwork Generalization in Pretrained Language Models