2024-06-28

Title: Evaluating Copyright Takedown Methods for Language Models

Title: Understand What LLM Needs: Dual Preference Alignment for Retrieval-Augmented Generation

Title: The Multilingual Alignment Prism: Aligning Global and Local Preferences to Reduce Harm

Title: Re-Ranking Step by Step: Investigating Pre-Filtering for Re-Ranking with Large Language Models

Title: Categorical Syllogisms Revisited: A Review of the Logical Reasoning Abilities of LLMs for Analyzing Categorical Syllogism

Title: Implicit Discourse Relation Classification For Nigerian Pidgin

Title: Psychological Profiling in Cybersecurity: A Look at LLMs and Psycholinguistic Features

Title: OutlierTune: Efficient Channel-Wise Quantization for Large Language Models

Title: Learning Retrieval Augmentation for Personalized Dialogue Generation

Title: FFN: a Fine-grained Chinese-English Financial Domain Parallel Corpus

Title: Two-Pronged Human Evaluation of ChatGPT Self-Correction in Radiology Report Simplification

Title: Efficacy of Language Model Self-Play in Non-Zero-Sum Games

Title: SSP: Self-Supervised Prompting for Cross-Lingual Transfer to Low-Resource Languages using Large Language Models

Title: Can we teach language models to gloss endangered languages?

Title: Sonnet or Not, Bot? Poetry Evaluation for Large Models and Datasets

Title: TrustUQA: A Trustful Framework for Unified Structured Data Question Answering

Title: Capturing Minds, Not Just Words: Enhancing Role-Playing Language Models with Personality-Indicative Data

Title: Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding

Title: UniGen: A Unified Framework for Textual Dataset Generation Using Large Language Models

Title: Improving Weak-to-Strong Generalization with Reliability-Aware Alignment

Title: STBench: Assessing the Ability of Large Language Models in Spatio-Temporal Analysis

Title: EmPO: Theory-Driven Dataset Construction for Empathetic Response Generation through Preference Optimization

Title: AMBROSIA: A Benchmark for Parsing Ambiguous Questions into Database Queries

Title: Fairness and Bias in Multimodal AI: A Survey

Title: Statements: Universal Information Extraction from Tables with Large Language Models for ESG KPIs

Title: CHEW: A Dataset of CHanging Events in Wikipedia

Title: SeaKR: Self-aware Knowledge Retrieval for Adaptive Retrieval Augmented Generation

Title: T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings

Title: Simulating Classroom Education with LLM-Empowered Agents

Title: Aligning Teacher with Student Preferences for Tailored Training Data Generation

Title: Tools Fail: Detecting Silent Errors in Faulty Tools

Title: RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs

Title: FlowVQA: Mapping Multimodal Logic in Visual Question Answering with Flowcharts

Title: Revealing Fine-Grained Values and Opinions in Large Language Models

Title: AutoRAG-HP: Automatic Online Hyper-Parameter Tuning for Retrieval-Augmented Generation

Title: Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding

Title: AutoPureData: Automated Filtering of Web Data for LLM Fine-tuning

Title: VERISCORE: Evaluating the factuality of verifiable claims in long-form text generation

Title: LiveBench: A Challenging, Contamination-Free LLM Benchmark

Title: IndoToxic2024: A Demographically-Enriched Dataset of Hate Speech and Toxicity Types for Indonesian Language

Title: Fundamental Problems With Model Editing: How Should Rational Belief Revision Work in LLMs?

Title: DiVERT: Distractor Generation with Variational Errors Represented as Text for Math Multiple-choice Questions

Title: The Model Arena for Cross-lingual Sentiment Analysis: A Comparative Study in the Era of Large Language Models

Title: Suri: Multi-constraint Instruction Following for Long-form Text Generation