2024-07-22

Title: RDBE: Reasoning Distillation-Based Evaluation Enhances Automatic Essay Scoring

Title: Phi-3 Safety Post-Training: Aligning Language Models with a "Break-Fix" Cycle

Title: Learning Goal-Conditioned Representations for Language Reward Models

Title: Crafting Efficient Fine-Tuning Strategies for Large Language Models

Title: BiasDPO: Mitigating Bias in Language Models through Direct Preference Optimization

Title: Werewolf Arena: A Case Study in LLM Evaluation via Social Deduction

Title: FANTAstic SEquences and Where to Find Them: Faithful and Efficient API Call Generation through State-tracked Constrained Decoding and Reranking

Title: RAG-QA Arena: Evaluating Domain Robustness for Long-form Retrieval Augmented Question Answering

Title: NeLLCom-X: A Comprehensive Neural-Agent Framework to Simulate Language Learning and Group Communication

Title: HeCiX: Integrating Knowledge Graphs and Large Language Models for Biomedical Research

Title: ECCO: Can We Improve Model-Generated Code Efficiency Without Sacrificing Functional Correctness?

Title: Prompted Aspect Key Point Analysis for Quantitative Review Summarization

Title: LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference

Title: Impact of Model Size on Fine-tuned LLM Performance in Data-to-Text Generation: A State-of-the-Art Investigation

Title: I Know About "Up"! Enhancing Spatial Reasoning in Visual Language Models Through 3D Reconstruction

Title: Automatic Classification of News Subjects in Broadcast News: Application to a Gender Bias Representation Analysis

Title: LeKUBE: A Legal Knowledge Update BEnchmark

Title: Conditioning Chat-GPT for information retrieval: the Unipa-GPT case study

Title: Voices in a Crowd: Searching for Clusters of Unique Perspectives

Title: Predictive Simultaneous Interpretation: Harnessing Large Language Models for Democratizing Real-Time Multilingual Communication

Title: How to Engage Your Readers? Generating Guiding Questions to Promote Active Reading

Title: Multimodal Misinformation Detection using Large Vision-Language Models

Title: LLMs left, right, and center: Assessing GPT's capabilities to label political bias from web domains

Title: Open Artificial Knowledge

Title: Check-Eval: A Checklist-based Approach for Evaluating Text Quality

Title: ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities

Title: Evaluating the Reliability of Self-Explanations in Large Language Models

Title: Internal Consistency and Self-Feedback in Large Language Models: A Survey