2024-10-22

Title: Agent Skill Acquisition for Large Language Models via CycleQD

Title: Eliciting Uncertainty in Chain-of-Thought to Mitigate Bias against Forecasting Harmful User Behaviors

Title: SemiEvol: Semi-supervised Fine-tuning for LLM Adaptation

Title: Accounting for Sycophancy in Language Model Uncertainty Estimation

Title: Enabling Scalable Evaluation of Bias Patterns in Medical LLMs

Title: Cross-Document Event-Keyed Summarization

Title: Adapting Multilingual LLMs to Low-Resource Languages using Continued Pre-training and Synthetic Corpus

Title: SPRIG: Improving Large Language Model Performance by System Prompt Optimization

Title: DFlow: Diverse Dialogue Flow Simulation with Large Language Models

Title: Which LLMs are Difficult to Detect? A Detailed Analysis of Potential Factors Contributing to Difficulties in LLM Text Detection

Title: From Test-Taking to Test-Making: Examining LLM Authoring of Commonsense Assessment Items

Title: SemiHVision: Enhancing Medical Multimodal Models with a Semi-Human Annotated Dataset and Fine-Tuned Instruction Generation

Title: ChronoFact: Timeline-based Temporal Fact Verification

Title: CAP: Data Contamination Detection via Consistency Amplification

Title: Transit Pulse: Utilizing Social Media as a Source for Customer Feedback and Information Extraction with Large Language Model

Title: DM-Codec: Distilling Multimodal Representations for Speech Tokenization

Title: A Survey of Ontology Expansion for Conversational Understanding

Title: Enhancing Multimodal Sentiment Analysis for Missing Modality through Self-Distillation and Unified Modality Cross-Attention

Title: Improving General Text Embedding Model: Tackling Task Conflict and Data Imbalance through Model Merging

Title: mHumanEval -- A Multilingual Benchmark to Evaluate Large Language Models for Code Generation

Title: Are LLMs Good Zero-Shot Fallacy Classifiers?

Title: Toward Robust RALMs: Revealing the Impact of Imperfect Retrieval on Retrieval-Augmented Language Models

Title: Coarse-to-Fine Highlighting: Reducing Knowledge Hallucination in Large Language Models

Title: MELT: Materials-aware Continued Pre-training for Language Model Adaptation to Materials Science

Title: Augmenting the Veracity and Explanations of Complex Fact Checking via Iterative Self-Revision with LLMs

Title: Less is More: Parameter-Efficient Selection of Intermediate Tasks for Transfer Learning

Title: Evaluating Deep Unlearning in Large Language Models

Title: An Electoral Approach to Diversify LLM-based Multi-Agent Collective Decision-Making

Title: Uncovering Autoregressive LLM Knowledge of Thematic Fit in Event Representation

Title: Fine-tuning foundational models to code diagnoses from veterinary health records

Title: On the Diversity of Synthetic Data and its Impact on Training Large Language Models

Title: Lossless KV Cache Compression to 2%

Title: Back to School: Translation Using Grammar Books

Title: BRIEF: Bridging Retrieval and Inference for Multi-hop Reasoning via Compression

Title: Training Language Models to Critique With Multi-agent Feedback

Title: Redefining Proactivity for Information Seeking Dialogue

Title: Does ChatGPT Have a Poetic Style?

Title: LlamaLens: Specialized Multilingual LLM for Analyzing News and Social Media Content

Title: Ichigo: Mixed-Modal Early-Fusion Realtime Voice Assistant

Title: Causality for Large Language Models

Title: A Survey of Uncertainty Estimation in LLMs: Theory Meets Practice

Title: BERTtime Stories: Investigating the Role of Synthetic Story Data in Language pre-training

Title: CalibraEval: Calibrating Prediction Distribution to Mitigate Selection Bias in LLMs-as-Judges

Title: A Comprehensive Evaluation of Cognitive Biases in LLMs

Title: Evaluating Consistencies in LLM responses through a Semantic Clustering of Question Answering

Title: CROPE: Evaluating In-Context Adaptation of Vision and Language Models to Culture-Specific Concepts

Title: A Novel Interpretability Metric for Explaining Bias in Language Models: Applications on Multilingual Models from Southeast Asia

Title: Keep Guessing? When Considering Inference Scaling, Mind the Baselines

Title: Hey GPT, Can You be More Racist? Analysis from Crowdsourced Attempts to Elicit Biased Content from Generative AI

Title: "What is the value of {templates}?" Rethinking Document Information Extraction Datasets for LLMs

Title: Reverse Question Answering: Can an LLM Write a Question so Hard (or Bad) that it Can't Answer?

Title: M-RewardBench: Evaluating Reward Models in Multilingual Settings

Title: Do RAG Systems Cover What Matters? Evaluating and Optimizing Responses with Sub-Question Coverage

Title: Grammatical Error Correction for Low-Resource Languages: The Case of Zarma

Title: WHoW: A Cross-domain Approach for Analysing Conversation Moderation

Title: Multi-IF: Benchmarking LLMs on Multi-Turn and Multilingual Instructions Following

Title: Stacking Small Language Models for Generalizability

Title: Leveraging Retrieval-Augmented Generation for Culturally Inclusive Hakka Chatbots: Design Insights and User Perceptions

Title: Neural Search Space in Gboard Decoder

Title: A Survey of Conversational Search

Title: AMPLE: Emotion-Aware Multimodal Fusion Prompt Learning for Fake News Detection

Title: Interventional Speech Noise Injection for ASR Generalizable Spoken Language Understanding

Title: Guardians of Discourse: Evaluating LLMs on Multilingual Offensive Language Detection

Title: Selecting Influential Samples for Long Context Alignment via Homologous Models' Guidance and Contextual Awareness Measurement

Title: Can Large Language Models Invent Algorithms to Improve Themselves?

Title: SMILES-Prompting: A Novel Approach to LLM Jailbreak Attacks in Chemical Synthesis

Title: Resource-Efficient Medical Report Generation using Large Language Models

Title: Scalable Data Ablation Approximations for Language Models through Modular Training and Merging

Title: RAC: Efficient LLM Factuality Correction with Retrieval Augmentation

Title: Learning to Generate and Evaluate Fact-checking Explanations with Transformers

Title: Revealing and Mitigating the Local Pattern Shortcuts of Mamba

Title: DomainSum: A Hierarchical Benchmark for Fine-Grained Domain Shift in Abstractive Text Summarization

Title: Efficient Terminology Integration for LLM-based Translation in Specialized Domains

Title: Tokenization as Finite-State Transduction

Title: Mitigating Hallucinations of Large Language Models in Medical Information Extraction via Contrastive Decoding

Title: Who's Who: Large Language Models Meet Knowledge Conflicts in Practice

Title: Learning-to-Defer for Extractive Question Answering

Title: Improve Dense Passage Retrieval with Entailment Tuning

Title: Using GPT Models for Qualitative and Quantitative News Analytics in the 2024 US Presidental Election Process

Title: Yeah, Un, Oh: Continuous and Real-time Backchannel Prediction with Fine-tuning of Voice Activity Projection

Title: CausalGraph2LLM: Evaluating LLMs for Causal Queries

Title: Do Large Language Models Have an English Accent? Evaluating and Improving the Naturalness of Multilingual LLMs

Title: Self-Explained Keywords Empower Large Language Models for Code Generation

Title: Large Language Models for Cross-lingual Emotion Detection

Title: Augmenting Legal Decision Support Systems with LLM-based NLI for Analyzing Social Media Evidence

Title: 1024m at SMM4H 2024: Tasks 3, 5 & 6 -- Ensembles of Transformers and Large Language Models for Medical Text Classification

Title: Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering

Title: Exploring Continual Fine-Tuning for Enhancing Language Ability in Large Language Model

Title: ComPO: Community Preferences for Language Model Personalization

Title: TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling

Title: Large Language Models Know What To Say But Not When To Speak

Title: Surprise! Uniform Information Density Isn't the Whole Story: Predicting Surprisal Contours in Long-form Discourse

Title: Rolling the DICE on Idiomaticity: How LLMs Fail to Grasp Context

Title: Fine-Tuning LLMs for Reliable Medical Question-Answering Services

Title: Analysing the Residual Stream of Language Models Under Knowledge Conflicts

Title: Do LLMs write like humans? Variation in grammatical and rhetorical styles

Title: A Psycholinguistic Evaluation of Language Models' Sensitivity to Argument Roles

Title: 1-bit AI Infra: Part 1.1, Fast and Lossless BitNet b1.58 Inference on CPUs

Title: Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages

Title: A Troublemaker with Contagious Jailbreak Makes Chaos in Honest Towns

Title: From Tokens to Materials: Leveraging Language Models for Scientific Discovery

Title: Exploring Pretraining via Active Forgetting for Improving Cross Lingual Transfer for Decoder Language Models

Title: MagicPIG: LSH Sampling for Efficient LLM Generation

Title: RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style

Title: Contamination Report for Multilingual Benchmarks

Title: Information for Conversation Generation: Proposals Utilising Knowledge Graphs

Title: Pre-training Distillation for Large Language Models: A Design Space Exploration

Title: On Creating an English-Thai Code-switched Machine Translation in Medical Domain

Title: Building A Coding Assistant via the Retrieval-Augmented Language Model

Title: Sketch2Code: Evaluating Vision-Language Models for Interactive Web Design Prototyping

Title: ToW: Thoughts of Words Improve Reasoning in Large Language Models

Title: Analyzing Context Contributions in LLM-based Machine Translation

Title: Can Knowledge Editing Really Correct Hallucinations?

Title: CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution