2024-06-27

Title: Role of Dependency Distance in Text Simplification: A Human vs ChatGPT Simplification Comparison

Title: Spanish and LLM Benchmarks: is MMLU Lost in Translation?

Title: Understanding the Role of User Profile in the Personalization of Large Language Models

Title: Can LLMs Generate Visualizations with Dataless Prompts?

Title: MOSSBench: Is Your Multimodal Language Model Oversensitive to Safe Queries?

Title: Enhancing Commentary Strategies for Imperfect Information Card Games: A Study of Large Language Models in Guandan Commentary

Title: Training-Free Exponential Extension of Sliding Window Context with Cascading KV Cache

Title: Improving Arithmetic Reasoning Ability of Large Language Models through Relation Tuples, Verification and Dynamic Feedback

Title: CTBench: A Comprehensive Benchmark for Evaluating Language Model Capabilities in Clinical Trial Design

Title: PAFT: A Parallel Training Paradigm for Effective LLM Fine-Tuning

Title: Do they mean 'us'? Interpreting Referring Expressions in Intergroup Bias

Title: NormTab: Improving Symbolic Reasoning in LLMs Through Tabular Data Normalization

Title: SimsChat: A Customisable Persona-Driven Role-Playing Agent

Title: Unmasking the Imposters: In-Domain Detection of Human vs. Machine-Generated Tweets

Title: Encourage or Inhibit Monosemanticity? Revisit Monosemanticity from a Feature Decorrelation Perspective

Title: Evaluating Fairness in Large Vision-Language Models Across Diverse Demographic Attributes and Prompts

Title: Inherent Challenges of Post-Hoc Membership Inference for Large Language Models

Title: EDEN: Empathetic Dialogues for English learning

Title: Multi-step Knowledge Retrieval and Inference over Unstructured Data

Title: Explicit Diversity Conditions for Effective Question Answer Generation with Large Language Models

Title: Catching Chameleons: Detecting Evolving Disinformation Generated using Large Language Models

Title: Decoding with Limited Teacher Supervision Requires Understanding When to Trust the Teacher

Title: Automated Clinical Data Extraction with Knowledge Conditioned LLMs

Title: LLMs for Doctors: Leveraging Medical LLMs to Assist Doctors, Not Replace Them

Title: PharmGPT: Domain-Specific Large Language Models for Bio-Pharmaceutical and Chemistry

Title: Improving Entity Recognition Using Ensembles of Deep Learning and Fine-tuned Large Language Models: A Case Study on Adverse Event Extraction from Multiple Sources

Title: AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tuning

Title: Evaluating Quality of Answers for Retrieval-Augmented Generation: A Strong LLM Is All You Need

Title: Self-Training with Pseudo-Label Scorer for Aspect Sentiment Quad Prediction

Title: Octo-planner: On-device Language Model for Planner-Action Agents

Title: Multilingual Knowledge Graph Completion from Pretrained Language Models with Knowledge Constraints

Title: LLM-Driven Multimodal Opinion Expression Identification

Title: Shimo Lab at "Discharge Me!": Discharge Summarization by Prompt-Driven Concatenation of Electronic Health Record Sections

Title: BADGE: BADminton report Generation and Evaluation with LLM

Title: ArzEn-LLM: Code-Switched Egyptian Arabic-English Translation and Speech Recognition Using LLMs

Title: Poisoned LangChain: Jailbreak LLMs by LangChain

Title: ResumeAtlas: Revisiting Resume Classification with Large-Scale Datasets and Large Language Models

Title: ConvoCache: Smart Re-Use of Chatbot Responses

Title: Assessing "Implicit" Retrieval Robustness of Large Language Models

Title: Automatic Speech Recognition for Hindi

Title: LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference

Title: NeBuLa: A discourse aware Minecraft Builder

Title: UIO-LLMs: Unbiased Incremental Optimization for Long-Context LLMs

Title: Selective Prompting Tuning for Personalized Conversations with LLMs

Title: Methodology of Adapting Large English Language Models for Specific Cultural Contexts

Title: SEED: Accelerating Reasoning Tree Construction via Scheduled Speculative Decoding

Title: A Closer Look into Mixture-of-Experts in Large Language Models

Title: Enhancing Data Privacy in Large Language Models through Private Association Editing

Title: Zero-shot prompt-based classification: topic labeling in times of foundation models in German Tweets

Title: LLaMIPa: An Incremental Discourse Parser

Title: Detecting Machine-Generated Texts: Not Just "AI vs Humans" and Explainability is Complicated

Title: "Vorbe\c{s}ti Rom\^ane\c{s}te?" A Recipe to Train Powerful Romanian LLMs with English Instructions

Title: Hierarchical Context Pruning: Optimizing Real-World Code Completion with Repository-Level Pretrained Code LLMs

Title: FactFinders at CheckThat! 2024: Refining Check-worthy Statement Detection with LLMs through Data Pruning

Title: S3: A Simple Strong Sample-effective Multimodal Dialog System

Title: AI-native Memory: A Pathway from LLMs Towards AGI

Title: MathOdyssey: Benchmarking Mathematical Problem-Solving Skills in Large Language Models Using Odyssey Math Data

Title: PaCoST: Paired Confidence Significance Testing for Benchmark Contamination Detection in Large Language Models

Title: Themis: Towards Flexible and Interpretable NLG Evaluation

Title: Do LLMs dream of elephants (when told not to)? Latent concept association and associative memory in transformers

Title: LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks

Title: IRCAN: Mitigating Knowledge Conflicts in LLM Generation via Identifying and Reweighting Context-Aware Neurons

Title: Cascading Large Language Models for Salient Event Graph Generation

Title: Role-Play Zero-Shot Prompting with Large Language Models for Open-Domain Human-Machine Conversation

Title: WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

Title: Is In-Context Learning a Type of Gradient-Based Learning? Evidence from the Inverse Frequency Effect in Structural Priming

Title: WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models

Title: "Is ChatGPT a Better Explainer than My Professor?": Evaluating the Explanation Capabilities of LLMs in Conversation Compared to a Human Baseline

Title: APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets

Title: CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs

Title: PrExMe! Large Scale Prompt Exploration of Open Source LLMs for Machine Translation and Summarization Evaluation

Title: Symbolic Learning Enables Self-Evolving Agents