2024-07-19

Title: GPT Czech Poet: Generation of Czech Poetic Strophes with Language Models

Title: TourLLM: Enhancing LLMs with Tourism Knowledge

Title: Building Understandable Messaging for Policy and Evidence Review (BUMPER) with AI

Title: Data Generation using Large Language Models for Text Classification: An Empirical Case Study

Title: SMLT-MUGC: Small, Medium, and Large Texts -- Machine versus User-Generated Content Detection and Comparison

Title: "I understand why I got this grade": Automatic Short Answer Grading with Feedback

Title: PQCache: Product Quantization-based KVCache for Long Context LLM Inference

Title: AutoFlow: Automated Workflow Generation for Large Language Model Agents

Title: Lightweight Large Language Model for Medication Enquiry: Med-Pal

Title: WTU-EVAL: A Whether-or-Not Tool Usage Evaluation Benchmark for Large Language Models

Title: Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models

Title: Assessing the Effectiveness of GPT-4o in Climate Change Evidence Synthesis and Systematic Assessments: Preliminary Insights

Title: Why Does New Knowledge Create Messy Ripple Effects in LLMs?

Title: Knowledge-based Consistency Testing of Large Language Models

Title: Truth is Universal: Robust Detection of Lies in LLMs

Title: ESQA: Event Sequences Question Answering

Title: Regurgitative Training: The Value of Real Data in Training Large Language Models

Title: OSPC: Artificial VLM Features for Hateful Meme Detection

Title: Historical Ink: 19th Century Latin American Spanish Newspaper Corpus with LLM OCR Correction

Title: What to do if language models disagree? Black-box model ensembling for textual and visual question answering

Title: NutriBench: A Dataset for Evaluating Large Language Models in Carbohydrate Estimation from Meal Descriptions

Title: $\texttt{metabench}$ -- A Sparse Benchmark to Measure General Ability in Large Language Models

Title: Identifying the Source of Generation for Large Language Models

Title: Aligning Model Evaluations with Human Preferences: Mitigating Token Count Bias in Language Model Assessments

Title: Applicability of Large Language Models and Generative Models for Legal Case Judgement Summarization

Title: Limits to Predicting Online Speech Using Large Language Models

Title: Scaling Retrieval-Based Language Models with a Trillion-Token Datastore

Title: Large Language Models can impersonate politicians and other public figures

Title: AI AI Bias: Large Language Models Favor Their Own Generated Content

Title: Automated Peer Reviewing in Paper SEA: Standardization, Evaluation, and Analysis

Title: Grounding and Evaluation for Large Language Models: Practical Challenges and Lessons Learned (Survey)

Title: Automated Question Generation on Tabular Data for Conversational Data Exploration

Title: STAGE: Simplified Text-Attributed Graph Embeddings Using Pre-trained LLMs

Title: CiteME: Can Language Models Accurately Cite Scientific Claims?

Title: Analyzing Large language models chatbots: An experimental approach using a probability test

Title: Token-Supervised Value Models for Enhancing Mathematical Reasoning Capabilities of Large Language Models

Title: GRAD-SUM: Leveraging Gradient Summarization for Optimal Prompt Engineering

Title: Beyond KV Caching: Shared Attention for Efficient LLMs

Title: Bilingual Adaptation of Monolingual Foundation Models

Title: MetaTool: Facilitating Large Language Models to Master Tools with Meta-task Augmentation

Title: Evaluating Large Language Models with fmeval

Title: Evaluation of RAG Metrics for Question Answering in the Telecom Domain

Title: SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning

Title: Review-Feedback-Reason (ReFeR): A Novel Framework for NLG Evaluation and Reasoning

Title: Do LLMs have Consistent Values?

Title: Large Visual-Language Models Are Also Good Classifiers: A Study of In-Context Multimodal Fake News Detection

Title: InstructAV: Instruction Fine-tuning Large Language Models for Authorship Verification

Title: BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval

Title: Whitening Not Recommended for Classification Tasks in LLMs

Title: Explainable Biomedical Hypothesis Generation via Retrieval Augmented Generation enabled Large Language Models

Title: Halu-J: Critique-Based Hallucination Judge

Title: A Survey of Prompt Engineering Methods in Large Language Models for Different NLP Tasks

Title: Establishing Knowledge Preference in Language Models

Title: Dynamic Sentiment Analysis with Local Large Language Models using Majority Voting: A Study on Factors Affecting Restaurant Evaluation

Title: AlcLaM: Arabic Dialectal Language Model

Title: Retrieve, Summarize, Plan: Advancing Multi-hop Question Answering with an Iterative Approach

Title: Translate-and-Revise: Boosting Large Language Models for Constrained Translation

Title: Retrieval-Augmented Generation for Natural Language Processing: A Survey

Title: Transformer-based Single-Cell Language Model: A Survey

Title: Evaluating Large Language Models for Anxiety and Depression Classification using Counseling and Psychotherapy Transcripts

Title: PM-LLM-Benchmark: Evaluating Large Language Models on Process Mining Tasks

Title: Are Large Language Models Capable of Generating Human-Level Narratives?

Title: SpeciaLex: A Benchmark for In-Context Specialized Lexicon Learning

Title: Robust ASR Error Correction with Conservative Data Filtering

Title: CoD, Towards an Interpretable Medical Agent using Chain of Diagnosis

Title: Why do you cite? An investigation on citation intents and decision-making classification processes

Title: Learning-From-Mistakes Prompting for Indigenous Language Translation

Title: From Words to Worlds: Compositionality for Cognitive Architectures

Title: End-To-End Clinical Trial Matching with Large Language Models

Title: Attention Overflow: Language Model Input Blur during Long-Context Missing Items Recommendation

Title: Combining Constraint Programming Reasoning with Large Language Model Predictions

Title: Enhancing Biomedical Knowledge Discovery for Diseases: An End-To-End Open-Source Framework

Title: Can Open-Source LLMs Compete with Commercial Models? Exploring the Few-Shot Performance of Current GPT Models in Biomedical Tasks

Title: Research on Tibetan Tourism Viewpoints information generation system based on LLM

Title: dzFinNlp at AraFinNLP: Improving Intent Detection in Financial Conversational Agents

Title: Large Language Models as Reliable Knowledge Bases?

Title: Towards Zero-Shot Multimodal Machine Translation

Title: PLANTS: A Novel Problem and Dataset for Summarization of Planning-Like (PL) Tasks

Title: Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

Title: Weak-to-Strong Reasoning

Title: FuLG: 150B Romanian Corpus for Language Model Pretraining

Title: DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving

Title: Prover-Verifier Games improve legibility of LLM outputs

Title: Benchmark Agreement Testing Done Right: A Guide for LLM Benchmark Evaluation

Title: ANHALTEN: Cross-Lingual Transfer for German Token-Level Reference-Free Hallucination Detection

Title: Understanding Reference Policies in Direct Preference Optimization

Title: Baba Is AI: Break the Rules to Beat the Benchmark

Title: LLMs as Function Approximators: Terminology, Taxonomy, and Questions for Evaluation

Title: Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models

Title: Latent Causal Probing: A Formal Perspective on Probing with Causal Models of Data