2024-05-14

Title: Levels of AI Agents: from Rules to Large Language Models

Title: Large Language Models as Planning Domain Generators

Title: Large Language Model (LLM) AI text generation detection based on transformer deep learning algorithm

Title: Enhancing Language Models for Financial Relation Extraction with Named Entities and Part-of-Speech

Title: Parameter-Efficient Instruction Tuning of Large Language Models For Extreme Financial Numeral Labelling

Title: Open-SQL Framework: Enhancing Text-to-SQL on Open-source Large Language Models

Title: EDA Corpus: A Large Language Model Dataset for Enhanced Interaction with OpenROAD

Title: ATG: Benchmarking Automated Theorem Generation for Generative Language Models

Title: Exploring the Compositional Deficiency of Large Language Models in Mathematical Reasoning

Title: Leveraging Lecture Content for Improved Feedback: Explorations with GPT-4 and Retrieval Augmented Generation

Title: Self-Reflection in LLM Agents: Effects on Problem-Solving Performance

Title: ERAGent: Enhancing Retrieval-Augmented Language Models with Improved Accuracy, Efficiency, and Personalization

Title: QuakeBERT: Accurate Classification of Social Media Texts for Rapid Earthquake Impact Assessment

Title: Multigenre AI-powered Story Composition

Title: Word2World: Generating Stories and Worlds through Large Language Models

Title: Hire Me or Not? Examining Language Model's Behavior with Occupation Attributes

Title: Fleet of Agents: Coordinated Problem Solving with Large Language Models using Genetic Particle Filtering

Title: SUTRA: Scalable Multilingual Language Model Architecture

Title: Utilizing Large Language Models to Generate Synthetic Data to Increase the Performance of BERT-Based Neural Networks

Title: Automated Conversion of Static to Dynamic Scheduler via Natural Language

Title: ChatSOS: Vector Database Augmented Generative Question Answering Assistant in Safety Engineering

Title: Interpretable Cross-Examination Technique (ICE-T): Using highly informative features to boost LLM performance

Title: LLMs can Find Mathematical Reasoning Mistakes by Pedagogical Chain-of-Thought

Title: Exploring the Capabilities of Large Multimodal Models on Dense Text

Title: Hypothesis Testing Prompting Improves Deductive Reasoning in Large Language Models

Title: Evaluating the Efficacy of AI Techniques in Textual Anonymization: A Comparative Study

Title: Mobile Sequencers

Title: Digital Diagnostics: The Potential Of Large Language Models In Recognizing Symptoms Of Common Illnesses

Title: Unveiling the Competitive Dynamics: A Comparative Evaluation of American and Chinese LLMs

Title: Towards a path dependent account of category fluency

Title: Enhancing Creativity in Large Language Models through Associative Thinking Strategies

Title: Enhancing Traffic Prediction with Textual Data Using Large Language Models

Title: Opportunities for Persian Digital Humanities Research with Artificial Intelligence Language Models; Case Study: Forough Farrokhzad

Title: LLM-Generated Black-box Explanations Can Be Adversarially Helpful

Title: Tackling Execution-Based Evaluation for NL2Bash

Title: TacoERE: Cluster-aware Compression for Event Relation Extraction

Title: Finding structure in logographic writing with library learning

Title: CoRE: LLM as Interpreter for Natural Language Programming, Pseudo-Code Programming, and Flow Programming of AI Agents

Title: Quite Good, but Not Enough: Nationality Bias in Large Language Models -- A Case Study of ChatGPT

Title: Evaluating Task-based Effectiveness of MLLMs on Charts

Title: A Turkish Educational Crossword Puzzle

Title: Length-Aware Multi-Kernel Transformer for Long Document Classification

Title: Integrating Emotional and Linguistic Models for Ethical Compliance in Large Language Models

Title: Do Pretrained Contextual Language Models Distinguish between Hebrew Homograph Analyses?

Title: Advanced Natural-based interaction for the ITAlian language: LLaMAntino-3-ANITA

Title: Designing and Evaluating Dialogue LLMs for Co-Creative Improvised Theatre

Title: InsightNet: Structured Insight Mining from Customer Feedback

Title: Limited Ability of LLMs to Simulate Human Psychological Behaviours: a Psychometric Analysis

Title: Human-interpretable clustering of short-text using large language models

Title: Humor Mechanics: Advancing Humor Generation with Multistep Reasoning

Title: Branching Narratives: Character Decision Points Detection

Title: L(u)PIN: LLM-based Political Ideology Nowcasting

Title: MedConceptsQA -- Open Source Medical Concepts QA Benchmark

Title: Evaluation of Retrieval-Augmented Generation: A Survey

Title: MCS-SQL: Leveraging Multiple Prompts and Multiple-Choice Selection For Text-to-SQL Generation

Title: Evaluating large language models in medical applications: a survey

Title: Strategic Data Ordering: Enhancing Large Language Model Performance through Curriculum Learning

Title: MacBehaviour: An R package for behavioural experimentation on large language models

Title: EMS-SD: Efficient Multi-sample Speculative Decoding for Accelerating Large Language Models

Title: MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical Reasoning

Title: NoiseBench: Benchmarking the Impact of Real Label Noise on Named Entity Recognition

Title: ViWikiFC: Fact-Checking for Vietnamese Wikipedia-Based Textual Knowledge Source

Title: COBias and Debias: Minimizing Language Model Pairwise Accuracy Bias via Nonlinear Integer Programming

Title: Age-Dependent Analysis and Stochastic Generation of Child-Directed Speech

Title: OpenLLM-Ro -- Technical Report on Open-source Romanian LLMs trained starting from Llama 2

Title: Quantifying and Optimizing Global Faithfulness in Persona-driven Role-playing

Title: LlamaTurk: Adapting Open-Source Generative Large Language Models for Low-Resource Language

Title: TANQ: An open domain dataset of table answered questions

Title: DEPTH: Discourse Education through Pre-Training Hierarchically

Title: Zero-Shot Tokenizer Transfer

Title: Russian-Language Multimodal Dataset for Automatic Summarization of Scientific Papers

Title: PARDEN, Can You Repeat That? Defending against Jailbreaks via Repetition

Title: EconLogicQA: A Question-Answering Benchmark for Evaluating Large Language Models in Economic Sequential Reasoning

Title: Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots