2024-03-05

Title: PRECISE Framework: GPT-based Text For Improved Readability, Reliability, and Understandability of Radiology Reports For Patient-Centered Care

Title: Getting Serious about Humor: Crafting Humor Datasets with Unfunny Large Language Models

Title: Executing Natural Language-Described Algorithms with Large Language Models: An Investigation

Title: An Empirical Study of Data Ability Boundary in LLMs' Math Reasoning

Title: Brain-Inspired Two-Stage Approach: Enhancing Mathematical Reasoning by Imitating Human Thought Processes

Title: Abdelhak at SemEval-2024 Task 9 : Decoding Brainteasers, The Efficacy of Dedicated Models Versus ChatGPT

Title: LoRA Meets Dropout under a Unified Framework

Title: UrbanGPT: Spatio-Temporal Large Language Models

Title: DenseMamba: State Space Models with Dense Hidden Connection for Efficient Large Language Models

Title: Information Flow Routes: Automatically Interpreting Language Models at Scale

Title: LLMGuard: Guarding Against Unsafe LLM Behavior

Title: Self-Refinement of Language Models from External Proxy Metrics Feedback

Title: Deep Learning Detection Method for Large Language Models-Generated Scientific Content

Title: CLLMs: Consistency Large Language Models

Title: EyeGPT: Ophthalmic Assistant with Large Language Models

Title: NewsBench: Systematic Evaluation of LLMs for Writing Proficiency and Safety Adherence in Chinese Journalistic Editorial Applications

Title: SoftTiger: A Clinical Foundation Model for Healthcare Workflows

Title: Word Order and World Knowledge

Title: DiaHalu: A Dialogue-level Hallucination Evaluation Benchmark for Large Language Models

Title: MediSwift: Efficient Sparse Pre-trained Biomedical Language Models

Title: AutoRD: An Automatic and End-to-End System for Rare Disease Knowledge Graph Construction Based on Ontologies-enhanced Large Language Models

Title: MALTO at SemEval-2024 Task 6: Leveraging Synthetic Data for LLM Hallucination Detection

Title: LocalRQA: From Generating Data to Locally Training, Testing, and Deploying Retrieval-Augmented QA Systems

Title: Merging Text Transformer Models from Different Initializations

Title: Formulation Comparison for Timeline Construction using LLMs

Title: Predictions from language models for multiple-choice tasks are not robust under variation of scoring methods

Title: Attribute Structuring Improves LLM-Based Evaluation of Clinical Text Summaries

Title: Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks

Title: Reading Subtext: Evaluating Large Language Models on Short Story Summarization with Writers

Title: FaiMA: Feature-aware In-context Learning for Multi-domain Aspect-based Sentiment Analysis

Title: LLMCRIT: Teaching Large Language Models to Use Criteria

Title: LAB: Large-Scale Alignment for ChatBots

Title: Distilling Text Style Transfer With Self-Explanation From LLMs

Title: MulCogBench: A Multi-modal Cognitive Benchmark Dataset for Evaluating Chinese and English Computational Language Models

Title: ParallelPARC: A Scalable Pipeline for Generating Natural-Language Analogies

Title: A Survey of AI-generated Text Forensic Systems: Detection, Attribution, and Characterization

Title: BootTOD: Bootstrap Task-oriented Dialogue Representations by Aligning Diverse Responses

Title: STAR: Constraint LoRA with Dynamic Active Learning for Data-Efficient Fine-Tuning of Large Language Models

Title: Balancing Exploration and Exploitation in LLM using Soft RLLF for Enhanced Negation Understanding

Title: RAGged Edges: The Double-Edged Sword of Retrieval-Augmented Chatbots

Title: DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling

Title: API Is Enough: Conformal Prediction for Large Language Models Without Logit-Access

Title: IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact

Title: Mitigating Catastrophic Forgetting in Large Language Models with Self-Synthesized Rehearsal

Title: Accelerating Greedy Coordinate Gradient via Probe Sampling

Title: Improving the Validity of Automatically Generated Feedback via Reinforcement Learning

Title: VBART: The Turkish LLM

Title: LM4OPT: Unveiling the Potential of Large Language Models in Formulating Mathematical Optimization Problems

Title: Evaluating and Mitigating Number Hallucinations in Large Vision-Language Models: A Consistency Perspective

Title: Automatic Question-Answer Generation for Long-Tail Knowledge

Title: Right for Right Reasons: Large Language Models for Verifiable Commonsense Knowledge Graph Question Answering

Title: CR-LT-KGQA: A Knowledge Graph Question Answering Dataset Requiring Commonsense Reasoning and Long-Tail Knowledge

Title: What Is Missing in Multilingual Visual Reasoning and How to Fix It

Title: OVEL: Large Language Model as Memory Manager for Online Video Entity Linking

Title: Fine Tuning vs. Retrieval Augmented Generation for Less Popular Knowledge

Title: Controlling Cloze-test Question Item Difficulty with PLM-based Surrogate Models for IRT Assessment

Title: KorMedMCQA: Multi-Choice Question Answering Benchmark for Korean Healthcare Professional Licensing Examinations

Title: Infusing Knowledge into Large Language Models with Contextual Prompts

Title: Fantastic Semantics and Where to Find Them: Investigating Which Layers of Generative LLMs Reflect Lexical Semantics

Title: Revisiting Dynamic Evaluation: Online Adaptation for Large Language Models

Title: In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation

Title: SERVAL: Synergy Learning between Vertical Models and LLMs towards Oracle-Level Zero-shot Medical Prediction

Title: Enhancing Neural Machine Translation of Low-Resource Languages: Corpus Development, Human Evaluation and Explainable AI Architectures

Title: Towards Comprehensive Vietnamese Retrieval-Augmented Generation and Large Language Models

Title: Hypertext Entity Extraction in Webpage

Title: Decode Neural signal as Speech

Title: Differentially Private Synthetic Data via Foundation Model APIs 2: Text

Title: Derivative-Free Optimization for Low-Rank Adaptation in Large Language Models

Title: WebCiteS: Attributed Query-Focused Summarization on Chinese Web Search Results with Citations

Title: NPHardEval4V: A Dynamic Reasoning Benchmark of Multimodal Large Language Models

Title: NusaBERT: Teaching IndoBERT to be Multilingual and Multicultural

Title: Making Pre-trained Language Models Great on Tabular Prediction

Title: Rethinking LLM Language Adaptation: A Case Study on Chinese Mixtral

Title: An Improved Traditional Chinese Evaluation Suite for Foundation Model

Title: Fostering the Ecosystem of Open Neural Encoders for Portuguese with Albertina PT* Family

Title: To Generate or to Retrieve? On the Effectiveness of Artificial Contexts for Medical Open-Domain Question Answering

Title: IndicVoices: Towards building an Inclusive Multilingual Speech Dataset for Indian Languages

Title: Analyzing and Adapting Large Language Models for Few-Shot Multilingual NLU: Are We There Yet?

Title: VariErr NLI: Separating Annotation Error from Human Label Variation

Title: DECIDER: A Rule-Controllable Decoding Strategy for Language Generation by Imitating Dual-System Cognitive Theory

Title: AS-ES Learning: Towards Efficient CoT Learning in Small Models

Title: Multi-perspective Improvement of Knowledge Graph Completion with Large Language Models

Title: SciAssess: Benchmarking LLM Proficiency in Scientific Literature Analysis

Title: FakeNewsGPT4: Advancing Multimodal Fake News Detection through Knowledge-Augmented LVLMs

Title: LLM-Oriented Retrieval Tuner

Title: Topic Aware Probing: From Sentence Length Prediction to Idiom Identification how reliant are Neural Language Models on Topic?

Title: Automated Generation of Multiple-Choice Cloze Questions for Assessing English Vocabulary Using GPT-turbo 3.5

Title: Leveraging Weakly Annotated Data for Hate Speech Detection in Code-Mixed Hinglish: A Feasibility-Driven Transfer Learning Approach with Large Language Models

Title: Using LLMs for the Extraction and Normalization of Product Attribute Values

Title: EEE-QA: Exploring Effective and Efficient Question-Answer Representations

Title: ProTrix: Building Models for Planning and Reasoning over Tables with Sentence Context

Title: Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models

Title: Not all Layers of LLMs are Necessary during Inference

Title: PHAnToM: Personality Has An Effect on Theory-of-Mind Reasoning in Large Language Models

Title: Birbal: An efficient 7B instruct-model fine-tuned with curated datasets

Title: FENICE: Factuality Evaluation of summarization based on Natural language Inference and Claim Extraction

Title: RIFF: Learning to Rephrase Inputs for Few-shot Fine-tuning of Language Models

Title: Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning