2024-03-20

Title: Evaluating Robustness of Generative Search Engine on Adversarial Factual Questions

Title: The Boy Who Survived: Removing Harry Potter from an LLM is harder than reported

Title: TMU at TREC Clinical Trials Track 2023

Title: Syn-QA2: Evaluating False Assumptions in Long-tail Questions with Synthetic QA Datasets

Title: EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models

Title: TnT-LLM: Text Mining at Scale with Large Language Models

Title: Reference-based Metrics Disprove Themselves in Question Generation

Title: Zero-Shot Multi-task Hallucination Detection

Title: FinLlama: Financial Sentiment Classification for Algorithmic Trading Applications

Title: A Comparative Investigation of Compositional Syntax and Semantics in DALL-E 2

Title: Leveraging Large Language Models to Extract Information on Substance Use Disorder Severity from Clinical Notes: A Zero-shot Learning Approach

Title: OpenEval: Benchmarking Chinese LLMs across Capability, Alignment and Safety

Title: Characteristic AI Agents via Large Language Models

Title: RankPrompt: Step-by-Step Comparisons Make Language Models Better Reasoners

Title: Improving Generalizability of Extracting Social Determinants of Health Using Large Language Models through Prompt-tuning

Title: AraPoemBERT: A Pretrained Language Model for Arabic Poetry Analysis

Title: Dr3: Ask Large Language Models Not to Give Off-Topic Answers in Open Domain Multi-Hop Question Answering

Title: An Empirical Study of Speech Language Models for Prompt-Conditioned Speech Synthesis

Title: Towards Interpretable Hate Speech Detection using Large Language Model-extracted Rationales

Title: Cross-Lingual Transfer for Natural Language Inference via Multilingual Prompt Translator

Title: MSLM-S2ST: A Multitask Speech Language Model for Textless Speech-to-Speech Translation with Speaker Style Preservation

Title: Third-Party Language Model Performance Prediction from Instruction

Title: CrossTune: Black-Box Few-Shot Classification with Label Enhancement

Title: Prompt-based Graph Model for Joint Liberal Event Extraction and Event Schema Induction

Title: Factorized Learning Assisted with Large Language Model for Gloss-free Sign Language Translation

Title: AlphaFin: Benchmarking Financial Analysis with Retrieval-Augmented Stock-Chain Framework

Title: Chart-based Reasoning: Transferring Capabilities from LLMs to VLMs

Title: LHMKE: A Large-scale Holistic Multi-subject Knowledge Evaluation Benchmark for Chinese Large Language Models

Title: Multi-Dimensional Machine Translation Evaluation: Model Evaluation and Resource for Korean

Title: Pragmatic Competence Evaluation of Large Language Models for Korean

Title: Empowering Air Travelers: A Chatbot for Canadian Air Passenger Rights

Title: Instructing Large Language Models to Identify and Ignore Irrelevant Conditions

Title: NovelQA: A Benchmark for Long-Range Novel Question Answering

Title: Automated Data Curation for Robust Language Model Fine-Tuning

Title: Comparing Explanation Faithfulness between Multilingual and Monolingual Fine-tuned Language Models

Title: Epistemology of Language Models: Do Language Models Have Holistic Knowledge?

Title: Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models

Title: Generalizable and Stable Finetuning of Pretrained Language Models on Low-Resource Texts

Title: Supporting Energy Policy Research with Large Language Models

Title: Automatic Information Extraction From Employment Tribunal Judgements Using Large Language Models

Title: Dated Data: Tracing Knowledge Cutoffs in Large Language Models

Title: LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression