2024-06-07

Title: Ranking Manipulation for Conversational Search Engines

Title: Measuring Retrieval Complexity in Question Answering Systems

Title: Knowledge-Infused Legal Wisdom: Navigating LLM Consultation through the Lens of Diagnostics and Positive-Unlabeled Reinforcement Learning

Title: TACT: Advancing Complex Aggregative Reasoning with Information Extraction Tools

Title: Is Free Self-Alignment Possible?

Title: What Makes Language Models Good-enough?

Title: Evaluating the World Model Implicit in a Generative Model

Title: M-QALM: A Benchmark to Assess Clinical Reading Comprehension and Knowledge Recall in Large Language Models via Question Answering

Title: A Survey on Medical Large Language Models: Technology, Application, Trustworthiness, and Future Directions

Title: LLMEmbed: Rethinking Lightweight LLM's Genuine Function in Text Classification

Title: Efficient Knowledge Infusion via KG-LLM Alignment

Title: NAP^2: A Benchmark for Naturalness and Privacy-Preserving Text Rewriting by Learning from Human

Title: XL-HeadTags: Leveraging Multimodal Retrieval Augmentation for the Multilingual Generation of News Headlines and Tags

Title: End-to-End Trainable Soft Retriever for Low-resource Relation Extraction

Title: Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early Pruning

Title: ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search

Title: Chaos with Keywords: Exposing Large Language Models Sycophancy to Misleading Keywords and Evaluating Defense Strategies

Title: Lean Workbook: A large-scale Lean problem set formalized from natural language math problems

Title: Speculative Decoding via Early-exiting for Faster LLM Inference with Thompson Sampling Control Mechanism

Title: Performance of large language models in numerical vs. semantic medical knowledge: Benchmarking on evidence-based Q&As

Title: BLSP-Emo: Towards Empathetic Large Speech-Language Models

Title: Spontaneous Speech-Based Suicide Risk Detection Using Whisper and Large Language Models

Title: HeSum: a Novel Dataset for Abstractive Text Summarization in Hebrew

Title: UltraMedical: Building Specialized Generalists in Biomedicine

Title: Tox-BART: Leveraging Toxicity Attributes for Explanation Generation of Implicit Hate Speech

Title: A + B: A General Generator-Reader Framework for Optimizing LLMs to Unleash Synergy Potential

Title: On The Persona-based Summarization of Domain-Specific Documents

Title: Assessing LLMs for Zero-shot Abstractive Summarization Through the Lens of Relevance Paraphrasing

Title: Ask LLMs Directly, "What shapes your bias?": Measuring Social Bias in Large Language Models

Title: Intention and Face in Dialog

Title: Uncovering Limitations of Large Language Models in Information Seeking from Tables

Title: Are We Done with MMLU?

Title: Legal Judgment Reimagined: PredEx and the Rise of Intelligent AI Interpretation in Indian Courts

Title: Do Language Models Understand Morality? Towards a Robust Detection of Moral Content

Title: Every Answer Matters: Evaluating Commonsense with Probabilistic Measures

Title: Towards Understanding Task-agnostic Debiasing Through the Lenses of Intrinsic Bias and Forgetfulness

Title: Pointer-Guided Pre-Training: Infusing Large Language Models with Paragraph-Level Contextual Awareness

Title: Confabulation: The Surprising Value of Large Language Model Hallucinations

Title: DICE: Detecting In-distribution Contamination in LLM's Fine-tuning Phase for Math Reasoning

Title: Legal Documents Drafting with Fine-Tuned Pre-Trained Large Language Model

Title: ValueBench: Towards Comprehensively Evaluating Value Orientations and Understanding of Large Language Models

Title: mCSQA: Multilingual Commonsense Reasoning Dataset with Unified Creation Strategy by Language Models and Humans

Title: What Do Language Models Learn in Context? The Structured Task Hypothesis

Title: Rethinking LLM and Linguistic Steganalysis: An Efficient Detection of Strongly Concealed Stego

Title: BEADs: Bias Evaluation Across Domains

Title: Benchmark Data Contamination of Large Language Models: A Survey

Title: Transformers need glasses! Information over-squashing in language tasks

Title: Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models

Title: Characterizing Similarities and Divergences in Conversational Tones in Humans and LLMs by Sampling with People

Title: What Languages are Easy to Language-Model? A Perspective from Learning Probabilistic Regular Languages

Title: PaCE: Parsimonious Concept Engineering for Large Language Models