2024-06-24

Title: Can LLMs Learn by Teaching? A Preliminary Study

Title: Unveiling the Spectrum of Data Contamination in Language Models: A Survey from Detection to Remediation

Title: Major Entity Identification: A Generalizable Alternative to Coreference Resolution

Title: OpenDebateEvidence: A Massive-Scale Argument Mining and Summarization Dataset

Title: Exploring Design Choices for Building Language-Specific LLMs

Title: Insights into LLM Long-Context Failures: When Transformers Know but Don't Tell

Title: Bidirectional Transformer Representations of (Spanish) Ambiguous Words in Context: A New Lexical Resource and Empirical Analysis

Title: Do LLMs Have Distinct and Consistent Personality? TRAIT: Personality Testset designed for LLMs with Psychometrics

Title: Factual Dialogue Summarization via Learning from Large Language Models

Title: MultiAgent Collaboration Attack: Investigating Adversarial Attacks in Large Language Model Collaborations via Debate

Title: 1+1>2: Can Large Language Models Serve as Cross-Lingual Knowledge Aggregators?

Title: TTQA-RS- A break-down prompting approach for Multi-hop Table-Text Question Answering with Reasoning and Summarization

Title: Dissecting the Ullman Variations with a SCALPEL: Why do LLMs fail at Trivial Alterations to the False Belief Task?

Title: Learning to Retrieve Iteratively for In-Context Learning

Title: Relation Extraction with Fine-Tuned Large Language Models in Retrieval Augmented Generation Frameworks

Title: An LLM Feature-based Framework for Dialogue Constructiveness Assessment

Title: A Learn-Then-Reason Model Towards Generalization in Knowledge Base Question Answering

Title: Understanding Finetuning for Factual Knowledge Extraction

Title: How Well Do LLMs Represent Values Across Cultures? Empirical Analysis of LLM Responses Based on Hofstede Cultural Dimensions

Title: TemPrompt: Multi-Task Prompt Learning for Temporal Relation Extraction in RAG-based Crowdsourcing Systems

Title: Word Matters: What Influences Domain Adaptation in Summarization?

Title: Efficient Continual Pre-training by Mitigating the Stability Gap

Title: ToVo: Toxicity Taxonomy via Voting

Title: Leveraging Passage Embeddings for Efficient Listwise Reranking with Large Language Models

Title: From LLMs to MLLMs: Exploring the Landscape of Multimodal Jailbreaking

Title: Direct Multi-Turn Preference Optimization for Language Agents

Title: Sports Intelligence: Assessing the Sports Understanding Capabilities of Language Models through Question Answering from Text to Video

Title: 70B-parameter large language models in Japanese medical question-answering

Title: OATH-Frames: Characterizing Online Attitudes Towards Homelessness with LLM Assistants

Title: FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents

Title: InternLM-Law: An Open Source Chinese Legal Large Language Model

Title: Generate-then-Ground in Retrieval-Augmented Generation for Multi-hop Question Answering

Title: Talking the Talk Does Not Entail Walking the Walk: On the Limits of Large Language Models in Lexical Entailment Recognition

Title: Towards Retrieval Augmented Generation over Large Video Libraries

Title: ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models

Title: ICLEval: Evaluating In-Context Learning Ability of Large Language Models

Title: Domain Adaptation of Llama3-70B-Instruct through Continual Pre-Training and Model Merging: A Comprehensive Evaluation

Title: A Tale of Trust and Accuracy: Base vs. Instruct LLMs in RAG Systems

Title: Retrieve-Plan-Generation: An Iterative Planning and Answering Framework for Knowledge-Intensive LLM Generation

Title: SpreadsheetBench: Towards Challenging Real World Spreadsheet Manipulation

Title: Unveiling the Impact of Multi-Modal Interactions on User Engagement: A Comprehensive Evaluation in AI-driven Conversations

Title: MedOdyssey: A Medical Domain Benchmark for Long Context Evaluation Up to 200K Tokens

Title: GiusBERTo: A Legal Language Model for Personal Data De-identification in Italian Court of Auditors Decisions

Title: Harnessing Knowledge Retrieval with Large Language Models for Clinical Report Error Correction

Title: PARIKSHA : A Large-Scale Investigation of Human-LLM Evaluator Agreement on Multilingual and Multi-Cultural Data

Title: Brain-Like Language Processing via a Shallow Untrained Multihead Attention Network

Title: On LLMs-Driven Synthetic Data Generation, Curation, and Evaluation: A Survey

Title: Assessing Good, Bad and Ugly Arguments Generated by ChatGPT: a New Dataset, its Methodology and Associated Tasks

Title: Enhancing Idiomatic Representation in Multiple Languages via an Adaptive Contrastive Triplet Loss

Title: Hybrid Alignment Training for Large Language Models

Title: Reward Steering with Evolutionary Heuristics for Decoding-time Alignment

Title: How Effective is GPT-4 Turbo in Generating School-Level Questions from Textbooks Based on Bloom's Revised Taxonomy?

Title: Unsupervised Extraction of Dialogue Policies from Conversations

Title: A LLM-Based Ranking Method for the Evaluation of Automatic Counter-Narrative Generation

Title: Detecting Synthetic Lyrics with Few-Shot Inference

Title: Unsupervised Morphological Tree Tokenizer

Title: Evaluating Diversity in Automatic Poetry Generation

Title: Cognitive Map for Language Models: Optimal Planning via Verbally Representing the World Model

Title: NLP-KG: A System for Exploratory Search of Scientific Literature in Natural Language Processing

Title: LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs

Title: A SMART Mnemonic Sounds like "Glue Tonic": Mixing LLMs with Student Feedback to Make Mnemonic Learning Stick