2024-06-21

Title: SHIELD: Evaluation and Defense Strategies for Copyright Compliance in LLM Text Generation

Title: Detecting Errors through Ensembling Prompts (DEEP): An End-to-End LLM Framework for Detecting Factual Errors

Title: D2O:Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models

Title: Think-then-Act: A Dual-Angle Evaluated Retrieval-Augmented Generation

Title: Evaluating $n$-Gram Novelty of Language Models Using Rusty-DAWG

Title: Exploring and Benchmarking the Planning Capabilities of Large Language Models

Title: Multi-Stage Balanced Distillation: Addressing Long-Tail Challenges in Sequence-Level Knowledge Distillation

Title: Can Long-Context Language Models Subsume Retrieval, RAG, SQL, and More?

Title: Learning to Generate Answers with Citations via Factual Consistency Models

Title: When Parts are Greater Than Sums: Individual LLM Components Can Outperform Full Models

Title: PathoLM: Identifying pathogenicity from the DNA sequence through the Genome Foundation Model

Title: Large Language Models are Biased Because They Are Large Language Models

Title: DialSim: A Real-Time Simulator for Evaluating Long-Term Dialogue Understanding of Conversational Agents

Title: Analyzing Diversity in Healthcare LLM Research: A Scientometric Perspective

Title: QRMeM: Unleash the Length Limitation through Question then Reflection Memory Mechanism

Title: Locating and Extracting Relational Concepts in Large Language Models

Title: Learnable In-Context Vector for Visual Question Answering

Title: Synthetic Context Generation for Question Generation

Title: Multi-Meta-RAG: Improving RAG for Multi-Hop Queries using Database Filtering with LLM-Extracted Metadata

Title: Bridging Law and Data: Augmenting Reasoning via a Semi-Structured Dataset with IRAC methodology

Title: Probing the Emergence of Cross-lingual Alignment during LLM Training

Title: Enhancing Language Model Factuality via Activation-Based Confidence Calibration and Guided Decoding

Title: Towards Robust Evaluation: A Comprehensive Taxonomy of Datasets and Metrics for Open Domain Question Answering in the Era of Large Language Models

Title: Data Contamination Can Cross Language Barriers

Title: GSR-BENCH: A Benchmark for Grounded Spatial Reasoning Evaluation via Multimodal LLMs

Title: R^2AG: Incorporating Retrieval Information into Retrieval Augmented Generation

Title: BeHonest: Benchmarking Honesty of Large Language Models

Title: Understanding the RoPE Extensions of Long-Context LLMs: An Attention Perspective

Title: Improving Zero-shot LLM Re-Ranker with Risk Minimization

Title: SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words

Title: ZeroDL: Zero-shot Distribution Learning for Text Clustering via Large Language Models

Title: Transferable speech-to-text large language model alignment module

Title: ALiiCE: Evaluating Positional Fine-grained Citation Generation

Title: CoAct: A Global-Local Hierarchy for Autonomous Agent Collaboration

Title: MoreHopQA: More Than Multi-hop Reasoning

Title: SQLFixAgent: Towards Semantic-Accurate SQL Generation via Multi-Agent Collaboration

Title: Factual Confidence of LLMs: on Reliability and Robustness of Current Estimators

Title: Finding Blind Spots in Evaluator LLMs with Interpretable Checklists

Title: Dual-Phase Accelerated Prompt Optimization

Title: VDebugger: Harnessing Execution Feedback for Debugging Visual Programs

Title: Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks

Title: LLMs Are Zero-Shot Context-Aware Simultaneous Translators

Title: Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models

Title: Mitigating Social Biases in Language Models through Unlearning

Title: BiLD: Bi-directional Logits Difference Loss for Large Language Model Distillation

Title: Evaluating Short-Term Temporal Fluctuations of Social Biases in Social Media Data and Masked Language Models

Title: Enhancing Distractor Generation for Multiple-Choice Questions with Retrieval Augmented Pretraining and Knowledge Graph Integration

Title: Optimizing Psychological Counseling with Instruction-Tuned Large Language Models

Title: In-Context Former: Lightning-fast Compressing Context for Large Language Model

Title: Improving Visual Commonsense in Language Models via Multiple Image Generation

Title: Fine-Tuning Gemma-7B for Enhanced Sentiment Analysis of Financial News Headlines

Title: InstructRAG: Instructing Retrieval-Augmented Generation with Explicit Denoising

Title: Can Few-shot Work in Long-Context? Recycling the Context to Generate Demonstrations

Title: Towards Minimal Targeted Updates of Language Models with Targeted Negative Training

Title: ObscurePrompt: Jailbreaking Large Language Models via Obscure Input

Title: Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation

Title: Leveraging Large Language Models to Measure Gender Bias in Gendered Languages

Title: Synchronous Faithfulness Monitoring for Trustworthy Retrieval-Augmented Generation

Title: MMTE: Corpus and Metrics for Evaluating Machine Translation Quality of Metaphorical Language

Title: Breaking News: Case Studies of Generative AI's Use in Journalism

Title: Benchmarking Open-Source Language Models for Efficient Question Answering in Industrial Applications

Title: Evaluating Large Language Models along Dimensions of Language Variation: A Systematik Invesdigatiom uv Cross-lingual Generalization

Title: On the Utility of Domain-Adjacent Fine-Tuned Model Ensembles for Few-shot Problems

Title: Every Language Counts: Learn and Unlearn in Multilingual LLMs

Title: Can LLMs Reason in the Wild with Programs?

Title: FoRAG: Factuality-optimized Retrieval Augmented Generation for Web-enhanced Long-form Question Answering

Title: Semantic Structure-Mapping in LLM and Human Analogical Reasoning

Title: WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia

Title: Neuro-symbolic Training for Reasoning over Spatial Language

Title: Text Serialization and Their Relationship with the Conventional Paradigms of Tabular Machine Learning

Title: Distributional reasoning in LLMs: Parallel reasoning processes in multi-hop reasoning

Title: Knowledge Graph-Enhanced Large Language Models via Path Selection

Title: Knowledge Tagging System on Math Questions via LLMs with Flexible Demonstration Retriever

Title: ClinicalLab: Aligning Agents for Multi-Departmental Clinical Diagnostics in the Real World

Title: Adaptable Logical Control for Large Language Models

Title: Open Generative Large Language Models for Galician

Title: Generative AI for Enhancing Active Learning in Education: A Comparative Study of GPT-3.5 and GPT-4 in Crafting Customized Test Questions

Title: Persuasiveness of Generated Free-Text Rationales in Subjective Decisions: A Case Study on Pairwise Argument Ranking

Title: GenderAlign: An Alignment Dataset for Mitigating Gender Bias in Large Language Models

Title: Large Language Models are Skeptics: False Negative Problem of Input-conflicting Hallucination

Title: Reasoning Like a Doctor: Improving Medical Dialogue Systems via Diagnostic Reasoning Process Alignment

Title: AutoCAP: Towards Automatic Cross-lingual Alignment Planning for Zero-shot Chain-of-Thought

Title: Evolving to be Your Soulmate: Personalized Dialogue Agents with Dynamically Adapted Personas

Title: MR-BEN: A Comprehensive Meta-Reasoning Benchmark for Large Language Models

Title: Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model Evaluation

Title: Exploring Changes in Nation Perception with Nationality-Assigned Personas in LLMs

Title: "Global is Good, Local is Bad?": Understanding Brand Bias in LLMs

Title: Information Guided Regularization for Fine-tuning Language Models

Title: Seeing Through AI's Lens: Enhancing Human Skepticism Towards LLM-Generated Fake News

Title: HIGHT: Hierarchical Graph Tokenization for Graph-Language Alignment

Title: Evaluating Implicit Bias in Large Language Models by Attacking From a Psychometric Perspective

Title: Prompt Injection Attacks in Defended Systems

Title: How Many Parameters Does it Take to Change a Light Bulb? Evaluating Performance in Self-Play of Conversational Games as a Function of Model Characteristics

Title: Protecting Privacy Through Approximating Optimal Parameters for Sequence Unlearning in Language Models

Title: Take the essence and discard the dross: A Rethinking on Data Selection for Fine-Tuning Large Language Models

Title: MACAROON: Training Vision-Language Models To Be Your Engaged Partners

Title: Finding Safety Neurons in Large Language Models

Title: Aligning Large Language Models with Diverse Political Viewpoints

Title: Definition generation for lexical semantic change detection

Title: In Tree Structure Should Sentence Be Generated

Title: Timo: Towards Better Temporal Reasoning for Language Models

Title: On the Representational Capacity of Neural Language Models with Chain-of-Thought Reasoning

Title: Raising the Bar: Investigating the Values of Large Language Models via Generative Evolving Testing

Title: On the Evaluation Practices in Multilingual NLP: Can Machine Translation Offer an Alternative to Human Translations?

Title: Step-Back Profiling: Distilling User History for Personalized Scientific Writing

Title: Augmenting Query and Passage for Retrieval-Augmented Generation using LLMs for Open-Domain Question Answering

Title: Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs

Title: VAIYAKARANA : A Benchmark for Automatic Grammar Correction in Bangla

Title: Infusing clinical knowledge into tokenisers for language models

Title: Robust Few-shot Transfer Learning for Knowledge Base Question Answering with Unanswerable Questions

Title: Identifying User Goals from UI Trajectories

Title: Mind the Privacy Unit! User-Level Differential Privacy for Language Model Fine-Tuning

Title: medIKAL: Integrating Knowledge Graphs as Assistants of LLMs for Enhanced Clinical Diagnosis on EMRs

Title: Self-supervised Interpretable Concept-based Models for Text Classification

Title: Exploring Spatial Representations in the Historical Lake District Texts with LLM-based Relation Extraction

Title: SEC-QA: A Systematic Evaluation Corpus for Financial QA

Title: SynDARin: Synthesising Datasets for Automated Reasoning in Low-Resource Languages

Title: Towards Truthful Multilingual Large Language Models: Benchmarking and Alignment Strategies

Title: Healing Powers of BERT: How Task-Specific Fine-Tuning Recovers Corrupted Language Models

Title: Explicit and Implicit Large Language Model Personas Generate Opinions but Fail to Replicate Deeper Perceptions and Biases

Title: Instruction Pre-Training: Language Models are Supervised Multitask Learners

Title: LLaSA: Large Multimodal Agent for Human Activity Analysis Through Wearable Sensors

Title: Improving Expert Radiology Report Summarization by Prompting Large Language Models with a Layperson Summary

Title: Overview of the CAIL 2023 Argument Mining Track

Title: Translating Across Cultures: LLMs for Intralingual Cultural Adaptation

Title: Evidence of a log scaling law for political persuasion with large language models

Title: Investigating Mysteries of CoT-Augmented Distillation

Title: Unmasking Database Vulnerabilities: Zero-Knowledge Schema Inference Attacks in Text-to-SQL Systems

Title: Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data

Title: GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models

Title: How to Compute the Probability of a Word

Title: Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities

Title: Model Merging and Safety Alignment: One Bad Model Spoils the Bunch