2024-10-18

Title: A Comprehensive Survey of Retrieval-Augmented Generation (RAG): Evolution, Current Landscape and Future Directions

Title: Capturing Bias Diversity in LLMs

Title: Answering Questions in Stages: Prompt Chaining for Contract QA

Title: UniAutoML: A Human-Centered Framework for Unified Discriminative and Generative AutoML with Large Language Models

Title: Exploring Prompt Engineering: A Systematic Review with SWOT Analysis

Title: TextLap: Customizing Language Models for Text-to-Layout Planning

Title: Toward Relieving Clinician Burden by Automatically Generating Progress Notes using Interim Hospital Data

Title: Accurate and Regret-aware Numerical Problem Solver for Tabular Question Answering

Title: ACCEPT: Adaptive Codebook for Composite and Efficient Prompt Tuning

Title: Prompt Engineering a Schizophrenia Chatbot: Utilizing a Multi-Agent Approach for Enhanced Compliance with Prompt Instructions

Title: RecurFormer: Not All Transformer Heads Need Self-Attention

Title: VibeCheck: Discover and Quantify Qualitative Differences in Large Language Models

Title: The Large Language Model GreekLegalRoBERTa

Title: Diversity of Thought Elicits Stronger Reasoning Capabilities in Multi-Agent Debate Frameworks

Title: TPO: Aligning Large Language Models with Multi-branch & Multi-step Preference Trees

Title: JAILJUDGE: A Comprehensive Jailbreak Judge Benchmark with Multi-Agent Enhanced Explanation Evaluation Framework

Title: Optimized Biomedical Question-Answering Services with LLM and Multi-BERT Integration

Title: Enterprise Benchmarks for Large Language Model Evaluation

Title: Large Language Models for Medical OSCE Assessment: A Novel Approach to Transcript Analysis

Title: Enhancing Long Context Performance in LLMs Through Inner Loop Query Mechanism

Title: LLMD: A Large Language Model for Interpreting Longitudinal Medical Records

Title: Investigating Implicit Bias in Large Language Models: A Large-Scale Study of Over 50 LLMs

Title: ELF-Gym: Evaluating Large Language Models Generated Features for Tabular Prediction

Title: Empowering Dysarthric Speech: Leveraging Advanced LLMs for Accurate Speech Correction and Multimodal Emotion Analysis

Title: Language Model Preference Evaluation with Multiple Weak Evaluators

Title: Skill Learning Using Process Mining for Large Language Model Plan Generation

Title: Beyond Right and Wrong: Mitigating Cold Start in Knowledge Tracing Using Large Language Model and Option Weight

Title: In-context KV-Cache Eviction for LLMs via Attention-Gate

Title: Improving Instruction-Following in Language Models through Activation Steering

Title: Towards More Effective Table-to-Text Generation: Assessing In-Context Learning and Self-Evaluation with Open-Source Models

Title: Navigating the Cultural Kaleidoscope: A Hitchhiker's Guide to Sensitivity in Large Language Models

Title: Scaling Laws for Multilingual Language Models

Title: AT-RAG: An Adaptive RAG Model Enhancing Query Efficiency with Topic Filtering and Iterative Reasoning

Title: REFINE on Scarce Data: Retrieval Enhancement through Fine-Tuning via Model Fusion of Embedding Models

Title: Multi-trait User Simulation with Adaptive Decoding for Conversational Task Assistants

Title: MIRROR: A Novel Approach for the Automated Evaluation of Open-Ended Question Generation

Title: Large Language Models and the Rationalist Empiricist Debate

Title: A Survey on Data Synthesis and Augmentation for Large Language Models

Title: MSc-SQL: Multi-Sample Critiquing Small Language Models For Text-To-SQL Translation

Title: Interpreting token compositionality in LLMs: A robustness analysis

Title: Enhancing Mathematical Reasoning in LLMs by Stepwise Correction

Title: Merge to Learn: Efficiently Adding Skills to Language Models with Model Merging

Title: Facilitating Multi-turn Function Calling for LLMs via Compositional Instruction Tuning

Title: Self-Pluralising Culture Alignment for Large Language Models

Title: Evaluating the Instruction-following Abilities of Language Models using Knowledge Tasks

Title: BenchmarkCards: Large Language Model and Risk Reporting

Title: Leveraging LLMs for Translating and Classifying Mental Health Data

Title: Qtok: A Comprehensive Framework for Evaluating Multilingual Tokenizer Quality in Large Language Models

Title: "Let's Argue Both Sides": Argument Generation Can Force Small Models to Utilize Previously Inaccessible Reasoning Capabilities

Title: POROver: Improving Safety and Reducing Overrefusal in Large Language Models with Overgeneration and Preference Optimization

Title: LEGAL-UQA: A Low-Resource Urdu-English Dataset for Legal Question Answering

Title: LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks

Title: When Not to Answer: Evaluating Prompts on GPT Models for Effective Abstention in Unanswerable Math Word Problems

Title: LFOSum: Summarizing Long-form Opinions with Large Language Models

Title: Channel-Wise Mixed-Precision Quantization for Large Language Models

Title: Is Semantic Chunking Worth the Computational Cost?

Title: PromptExp: Multi-granularity Prompt Explanation of Large Language Models

Title: Tuning Language Models by Mixture-of-Depths Ensemble

Title: Graph-constrained Reasoning: Faithful Reasoning on Knowledge Graphs with Large Language Models

Title: Reverse-Engineering the Reader

Title: Learning to Summarize from LLM-generated Feedback

Title: Retrieval-Enhanced Named Entity Recognition

Title: Data Defenses Against Large Language Models

Title: Mapping Bias in Vision Language Models: Signposts, Pitfalls, and the Road Ahead

Title: Better to Ask in English: Evaluation of Large Language Models on English, Low-resource and Cross-Lingual Settings

Title: SLM-Mod: Small Language Models Surpass LLMs at Content Moderation

Title: AdaSwitch: Adaptive Switching between Small and Large Agents for Effective Cloud-Local Collaborative Learning

Title: aiXcoder-7B: A Lightweight and Effective Large Language Model for Code Completion

Title: MCQG-SRefine: Multiple Choice Question Generation and Evaluation with Iterative Self-Critique, Correction, and Comparison Feedback

Title: Evaluating Self-Generated Documents for Enhancing Retrieval-Augmented Generation with Large Language Models

Title: The Geometry of Numerical Reasoning: Language Models Compare Numeric Properties in Linear Subspaces

Title: Meta-DiffuB: A Contextualized Sequence-to-Sequence Text Diffusion Model with Meta-Exploration

Title: Measuring Free-Form Decision-Making Inconsistency of Language Models in Military Crisis Simulations

Title: BQA: Body Language Question Answering Dataset for Video Large Language Models

Title: FaithBench: A Diverse Hallucination Benchmark for Summarization by Modern LLMs

Title: CBT-Bench: Evaluating Large Language Models on Assisting Cognitive Behavior Therapy

Title: Proof Flow: Preliminary Study on Generative Flow Network Language Model Tuning for Formal Reasoning

Title: Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation

Title: SPIN: Self-Supervised Prompt INjection

Title: Large Language Models are Easily Confused: A Quantitative Metric, Security Implications and Typological Analysis

Title: Atomic Calibration of LLMs in Long-Form Generations

Title: A Systematic Investigation of Knowledge Retrieval and Selection for Retrieval Augmented Generation

Title: From Babbling to Fluency: Evaluating the Evolution of Language Models in Terms of Human Language Acquisition

Title: Roadmap towards Superhuman Speech Understanding using Large Language Models

Title: Breaking Chains: Unraveling the Links in Multi-Hop Knowledge Unlearning

Title: SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs

Title: BANTH: A Multi-label Hate Speech Detection Dataset for Transliterated Bangla

Title: Learning to Route with Confidence Tokens

Title: Advancing Large Language Model Attribution through Self-Improving

Title: Reference-Based Post-OCR Processing with LLM for Diacritic Languages

Title: Mitigating Biases to Embrace Diversity: A Comprehensive Annotation Benchmark for Toxic Language

Title: Fine-Tuning Language Models on Multiple Datasets for Citation Intention Classification

Title: Do LLMs Have Political Correctness? Analyzing Ethical Biases and Jailbreak Vulnerabilities in AI Systems

Title: Probing-RAG: Self-Probing to Guide Language Models in Selective Document Retrieval

Title: Do LLMs Overcome Shortcut Learning? An Evaluation of Shortcut Challenges in Large Language Models

Title: Cerberus: Efficient Inference with Adaptive Parallel Decoding and Sequential Knowledge Enhancement

Title: Representation Learning of Structured Data for Medical Foundation Models

Title: LAR-ECHR: A New Legal Argument Reasoning Task and Dataset for Cases of the European Court of Human Rights

Title: Metacognitive Monitoring: A Human Ability Beyond Generative Artificial Intelligence

Title: Cross-Lingual Auto Evaluation for Assessing Multilingual LLMs

Title: Linguistically Grounded Analysis of Language Models using Shapley Head Values

Title: Think Thrice Before You Act: Progressive Thought Refinement in Large Language Models

Title: MedINST: Meta Dataset of Biomedical Instructions

Title: Breaking the Manual Annotation Bottleneck: Creating a Comprehensive Legal Case Criticality Dataset through Semi-Automated Labeling

Title: IterSelectTune: An Iterative Training Framework for Efficient Instruction-Tuning Data Selection

Title: Repetition Neurons: How Do Language Models Produce Repetitions?

Title: Enhancing Text Generation in Joint NLG/NLU Learning Through Curriculum Learning, Semi-Supervised Training, and Advanced Optimization Techniques

Title: RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards

Title: GeoCoder: Solving Geometry Problems by Generating Modular Code through Vision-Language Models

Title: Bias in the Mirror : Are LLMs opinions robust to their own adversarial attacks ?

Title: Integrating Temporal Representations for Dynamic Memory Retrieval and Management in Large Language Models

Title: Enhancing Fact Retrieval in PLMs through Truthfulness

Title: A Comparative Study on Reasoning Patterns of OpenAI's o1 Model

Title: Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation

Title: An Active Learning Framework for Inclusive Generation by Large Language Models

Title: SimpleToM: Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs

Title: ORCHID: A Chinese Debate Corpus for Target-Independent Stance Detection and Argumentative Dialogue Summarization

Title: HEALTH-PARIKSHA: Assessing RAG Models for Health Chatbots in Real-World Multilingual Settings

Title: Unconstrained Model Merging for Enhanced LLM Reasoning

Title: On the Role of Attention Heads in Large Language Model Safety

Title: MIRAGE-Bench: Automatic Multilingual Benchmark Arena for Retrieval-Augmented Generation Systems

Title: LLM-Human Pipeline for Cultural Context Grounding of Conversations

Title: Knowledge-Aware Query Expansion with Large Language Models for Textual and Relational Retrieval

Title: Aggregation Artifacts in Subjective Tasks Collapse Large Language Models' Posteriors

Title: The Mystery of the Pathological Path-star Task for Language Models

Title: PopAlign: Diversifying Contrasting Patterns for a More Comprehensive Alignment

Title: Looking Inward: Language Models Can Learn About Themselves by Introspection

Title: Modeling Future Conversation Turns to Teach LLMs to Ask Clarifying Questions

Title: BenTo: Benchmark Task Reduction with In-Context Transferability

Title: A Watermark for Order-Agnostic Language Models

Title: De-mark: Watermark Removal in Large Language Models

Title: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction

Title: Retrospective Learning from Interactions

Title: Can MLLMs Understand the Deep Implication Behind Chinese Images?