2024-06-18

Title: QCQA: Quality and Capacity-aware grouped Query Attention

Title: On the Worst Prompt Performance of Large Language Models

Title: The Impact of Quantization on Retrieval-Augmented Generation: An Analysis of Small LLMs

Title: Towards Signal Processing In Large Language Models

Title: Explicit Word Density Estimation for Language Modelling

Title: Flextron: Many-in-One Flexible Large Language Model

Title: FoodSky: A Food-oriented Large Language Model that Passes the Chef and Dietetic Examination

Title: Improving Language Models for Emotion Analysis: Insights from Cognitive Science

Title: COVID-19 Twitter Sentiment Classification Using Hybrid Deep Learning Model Based on Grid Search Methodology

Title: Unused information in token probability distribution of generative LLM: improving LLM reading comprehension through calculation of expected values

Title: Markov Constraint as Large Language Model Surrogate

Title: Beyond Words: On Large Language Models Actionability in Mission-Critical Risk Analysis

Title: Prompt-Based Length Controlled Generation with Multiple Control Types

Title: Mimicking User Data: On Mitigating Fine-Tuning Risks in Closed Large Language Models

Title: VeraCT Scan: Retrieval-Augmented Fake News Detection with Justifiable Reasoning

Title: MobileAIBench: Benchmarking LLMs and LMMs for On-Device Use Cases

Title: RelevAI-Reviewer: A Benchmark on AI Reviewers for Survey Paper Relevance

Title: Robustness of Structured Data Extraction from In-plane Rotated Documents using Multi-Modal Large Language Models (LLM)

Title: CLST: Cold-Start Mitigation in Knowledge Tracing by Aligning a Generative Language Model as a Students' Knowledge Tracer

Title: SememeLM: A Sememe Knowledge Enhanced Method for Long-tail Relation Representation

Title: A Survey on Large Language Models from General Purpose to Medical Applications: Datasets, Methodologies, and Evaluations

Title: What is the best model? Application-driven Evaluation for Large Language Models

Title: TEG-DB: A Comprehensive Dataset and Benchmark of Textual-Edge Graphs

Title: CHiSafetyBench: A Chinese Hierarchical Safety Benchmark for Large Language Models

Title: GenQA: Generating Millions of Instructions from a Handful of Prompts

Title: EWEK-QA: Enhanced Web and Efficient Knowledge Graph Retrieval for Citation-based Question Answering Systems

Title: Self-Reflection Outcome is Sensitive to Prompt Construction

Title: SciEx: Benchmarking Large Language Models on Scientific Exams with Human Expert Grading and Automatic Grading

Title: Enhancing In-Context Learning with Semantic Representations for Relation Extraction

Title: Domain-Specific Shorthand for Generation Based on Context-Free Grammar

Title: CancerLLM: A Large Language Model in Cancer Domain

Title: Personalized Pieces: Efficient Personalized Large Language Models through Collaborative Efforts

Title: From Words to Worlds: Transforming One-line Prompt into Immersive Multi-modal Digital Stories with Communicative LLM Agent

Title: Do Large Language Models Discriminate in Hiring Decisions on the Basis of Race, Ethnicity, and Gender?

Title: Large Language Models as Event Forecasters

Title: CroPrompt: Cross-task Interactive Prompting for Zero-shot Spoken Language Understanding

Title: Large Language Model Enhanced Clustering for News Event Detection

Title: Facts-and-Feelings: Capturing both Objectivity and Subjectivity in Table-to-Text Generation

Title: We Care: Multimodal Depression Detection and Knowledge Infused Mental Health Therapeutic Response Generation

Title: Concentrate Attention: Towards Domain-Generalizable Prompt Optimization for Language Models

Title: BlockPruner: Fine-grained Pruning for Large Language Models

Title: Multilingual Large Language Models and Curse of Multilinguality

Title: StructBench: An Autogenerated Benchmark for Evaluating Large Language Model's Ability in Structure-Rich Text Understanding

Title: On the Hardness of Faithful Chain-of-Thought Reasoning in Large Language Models

Title: Emerging Safety Attack and Defense in Federated Instruction Tuning of Large Language Models

Title: DIEKAE: Difference Injection for Efficient Knowledge Augmentation and Editing of Large Language Models

Title: Augmenting Biomedical Named Entity Recognition with General-domain Resources

Title: MIND: Multimodal Shopping Intention Distillation from Large Vision-language Models for E-commerce Purchase Understanding

Title: SparseCL: Sparse Contrastive Learning for Contradiction Retrieval

Title: GNOME: Generating Negotiations through Open-Domain Mapping of Exchanges

Title: Quantifying Generative Media Bias with a Corpus of Real-world and Generated News Articles

Title: Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference

Title: RoseLoRA: Row and Column-wise Sparse Low-rank Adaptation of Pre-trained Language Model for Knowledge Editing and Fine-tuning

Title: ShareLoRA: Parameter Efficient and Robust Large Language Model Fine-tuning via Shared Low-Rank Adaptation

Title: Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis

Title: KGPA: Robustness Evaluation for Large Language Models via Cross-Domain Knowledge Graphs

Title: Post-hoc Utterance Refining Method by Entity Mining for Faithful Knowledge Grounded Conversations

Title: LLMFactor: Extracting Profitable Factors through Prompts for Explainable Stock Movement Prediction

Title: Self-Evolution Fine-Tuning for Policy Optimization

Title: A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery

Title: Exposing the Achilles' Heel: Evaluating LLMs Ability to Handle Mistakes in Mathematical Reasoning

Title: Large Language Models for Automatic Milestone Detection in Group Discussions

Title: Leading Whitespaces of Language Models' Subword Vocabulary Poses a Confound for Calculating Word Probabilities

Title: Step-level Value Preference Optimization for Mathematical Reasoning

Title: Analyzing Key Neurons in Large Language Models

Title: COOL: Comprehensive Knowledge Enhanced Prompt Learning for Domain Adaptive Few-shot Fake News Detection

Title: Exploring the Potential of Multimodal LLM with Knowledge-Intensive Multimodal ASR

Title: Teaching Large Language Models to Express Knowledge Boundary from Their Own Signals

Title: SCAR: Efficient Instruction-Tuning for Large Language Models via Style Consistency-Aware Response Ranking

Title: Distilling Opinions at Scale: Incremental Opinion Summarization using XL-OPSUMM

Title: RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models

Title: MICL: Improving In-Context Learning through Multiple-Label Words in Demonstration

Title: Generating Tables from the Parametric Knowledge of Language Models

Title: E-Bench: Towards Evaluating the Ease-of-Use of Large Language Models

Title: Avoiding Copyright Infringement via Machine Unlearning

Title: Eliminating Biased Length Reliance of Direct Preference Optimization via Down-Sampled KL Divergence

Title: DocNet: Semantic Structure in Inductive Bias Detection Models

Title: Toward Optimal LLM Alignments Using Two-Player Games

Title: Taking a Deep Breath: Enhancing Language Modeling of Large Language Models with Sentinel Tokens

Title: Adaptive Query Rewriting: Aligning Rewriters through Marginal Probability of Conversational Answers

Title: THEANINE: Revisiting Memory Management in Long-term Conversations with Timeline-augmented Response Generation

Title: Not All Bias is Bad: Balancing Rational Deviations and Cognitive Biases in Large Language Model Reasoning

Title: Connecting the Dots: Evaluating Abstract Reasoning Capabilities of LLMs Using the New York Times Connections Word Game

Title: RUPBench: Benchmarking Reasoning Under Perturbations for Robustness Evaluation in Large Language Models

Title: FoodieQA: A Multimodal Dataset for Fine-Grained Understanding of Chinese Food Culture

Title: Scaling Synthetic Logical Reasoning Datasets with Context-Sensitive Declarative Grammars

Title: garak: A Framework for Security Probing Large Language Models

Title: Evaluating the Performance of Large Language Models via Debates

Title: A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners

Title: Can LLMs Understand the Implication of Emphasized Sentences in Dialogue?

Title: Exploring the Limitations of Detecting Machine-Generated Text

Title: Multiple Sources are Better Than One: Incorporating External Knowledge in Low-Resource Glossing

Title: RAEmoLLM: Retrieval Augmented LLMs for Cross-Domain Misinformation Detection Using In-Context Learning based on Emotional Information

Title: The Potential and Challenges of Evaluating Attitudes, Opinions, and Values in Large Language Models

Title: InstructCMP: Length Control in Sentence Compression through Instruction-based Large Language Models

Title: Grading Massive Open Online Courses Using Large Language Models

Title: From Intentions to Techniques: A Comprehensive Taxonomy and Challenges in Text Watermarking for Large Language Models

Title: Exploring Safety-Utility Trade-Offs in Personalized Language Models

Title: Investigating Annotator Bias in Large Language Models for Hate Speech Detection

Title: Text Grafting: Near-Distribution Weak Supervision for Minority Classes in Text Classification

Title: Grammaticality Representation in ChatGPT as Compared to Linguists and Laypeople

Title: Dynamic Order Template Prediction for Generative Aspect-Based Sentiment Analysis

Title: Are Large Language Models a Good Replacement of Taxonomies?

Title: RePrompt: Planning by Automatic Prompt Engineering for Large Language Models Agents

Title: Breaking Boundaries: Investigating the Effects of Model Editing on Cross-linguistic Performance

Title: GoldCoin: Grounding Large Language Models in Privacy Laws via Contextual Integrity Theory

Title: How Good are LLMs at Relation Extraction under Low-Resource Scenario? Comprehensive Evaluation

Title: Watch Every Step! LLM Agent Learning via Iterative Step-Level Process Refinement

Title: TIFG: Text-Informed Feature Generation with Large Language Models

Title: Aligning Large Language Models from Self-Reference AI Feedback with one General Principle

Title: A Survey on Human Preference Learning for Large Language Models

Title: Beyond Boundaries: Learning a Universal Entity Taxonomy across Datasets and Languages for Open Named Entity Recognition

Title: MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model

Title: In-Context Editing: Learning Knowledge from Self-Induced Distributions

Title: Fine-Tuning or Fine-Failing? Debunking Performance Myths in Large Language Models

Title: Global Data Constraints: Ethical and Effectiveness Challenges in Large Language Model

Title: Building another Spanish dictionary, this time with GPT-4

Title: ComperDial: Commonsense Persona-grounded Dialogue Dataset and Benchmark

Title: MiniConGTS: A Near Ultimate Minimalist Contrastive Grid Tagging Scheme for Aspect Sentiment Triplet Extraction

Title: What Kinds of Tokens Benefit from Distant Text? An Analysis on Long Context Language Modeling

Title: Evading AI-Generated Content Detectors using Homoglyphs

Title: FamiCom: Further Demystifying Prompts for Language Models with Task-Agnostic Performance Estimation

Title: Can Machines Resonate with Humans? Evaluating the Emotional and Empathic Comprehension of LMs

Title: Enhancing Biomedical Knowledge Retrieval-Augmented Generation with Self-Rewarding Tree Search and Proximal Policy Optimization

Title: Adversarial Style Augmentation via Large Language Model for Robust Fake News Detection

Title: The Fall of ROME: Understanding the Collapse of LLMs in Model Editing

Title: Mitigating Large Language Model Hallucination with Faithful Finetuning

Title: Skip-Layer Attention: Bridging Abstract and Detailed Dependencies in Transformers

Title: Self-training Large Language Models through Knowledge Detection

Title: Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector

Title: Do Not Design, Learn: A Trainable Scoring Function for Uncertainty Estimation in Generative LLMs

Title: MFC-Bench: Benchmarking Multimodal Fact-Checking with Large Vision-Language Models

Title: A Systematic Survey of Text Summarization: From Statistical Methods to Large Language Models

Title: Are Large Language Models True Healthcare Jacks-of-All-Trades? Benchmarking Across Health Professions Beyond Physician Exams

Title: A Systematic Analysis of Large Language Models as Soft Reasoners: The Case of Syllogistic Inferences

Title: Full-ECE: A Metric For Token-level Calibration on Large Language Models

Title: Preserving Knowledge in Large Language Model: A Model-Agnostic Self-Decompression Approach

Title: $\textit{Refiner}$: Restructure Retrieval Content Efficiently to Advance Question-Answering Capabilities

Title: Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments

Title: Boosting Scientific Concepts Understanding: Can Analogy from Teacher Models Empower Student Models?

Title: A Realistic Evaluation of LLMs for Quotation Attribution in Literary Texts: A Case Study of LLaMa3

Title: MetaGPT: Merging Large Language Models Using Model Exclusive Task Arithmetic

Title: Large Language Models and Knowledge Graphs for Astronomical Entity Disambiguation

Title: Evaluating Open Language Models Across Task Types, Application Domains, and Reasoning Types: An In-Depth Experimental Analysis

Title: HARE: HumAn pRiors, a key to small language model Efficiency

Title: BAMBINO-LM: (Bilingual-)Human-Inspired Continual Pretraining of BabyLM

Title: A Simple and Effective $L_2$ Norm-Based Strategy for KV Cache Compression

Title: Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization

Title: Adaptive Reinforcement Learning Planning: Harnessing Large Language Models for Complex Information Extraction

Title: TRACE the Evidence: Constructing Knowledge-Grounded Reasoning Chains for Retrieval-Augmented Generation

Title: Automating Easy Read Text Segmentation

Title: Promises, Outlooks and Challenges of Diffusion Language Modeling

Title: How Far Can In-Context Alignment Go? Exploring the State of In-Context Alignment

Title: Vocabulary Expansion for Low-resource Cross-lingual Transfer

Title: Analysing zero-shot temporal relation extraction on clinical notes using temporal consistency

Title: CrAM: Credibility-Aware Attention Modification in LLMs for Combating Misinformation in RAG

Title: Counterfactual Debating with Preset Stances for Hallucination Elimination of LLMs

Title: Input Conditioned Graph Generation for Language Agents

Title: Extrinsic Evaluation of Cultural Competence in Large Language Models

Title: MEMLA: Enhancing Multilingual Knowledge Editing with Neuron-Masked Low-Rank Adaptation

Title: Towards an End-to-End Framework for Invasive Brain Signal Decoding with Large Language Models

Title: Intrinsic Evaluation of Unlearning Using Parametric Knowledge Traces

Title: Building Knowledge-Guided Lexica to Model Cultural Variation

Title: Can Many-Shot In-Context Learning Help Long-Context LLM Judges? See More, Judge Better!

Title: The Base-Rate Effect on LLM Benchmark Performance: Disambiguating Test-Taking Strategies from Benchmark Performance

Title: A Two-dimensional Zero-shot Dialogue State Tracking Evaluation Method using GPT-4

Title: Ruby Teaming: Improving Quality Diversity Search with Memory for Automated Red Teaming

Title: Can LLM be a Personalized Judge?

Title: Cultural Conditioning or Placebo? On the Effectiveness of Socio-Demographic Prompting

Title: See It from My Perspective: Diagnosing the Western Cultural Bias of Large Vision-Language Models in Image Understanding

Title: "Not Aligned" is Not "Malicious": Being Careful about Hallucinations of Large Language Models' Jailbreak

Title: Benchmarking of LLM Detection: Comparing Two Competing Approaches

Title: Endor: Hardware-Friendly Sparse Format for Offloaded LLM Inference

Title: R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models

Title: Knowledge-to-Jailbreak: One Knowledge Point Worth One Attack

Title: HoLLMwood: Unleashing the Creativity of Large Language Models in Screenwriting via Role Playing

Title: Tokenization Falling Short: The Curse of Tokenization

Title: Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs

Title: Meta Reasoning for Large Language Models

Title: Nemotron-4 340B Technical Report

Title: Instruct, Not Assist: LLM-based Multi-Turn Planning and Hierarchical Questioning for Socratic Code Debugging

Title: Zero-Shot Generalization during Instruction Tuning: Insights from Similarity and Granularity

Title: Interactive Evolution: A Neural-Symbolic Self-Training Framework For Large Language Models

Title: A Semantic-based Layer Freezing Approach to Efficient Fine-Tuning of Language Models

Title: Improving Multi-Agent Debate with Sparse Communication Topology

Title: MDCR: A Dataset for Multi-Document Conditional Reasoning

Title: CELL your Model: Contrastive Explanation Methods for Large Language Models

Title: Safety Arithmetic: A Framework for Test-time Safety Alignment of Language Models by Steering Parameters and Activations

Title: RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content

Title: How Do Large Language Models Acquire Factual Knowledge During Pretraining?

Title: Iterative Length-Regularized Direct Preference Optimization: A Case Study on Improving 7B Language Models to GPT-4 Level

Title: WPO: Enhancing RLHF with Weighted Preference Optimization

Title: Language Modeling with Editable External Knowledge