2024-10-04

Title: CALF: Benchmarking Evaluation of LFQA Using Chinese Examinations

Title: SciPrompt: Knowledge-augmented Prompting for Fine-grained Categorization of Scientific Topics

Title: TypedThinker: Typed Thinking Improves Large Language Model Reasoning

Title: Generate then Refine: Data Augmentation for Zero-shot Intent Detection

Title: How Reliable Is Human Feedback For Aligning Large Language Models?

Title: Are Large Language Models Good Classifiers? A Study on Edit Intent Classification in Scientific Document Revisions

Title: Improving Autonomous AI Agents with Reflective Tree Search and Self-Learning

Title: RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning

Title: Racing Thoughts: Explaining Large Language Model Contextualization Errors

Title: ReGenesis: LLMs can Grow into Reasoning Generalists via Self-Improvement

Title: L-CiteEval: Do Long-Context Models Truly Leverage Context for Responding?

Title: Controlled Generation of Natural Adversarial Documents for Stealthy Retrieval Poisoning

Title: POSIX: A Prompt Sensitivity Index For Large Language Models

Title: Can Language Models Take A Hint? Prompting for Controllable Contextualized Commonsense Inference

Title: Measuring, Evaluating and Improving Logical Consistency in Large Language Models

Title: Calibrate to Discriminate: Improve In-Context Learning with Label-Free Comparative Inference

Title: EmbedLLM: Learning Compact Representations of Large Language Models

Title: Morphological evaluation of subwords vocabulary used by BETO language model

Title: Correlation and Navigation in the Vocabulary Key Representation Space of Language Models

Title: Language Models are Graph Learners

Title: Traffic Light or Light Traffic? Investigating Phrasal Semantics in Large Language Models

Title: Post-edits Are Preferences Too

Title: Llama SLayer 8B: Shallow Layers Hold the Key to Knowledge Injection

Title: How Much Can RAG Help the Reasoning of LLM?

Title: Listening to the Wise Few: Select-and-Copy Attention Heads for Multiple-Choice QA

Title: AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models

Title: From Concrete to Abstract: A Multimodal Generative Approach to Abstract Concept Learning

Title: Towards Comprehensive Detection of Chinese Harmful Memes

Title: Learning the Latent Rules of a Game from Data: A Chess Story

Title: Collective Critics for Creative Story Generation

Title: Better Call SAUL: Fluent and Consistent Language Model Editing with Generation Regularization

Title: Response Tuning: Aligning Large Language Models without Instruction

Title: Defining Knowledge: Bridging Epistemology and Large Language Models

Title: Mixed-Session Conversation with Egocentric Memory

Title: Towards Implicit Bias Detection and Mitigation in Multi-Agent LLM Interactions

Title: Agents' Room: Narrative Generation through Multi-step Collaboration

Title: Large Language Model for Multi-Domain Translation: Benchmarking and Domain CoT Fine-tuning

Title: Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers

Title: Undesirable Memorization in Large Language Models: A Survey

Title: Measuring and Improving Persuasiveness of Generative Models

Title: Hate Personified: Investigating the role of LLMs in content moderation

Title: How to Train Long-Context Language Models (Effectively)

Title: Examining Language Modeling Assumptions Using an Annotated Literary Dialect Corpus

Title: CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs

Title: Distilling an End-to-End Voice Assistant Without Instruction Training Data

Title: DailyDilemmas: Revealing Value Preferences of LLMs with Quandaries of Daily Life

Title: HiddenGuard: Fine-Grained Safe Generation with Specialized Representation Router

Title: On the Proper Treatment of Tokenization in Psycholinguistics

Title: HELMET: How to Evaluate Long-Context Language Models Effectively and Thoroughly

Title: Selective Attention Improves Transformer

Title: LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations

Title: UncertaintyRAG: Span-Level Uncertainty Enhanced Long-Context Modeling for Retrieval-Augmented Generation

Title: Domain-Specific Retrieval-Augmented Generation Using Vector Stores, Knowledge Graphs, and Tensor Factorization

Title: Adaptive Inference-Time Compute: LLMs Can Predict if They Can Do Better, Even Mid-Generation

Title: Unified Multi-Modal Interleaved Document Representation for Information Retrieval

Title: Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge

Title: Salient Information Prompting to Steer Content in Prompt-based Abstractive Summarization

Title: Grounding Large Language Models In Embodied Environment With Imperfect World Models

Title: MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions

Title: Neutral residues: revisiting adapters for model extension

Title: CriSPO: Multi-Aspect Critique-Suggestion-guided Automatic Prompt Optimization for Text Generation

Title: SIEVE: General Purpose Data Filtering System Matching GPT-4o Accuracy at 1% the Cost

Title: Erasing Conceptual Knowledge from Language Models