2024-10-17

Title: The Fair Language Model Paradox

Title: DISP-LLM: Dimension-Independent Structural Pruning for Large Language Models

Title: Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data

Title: Impacts of Continued Legal Pre-Training and IFT on LLMs' Latent Representations of Human-Defined Legal Concepts

Title: Toolken+: Improving LLM Tool Usage with Reranking and a Reject Option

Title: Pixology: Probing the Linguistic and Visual Capabilities of Pixel-based Language Models

Title: MoE-Pruner: Pruning Mixture-of-Experts Large Language Model using the Hints from Its Router

Title: On Classification with Large Language Models in Cultural Analytics

Title: Concept-Reversed Winograd Schema Challenge: Evaluating and Improving Robust Reasoning in Large Language Models via Abstraction

Title: Boosting Logical Fallacy Reasoning in LLMs via Logical Structure Tree

Title: Sabi\'a-3 Technical Report

Title: Skill-LLM: Repurposing General-Purpose LLMs for Skill Extraction

Title: Large-scale cloze evaluation reveals that token prediction tasks are neither lexically nor semantically aligned

Title: LegalLens Shared Task 2024: Legal Violation Identification in Unstructured Text

Title: De-jargonizing Science for Journalists with GPT-4: A Pilot Study

Title: OMCAT: Omni Context Aware Transformer

Title: Iter-AHMCL: Alleviate Hallucination for Large Language Model via Iterative Model-level Contrastive Learning

Title: Layer-of-Thoughts Prompting (LoT): Leveraging LLM-Based Retrieval with Constraint Hierarchies

Title: Exploiting LLMs' Reasoning Capability to Infer Implicit Concepts in Legal Information Retrieval

Title: Table-LLM-Specialist: Language Model Specialists for Tables using Iterative Generator-Validator Fine-tuning

Title: Exploring Large Language Models for Hate Speech Detection in Rioplatense Spanish

Title: Negative-Prompt-driven Alignment for Generative Language Model

Title: On A Scale From 1 to 5: Quantifying Hallucination in Faithfulness Evaluation

Title: EPS-MoE: Expert Pipeline Scheduler for Cost-Efficient MoE Inference

Title: CoFE-RAG: A Comprehensive Full-chain Evaluation Framework for Retrieval-Augmented Generation with Enhanced Data Diversity

Title: An Automatic and Cost-Efficient Peer-Review Framework for Language Generation Evaluation

Title: Kallini et al. (2024) do not compare impossible languages with constituency-based ones

Title: How much do contextualized representations encode long-range context?

Title: Pyramid-Driven Alignment: Pyramid Principle Guided Integration of Large Language Models and Knowledge Graphs

Title: Semantics-Adaptive Activation Intervention for LLMs via Dynamic Steering Vectors

Title: Open Domain Question Answering with Conflicting Contexts

Title: Reversal of Thought: Enhancing Large Language Models with Preference-Guided Reverse Reasoning Warm-up

Title: Optimizing Low-Resource Language Model Training: Comprehensive Analysis of Multi-Epoch, Multi-Lingual, and Two-Stage Approaches

Title: Neuron-based Personality Trait Induction in Large Language Models

Title: Understanding the Role of LLMs in Multimodal Evaluation Benchmarks

Title: A linguistic analysis of undesirable outcomes in the era of generative AI

Title: HerO at AVeriTeC: The Herd of Open Large Language Models for Verifying Real-World Claims

Title: Evaluation of Attribution Bias in Retrieval-Augmented Large Language Models

Title: Prompt Compression for Large Language Models: A Survey

Title: Tracking Universal Features Through Fine-Tuning and Model Merging

Title: ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs

Title: Conformity in Large Language Models

Title: Expanding Chatbot Knowledge in Customer Service: Context-Aware Similar Question Generation Using Large Language Models

Title: Open Ko-LLM Leaderboard2: Bridging Foundational and Practical Evaluation for Korean LLMs

Title: The Best of Both Worlds: Bridging Quality and Diversity in Data Selection with Bipartite Graph

Title: Bridging the Language Gaps in Large Language Models with Inference-Time Cross-Lingual Intervention

Title: Learning to Predict Usage Options of Product Reviews with LLM-Generated Labels

Title: Retrieval-Reasoning Large Language Model-based Synthetic Clinical Trial Generation

Title: MlingConf: A Comprehensive Study of Multilingual Confidence Estimation on Large Language Models

Title: KcMF: A Knowledge-compliant Framework for Schema and Entity Matching with Fine-tuning-free LLMs

Title: Insights from the Inverse: Reconstructing LLM Training Goals Through Inverse RL

Title: End-to-end Planner Training for Language Modeling

Title: With a Grain of SALT: Are LLMs Fair Across Social Dimensions?

Title: FiRST: Finetuning Router-Selective Transformers for Input-Adaptive Latency Reduction

Title: MedAide: Towards an Omni Medical Aide via Specialized LLM-based Multi-Agent Collaboration

Title: LLM-based Translation Inference with Iterative Bilingual Understanding

Title: A Claim Decomposition Benchmark for Long-form Answer Verification

Title: STRUX: An LLM for Decision-Making with Structured Explanations

Title: Can We Reverse In-Context Knowledge Edits?

Title: On the Risk of Evidence Pollution for Malicious Social Text Detection in the Era of LLMs

Title: CCSBench: Evaluating Compositional Controllability in LLMs for Scientific Document Summarization

Title: Not All Votes Count! Programs as Verifiers Improve Self-Consistency of Language Models for Math Reasoning

Title: Exploring Model Kinship for Merging Large Language Models

Title: Weak-to-Strong Generalization beyond Accuracy: a Pilot Study in Safety, Toxicity, and Legal Reasoning

Title: Evaluating Morphological Compositional Generalization in Large Language Models

Title: WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines

Title: WorldMedQA-V: a multilingual, multimodal medical examination dataset for multimodal language models evaluation

Title: StyleDistance: Stronger Content-Independent Style Embeddings with Synthetic Parallel Examples

Title: Identifying Task Groupings for Multi-Task Learning Using Pointwise V-Usable Information

Title: Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception