2024-10-08

Title: Revisiting the Superficial Alignment Hypothesis

Title: Performance Evaluation of Tokenizers in Large Language Models for the Assamese Language

Title: Thematic Analysis with Open-Source Generative AI and Machine Learning: A New Method for Inductive Qualitative Codebook Development

Title: Realtime, multimodal invasive ventilation risk monitoring using language models and BoXHED

Title: Neurosymbolic AI approach to Attribution in Large Language Models

Title: FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows"

Title: Progress Report: Towards European LLMs

Title: Unsupervised Human Preference Learning

Title: Task-Adaptive Pretrained Language Models via Clustered-Importance Sampling

Title: ERASMO: Leveraging Large Language Models for Enhanced Clustering Segmentation

Title: Language Enhanced Model for Eye (LEME): An Open-Source Ophthalmology-Specific Large Language Model

Title: Beyond Scalar Reward Model: Learning Generative Judge from Preference Data

Title: Mitigating Training Imbalance in LLM Fine-Tuning via Selective Parameter Merging

Title: Khattat: Enhancing Readability and Concept Representation of Semantic Typography

Title: Recent Advances in Speech Language Models: A Survey

Title: Enhancing Retrieval in QA Systems with Derived Feature Association

Title: HiReview: Hierarchical Taxonomy-Driven Automatic Literature Review Generation

Title: Basis Sharing: Cross-Layer Parameter Sharing for Large Language Model Compression

Title: Reasoning Elicitation in Language Models via Counterfactual Feedback

Title: Hidden in Plain Text: Emergence & Mitigation of Steganographic Collusion in LLMs

Title: SciSafeEval: A Comprehensive Benchmark for Safety Alignment of Large Language Models in Scientific Tasks

Title: A Two-Stage Proactive Dialogue Generator for Efficient Clinical Information Collection Using Large Language Model

Title: Precision Knowledge Editing: Enhancing Safety in Large Language Models

Title: Determine-Then-Ensemble: Necessity of Top-k Union for Large Language Model Ensembling

Title: Reward-RAG: Enhancing RAG with Reward Driven Supervision

Title: Searching for Best Practices in Medical Transcription with Large Language Model

Title: Self-Powered LLM Modality Expansion for Large Speech-Text Models

Title: Mixture of Attentions For Speculative Decoding

Title: Misinformation with Legal Consequences (MisLC): A New Task Towards Harnessing Societal Harm of Misinformation

Title: ORAssistant: A Custom RAG-based Conversational Assistant for OpenROAD

Title: Using Prompts to Guide Large Language Models in Imitating a Real Person's Language Style

Title: Detecting Machine-Generated Long-Form Content with Latent-Space Variables

Title: You Know What I'm Saying -- Jailbreak Attack via Implicit Reference

Title: SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?

Title: Can Language Models Reason about Individualistic Human Values and Preferences?

Title: Chain-of-Jailbreak Attack for Image Generation Models via Editing Step by Step

Title: KidLM: Advancing Language Models for Children -- Early Insights and Future Directions

Title: PersonalSum: A User-Subjective Guided Personalized Summarization Dataset for Large Language Models

Title: ActPlan-1K: Benchmarking the Procedural Planning Ability of Visual Language Models in Household Activities

Title: Still Not Quite There! Evaluating Large Language Models for Comorbid Mental Health Diagnosis

Title: Structured List-Grounded Question Answering

Title: LLM-TOPLA: Efficient LLM Ensemble by Maximising Diversity

Title: Grounding Language in Multi-Perspective Referential Communication

Title: On the Influence of Gender and Race in Romantic Relationship Prediction from Large Language Models

Title: Take It Easy: Label-Adaptive Self-Rationalization for Fact Verification and Explanation Generation

Title: A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction Based on Large Language Models

Title: SyllableLM: Learning Coarse Semantic Units for Speech Language Models

Title: Neuron-Level Sequential Editing for Large Language Models

Title: Large Language Models can Achieve Social Balance

Title: Self-Correction is More than Refinement: A Learning Framework for Visual and Language Reasoning Tasks

Title: LoRTA: Low Rank Tensor Adaptation of Large Language Models

Title: ECon: On the Detection and Resolution of Evidence Conflicts

Title: PAD: Personalized Alignment at Decoding-Time

Title: On Eliciting Syntax from Language Models via Hashing

Title: GlobeSumm: A Challenging Benchmark Towards Unifying Multi-lingual, Cross-lingual and Multi-document News Summarization

Title: BloomWise: Enhancing Problem-Solving capabilities of Large Language Models using Bloom's-Taxonomy-Inspired Prompts

Title: A Learning Rate Path Switching Training Paradigm for Version Updates of Large Language Models

Title: Exploring LLM-based Data Annotation Strategies for Medical Dialogue Preference Alignment

Title: From Reading to Compressing: Exploring the Multi-document Reader for Prompt Compression

Title: Toxic Subword Pruning for Dialogue Response Generation on Large Language Models

Title: DiDOTS: Knowledge Distillation from Large-Language-Models for Dementia Obfuscation in Transcribed Speech

Title: Consistent Autoformalization for Constructing Mathematical Libraries

Title: CS4: Measuring the Creativity of Large Language Models Automatically by Controlling the Number of Story-Writing Constraints

Title: LongGenBench: Long-context Generation Benchmark

Title: Correlation-Aware Select and Merge Attention for Efficient Fine-Tuning and Context Length Extension

Title: Persona Knowledge-Aligned Prompt Tuning Method for Online Debate

Title: Adaptive Question Answering: Enhancing Language Model Proficiency for Addressing Knowledge Conflicts with Source Citations

Title: Entity Insertion in Multilingual Linked Corpora: The Case of Wikipedia

Title: AI as Humanity's Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution of Machine Text against Web Text

Title: RoQLlama: A Lightweight Romanian Adapted Language Model

Title: Evaluating Language Model Character Traits

Title: Mechanistic Behavior Editing of Language Models

Title: Calibrating Expressions of Certainty

Title: ReTok: Replacing Tokenizer to Enhance Representation Efficiency in Large Language Model

Title: Inference Scaling for Long-Context Retrieval Augmented Generation

Title: Ordinal Preference Optimization: Aligning Human Preferences via NDCG

Title: TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization With Estimated Weights

Title: Lens: Rethinking Multilingual Enhancement for Large Language Models

Title: Hyper-multi-step: The Truth Behind Difficult Long-context Tasks

Title: DAdEE: Unsupervised Domain Adaptation in Early Exit PLMs

Title: MindScope: Exploring cognitive biases in large language models through Multi-Agent Systems

Title: CopyLens: Dynamically Flagging Copyrighted Sub-Dataset Contributions to LLM Outputs

Title: SWEb: A Large Web Dataset for the Scandinavian Languages

Title: Wrong-of-Thought: An Integrated Reasoning Framework with Multi-Perspective Verification and Wrong Information

Title: Revisiting In-context Learning Inference Circuit in Large Language Models

Title: Collapsed Language Models Promote Fairness

Title: Fine-Grained Prediction of Reading Comprehension from Eye Movements

Title: Leveraging Large Language Models for Suicide Detection on Social Media with Limited Labels

Title: ErrorRadar: Benchmarking Complex Mathematical Reasoning of Multimodal Large Language Models Via Error Detection

Title: DAMRO: Dive into the Attention Mechanism of LVLM to Reduce Object Hallucination

Title: RevMUX: Data Multiplexing with Reversible Adapters for Efficient LLM Batch Inference

Title: Towards Secure Tuning: Mitigating Security Risks Arising from Benign Instruction Fine-Tuning

Title: FAMMA: A Benchmark for Financial Domain Multilingual Multimodal Question Answering

Title: How Does the Disclosure of AI Assistance Affect the Perceptions of Writing?

Title: Upsample or Upweight? Balanced Training on Heavily Imbalanced Datasets

Title: Reasoning-Enhanced Healthcare Predictions with Knowledge Graph Community Retrieval

Title: ProtocoLLM: Automatic Evaluation Framework of LLMs on Domain-Specific Scientific Protocol Formulation Tasks

Title: LRQ-Fact: LLM-Generated Relevant Questions for Multimodal Fact-Checking

Title: Evaluation of Code LLMs on Geospatial Code Generation

Title: Control Large Language Models via Divide and Conquer

Title: Contrastive Learning to Improve Retrieval for Real-world Fact Checking

Title: Adversarial Multi-Agent Evaluation of Large Language Models through Iterative Debates

Title: MathHay: An Automated Benchmark for Long-Context Mathematical Reasoning in LLMs

Title: The LLM Effect: Are Humans Truly Using LLMs, or Are They Being Influenced By Them Instead?

Title: Rule-based Data Selection for Large Language Models

Title: $\textbf{Only-IF}$:Revealing the Decisive Effect of Instruction Diversity on Generalization

Title: Forgetting Curve: A Reliable Method for Evaluating Memorization Capability for Long-context Models

Title: Efficient transformer with reinforced position embedding for language models

Title: TableRAG: Million-Token Table Understanding with Language Models

Title: Document-level Causal Relation Extraction with Knowledge-guided Binary Question Answering

Title: Formality is Favored: Unraveling the Learning Preferences of Large Language Models on Data with Conflicting Knowledge

Title: GARLIC: LLM-Guided Dynamic Progress Control with Hierarchical Weighted Graph for Long Document QA

Title: Representing the Under-Represented: Cultural and Core Capability Benchmarks for Developing Thai Large Language Models

Title: LPZero: Language Model Zero-cost Proxy Search from Zero

Title: MINER: Mining the Underlying Pattern of Modality-Specific Neurons in Multimodal Large Language Models

Title: As Simple as Fine-tuning: LLM Alignment via Bidirectional Negative Feedback Loss

Title: Rationale-Aware Answer Verification by Pairwise Self-Evaluation

Title: Intent Classification for Bank Chatbots through LLM Fine-Tuning

Title: Activation Scaling for Steering and Interpreting Language Models

Title: SkillMatch: Evaluating Self-supervised Learning of Skill Relatedness

Title: Named Clinical Entity Recognition Benchmark

Title: A test suite of prompt injection attacks for LLM-based machine translation

Title: Initialization of Large Language Models via Reparameterization to Mitigate Loss Spikes

Title: ZEBRA: Zero-Shot Example-Based Retrieval Augmentation for Commonsense Question Answering

Title: ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery

Title: Explanation sensitivity to the randomness of large language models: the case of journalistic text classification

Title: Investigating large language models for their competence in extracting grammatically sound sentences from transcribed noisy utterances

Title: SparsePO: Controlling Preference Alignment of LLMs via Sparse Token Masks

Title: Deciphering the Interplay of Parametric and Non-parametric Memory in Retrieval-augmented Language Models

Title: ReasoningRank: Teaching Student Models to Rank through Reasoning-Based Knowledge Distillation

Title: Enhancing Equity in Large Language Models for Medical Applications

Title: RevisEval: Improving LLM-as-a-Judge via Response-Adapted References

Title: Cookbook: A framework for improving LLM generative abilities via programmatic data generating templates

Title: SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe

Title: Causal Micro-Narratives

Title: GLEE: A Unified Framework and Benchmark for Language-based Economic Environments

Title: Differential Transformer

Title: TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles

Title: Grounding Partially-Defined Events in Multimodal Data

Title: Data Advisor: Dynamic Data Curation for Safety Alignment of Large Language Models