2024-10-25

Title: Analyzing Nobel Prize Literature with Large Language Models

Title: Meaning Typed Prompting: A Technique for Efficient, Reliable Structured Output Generation

Title: Future Token Prediction -- Causal Language Modelling with Per-Token Semantic State Vector for Multi-Token Prediction

Title: Gazelle: An Instruction Dataset for Arabic Writing Assistance

Title: CorrectionLM: Self-Corrections with SLM for Dialogue State Tracking

Title: Towards Understanding the Fragility of Multilingual LLMs against Fine-Tuning Attacks

Title: Generalizations across filler-gap dependencies in neural language models

Title: Multilingual Hallucination Gaps in Large Language Models

Title: LEGO: Language Model Building Blocks

Title: Assessing the Creativity of LLMs in Proposing Novel Solutions to Mathematical Problems

Title: Aggregated Knowledge Model: Enhancing Domain-Specific QA with Fine-Tuned and Retrieval-Augmented Generation Models

Title: AdaEDL: Early Draft Stopping for Speculative Decoding of Large Language Models via an Entropy-based Lower Bound on Token Acceptance Probability

Title: Improving Model Factuality with Fine-grained Critique-based Evaluator

Title: MoMQ: Mixture-of-Experts Enhances Multi-Dialect Query Generation across Relational and Non-Relational Databases

Title: Decoding on Graphs: Faithful and Sound Reasoning on Knowledge Graphs through Generation of Well-Formed Chains

Title: Large Language Models Reflect the Ideology of their Creators

Title: Can Code-Switched Texts Activate a Knowledge Switch in LLMs? A Case Study on English-Korean Code-Switching

Title: ToolFlow: Boosting LLM Tool-Calling Through Natural and Coherent Dialogue Synthesis

Title: Iterative Self-Tuning LLMs for Enhanced Jailbreaking Capabilities

Title: Dialog2Flow: Pre-training Soft-Contrastive Action-Driven Sentence Embeddings for Automatic Dialog Flow Extraction

Title: ChineseSafe: A Chinese Benchmark for Evaluating Safety in Large Language Models

Title: CCI3.0-HQ: a large-scale Chinese dataset of high quality designed for pre-training large language models

Title: A Systematic Survey on Instructional Text: From Representation and Downstream NLP Tasks

Title: LOGO -- Long cOntext aliGnment via efficient preference Optimization

Title: Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction Data

Title: Bielik 7B v0.1: A Polish Language Model -- Development, Insights, and Evaluation

Title: Taipan: Efficient and Expressive State Space Language Models with Selective Attention

Title: Prompting and Fine-Tuning of Small LLMs for Length-Controllable Telephone Call Summarization

Title: Little Giants: Synthesizing High-Quality Embedding Data at Scale

Title: Weak-to-Strong Preference Optimization: Stealing Reward from Weak Aligned Model

Title: Towards Better Open-Ended Text Generation: A Multicriteria Evaluation Framework

Title: Unleashing Reasoning Capability of LLMs via Scalable Question Synthesis from Scratch

Title: How Good Are LLMs for Literary Translation, Really? Literary Translation Evaluation with Humans and LLMs

Title: GrammaMT: Improving Machine Translation with Grammar-Informed In-Context Learning

Title: Why Does the Effective Context Length of LLMs Fall Short?

Title: Does Differential Privacy Impact Bias in Pretrained NLP Models?

Title: Task Calibration: Calibrating Large Language Models on Inference Tasks

Title: Distill Visual Chart Reasoning Ability from LLMs to MLLMs

Title: Delving into the Reversal Curse: How Far Can Large Language Models Generalize?

Title: From Imitation to Introspection: Probing Self-Consciousness in Language Models

Title: From English-Centric to Effective Bilingual: LLMs with Custom Tokenizers for Underrepresented Languages

Title: DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucinations

Title: Are LLMs Better than Reported? Detecting Label Errors and Mitigating Their Effect on Model Performance

Title: LLMs for Extremely Low-Resource Finno-Ugric Languages

Title: PRISM: A Methodology for Auditing Biases in Large Language Models

Title: From Blind Solvers to Logical Thinkers: Benchmarking LLMs' Logical Integrity on Faulty Mathematical Problems

Title: Dynamic Vocabulary Pruning in Early-Exit LLMs

Title: BioMistral-NLU: Towards More Generalizable Medical Language Understanding through Instruction Tuning

Title: Bridge-Coder: Unlocking LLMs' Potential to Overcome Language Gaps in Low-Resource Code

Title: Does Data Contamination Detection Work (Well) for LLMs? A Survey and Evaluation on Detection Assumptions