2026-03-13

Title: Speculative Decoding Scaling Laws (SDSL): Throughput Optimization Made Simple

Title: Summarize Before You Speak with ARACH: A Training-Free Inference-Time Plug-In for Enhancing LLMs via Global Attention Reallocation

Title: DeReason: A Difficulty-Aware Curriculum Improves Decoupled SFT-then-RL Training for General Reasoning

Title: MDER-DR: Multi-Hop Question Answering with Entity-Centric Summaries

Title: Markovian Generation Chains in Large Language Models

Title: Artificial Intelligence for Sentiment Analysis of Persian Poetry

Title: ThReadMed-QA: A Multi-Turn Medical Dialogue Benchmark from Real Patient Questions

Title: Temporal Text Classification with Large Language Models

Title: Stop Listening to Me! How Multi-turn Conversations Can Degrade Diagnostic Reasoning

Title: Algorithmic Consequences of Particle Filters for Sentence Processing: Amplified Garden-Paths and Digging-In Effects

Title: MaterialFigBENCH: benchmark dataset with figures for evaluating college-level materials science problem-solving abilities of multimodal large language models

Title: BLooP: Zero-Shot Abstractive Summarization using Large Language Models with Bigram Lookahead Promotion

Title: LLM-Assisted Causal Structure Disambiguation and Factor Extraction for Legal Judgment Prediction

Title: Try, Check and Retry: A Divide-and-Conquer Framework for Boosting Long-context Tool-Calling Performance of LLMs

Title: Tiny Aya: Bridging Scale and Multilingual Depth

Title: Can Small Language Models Use What They Retrieve? An Empirical Study of Retrieval Utilization Across Model Scale

Title: One Supervisor, Many Modalities: Adaptive Tool Orchestration for Autonomous Queries

Title: Where Matters More Than What: Decoding-aligned KV Cache Compression via Position-aware Pseudo Queries

Title: UtilityMax Prompting: A Formal Framework for Multi-Objective Large Language Model Optimization

Title: Performance Evaluation of Open-Source Large Language Models for Assisting Pathology Report Writing in Japanese

Title: QChunker: Learning Question-Aware Text Chunking for Domain RAG via Multi-Agent Debate

Title: Multi-Task Reinforcement Learning for Enhanced Multimodal LLM-as-a-Judge

Title: In the LLM era, Word Sense Induction remains unsolved

Title: SemBench: A Universal Semantic Framework for LLM Evaluation

Title: Compression Favors Consistency, Not Truth: When and Why Language Models Prefer Correct Information

Title: Legal-DC: Benchmarking Retrieval-Augmented Generation for Legal Documents

Title: Large Language Models for Biomedical Article Classification

Title: DatedGPT: Preventing Lookahead Bias in Large Language Models with Time-Aware Pretraining

Title: Bielik-Minitron-7B: Compressing Large Language Models via Structured Pruning and Knowledge Distillation for the Polish Language

Title: CoMMET: To What Extent Can LLMs Perform Theory of Mind Tasks?

Title: PersonaTrace: Synthesizing Realistic Digital Footprints with LLM Agents

Title: CHiL(L)Grader: Calibrated Human-in-the-Loop Short-Answer Grading

Title: BTZSC: A Benchmark for Zero-Shot Text Classification Across Cross-Encoders, Embedding Models, Rerankers and LLMs

Title: Translationese as a Rational Response to Translation Task Difficulty

Title: To Words and Beyond: Probing Large Language Models for Sentence-Level Psycholinguistic Norms of Memorability and Reading Times

Title: SommBench: Assessing Sommelier Expertise of Language Models

Title: Cross-Context Review: Improving LLM Output Quality by Separating Production and Review Sessions

Title: LifeSim: Long-Horizon User Life Simulator for Personalized Assistant Evaluation

Title: QAQ: Bidirectional Semantic Coherence for Selecting High-Quality Synthetic Code Instructions

Title: Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

Title: Long-Context Encoder Models for Polish Language Understanding

Title: IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse

Title: CLASP: Defending Hybrid Large Language Models Against Hidden State Poisoning Attacks

Title: Sparking Scientific Creativity via LLM-Driven Interdisciplinary Inspiration