2026-03-27

Title: When Consistency Becomes Bias: Interviewer Effects in Semi-Structured Clinical Interviews

Title: Demystifying When Pruning Works via Representation Hierarchies

Title: Fine-Tuning A Large Language Model for Systematic Review Screening

Title: Evaluating Fine-Tuned LLM Model For Medical Transcription With Small Low-Resource Languages Validated Dataset

Title: Synthetic Rewriting as a Quality Multiplier: Evidence from Portuguese Continued Pretraining

Title: Prune as You Generate: Online Rollout Pruning for Faster and Better RLVR

Title: Estimating near-verbatim extraction risk in language models with decoding-constrained beam search

Title: Toward domain-specific machine translation and quality estimation systems

Title: LLM-Driven Reasoning for Constraint-Aware Feature Selection in Industrial Systems

Title: Exons-Detect: Identifying and Amplifying Exonic Tokens via Hidden-State Discrepancy for Robust AI-Generated Text Detection

Title: Imperative Interference: Social Register Shapes Instruction Topology in Large Language Models

Title: Approaches to Analysing Historical Newspapers Using LLMs

Title: Closing the Confidence-Faithfulness Gap in Large Language Models

Title: OMIND: Framework for Knowledge Grounded Finetuning and Multi-Turn Dialogue Benchmark for Mental Health LLMs

Title: Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory

Title: To Write or to Automate Linguistic Prompts, That Is the Question

Title: Prompt Attack Detection with LLM-as-a-Judge and Mixture-of-Models

Title: Probing the Lack of Stable Internal Beliefs in LLMs

Title: A Decade-Scale Benchmark Evaluating LLMs' Clinical Practice Guidelines Detection and Adherence in Multi-turn Conversations

Title: SafeMath: Inference-time Safety improves Math Accuracy

Title: Comparing Natural and Synthetic Structured Data: A Study of the Passive Verb Alternation in French and Italian

Title: MolQuest: A Benchmark for Agentic Evaluation of Abductive Reasoning in Chemical Structure Elucidation

Title: CRAFT: Grounded Multi-Agent Coordination Under Partial Information

Title: When Hate Meets Facts: LLMs-in-the-Loop for Check-worthiness Detection in Hate Speech

Title: Separate Before You Compress: The WWHO Tokenization Architecture

Title: Beyond Detection: Rethinking Education in the Age of AI-writing

Title: Adaptive Chunking: Optimizing Chunking-Method Selection for RAG

Title: Large Language Model as Token Compressor and Decompressor

Title: TAPO: Translation Augmented Policy Optimization for Multilingual Mathematical Reasoning

Title: Navigating the Prompt Space: Improving LLM Classification of Social Science Texts Through Prompt Engineering

Title: Translation Asymmetry in LLMs as a Data Augmentation Factor: A Case Study for 6 Romansh Language Varieties

Title: An Experimental Comparison of the Most Popular Approaches to Fake News Detection

Title: Humans vs Vision-Language Models: A Unified Measure of Narrative Coherence

Title: PICon: A Multi-Turn Interrogation Framework for Evaluating Persona Agent Consistency

Title: Beyond Via: Analysis and Estimation of the Impact of Large Language Models in Academic Papers

Title: Measuring What Matters -- or What's Convenient?: Robustness of LLM-Based Scoring Systems to Construct-Irrelevant Factors

Title: Self-Improvement of Large Language Models: A Technical Overview and Future Outlook

Title: S2D2: Fast Decoding for Diffusion LLMs via Training-Free Self-Speculation

Title: Natural-Language Agent Harnesses