2026-02-20

Title: References Improve LLM Alignment in Non-Verifiable Domains

Title: Evaluating Monolingual and Multilingual Large Language Models for Greek Question Answering: The DemosQA Benchmark

Title: One-step Language Modeling via Continuous Denoising

Title: Claim Automation using Large Language Model

Title: BanglaSummEval: Reference-Free Factual Consistency Evaluation for Bangla Summarization

Title: Meenz bleibt Meenz, but Large Language Models Do Not Speak Its Dialect

Title: ConvApparel: A Benchmark Dataset and Validation Framework for User Simulators in Conversational Recommenders

Title: Persona2Web: Benchmarking Personalized Web Agents for Contextual Reasoning with User History

Title: ReIn: Conversational Error Recovery with Reasoning Inception

Title: Large Language Models Persuade Without Planning Theory of Mind

Title: BankMathBench: A Benchmark for Numerical Reasoning in Banking Scenarios

Title: The Emergence of Lab-Driven Alignment Signatures: A Psychometric Framework for Auditing Latent Bias and Compounding Risk in Generative AI

Title: Quantifying and Mitigating Socially Desirable Responding in LLMs: A Desirability-Matched Graded Forced-Choice Psychometric Study

Title: Towards Cross-lingual Values Assessment: A Consensus-Pluralism Perspective

Title: Same Meaning, Different Scores: Lexical and Syntactic Sensitivity in LLM Evaluation

Title: RPDR: A Round-trip Prediction-Based Data Augmentation Framework for Long-Tail Question Answering

Title: The Role of the Availability Heuristic in Multiple-Choice Answering Behaviour

Title: Evaluating Extremely Low-Resource Machine Translation: A Comparative Study of ChrF++ and BLEU Metrics

Title: Fine-Grained Uncertainty Quantification for Long-Form Language Model Outputs: A Comparative Study

Title: AIDG: Evaluating Asymmetry Between Information Extraction and Containment in Multi-Turn Dialogue

Title: ABCD: All Biases Come Disguised

Title: Entropy-Based Data Selection for Language Models

Title: PEACE 2.0: Grounded Explanations and Counter-Speech for Combating Hate Expressions

Title: Small LLMs for Medical NLP: a Systematic Analysis of Few-Shot, Constraint Decoding, Fine-Tuning and Continual Pre-Training in Italian

Title: Bridging the Domain Divide: Supervised vs. Zero-Shot Clinical Section Segmentation from MIMIC-III to Obstetrics

Title: Using LLMs for Knowledge Component-level Correctness Labeling in Open-ended Coding Problems

Title: Learning to Stay Safe: Adaptive Regularization Against Safety Degradation during Fine-Tuning

Title: Modeling Distinct Human Interaction in Web Agents

Title: The Cascade Equivalence Hypothesis: When Do Speech LLMs Behave Like ASR$\rightarrow$LLM Pipelines?

Title: Unmasking the Factual-Conceptual Gap in Persian Language Models

Title: Differences in Typological Alignment in Language Models' Treatment of Differential Argument Marking

Title: What Language is This? Ask Your Tokenizer

Title: Sink-Aware Pruning for Diffusion Language Models