2026-03-18

Title: Recursive Language Models Meet Uncertainty: The Surprising Effectiveness of Self-Reflective Program Search for Long Context

Title: MedArena: Comparing LLMs for Medicine-in-the-Wild Clinician Preferences

Title: MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification

Title: Morphemes Without Borders: Evaluating Root-Pattern Morphology in Arabic Tokenizers and LLMs

Title: COGNAC at SemEval-2026 Task 5: LLM Ensembles for Human-Level Word Sense Plausibility Rating in Challenging Narratives

Title: Agent-based imitation dynamics can yield efficiently compressed population-level vocabularies

Title: BANGLASOCIALBENCH: A Benchmark for Evaluating Sociopragmatic and Cultural Alignment of LLMs in Bangladeshi Social Interaction

Title: POLAR:A Per-User Association Test in Embedding Space

Title: A Family of LLMs Liberated from Static Vocabularies

Title: Aligning Paralinguistic Understanding and Generation in Speech LLMs via Multi-Task Reinforcement Learning

Title: RadAnnotate: Large Language Models for Efficient and Reliable Radiology Report Annotation

Title: Understanding Moral Reasoning Trajectories in Large Language Models: Toward Probing-Based Explainability

Title: SEAHateCheck: Functional Tests for Detecting Hate Speech in Low-Resource Languages of Southeast Asia

Title: ClaimFlow: Tracing the Evolution of Scientific Claims in NLP

Title: CounterRefine: Answer-Conditioned Counterevidence Retrieval for Inference-Time Knowledge Repair in Factual Question Answering

Title: Frequency Matters: Fast Model-Agnostic Data Curation for Pruning and Quantization

Title: ASDA: Automated Skill Distillation and Adaptation for Financial Reasoning

Title: Language Models Don't Know What You Want: Evaluating Personalization in Deep Research Needs Real Users

Title: Pre-training LLM without Learning Rate Decay Enhances Supervised Fine-Tuning

Title: Social Simulacra in the Wild: AI Agent Communities on Moltbook

Title: SciZoom: A Large-scale Benchmark for Hierarchical Scientific Summarization across the LLM Era

Title: SIA: A Synthesize-Inject-Align Framework for Knowledge-Grounded and Secure E-commerce Search LLMs with Industrial Deployment

Title: Parametric Social Identity Injection and Diversification in Public Opinion Simulation

Title: Structured Semantic Cloaking for Jailbreak Attacks on Large Language Models

Title: SpecSteer: Synergizing Local Context and Global Reasoning for Efficient Personalized Generation

Title: More Rounds, More Noise: Why Multi-Turn Review Fails to Improve Cross-Context Verification

Title: Attention-guided Evidence Grounding for Spoken Question Answering

Title: Omnilingual MT: Machine Translation for 1,600 Languages

Title: PashtoCorp: A 1.25-Billion-Word Corpus, Evaluation Suite, and Reproducible Pipeline for Low-Resource Language Development

Title: Fanar 2.0: Arabic Generative AI Stack

Title: Who Benchmarks the Benchmarks? A Case Study of LLM Evaluation in Icelandic

Title: PlotTwist: A Creative Plot Generation Framework with Small Language Models

Title: RECOVER: Robust Entity Correction via agentic Orchestration of hypothesis Variants for Evidence-based Recovery

Title: IndexRAG: Bridging Facts for Cross-Document Reasoning at Index Time

Title: EngGPT2: Sovereign, Efficient and Open Intelligence

Title: VQKV: High-Fidelity and High-Ratio Cache Compression via Vector-Quantization

Title: DynHD: Hallucination Detection for Diffusion Large Language Models via Denoising Dynamics Deviation Learning

Title: On the Emotion Understanding of Synthesized Speech

Title: AdaMem: Adaptive User-Centric Memory for Long-Horizon Dialogue Agents

Title: How often do Answers Change? Estimating Recency Requirements in Question Answering

Title: DanceHA: A Multi-Agent Framework for Document-Level Aspect-Based Sentiment Analysis

Title: EmoLLM: Appraisal-Grounded Cognitive-Emotional Co-Reasoning in Large Language Models

Title: Characterizing Delusional Spirals through Human-LLM Chat Logs

Title: Diverging Transformer Predictions for Human Sentence Processing: A Comprehensive Analysis of Agreement Attraction Effects

Title: BATQuant: Outlier-resilient MXFP4 Quantization via Learnable Block-wise Optimization

Title: Omnilingual SONAR: Cross-Lingual and Cross-Modal Sentence Embeddings Bridging Massively Multilingual Text and Speech

Title: Domain Mixture Design via Log-Likelihood Differences for Aligning Language Models with a Target Model

Title: Good Arguments Against the People Pleasers: How Reasoning Mitigates (Yet Masks) LLM Sycophancy

Title: Omanic: Towards Step-wise Evaluation of Multi-hop Reasoning in Large Language Models

Title: Can Linguistically Related Languages Guide LLM Translation in Low-Resource Settings?

Title: Arabic Morphosyntactic Tagging and Dependency Parsing with Large Language Models

Title: Probing Cultural Signals in Large Language Models through Author Profiling

Title: TurnWise: The Gap between Single- and Multi-turn Language Model Capabilities

Title: SpokenUS: A Spoken User Simulator for Task-Oriented Dialogue

Title: Mediocrity is the key for LLM as a Judge Anchor Selection

Title: Online Experiential Learning for Language Models

Title: Chronos: Temporal-Aware Conversational Agents with Structured Event Retrieval for Long-Term Memory