2026-02-12

Title: Reviewing the Reviewer: Elevating Peer Review Quality through LLM-Guided Feedback

Title: Latent Thoughts Tuning: Bridging Context and Reasoning with Fused Information in Latent Tokens

Title: Learning to Evict from Key-Value Cache

Title: On Emergent Social World Models -- Evidence for Functional Integration of Theory of Mind and Pragmatic Reasoning in Language Models

Title: Are More Tokens Rational? Inference-Time Scaling in Language Models as Adaptive Resource Rationality

Title: Geometry-Aware Decoding with Wasserstein-Regularized Truncation and Mass Penalties for Large Language Models

Title: Learning Self-Interpretation from Interpretability Artifacts: Training Lightweight Adapters on Vector-Label Pairs

Title: Physically Interpretable AlphaEarth Foundation Model Embeddings Enable LLM-Based Land Surface Intelligence

Title: Autonomous Continual Learning of Computer-Use Agents for Environment Adaptation

Title: Triggers Hijack Language Circuits: A Mechanistic Analysis of Backdoor Behaviors in Large Language Models

Title: When Tables Go Crazy: Evaluating Multimodal Models on French Financial Documents

Title: Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs

Title: LATA: A Tool for LLM-Assisted Translation Annotation

Title: Neuro-Symbolic Synergy for Interactive World Modeling

Title: Canvas-of-Thought: Grounding Reasoning via Mutable Structured States

Title: On the Robustness of Knowledge Editing for Detoxification

Title: LHAW: Controllable Underspecification for Long-Horizon Tasks

Title: When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning

Title: Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Title: Online Causal Kalman Filtering for Stable and Effective Policy Optimization

Title: How Do Decoder-Only LLMs Perceive Users? Rethinking Attention Masking for User Representation Learning

Title: UMEM: Unified Memory Extraction and Management Framework for Generalizable Memory

Title: Benchmarks Are Not That Out of Distribution: Word Overlap Predicts Performance

Title: Targeted Syntactic Evaluation of Language Models on Georgian Case Alignment

Title: Locomo-Plus: Beyond-Factual Cognitive Memory Evaluation Framework for LLM Agents

Title: Macaron: Controlled, Human-Written Benchmark for Multilingual and Multicultural Reasoning via Template-Filling

Title: Reinforced Curriculum Pre-Alignment for Domain-Adaptive VLMs

Title: Deep Learning-based Method for Expressing Knowledge Boundary of Black-Box LLM

Title: Beyond Confidence: The Rhythms of Reasoning in Generative Models

Title: C-MOP: Integrating Momentum and Boundary-Aware Clustering for Enhanced Prompt Evolution

Title: Diagnosing Structural Failures in LLM-Based Evidence Extraction for Meta-Analysis

Title: The CLEF-2026 FinMMEval Lab: Multilingual and Multimodal Evaluation of Financial AI Systems

Title: Search or Accelerate: Confidence-Switched Position Beam Search for Diffusion Language Models

Title: Language Model Inversion through End-to-End Differentiation

Title: Embedding Inversion via Conditional Masked Diffusion Language Models

Title: SteuerLLM: Local specialized large language model for German tax law analysis

Title: DataChef: Cooking Up Optimal Data Recipes for LLM Adaptation via Reinforcement Learning

Title: Can Large Language Models Make Everyone Happy?

Title: Safety Recovery in Reasoning Models Is Only a Few Early Steering Steps Away

Title: TEGRA: Text Encoding With Graph and Retrieval Augmentation for Misinformation Detection

Title: Data Repetition Beats Data Scaling in Long-CoT Supervised Fine-Tuning