2025-11-20

Title: Test-time Scaling of LLMs: A Survey from A Subproblem Structure Perspective

Title: Temporal Predictors of Outcome in Reasoning Language Models

Title: LiveCLKTBench: Towards Reliable Evaluation of Cross-Lingual Knowledge Transfer in Multilingual LLMs

Title: COMPASS: Context-Modulated PID Attention Steering System for Hallucination Mitigation

Title: Human or LLM as Standardized Patients? A Comparative Study for Medical Education

Title: Hierarchical Token Prepending: Enhancing Information Flow in Decoder-based LLM Embeddings

Title: Mathematical Analysis of Hallucination Dynamics in Large Language Models: Uncertainty Quantification, Advanced Decoding, and Principled Mitigation

Title: Teaching According to Students' Aptitude: Personalized Mathematics Tutoring via Persona-, Memory-, and Forgetting-Aware LLMs

Title: HinTel-AlignBench: A Framework and Benchmark for Hindi-Telugu with English-Aligned Samples

Title: Unveiling Intrinsic Dimension of Texts: from Academic Abstract to Creative Story

Title: OEMA: Ontology-Enhanced Multi-Agent Collaboration Framework for Zero-Shot Clinical Named Entity Recognition

Title: Context Cascade Compression: Exploring the Upper Limits of Text Compression

Title: IndicGEC: Powerful Models, or a Measurement Mirage?

Title: Adversarial Poetry as a Universal Single-Turn Jailbreak Mechanism in Large Language Models

Title: HEAD-QA v2: Expanding a Healthcare Benchmark for Reasoning

Title: The Empowerment of Science of Science by Large Language Models: New Tools and Methods

Title: A Compliance-Preserving Retrieval System for Aircraft MRO Task Search

Title: DEPO: Dual-Efficiency Preference Optimization for LLM Agents

Title: NAMeGEn: Creative Name Generation via A Novel Agent-based Multiple Personalized Goal Enhancement Framework

Title: LLM-MemCluster: Empowering Large Language Models with Dynamic Memory for Text Clustering

Title: Standardising the NLP Workflow: A Framework for Reproducible Linguistic Analysis

Title: Multimodal Evaluation of Russian-language Architectures

Title: HSKBenchmark: Modeling and Benchmarking Chinese Second Language Acquisition in Large Language Models through Curriculum Tuning