2026-02-17

Title: LLM-Powered Automatic Translation and Urgency in Crisis Scenarios

Title: Language Model Memory and Memory Models for Language

Title: From Perceptions To Evidence: Detecting AI-Generated Content In Turkish News Media With A Fine-Tuned Bert Classifier

Title: Think Deep, Not Just Long: Measuring LLM Reasoning Effort via Deep-Thinking Tokens

Title: On Calibration of Large Language Models: From Response To Capability

Title: Small Reward Models via Backward Inference

Title: DistillLens: Symmetric Knowledge Distillation Through Logit Lens

Title: LLM-Confidence Reranker: A Training-Free Approach for Enhancing Retrieval-Augmented Generation Systems

Title: Elo-Evolve: A Co-evolutionary Framework for Language Model Alignment

Title: On Theoretically-Driven LLM Agents for Multi-Dimensional Discourse Analysis

Title: RMPL: Relation-aware Multi-task Progressive Learning with Stage-wise Training for Multimedia Event Extraction

Title: OMGs: A multi-agent system supporting MDT decision-making across the ovarian tumour care continuum

Title: Beyond Words: Evaluating and Bridging Epistemic Divergence in User-Agent Interaction via Theory of Mind

Title: Speculative Decoding with a Speculative Vocabulary

Title: PrivAct: Internalizing Contextual Privacy Preservation via Multi-Agent Preference Training

Title: Tutoring Large Language Models to be Domain-adaptive, Precise, and Safe

Title: Bridging the Multilingual Safety Divide: Efficient, Culturally-Aware Alignment for Global South Languages

Title: ADAB: Arabic Dataset for Automated Politeness Benchmarking -- A Large-Scale Resource for Computational Sociopragmatics

Title: Evaluating Prompt Engineering Techniques for RAG in Small Language Models: A Multi-Hop QA Approach

Title: HLE-Verified: A Systematic Verification and Structured Revision of Humanity's Last Exam

Title: Chain-of-Thought Reasoning with Large Language Models for Clinical Alzheimer's Disease Assessment and Diagnosis

Title: The Sufficiency-Conciseness Trade-off in LLM Self-Explanation from an Information Bottleneck Perspective

Title: GRRM: Group Relative Reward Modeling for Machine Translation

Title: Context Shapes LLMs Retrieval-Augmented Fact-Checking Effectiveness

Title: LogitsCoder: Towards Efficient Chain-of-Thought Path Search via Logits Preference Decoding for Code Generation

Title: LM-Lexicon: Improving Definition Modeling via Harmonizing Semantic Experts

Title: From Scarcity to Scale: A Release-Level Analysis of the Pashto Common Voice Dataset

Title: Open Rubric System: Scaling Reinforcement Learning with Pairwise Adaptive Rubric

Title: Annotation-Efficient Vision-Language Model Adaptation to the Polish Language Using the LLaVA Framework

Title: Empty Shelves or Lost Keys? Recall Is the Bottleneck for Parametric Factuality

Title: CCiV: A Benchmark for Structure, Rhythm and Quality in LLM-Generated Chinese \textit{Ci} Poetry

Title: A Multi-Agent Framework for Medical AI: Leveraging Fine-Tuned GPT, LLaMA, and DeepSeek R1 for Evidence-Based and Bias-Aware Clinical Query Processing

Title: Index Light, Reason Deep: Deferred Visual Ingestion for Visual-Dense Document Question Answering

Title: GPT-5 vs Other LLMs in Long Short-Context Performance

Title: Knowing When Not to Answer: Abstention-Aware Scientific Reasoning

Title: AD-Bench: A Real-World, Trajectory-Aware Advertising Analytics Benchmark for LLM Agents

Title: Detecting LLM Hallucinations via Embedding Cluster Geometry: A Three-Type Taxonomy with Measurable Signatures

Title: STATe-of-Thoughts: Structured Action Templates for Tree-of-Thoughts

Title: Does Socialization Emerge in AI Agent Society? A Case Study of Moltbook

Title: InnoEval: On Research Idea Evaluation as a Knowledge-Grounded, Multi-Perspective Reasoning Problem

Title: Beyond Token-Level Policy Gradients for Complex Reasoning with Large Language Models

Title: TruthStance: An Annotated Dataset of Conversations on Truth Social

Title: WavePhaseNet: A DFT-Based Method for Constructing Semantic Conceptual Hierarchy Structures (SCHS)

Title: LLM-Guided Knowledge Distillation for Temporal Knowledge Graph Reasoning

Title: Robust Bias Evaluation with FilBBQ: A Filipino Bias Benchmark for Question-Answering Language Models

Title: Measuring and Mitigating Post-hoc Rationalization in Reverse Chain-of-Thought Generation

Title: HyperRAG: Reasoning N-ary Facts over Hypergraphs for Retrieval Augmented Generation

Title: BETA-Labeling for Multilingual Dataset Construction in Low-Resource IR

Title: Query as Anchor: Scenario-Adaptive User Representation via Large Language Model

Title: Beyond Translation: Evaluating Mathematical Reasoning Capabilities of LLMs in Sinhala and Tamil

Title: Explainable Token-level Noise Filtering for LLM Fine-tuning Datasets

Title: Assessing Large Language Models for Medical QA: Zero-Shot and LLM-as-a-Judge Evaluation

Title: The Wikidata Query Logs Dataset

Title: GradMAP: Faster Layer Pruning with Gradient Metric and Projection Compensation

Title: Is Information Density Uniform when Utterances are Grounded on Perception and Discourse?

Title: Crowdsourcing Piedmontese to Test LLMs on Non-Standard Orthography

Title: LLMStructBench: Benchmarking Large Language Model Structured Data Extraction

Title: Rethinking the Role of LLMs in Time Series Forecasting

Title: Cognitive networks reconstruct mindsets about STEM subjects and educational contexts in almost 1000 high-schoolers, University students and LLM-based digital twins

Title: Residual Connections and the Causal Shift: Uncovering a Structural Misalignment in Transformers

Title: Unlocking Reasoning Capability on Machine Translation in Large Language Models

Title: Multi-Agent Comedy Club: Investigating Community Discussion Effects on LLM Humor Generation

Title: Emergently Misaligned Language Models Show Behavioral Self-Awareness That Shifts With Subsequent Realignment

Title: A Geometric Analysis of Small-sized Language Model Hallucinations

Title: Overthinking Loops in Agents: A Structural Risk via MCP Tools

Title: Physical Commonsense Reasoning for Lower-Resourced Languages and Dialects: a Study on Basque

Title: Testimole-Conversational: A 30-Billion-Word Italian Discussion Board Corpus (1996-2024) for Language Modeling and Sociolinguistic Research

Title: Tool-Aware Planning in Contact Center AI: Evaluating LLMs through Lineage-Guided Query Decomposition

Title: Counterfactual Fairness Evaluation of LLM-Based Contact Center Agent Quality Assurance System

Title: Learning User Interests via Reasoning and Distillation for Cross-Domain News Recommendation

Title: Text Style Transfer with Parameter-efficient LLM Finetuning and Round-trip Translation