2026-02-16

Title: A Lightweight LLM Framework for Disaster Humanitarian Information Classification

Title: From Biased Chatbots to Biased Agents: Examining Role Assignment Effects on LLM Agent Robustness

Title: Retrieval-Augmented Self-Taught Reasoning Model with Adaptive Chain-of-Thought for ASR Named Entity Correction

Title: Grandes Modelos de Linguagem Multimodais (MLLMs): Da Teoria à Prática

Title: propella-1: Multi-Property Document Annotation for LLM Data Curation at Scale

Title: RankLLM: Weighted Ranking of LLMs by Quantifying Question Difficulty

Title: RBCorr: Response Bias Correction in Language Models

Title: Unleashing Low-Bit Inference on Ascend NPUs: A Comprehensive Evaluation of HiFloat Formats

Title: CLASE: A Hybrid Method for Chinese Legalese Stylistic Evaluation

Title: Beyond Normalization: Rethinking the Partition Function as a Difficulty Scheduler for RLVR

Title: Learning Ordinal Probabilistic Reward from Preferences

Title: $\mathcal{X}$-KD: General Experiential Knowledge Distillation for Large Language Models

Title: MedXIAOHE: A Comprehensive Recipe for Building Medical MLLMs

Title: ReFilter: Improving Robustness of Retrieval-Augmented Generation via Gated Filter

Title: RAT-Bench: A Comprehensive Benchmark for Text Anonymization

Title: Left-right asymmetry in predicting brain activity from LLMs' representations emerges with their formal linguistic competence

Title: AIWizards at MULTIPRIDE: A Hierarchical Approach to Slur Reclamation Detection

Title: MentalBench: A Benchmark for Evaluating Psychiatric Diagnostic Capability of Large Language Models

Title: BaziQA-Benchmark: Evaluating Symbolic and Temporally Compositional Reasoning in Large Language Models

Title: When Words Don't Mean What They Say: Figurative Understanding in Bengali Idioms

Title: Curriculum Learning and Pseudo-Labeling Improve the Generalization of Multi-Label Arabic Dialect Identification Models

Title: ProbeLLM: Automating Principled Diagnosis of LLM Failures

Title: SciAgentGym: Benchmarking Multi-Step Scientific Tool-use in LLM Agents

Title: Evaluating the Homogeneity of Keyphrase Prediction Models

Title: Know More, Know Clearer: A Meta-Cognitive Framework for Knowledge Augmentation in Large Language Models

Title: TraceBack: Multi-Agent Decomposition for Fine-Grained Table Attribution

Title: Exploring a New Competency Modeling Process with Large Language Models

Title: SCOPE: Selective Conformal Optimized Pairwise LLM Judging

Title: Semantic Chunking and the Entropy of Natural Language