2026-01-05

Title: RIMRULE: Improving Tool-Using Language Agents via MDL-Guided Rule Learning

Title: Universal Adaptive Constraint Propagation: Scaling Structured Inference for Large Language Models via Meta-Reinforcement Learning

Title: Pat-DEVAL: Chain-of-Legal-Thought Evaluation for Patent Description

Title: Knowledge Distillation for Temporal Knowledge Graph Reasoning with Large Language Models

Title: From Evidence-Based Medicine to Knowledge Graph: Retrieval-Augmented Generation for Sports Rehabilitation and a Domain Benchmark

Title: JP-TL-Bench: Anchored Pairwise LLM Evaluation for Bidirectional Japanese-English Translation

Title: Talk Less, Verify More: Improving LLM Assistants with Semantic Checks and Execution Feedback

Title: Parallel Universes, Parallel Languages: A Comprehensive Study on LLM-based Multilingual Counterfactual Example Generation

Title: Beyond Perfect APIs: A Comprehensive Evaluation of LLM Agents Under Real-World API Complexity

Title: Can Large Language Models Still Explain Themselves? Investigating the Impact of Quantization on Self-Explanations

Title: Robust Uncertainty Quantification for Factual Generation of Large Language Models

Title: The Role of Mixed-Language Documents for Multilingual Large Language Model Pretraining

Title: Vision-Language Reasoning for Geolocalization: A Reinforcement Learning Approach

Title: Do LLMs Judge Distantly Supervised Named Entity Labels Well? Constructing the JudgeWEL Dataset

Title: Toward Better Temporal Structures for Geopolitical Events Forecasting

Title: Language as Mathematical Structure: Examining Semantic Field Theory Against Language Games

Title: Defensive M2S: Training Guardrail Models on Compressed Multi-turn Conversations

Title: Rule-Based Approaches to Atomic Sentence Extraction

Title: Retrieval--Reasoning Processes for Multi-hop Question Answering: A Four-Axis Design Framework and Empirical Trends

Title: ECR: Manifold-Guided Semantic Cues for Compact Language Models

Title: InfoSynth: Information-Guided Benchmark Synthesis for LLMs

Title: CSSBench: Evaluating the Safety of Lightweight LLMs against Chinese-Specific Adversarial Patterns

Title: Beyond IVR: Benchmarking Customer Support LLM Agents for Business-Adherence

Title: Probabilistic Guarantees for Reducing Contextual Hallucinations in LLMs

Title: Physio-DPO: Aligning Large Language Models with the Protein Energy Landscape to Eliminate Structural Hallucinations

Title: Fast-weight Product Key Memory

Title: Sigmoid Head for Quality Estimation under Language Ambiguity

Title: Exploring the Performance of Large Language Models on Subjective Span Identification Tasks