2025-11-06

Title: Cache Mechanism for Agent RAG Systems

Title: LEGO-Eval: Towards Fine-Grained Evaluation on Synthesizing 3D Embodied Environments with Tool Augmentation

Title: Targeted Error Correction in Knowledge Distillation: Small Language Models Surpass GPT

Title: Data-Efficient Adaptation and a Novel Evaluation Method for Aspect-based Sentiment Analysis

Title: ROBoto2: An Interactive System and Dataset for LLM-assisted Clinical Trial Risk of Bias Assessment

Title: Reading Between the Lines: The One-Sided Conversation Problem

Title: PolyNorm: Few-Shot LLM-Based Text Normalization for Text-to-Speech

Title: CARMA: Comprehensive Automatically-annotated Reddit Mental Health Dataset for Arabic

Title: Control Barrier Function for Aligning Large Language Models

Title: MME-CC: A Challenging Multi-Modal Evaluation Benchmark of Cognitive Capacity

Title: Who Sees the Risk? Stakeholder Conflicts and Explanatory Policies in LLM-based Risk Assessment

Title: Measuring Aleatoric and Epistemic Uncertainty in LLMs: Empirical Evaluation on ID and OOD QA Tasks

Title: BengaliMoralBench: A Benchmark for Auditing Moral Reasoning in Large Language Models within Bengali Language and Culture

Title: LGM: Enhancing Large Language Models with Conceptual Meta-Relations and Iterative Retrieval

Title: Hybrid Fact-Checking that Integrates Knowledge Graphs, Large Language Models, and Search-Based Retrieval Agents Improves Interpretable Claim Verification

Title: IndicSuperTokenizer: An Optimized Tokenizer for Indic Multilingual LLMs

Title: Comparing the Performance of LLMs in RAG-based Question-Answering: A Case Study in Computer Science Literature

Title: SCALE: Upscaled Continual Learning of Large Language Models

Title: Benchmarking the Thinking Mode of Multimodal Large Language Models in Clinical Tasks

Title: Silenced Biases: The Dark Side LLMs Learned to Refuse

Title: EQ-Negotiator: Dynamic Emotional Personas Empower Small Language Models for Edge-Deployable Credit Negotiation

Title: LFC-DA: Logical Formula-Controlled Data Augmentation for Enhanced Logical Reasoning

Title: Overcoming the Generalization Limits of SLM Finetuning for Shape-Based Extraction of Datatype and Object Properties

Title: Efficient Reasoning via Thought-Training and Thought-Free Inference

Title: Knowledge-Augmented Question Error Correction for Chinese Question Answer System with QuestionRAG

Title: CareMedEval dataset: Evaluating Critical Appraisal and Reasoning in the Biomedical Field

Title: Kastor: Fine-tuned Small Language Models for Shape-based Active Relation Extraction

Title: BanglaSTEM: A Parallel Corpus for Technical Domain Bangla-English Translation

Title: HaluMem: Evaluating Hallucinations in Memory Systems of Agents

Title: One Battle After Another: Probing LLMs' Limits on Multi-Turn Instruction Following with a Benchmark Evolving Framework

Title: SOLVE-Med: Specialized Orchestration for Leading Vertical Experts across Medical Specialties

Title: MultiZebraLogic: A Multilingual Logical Reasoning Benchmark

Title: AILA--First Experiments with Localist Language Models

Title: ASVRI-Legal: Fine-Tuning LLMs with Retrieval Augmented Generation for Enhanced Legal Regulation

Title: Step-Audio-EditX Technical Report

Title: Towards Transparent Stance Detection: A Zero-Shot Approach Using Implicit and Explicit Interpretability

Title: Do Androids Dream of Unseen Puppeteers? Probing for a Conspiracy Mindset in Large Language Models

Title: Grounded Misunderstandings in Asymmetric Dialogue: A Perspectivist Annotation Scheme for MapTask