2025-11-25

Title: SCARE: A Benchmark for SQL Correction and Question Answerability Classification for Reliable EHR Question Answering

Title: $A^3$: Attention-Aware Accurate KV Cache Fusion for Fast Large Language Model Serving

Title: LexInstructEval: Lexical Instruction Following Evaluation for Large Language Models

Title: Generative Caching for Structurally Similar Prompts and Responses

Title: Community-Aligned Behavior Under Uncertainty: Evidence of Epistemic Stance Transfer in LLMs

Title: Random Text, Zipf's Law, Critical Length,and Implications for Large Language Models

Title: Computational frame analysis revisited: On LLMs for studying news coverage

Title: PoETa v2: Toward More Robust Evaluation of Large Language Models in Portuguese

Title: Point of Order: Action-Aware LLM Persona Modeling for Realistic Civic Simulation

Title: A superpersuasive autonomous policy debating system

Title: Principled Context Engineering for RAG: Statistical Guarantees via Conformal Prediction

Title: L2V-CoT: Cross-Modal Transfer of Chain-of-Thought Reasoning via Latent Intervention

Title: Towards Efficient LLM-aware Heterogeneous Graph Learning

Title: SPINE: Token-Selective Test-Time Reinforcement Learning with Entropy-Band Regularization

Title: Measuring the Impact of Lexical Training Data Coverage on Hallucination Detection in Large Language Models

Title: Blu-WERP (Web Extraction and Refinement Pipeline): A Scalable Pipeline for Preprocessing Large Language Model Datasets

Title: Vector Arithmetic in Concept and Token Subspaces

Title: Rethinking Retrieval: From Traditional Retrieval Augmented Generation to Agentic and Non-Vector Reasoning Systems in the Financial Domain for Large Language Models

Title: Agent-as-a-Graph: Knowledge Graph-Based Tool and Agent Retrieval for LLM Multi-Agent Systems

Title: From Archives to Decisions: Multi-Agent Pharmaceutical Co-Scientist for Traceable Drug Discovery and Reverse Translation

Title: "AGI" team at SHROOM-CAP: Data-Centric Approach to Multilingual Hallucination Detection using XLM-RoBERTa

Title: Table Comprehension in Building Codes using Vision Language Models and Domain-Specific Fine-Tuning

Title: Path-Constrained Retrieval: A Structural Approach to Reliable LLM Agent Reasoning Through Graph-Scoped Semantic Search

Title: Gradient Masters at BLP-2025 Task 1: Advancing Low-Resource NLP for Bengali using Ensemble-Based Adversarial Training for Hate Speech Detection

Title: OmniStruct: Universal Text-to-Structure Generation across Diverse Schemas

Title: Towards Robust and Fair Next Visit Diagnosis Prediction under Noisy Clinical Notes with Large Language Models

Title: Findings of the BlackboxNLP 2025 Shared Task: Localizing Circuits and Causal Variables in Language Models

Title: Multi-Agent Collaborative Filtering: Orchestrating Users and Items for Agentic Recommendations

Title: General Agentic Memory Via Deep Research

Title: MindEval: Benchmarking Language Models on Multi-turn Mental Health Support

Title: For Those Who May Find Themselves on the Red Team

Title: Toward Trustworthy Difficulty Assessments: Large Language Models as Judges in Programming and Synthetic Tasks

Title: A Benchmark for Zero-Shot Belief Inference in Large Language Models

Title: Prompt Optimization as a State-Space Search Problem

Title: OpenGloss: A Synthetic Encyclopedic Dictionary and Semantic Knowledge Graph

Title: No Free Lunch in Language Model Bias Mitigation? Targeted Bias Reduction Can Exacerbate Unmitigated LLM Biases

Title: Evaluating Large Language Models on the 2026 Korean CSAT Mathematics Exam: Measuring Mathematical Ability in a Zero-Data-Leakage Setting

Title: CLaRa: Bridging Retrieval and Generation with Continuous Latent Reasoning

Title: Empathetic Cascading Networks: A Multi-Stage Prompting Technique for Reducing Social Biases in Large Language Models

Title: RhinoInsight: Improving Deep Research through Control Mechanisms for Model Behavior and Context

Title: Large Language Models Require Curated Context for Reliable Political Fact-Checking -- Even with Reasoning and Web Search

Title: Context-Aware Whisper for Arabic ASR Under Linguistic Varieties

Title: HyperbolicRAG: Enhancing Retrieval-Augmented Generation with Hyperbolic Representations

Title: Concept than Document: Context Compression via AMR-based Conceptual Entropy

Title: Large Language Models for the Summarization of Czech Documents: From History to the Present

Title: Cognitive Alpha Mining via LLM-Driven Code-Based Evolution

Title: FanarGuard: A Culturally-Aware Moderation Filter for Arabic Language Models

Title: Generating Reading Comprehension Exercises with Large Language Models for Educational Applications

Title: Think Before You Prune: Selective Self-Generated Calibration for Pruning Large Reasoning Models

Title: CoreEval: Automatically Building Contamination-Resilient Datasets with Real-World Knowledge toward Reliable LLM Evaluation

Title: Reproducibility Study of Large Language Model Bayesian Optimization

Title: Look It Up: Analysing Internal Web Search Capabilities of Modern LLMs

Title: Skeletons Matter: Dynamic Data Augmentation for Text-to-Query

Title: GraphMind: Theorem Selection and Conclusion Generation Framework with Dynamic GNN for LLM Reasoning

Title: A Multi-Agent LLM Framework for Multi-Domain Low-Resource In-Context NER via Knowledge Retrieval, Disambiguation and Reflective Analysis

Title: DeCoRL: Decoupling Reasoning Chains via Parallel Sub-Step Generation and Cascaded Reinforcement for Interpretable and Scalable RLHF

Title: Emotion-Enhanced Multi-Task Learning with LLMs for Aspect Category Sentiment Analysis

Title: Eliciting Chain-of-Thought in Base LLMs via Gradient-Based Representation Optimization

Title: Representational Stability of Truth in Large Language Models

Title: In Machina N400: Pinpointing Where a Causal Language Model Detects Semantic Violations

Title: Learning to Reason: Training LLMs with GPT-OSS or DeepSeek R1 Reasoning Traces

Title: DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Title: Be My Eyes: Extending Large Language Models to New Modalities Through Multi-Agent Collaboration