2025-09-10

Title: MedBench-IT: A Comprehensive Benchmark for Evaluating Large Language Models on Italian Medical Entrance Examinations

Title: Toward Purpose-oriented Topic Model Evaluation enabled by Large Language Models

Title: Towards EnergyGPT: A Large Language Model Specialized for the Energy Sector

Title: DischargeSim: A Simulation Benchmark for Educational Doctor-Patient Communication at Discharge

Title: Rule-Based Moral Principles for Explaining Uncertainty in Natural Language Generation

Title: LLM Analysis of 150+ years of German Parliamentary Debates on Migration Reveals Shift from Post-War Solidarity to Anti-Solidarity in the Last Decade

Title: Causal Attention with Lookahead Keys

Title: Instance-level Performance Prediction for Long-form Generation Tasks

Title: Does This Look Familiar to You? Knowledge Analysis via Model Internal Representations

Title: Mitigating Attention Localization in Small Scale: Self-Attention Refinement via One-step Belief Propagation

Title: PersonaFuse: A Personality Activation-Driven Framework for Enhancing Human-LLM Interactions

Title: Talking with Oompa Loompas: A novel framework for evaluating linguistic acquisition of LLM agents

Title: The Role of Exploration Modules in Small Language Models for Knowledge Graph Question Answering

Title: LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction

Title: AIxcellent Vibes at GermEval 2025 Shared Task on Candy Speech Detection: Improving Model Performance by Span-Level Training

Title: HALT-RAG: A Task-Adaptable Framework for Hallucination Detection with Calibrated NLI Ensembles and Abstention

Title: ALLabel: Three-stage Active Learning for LLM-based Entity Recognition using Demonstration Retrieval

Title: VeriOS: Query-Driven Proactive Human-Agent-GUI Interaction for Trustworthy OS Agents

Title: Avoiding Knowledge Edit Skipping in Multi-hop Question Answering with Guided Decomposition

Title: BALI: Enhancing Biomedical Language Representations through Knowledge Graph and Language Model Alignment

Title: MaLei at MultiClinSUM: Summarisation of Clinical Documents using Perspective-Aware Iterative Self-Prompting with LLMs

Title: MoLoRAG: Bootstrapping Document Understanding via Multi-modal Logic-aware Retrieval

Title: M-BRe: Discovering Training Samples for Relation Extraction from Unlabeled Texts with Large Language Models

Title: Factuality Beyond Coherence: Evaluating LLM Watermarking Methods for Medical Texts

Title: Are LLMs Enough for Hyperpartisan, Fake, Polarized and Harmful Content Detection? Evaluating In-Context Learning vs. Fine-Tuning

Title: Dual Knowledge-Enhanced Two-Stage Reasoner for Multimodal Dialog Systems

Title: Small Open Models Achieve Near Parity with Large Models in Low Resource Literary Translation at a Fraction of the Cost

Title: Are Humans as Brittle as Large Language Models?

Title: From Detection to Mitigation: Addressing Gender Bias in Chinese Texts via Efficient Tuning and Voting-Based Rebalancing

Title: Biased Tales: Cultural and Topic Bias in Generating Children's Stories

Title: GENUINE: Graph Enhanced Multi-level Uncertainty Estimation for Large Language Models

Title: SimpleQA Verified: A Reliable Factuality Benchmark to Measure Parametric Knowledge

Title: Parallel-R1: Towards Parallel Thinking via Reinforcement Learning