2025-07-24

Title: A Unifying Scheme for Extractive Content Selection Tasks

Title: AI-based Clinical Decision Support for Primary Care: A Real-World Study

Title: Harnessing RLHF for Robust Unanswerability Recognition and Trustworthy Response Generation in LLMs

Title: Text-to-SPARQL Goes Beyond English: Multilingual Question Answering Over Knowledge Graphs through Human-Inspired Reasoning

Title: Leveraging Synthetic Data for Question Answering with Multilingual LLMs in the Agricultural Domain

Title: Obscured but Not Erased: Evaluating Nationality Bias in LLMs via Name-Based Bias Benchmarks

Title: Multi-Label Classification with Generative AI Models in Healthcare: A Case Study of Suicidality and Risk Factors

Title: Can External Validation Tools Improve Annotation Quality for LLM-as-a-Judge?

Title: CogDual: Enhancing Dual Cognition of LLMs via Reinforcement Learning with Implicit Rule-Based Rewards

Title: SKA-Bench: A Fine-Grained Benchmark for Evaluating Structured Knowledge Understanding of LLMs

Title: FinGAIA: An End-to-End Benchmark for Evaluating AI Agents in Finance

Title: The Pluralistic Moral Gap: Understanding Judgment and Value Differences between Humans and Large Language Models

Title: Triple X: A LLM-Based Multilingual Speech Recognition System for the INTERSPEECH2025 MLC-SLM Challenge

Title: Millions of $\text{GeAR}$-s: Extending GraphRAG to Millions of Documents

Title: Each to Their Own: Exploring the Optimal Embedding in RAG

Title: MultiNRC: A Challenging and Native Multilingual Reasoning Evaluation Benchmark for LLMs

Title: Synthetic Voice Data for Automatic Speech Recognition in African Languages

Title: A Hybrid Early-Exit Algorithm for Large Language Models Based on Space Alignment Decoding (SPADE)

Title: WSM: Decay-Free Learning Rate Schedule via Checkpoint Merging for LLM Pre-training

Title: Who Attacks, and Why? Using LLMs to Identify Negative Campaigning in 18M Tweets across 19 Countries

Title: Towards Greater Leverage: Scaling Laws for Efficient Mixture-of-Experts Language Models

Title: From Feedback to Checklists: Grounded Evaluation of AI-Generated Clinical Notes

Title: AI Telephone Surveying: Automating Quantitative Data Collection with an AI Interviewer

Title: Megrez2 Technical Report

Title: Pretraining on the Test Set Is No Longer All You Need: A Debate-Driven Approach to QA Benchmarks