2025-08-20

Title: Fair Play in the Newsroom: Actor-Based Filtering Gender Discrimination in Text Corpora

Title: MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents

Title: Stands to Reason: Investigating the Effect of Reasoning on Idiomaticity Detection

Title: Datarus-R1: An Adaptive Multi-Step Reasoning LLM for Automated Data Analysis

Title: ALIGN: Word Association Learning for Cross-Cultural Generalization in Large Language Models

Title: ProMed: Shapley Information Gain Guided Reinforcement Learning for Proactive Medical LLMs

Title: Saudi-Dialect-ALLaM: LoRA Fine-Tuning for Dialectal Arabic Generation

Title: MATA (māta): Mindful Assessment of the Telugu Abilities of Large Language Models

Title: A Comparative Study of Decoding Strategies in Medical Text Generation

Title: Who Gets the Mic? Investigating Gender Bias in the Speaker Assignment of a Speech-LLM

Title: CRISP: Persistent Concept Unlearning via Sparse Autoencoders

Title: ViExam: Are Vision Language Models Better than Humans on Vietnamese Multimodal Exam Questions?

Title: Generics and Default Reasoning in Large Language Models

Title: Prediction is not Explanation: Revisiting the Explanatory Capacity of Mapping Embeddings

Title: EEG-MedRAG: Enhancing EEG-based Clinical Decision-Making via Hierarchical Hypergraph Retrieval-Augmented Generation

Title: Sycophancy under Pressure: Evaluating and Mitigating Sycophantic Bias via Adversarial Dialogues in Scientific QA

Title: MGT-Prism: Enhancing Domain Generalization for Machine-Generated Text Detection via Spectral Alignment

Title: Can Large Language Models (LLMs) Describe Pictures Like Children? A Comparative Corpus Study

Title: TracSum: A New Benchmark for Aspect-Based Summarization with Sentence-Level Traceability in Medical Domain

Title: Beyond Human Judgment: A Bayesian Evaluation of LLMs' Moral Values Understanding

Title: Prompt-Based One-Shot Exact Length-Controlled Generation with LLMs

Title: The illusion of a perfect metric: Why evaluating AI's words is harder than it looks

Title: Extracting Structured Requirements from Unstructured Building Technical Specifications for Building Information Modeling

Title: MME-SCI: A Comprehensive and Challenging Science Benchmark for Multimodal Large Language Models

Title: ReviewGraph: A Knowledge Graph Embedding Based Framework for Review Rating Prediction with Sentiment Features

Title: Chunks as Arms: Multi-Armed Bandit-Guided Sampling for Long-Context LLM Preference Optimization

Title: Ask Good Questions for Large Language Models

Title: Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

Title: Unintended Misalignment from Agentic Fine-Tuning: Risks and Mitigation

Title: The Promise of Large Language Models in Digital Health: Evidence from Sentiment Analysis in Online Health Communities