2025-07-03

Title: MALIBU Benchmark: Multi-Agent LLM Implicit Bias Uncovered

Title: Event-based evaluation of abstractive news summarization

Title: GAIus: Combining Genai with Legal Clauses Retrieval for Knowledge-based Assistant

Title: Evaluating Large Language Models for Multimodal Simulated Ophthalmic Decision-Making in Diabetic Retinopathy and Glaucoma Screening

Title: Rethinking All Evidence: Enhancing Trustworthy Retrieval-Augmented Generation via Conflict-Driven Summarization

Title: Frustratingly Simple Retrieval Improves Challenging, Reasoning-Intensive Benchmarks

Title: La RoSA: Enhancing LLM Efficiency via Layerwise Rotated Sparse Activation

Title: Symbolic or Numerical? Understanding Physics Problem Solving in Reasoning LLMs

Title: LEDOM: An Open and Fundamental Reverse Language Model

Title: Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

Title: LogitSpec: Accelerating Retrieval-based Speculative Decoding via Next Next Token Speculation

Title: Evaluating the Effectiveness of Direct Preference Optimization for Personalizing German Automatic Text Simplifications for Persons with Intellectual Disabilities

Title: Efficient Out-of-Scope Detection in Dialogue Systems via Uncertainty-Driven LLM Routing

Title: Is External Information Useful for Stance Detection with LLMs?

Title: Emotionally Intelligent Task-oriented Dialogue Systems: Architecture, Representation, and Optimisation

Title: Chart Question Answering from Real-World Analytical Narratives

Title: Confidence and Stability of Global and Pairwise Scores in NLP Evaluation

Title: Adapting Language Models to Indonesian Local Languages: An Empirical Study of Language Transferability on Zero-Shot Settings

Title: AdamMeme: Adaptively Probe the Reasoning Capacity of Multimodal Large Language Models on Harmfulness

Title: Stereotype Detection as a Catalyst for Enhanced Bias Detection: A Multi-Task Learning Approach

Title: LLMs for Legal Subsumption in German Employment Contracts

Title: MuRating: A High Quality Data Selecting Approach to Multilingual Large Language Model Pretraining

Title: Probing Evaluation Awareness of Language Models

Title: How Do Vision-Language Models Process Conflicting Information Across Modalities?

Title: Evaluating Structured Output Robustness of Small Language Models for Open Attribute-Value Extraction from Clinical Notes

Title: Low-Perplexity LLM-Generated Sequences and Where To Find Them

Title: Eka-Eval : A Comprehensive Evaluation Framework for Large Language Models in Indian Languages

Title: DIY-MKG: An LLM-Based Polyglot Language Learning System

Title: MiCoTA: Bridging the Learnability Gap with Intermediate CoT and Teacher Assistants

Title: High-Layer Attention Pruning with Rescaling

Title: AI4Research: A Survey of Artificial Intelligence for Scientific Research

Title: Gradient-Adaptive Policy Optimization: Towards Multi-Objective Alignment of Large Language Models

Title: Decision-oriented Text Evaluation

Title: The Thin Line Between Comprehension and Persuasion in LLMs