2025-05-08

Title: Calibrating Uncertainty Quantification of Multi-Modal LLMs using Grounding

Title: A Reasoning-Focused Legal Retrieval Benchmark

Title: Divide, Optimize, Merge: Fine-Grained LLM Agent Optimization at Scale

Title: X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains

Title: SLOT: Structuring the Output of Large Language Models

Title: Advancing and Benchmarking Personalized Tool Invocation for LLMs

Title: Natural Language Generation in Healthcare: A Review of Methods and Applications

Title: Bringing legal knowledge to the public by constructing a legal question bank using large-scale pre-trained language model

Title: Enhancing Granular Sentiment Classification with Chain-of-Thought Prompting in Large Language Models

Title: Unmasking the Canvas: A Dynamic Benchmark for Image Generation Jailbreaking and LLM Content Safety

Title: Can Language Models Understand Social Behavior in Clinical Conversations?

Title: LLM-Independent Adaptive RAG: Let the Question Speak for Itself

Title: GASCADE: Grouped Summarization of Adverse Drug Event for Enhanced Cancer Pharmacovigilance

Title: The Aloe Family Recipe for Open and Specialized Healthcare LLMs

Title: Large Means Left: Political Bias in Large Language Models Increases with Their Number of Parameters

Title: YABLoCo: Yet Another Benchmark for Long Context Code Generation

Title: OBLIVIATE: Robust and Practical Machine Unlearning for Large Language Models

Title: Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs

Title: Overcoming Data Scarcity in Generative Language Modelling for Low-Resource Languages: A Systematic Review

Title: ZeroSearch: Incentivize the Search Capability of LLMs without Searching