2025-03-21

Title: Does Context Matter? ContextualJudgeBench for Evaluating LLM-based Judges in Contextual Settings

Title: Enhancing Pancreatic Cancer Staging with Large Language Models: The Role of Retrieval-Augmented Generation

Title: Am I eligible? Natural Language Inference for Clinical Trial Patient Recruitment: the Patient's Point of View

Title: KoGNER: A Novel Framework for Knowledge Graph Distillation on Biomedical Named Entity Recognition

Title: Can one size fit all?: Measuring Failure in Multi-Document Summarization Domain Transfer

Title: Grammar and Gameplay-aligned RL for Game Description Generation with LLMs

Title: Fùxì: A Benchmark for Evaluating Language Models on Ancient Chinese Text Understanding and Generation

Title: Uncertainty Quantification and Confidence Calibration in Large Language Models: A Survey

Title: Typed-RAG: Type-aware Multi-Aspect Decomposition for Non-Factoid Question Answering

Title: Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models

Title: From Structured Prompts to Open Narratives: Measuring Gender Bias in LLMs Through Open-Ended Storytelling

Title: Towards Automatic Continual Learning: A Self-Adaptive Framework for Continual Instruction Tuning

Title: From Chaos to Order: The Atomic Reasoner Framework for Fine-grained Reasoning in Large Language Models

Title: Adaptive Group Policy Optimization: Towards Stable Training and Token-Efficient Reasoning

Title: InhibiDistilbert: Knowledge Distillation for a ReLU and Addition-based Transformer

Title: ECKGBench: Benchmarking Large Language Models in E-commerce Leveraging Knowledge Graph

Title: Corrective In-Context Learning: Evaluating Self-Correction in Large Language Models

Title: The Lighthouse of Language: Enhancing LLM Agents via Critique-Guided Improvement

Title: Deceptive Humor: A Synthetic Multilingual Benchmark Dataset for Bridging Fabricated Claims with Humorous Content

Title: Evaluating Test-Time Scaling LLMs for Legal Reasoning: OpenAI o1, DeepSeek-R1, and Beyond

Title: Incomplete Utterance Rewriting with Editing Operation Guidance and Utterance Augmentation

Title: Meta-Learning Neural Mechanisms rather than Bayesian Priors

Title: Tuning LLMs by RAG Principles: Towards LLM-native Memory

Title: Cultural Alignment in Large Language Models Using Soft Prompt Tuning

Title: MKG-Rank: Enhancing Large Language Models with Knowledge Graph for Multilingual Medical Question Answering

Title: Automatically Generating Chinese Homophone Words to Probe Machine Translation Estimation Systems

Title: Towards Lighter and Robust Evaluation for Retrieval Augmented Generation

Title: SpeCache: Speculative Key-Value Caching for Efficient Generation of LLMs

Title: MathFusion: Enhancing Mathematic Problem-solving of LLM through Instruction Fusion

Title: Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning

Title: LLM Braces: Straightening Out LLM Predictions with Relevant Sub-Updates

Title: CaKE: Circuit-aware Editing Enables Generalizable Knowledge Learners

Title: Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models