2025-05-13

Title: ScaleMCP: Dynamic and Auto-Synchronizing Model Context Protocol Tools for LLM Agents

Title: Is your multimodal large language model a good science tutor?

Title: xGen-small Technical Report

Title: REFINE-AF: A Task-Agnostic Framework to Align Language Models via Self-Generated Instructions using Reinforcement Learning from Automated Feedback

Title: MacRAG: Compress, Slice, and Scale-up for Multi-Scale Adaptive Context RAG

Title: Evaluating LLM-Generated Q&A Test: a Student-Centered Study

Title: Integrating Video and Text: A Balanced Approach to Multimodal Summary Generation and Evaluation

Title: Bridging the Gap: An Intermediate Language for Enhanced and Cost-Effective Grapheme-to-Phoneme Conversion with Homographs with Multiple Pronunciations Disambiguation

Title: Boosting Neural Language Inference via Cascaded Interactive Reasoning

Title: Attention Is Not All You Need: The Importance of Feedforward Networks in Transformer Models

Title: TS-SUPERB: A Target Speech Processing Benchmark for Speech Self-Supervised Learning Models

Title: From Rankings to Insights: Evaluation Should Shift Focus from Leaderboard to Feedback

Title: Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

Title: Utilizing LLMs to Investigate the Disputed Role of Evidence in Electronic Cigarette Health Policy Formation in Australia and the UK

Title: IM-BERT: Enhancing Robustness of BERT through the Implicit Euler Method

Title: EcoLANG: Efficient and Effective Agent Communication Language Induction for Social Simulation

Title: The Distracting Effect: Understanding Irrelevant Passages in RAG

Title: Convert Language Model into a Value-based Strategic Planner

Title: HAMLET: Healthcare-focused Adaptive Multilingual Learning Embedding-based Topic Modeling

Title: Towards Actionable Pedagogical Feedback: A Multi-Perspective Analysis of Mathematics Teaching and Tutoring Dialogue

Title: KDH-MLTC: Knowledge Distillation for Healthcare Multi-Label Text Classification

Title: Structural Entropy Guided Agent for Detecting and Repairing Knowledge Deficiencies in LLMs

Title: On the Cost and Benefits of Training Context with Utterance or Full Conversation Training: A Comparative Stud

Title: Benchmarking Ethical and Safety Risks of Healthcare LLMs in China-Toward Systemic Governance under Healthy China 2030

Title: DynamicRAG: Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generation

Title: SAS-Bench: A Fine-Grained Benchmark for Evaluating Short Answer Scoring with Large Language Models

Title: No Query, No Access

Title: On the Robustness of Reward Models for Language Model Alignment

Title: Semantic Retention and Extreme Compression in LLMs: Can We Have Both?

Title: AttentionInfluence: Adopting Attention Head Influence for Weak-to-Strong Pretraining Data Selection

Title: Towards Multi-Agent Reasoning Systems for Collaborative Expertise Delegation: An Exploratory Design Study

Title: QUPID: Quantified Understanding for Enhanced Performance, Insights, and Decisions in Korean Search Engines

Title: Computational Fact-Checking of Online Discourse: Scoring scientific accuracy in climate change related news articles

Title: ToolACE-DEV: Self-Improving Tool Learning via Decomposition and EVolution

Title: SEReDeEP: Hallucination Detection in Retrieval-Augmented Models via Semantic Entropy and Context-Parameter Fusion

Title: A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in Large Language Models

Title: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent

Title: Characterizing the Investigative Methods of Fictional Detectives with Large Language Models

Title: MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Title: Concept-Level Explainability for Auditing & Steering LLM Responses

Title: Chronocept: Instilling a Sense of Time in Machines

Title: JobHop: A Large-Scale Dataset of Career Trajectories

Title: Benchmarking Retrieval-Augmented Generation for Chemistry

Title: OnPrem.LLM: A Privacy-Conscious Document Intelligence Toolkit

Title: Codifying Character Logic in Role-Playing

Title: Spoken Language Understanding on Unseen Tasks With In-Context Learning

Title: Must Read: A Systematic Survey of Computational Persuasion

Title: Domain Regeneration: How well do LLMs match syntactic properties of text domains?

Title: Learning Dynamics in Continual Pre-Training for Large Language Models