2025-06-04

Title: Research on Medical Named Entity Identification Based On Prompt-Biomrc Model and Its Application in Intelligent Consultation System

Title: No Free Lunch in Active Learning: LLM Embedding Quality Dictates Query Strategy Success

Title: NovelHopQA: Diagnosing Multi-Hop Reasoning Failures in Long Narrative Contexts

Title: Enhancing Paraphrase Type Generation: The Impact of DPO and RLHF Evaluated with Human-Ranked Data

Title: ChatCFD: an End-to-End CFD Agent with Domain-specific Structured Thinking

Title: FinS-Pilot: A Benchmark for Online Financial System

Title: Enhancing Multimodal Continual Instruction Tuning with BranchLoRA

Title: Evaluating the Unseen Capabilities: How Many Theorems Do LLMs Know?

Title: Knowledge or Reasoning? A Close Look at How LLMs Think Across Domains

Title: Model Internal Sleuthing: Finding Lexical Identity and Inflectional Morphology in Modern Language Models

Title: BabyLM's First Constructions: Causal interventions provide a signal of learning

Title: BehaviorBox: Automated Discovery of Fine-Grained Performance Differences Between Language Models

Title: Leveraging Natural Language Processing to Unravel the Mystery of Life: A Review of NLP Approaches in Genomics, Transcriptomics, and Proteomics

Title: Investigating the Impact of Word Informativeness on Speech Emotion Recognition

Title: CoDial: Interpretable Task-Oriented Dialogue Systems Through Dialogue Flow Alignment

Title: ImpRAG: Retrieval-Augmented Generation with Implicit Queries

Title: LAM SIMULATOR: Advancing Data Generation for Large Action Model Training via Online Exploration and Trajectory Feedback

Title: Explain-then-Process: Using Grammar Prompting to Enhance Grammatical Acceptability Judgments

Title: Something Just Like TRuST : Toxicity Recognition of Span and Target

Title: One Missing Piece for Open-Source Reasoning Models: A Dataset to Mitigate Cold-Starting Short CoT LLMs in RL

Title: Truth over Tricks: Measuring and Mitigating Shortcut Learning in Misinformation Detection

Title: DIAMOND: An LLM-Driven Agent for Context-Aware Baseball Highlight Summarization

Title: AnswerCarefully: A Dataset for Improving the Safety of Japanese LLM Output

Title: Exploring Explanations Improves the Robustness of In-Context Learning

Title: Consultant Decoding: Yet Another Synergistic Mechanism

Title: GraphRAG-Bench: Challenging Domain-Specific Reasoning for Evaluating Graph Retrieval-Augmented Generation

Title: Gender Inequality in English Textbooks Around the World: an NLP Approach

Title: Comparative Analysis of AI Agent Architectures for Entity Relationship Classification

Title: From Anger to Joy: How Nationality Personas Shape Emotion Attribution in Large Language Models

Title: Should LLM Safety Be More Than Refusing Harmful Instructions?

Title: Multimodal DeepResearcher: Generating Text-Chart Interleaved Reports From Scratch with Agentic Framework

Title: MidPO: Dual Preference Optimization for Safety and Helpfulness in Large Language Models via a Mixture of Experts Framework

Title: XToM: Exploring the Multilingual Theory of Mind for Large Language Models

Title: FroM: Frobenius Norm-Based Data-Free Adaptive Model Merging

Title: ORPP: Self-Optimizing Role-playing Prompts to Enhance Language Model Capabilities

Title: Do Language Models Think Consistently? A Study of Value Preferences Across Varying Response Lengths

Title: Enhancing Large Language Models with Neurosymbolic Reasoning for Multilingual Tasks

Title: Minos: A Multimodal Evaluation Model for Bidirectional Generation Between Image and Text

Title: KARE-RAG: Knowledge-Aware Refinement and Enhancement for RAG

Title: M$^3$FinMeeting: A Multilingual, Multi-Sector, and Multi-Task Financial Meeting Understanding Evaluation Dataset

Title: FinChain: A Symbolic Benchmark for Verifiable Chain-of-Thought Financial Reasoning

Title: Learning Together to Perform Better: Teaching Small-Scale LLMs to Collaborate via Preferential Rationale Tuning

Title: Answer Convergence as a Signal for Early Stopping in Reasoning

Title: CoRe-MMRAG: Cross-Source Knowledge Reconciliation for Multimodal RAG

Title: Pruning General Large Language Models into Customized Expert Models

Title: IndoSafety: Culturally Grounded Safety for LLMs in Indonesian Languages

Title: Evaluating Named Entity Recognition Models for Russian Cultural News Texts: From BERT to LLM

Title: On Generalization across Measurement Systems: LLMs Entail More Test-Time Compute for Underrepresented Cultures

Title: Beyond the Surface: Measuring Self-Preference in LLM Judgments

Title: EssayBench: Evaluating Large Language Models in Multi-Genre Chinese Essay Writing

Title: Are Economists Always More Introverted? Analyzing Consistency in Persona-Assigned LLMs

Title: EvaLearn: Quantifying the Learning Capability and Efficiency of LLMs via Sequential Problem Solving

Title: TL;DR: Too Long, Do Re-weighting for Effcient LLM Reasoning Compression

Title: Decompose, Plan in Parallel, and Merge: A Novel Paradigm for Large Language Models based Planning with Multiple Constraints

Title: MASTER: Enhancing Large Language Model via Multi-Agent Simulated Teaching

Title: On Entity Identification in Language Models

Title: RACE-Align: Retrieval-Augmented and Chain-of-Thought Enhanced Preference Alignment for Large Language Models

Title: Exploiting the English Vocabulary Profile for L2 word-level vocabulary assessment with LLMs

Title: SemVink: Advancing VLMs' Semantic Understanding of Optical Illusions via Visual Global Thinking

Title: ProcrustesGPT: Compressing LLMs with Structured Matrices and Orthogonal Transformations

Title: TO-GATE: Clarifying Questions and Summarizing Responses with Trajectory Optimization for Eliciting Human Preference

Title: Token and Span Classification for Entity Recognition in French Historical Encyclopedias

Title: CoT is Not True Reasoning, It Is Just a Tight Constraint to Imitate: A Theory Perspective

Title: IMPARA-GED: Grammatical Error Detection is Boosting Reference-free Grammatical Error Quality Estimator

Title: Cell-o1: Training LLMs to Solve Single-Cell Reasoning Puzzles with Reinforcement Learning

Title: A Controllable Examination for Long-Context Language Models

Title: INESC-ID @ eRisk 2025: Exploring Fine-Tuned, Similarity-Based, and Prompt-Based Approaches to Depression Symptom Identification

Title: Quantitative LLM Judges

Title: Adaptive Graph Pruning for Multi-Agent Communication

Title: HACo-Det: A Study Towards Fine-Grained Machine-Generated Text Detection under Human-AI Coauthoring

Title: FlowerTune: A Cross-Domain Benchmark for Federated Fine-Tuning of Large Language Models

Title: Expanding before Inferring: Enhancing Factuality in Large Language Models through Premature Layers Interpolation

Title: Performance of leading large language models in May 2025 in Membership of the Royal College of General Practitioners-style examination questions: a cross-sectional analysis

Title: It's Not a Walk in the Park! Challenges of Idiom Translation in Speech-to-text Systems

Title: A Multi-Agent Framework for Mitigating Dialect Biases in Privacy Policy Question-Answering Systems

Title: Conditioning Large Language Models on Legal Systems? Detecting Punishable Hate Speech

Title: Coding Agents with Multimodal Browsing are Generalist Problem Solvers

Title: Leveraging Information Retrieval to Enhance Spoken Language Understanding Prompts in Few-Shot Learning

Title: Towards Analyzing and Understanding the Limitations of VAPO: A Theoretical Perspective

Title: Facts Do Care About Your Language: Assessing Answer Quality of Multilingual LLMs

Title: Literary Evidence Retrieval via Long-Context Language Models

Title: Beyond Text Compression: Evaluating Tokenizers Across Scales

Title: Critique-GRPO: Advancing LLM Reasoning with Natural Language and Numerical Feedback

Title: AUTOCIRCUIT-RL: Reinforcement Learning-Driven LLM for Automated Circuit Topology Generation

Title: Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning

Title: GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents

Title: Entity-Augmented Neuroscience Knowledge Retrieval Using Ontology and Semantic Understanding Capability of LLM

Title: Causal Estimation of Tokenisation Bias