2025-07-31

Title: IndoPref: A Multi-Domain Pairwise Preference Dataset for Indonesian

Title: Persona-Augmented Benchmarking: Evaluating LLMs Across Diverse Writing Styles

Title: A Scalable Pipeline for Estimating Verb Frame Frequencies Using Large Language Models

Title: How Well Does First-Token Entropy Approximate Word Entropy as a Psycholinguistic Predictor?

Title: RL from Teacher-Model Refinement: Gradual Imitation Learning for Machine Translation

Title: Meaning-infused grammar: Gradient Acceptability Shapes the Geometric Representations of Constructions in LLMs

Title: Intent Recognition and Out-of-Scope Detection using LLMs in Multi-party Conversations

Title: A Comprehensive Taxonomy of Negation for NLP and Neural Retrievers

Title: Traits Run Deep: Enhancing Personality Assessment via Psychology-Guided LLM Representations and Multimodal Apparent Behaviors

Title: PATENTWRITER: A Benchmarking Study for Patent Drafting with LLMs

Title: Question Generation for Assessing Early Literacy Reading Comprehension

Title: NeedleChain: Measuring Intact Long-Context Reasoning Capability of Large Language Models

Title: AI-generated stories favour stability over change: homogeneity and cultural stereotyping in narratives generated by gpt-4o-mini

Title: Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

Title: What is an "Abstract Reasoner"? Revisiting Experiments and Arguments about Large Language Models

Title: IFEvalCode: Controlled Code Generation

Title: SLM-SQL: An Exploration of Small Language Models for Text-to-SQL

Title: CliCARE: Grounding Large Language Models in Clinical Guidelines for Decision Support over Longitudinal Cancer Electronic Health Records

Title: A Benchmark Dataset and Evaluation Framework for Vietnamese Large Language Models in Customer Support

Title: ControlMed: Adding Reasoning Control to Medical Language Model

Title: Exploiting Synergistic Cognitive Biases to Bypass Safety in LLMs

Title: Unveiling the Influence of Amplifying Language-Specific Neurons

Title: BALSAM: A Platform for Benchmarking Arabic Large Language Models

Title: Language Arithmetics: Towards Systematic Language Neuron Identification and Manipulation

Title: Multilingual Political Views of Large Language Models: Identification and Steering

Title: From Sufficiency to Reflection: Reinforcement-Guided Thinking Quality in Retrieval-Augmented Reasoning for LLMs

Title: Investigating Hallucination in Conversations for Low Resource Languages

Title: Resource-Efficient Adaptation of Large Language Models for Text Embeddings via Prompt Engineering and Contrastive Fine-tuning

Title: Reducing Hallucinations in Summarization via Reinforcement Learning with Entity Hallucination Index

Title: CUS-QA: Local-Knowledge-Oriented Open-Ended Question Answering Dataset

Title: Opportunities and Challenges of LLMs in Education: An NLP Perspective

Title: MASCA: LLM based-Multi Agents System for Credit Assessment

Title: DBLPLink 2.0 -- An Entity Linker for the DBLP Scholarly Knowledge Graph

Title: Beyond Natural Language Plans: Structure-Aware Planning for Query-Focused Table Summarization

Title: Where to show Demos in Your Prompt: A Positional Bias of In-Context Learning