2025-04-14

Title: Metamorphic Testing for Fairness Evaluation in Large Language Models: Identifying Intersectional Bias in LLaMA and GPT

Title: Psychological Health Knowledge-Enhanced LLM-based Social Network Crisis Intervention Text Transfer Recognition Method

Title: SEAL: Steerable Reasoning Calibration of Large Language Models for Free

Title: Regional Tiny Stories: Using Small Models to Compare Language Learning and Tokenizer Performance

Title: 'Neural howlround' in large language models: a self-reinforcing bias phenomenon, and a dynamic attenuation solution

Title: SafeChat: A Framework for Building Trustworthy Collaborative Assistants and a Case Study of its Usefulness

Title: BiasCause: Evaluate Socially Biased Causal Reasoning of Large Language Models

Title: Linguistic Interpretability of Transformer-based Language Models: a systematic review

Title: More diverse more adaptive: Comprehensive Multi-task Learning for Improved LLM Domain Adaptation in E-commerce

Title: Can Reasoning LLMs Enhance Clinical Document Classification?

Title: DeepSeek vs. o3-mini: How Well can Reasoning LLMs Evaluate MT and Summarization?

Title: Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora

Title: Harnessing the Unseen: The Hidden Influence of Intrinsic Knowledge in Long-Context Language Models

Title: LLM for Comparative Narrative Analysis

Title: Out of Style: RAG's Fragility to Linguistic Variation

Title: Evaluating the Bias in LLMs for Surveying Opinion and Decision Making in Healthcare

Title: ELSA: A Style Aligned Dataset for Emotionally Intelligent Language Generation

Title: Large language models could be rote learners

Title: Scholar Inbox: Personalized Paper Recommendations for Scientists

Title: Beyond Self-Reports: Multi-Observer Agents for Personality Assessment in Large Language Models

Title: Integrated ensemble of BERT- and features-based models for authorship attribution in Japanese literary works

Title: On The Landscape of Spoken Language Models: A Comprehensive Survey

Title: Lexical Bundle Frequency as a Construct-Relevant Candidate Feature in Automated Scoring of L2 Academic Writing

Title: UoB-NLP at SemEval-2025 Task 11: Leveraging Adapters for Multilingual and Cross-Lingual Emotion Detection

Title: Playpen: An Environment for Exploring Learning Through Conversational Interaction

Title: MedHal: An Evaluation Dataset for Medical Hallucination Detection

Title: A Survey of Machine Learning Models and Datasets for the Multi-label Classification of Textual Hate Speech in English

Title: Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning

Title: Fast-Slow-Thinking: Complex Task Solving with Large Language Models

Title: TP-RAG: Benchmarking Retrieval-Augmented Large Language Model Agents for Spatiotemporal-Aware Travel Planning

Title: Large Language Models as Span Annotators

Title: SWAN-GPT: An Efficient and Scalable Approach for Long-Context Language Modeling