2025-09-04

Title: Clustering Discourses: Racial Biases in Short Stories about Women Generated by Large Language Models

Title: IDEAlign: Comparing Large Language Models to Human Experts in Open-ended Interpretive Annotations

Title: A-SEA3L-QA: A Fully Automated Self-Evolving, Adversarial Workflow for Arabic Long-Context Question-Answer Generation

Title: English Pronunciation Evaluation without Complex Joint Training: LoRA Fine-tuned Speech Multimodal LLM

Title: ProMQA-Assembly: Multimodal Procedural QA Dataset on Assembly

Title: DiaCBT: A Long-Periodic Dialogue Corpus Guided by Cognitive Conceptualization Diagram for CBT-based Psychological Counseling

Title: Training LLMs to be Better Text Embedders through Bidirectional Reconstruction

Title: Structure-Learnable Adapter Fine-Tuning for Parameter-Efficient Large Language Models

Title: Measuring Scalar Constructs in Social Science with LLMs

Title: From Evaluation to Defense: Constructing Persistent Edit-Based Fingerprints for Large Language Models

Title: Expanding the WMT24++ Benchmark with Rumantsch Grischun, Sursilvan, Sutsilvan, Surmiran, Puter, and Vallader

Title: Domain Adaptation of LLMs for Process Data

Title: SinhalaMMLU: A Comprehensive Benchmark for Evaluating Multitask Language Understanding in Sinhala

Title: AgenTracer: Who Is Inducing Failure in the LLM Agentic Systems?

Title: LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations

Title: Curse of Knowledge: When Complex Evaluation Context Benefits yet Biases LLM Judges

Title: Design and Optimization of Reinforcement Learning-Based Agents in Text-Based Games