2025-04-30

Title: It's the same but not the same: Do LLMs distinguish Spanish varieties?

Title: Evaluating Large Language Models on Multiword Expressions in Multilingual and Code-Switched Contexts

Title: Toward Evaluative Thinking: Meta Policy Optimization with Evolving Reward Models

Title: MICE for CATs: Model-Internal Confidence Estimation for Calibrating Agents with Tools

Title: A Multimodal Pipeline for Clinical Data Extraction: Applying Vision-Language Models to Scans of Transfusion Reaction Reports

Title: Enhancing Systematic Reviews with Large Language Models: Using GPT-4 and Kimi

Title: Local Prompt Optimization

Title: What Causes Knowledge Loss in Multilingual Language Models?

Title: DMDTEval: An Evaluation and Analysis of LLMs on Disambiguation in Multi-domain Translation

Title: On Psychology of AI -- Does Primacy Effect Affect ChatGPT and Other LLMs?

Title: Team ACK at SemEval-2025 Task 2: Beyond Word-for-Word Machine Translation for English-Korean Pairs

Title: Fane at SemEval-2025 Task 10: Zero-Shot Entity Framing with Large Language Models

Title: Enhancing LLM Language Adaption through Cross-lingual In-Context Pre-training

Title: UniDetox: Universal Detoxification of Large Language Models via Dataset Distillation

Title: Revisiting the MIMIC-IV Benchmark: Experiments Using Language Models for Electronic Health Records

Title: BrAIcht, a theatrical agent that speaks like Bertolt Brecht's characters

Title: TF1-EN-3M: Three Million Synthetic Moral Fables for Training Small, Open Language Models

Title: WenyanGPT: A Large Language Model for Classical Chinese Tasks

Title: Cooking Up Creativity: A Cognitively-Inspired Approach for Enhancing LLM Creativity through Structured Representations

Title: A Generative-AI-Driven Claim Retrieval System Capable of Detecting and Retrieving Claims from Social Media Platforms in Multiple Languages

Title: Are Information Retrieval Approaches Good at Harmonising Longitudinal Survey Questions in Social Science?

Title: Can LLMs Detect Intrinsic Hallucinations in Paraphrasing and Machine Translation?

Title: Beyond the Last Answer: Your Reasoning Trace Uncovers More than You Think

Title: UniversalRAG: Retrieval-Augmented Generation over Multiple Corpora with Diverse Modalities and Granularities

Title: Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers

Title: Chain-of-Defensive-Thought: Structured Reasoning Elicits Robustness in Large Language Models against Reference Corruption

Title: Turing Machine Evaluation for Large Language Model

Title: Universal language model with the intervention of quantum theory

Title: JaccDiv: A Metric and Benchmark for Quantifying Diversity of Generated Marketing Text in the Music Industry

Title: DYNAMAX: Dynamic computing for Transformers and Mamba based architectures

Title: Trace-of-Thought: Enhanced Arithmetic Problem Solving via Reasoning Distillation From Large to Small Language Models

Title: Information Gravity: A Field-Theoretic Model for Token Selection in Large Language Models

Title: OSVBench: Benchmarking LLMs on Specification Generation Tasks for Operating System Verification

Title: SetKE: Knowledge Editing for Knowledge Elements Overlap