2025-04-03

Title: Repetitions are not all alike: distinct mechanisms sustain repetition in language models

Title: Can LLMs Grasp Implicit Cultural Values? Benchmarking LLMs' Metacognitive Cultural Intelligence with CQ-Bench

Title: Is the Top Still Spinning? Evaluating Subjectivity in Narrative Understanding

Title: Follow the Flow: On Information Flow Across Textual Tokens in Text-to-Image Models

Title: $μ$KE: Matryoshka Unstructured Knowledge Editing of Large Language Models

Title: Medical large language models are easily distracted

Title: Detecting PTSD in Clinical Interviews: A Comparative Analysis of NLP Methods and Large Language Models

Title: Catastrophic Forgetting in LLMs: A Comparative Analysis Across Language Tasks

Title: Automated Factual Benchmarking for In-Car Conversational Systems using Large Language Models

Title: Grade Guard: A Smart System for Short Answer Automated Grading

Title: Prompt-Reverse Inconsistency: LLM Self-Inconsistency Beyond Generative Randomness and Prompt Paraphrasing

Title: ThinkPrune: Pruning Long Chain-of-Thought of LLMs via Reinforcement Learning

Title: Biomedical Question Answering via Multi-Level Summarization on a Local Knowledge Graph

Title: Adaptive Rectification Sampling for Test-Time Compute Scaling

Title: GTR: Graph-Table-RAG for Cross-Table Question Answering

Title: LITE: LLM-Impelled efficient Taxonomy Evaluation

Title: ToolACE-R: Tool Learning with Adaptive Self-Refinement

Title: FAIRE: Assessing Racial and Gender Bias in AI-Driven Resume Evaluations

Title: Refining Interactions: Enhancing Anisotropy in Graph Neural Networks with Language Semantics

Title: PROPHET: An Inferable Future Forecasting Benchmark with Causal Intervened Likelihood Estimation

Title: Chain of Correction for Full-text Speech Recognition with Large Language Models

Title: Context-Aware Toxicity Detection in Multiplayer Games: Integrating Domain-Adaptive Pretraining and Match Metadata

Title: From Smør-re-brød to Subwords: Training LLMs on Danish, One Morpheme at a Time

Title: Register Always Matters: Analysis of LLM Pretraining Data Through the Lens of Language Variation

Title: Testing Low-Resource Language Support in LLMs Using Language Proficiency Exams: the Case of Luxembourgish

Title: ToM-RL: Reinforcement Learning Unlocks Theory of Mind in Small LLMs

Title: InfiniteICL: Breaking the Limit of Context Window Size via Long Short-term Memory Transformation

Title: Style over Substance: Distilled Language Models Reason Via Stylistic Replication

Title: OpenThaiGPT 1.6 and R1: Thai-Centric Open Source and Reasoning Large Language Models

Title: Investigating and Scaling up Code-Switching for Multilingual Language Model Pre-Training

Title: YourBench: Easy Custom Evaluation Sets for Everyone

Title: LARGE: Legal Retrieval Augmented Generation Evaluation Tool

Title: Cross-Lingual Consistency: A Novel Inference Framework for Advancing Reasoning in Large Language Models

Title: TransientTables: Evaluating LLMs' Reasoning on Temporally Evolving Semi-structured Tables

Title: STAR-1: Safer Alignment of Reasoning LLMs with 1K Data

Title: Bridging the Linguistic Divide: A Survey on Leveraging Large Language Models for Machine Translation

Title: Is the Reversal Curse a Binding Problem? Uncovering Limitations of Transformers from a Basic Generalization Failure

Title: A thorough benchmark of automatic text classification: From traditional approaches to large language models

Title: Review, Refine, Repeat: Understanding Iterative Decoding of AI Agents with Dynamic Evaluation and Selection

Title: OpenCodeReasoning: Advancing Data Distillation for Competitive Coding