2025-04-04

Title: Increasing happiness through conversations with artificial intelligence

Title: ContrastScore: Towards Higher Quality, Less Biased, More Efficient Evaluation Metrics with Contrastive Evaluation

Title: Language Models at the Syntax-Semantics Interface: A Case Study of the Long-Distance Binding of Chinese Reflexive ziji

Title: Overcoming Vocabulary Constraints with Pixel-level Fallback

Title: One Pic is All it Takes: Poisoning Visual Document Retrieval Augmented Generation with a Single Image

Title: LL4G: Self-Supervised Dynamic Optimization for Graph-Based Personality Detection

Title: Subasa -- Adapting Language Models for Low-resourced Offensive Language Detection in Sinhala

Title: LLMs as Deceptive Agents: How Role-Based Prompting Induces Semantic Ambiguity in Puzzle Tasks

Title: State-of-the-Art Translation of Text-to-Gloss using mBART : A case study of Bangla

Title: Measurement of LLM's Philosophies of Human Nature

Title: Improving Harmful Text Detection with Joint Retrieval and External Knowledge

Title: CoTAL: Human-in-the-Loop Prompt Engineering, Chain-of-Thought Reasoning, and Active Learning for Generalizable Formative Assessment Scoring

Title: LearNAT: Learning NL2SQL with AST-guided Task Decomposition for Large Language Models

Title: The quasi-semantic competence of LLMs: a case study on the part-whole relation

Title: Scaling Analysis of Interleaved Speech-Text Language Models

Title: DaKultur: Evaluating the Cultural Awareness of Language Models for Danish with Native Speakers

Title: AnesBench: Multi-Dimensional Evaluation of LLM Reasoning in Anesthesiology

Title: Adapting Large Language Models for Multi-Domain Retrieval-Augmented-Generation

Title: Scaling Video-Language Models to 10K Frames via Hierarchical Differential Distillation

Title: Cognitive Memory in Large Language Models

Title: Inference-Time Scaling for Generalist Reward Modeling

Title: UNDO: Understanding Distillation as Optimization

Title: Leveraging LLM For Synchronizing Information Across Multilingual Tables

Title: Language Models reach higher Agreement than Humans in Historical Interpretation

Title: LexPam: Legal Procedure Awareness-Guided Mathematical Reasoning

Title: LLM for Complex Reasoning Task: An Exploratory Study in Fermi Problems

Title: The Hidden Space of Safety: Understanding Preference-Tuned LLMs in Multilingual context

Title: ERPO: Advancing Safety Alignment via Ex-Ante Reasoning Preference Optimization

Title: Why do LLMs attend to the first token?

Title: Enhancing LLM Robustness to Perturbed Instructions: An Empirical Study

Title: MultiBLiMP 1.0: A Massively Multilingual Benchmark of Linguistic Minimal Pairs

Title: A Framework for Robust Cognitive Evaluation of LLMs

Title: A Survey of Large Language Models in Mental Health Disorder Detection on Social Media

Title: MegaMath: Pushing the Limits of Open Math Corpora

Title: Generative Evaluation of Complex Reasoning in Large Language Models