2025-06-12

Title: LLM-as-a-qualitative-judge: automating error analysis in natural language generation

Title: PHRASED: Phrase Dictionary Biasing for Speech Translation

Title: Extrapolation by Association: Length Generalization Transfer in Transformers

Title: Self-Anchored Attention Model for Sample-Efficient Classification of Prosocial Text Chat

Title: Did I Faithfully Say What I Thought? Bridging the Gap Between Neural Activity and Self-Explanations in Large Language Models

Title: $(RSA)^2$: A Rhetorical-Strategy-Aware Rational Speech Act Framework for Figurative Language Understanding

Title: Alzheimer's Dementia Detection Using Perplexity from Paired Large Language Models

Title: Towards Efficient and Effective Alignment of Large Language Models

Title: Multi-Agent Language Models: Advancing Cooperation, Coordination, and Adaptation

Title: RePO: Replay-Enhanced Policy Optimization

Title: Latent Multi-Head Attention for Small Language Models

Title: OmniDRCA: Parallel Speech-Text Foundation Model via Dual-Resolution Speech Representations and Contrastive Alignment

Title: DIVE into MoE: Diversity-Enhanced Reconstruction of Large Language Models from Dense into Mixture-of-Experts

Title: Taming SQL Complexity: LLM-Based Equivalence Evaluation for Text-to-SQL

Title: COGENT: A Curriculum-oriented Framework for Generating Grade-appropriate Educational Content

Title: CoLMbo: Speaker Language Model for Descriptive Profiling

Title: Comparing human and LLM politeness strategies in free production

Title: Token Constraint Decoding Improves Robustness on Question Answering for Large Language Models

Title: PGDA-KGQA: A Prompt-Guided Generative Framework with Multiple Data Augmentation Strategies for Knowledge Graph Question Answering

Title: Hidden in Plain Sight: Evaluation of the Deception Detection Capabilities of LLMs in Multimodal Settings

Title: Improved Supervised Fine-Tuning for Large Language Models to Mitigate Catastrophic Forgetting

Title: GigaChat Family: Efficient Russian Language Modeling Through Mixture of Experts Architecture

Title: UniToMBench: Integrating Perspective-Taking to Improve Theory of Mind in LLMs

Title: Towards Bridging the Reward-Generation Gap in Direct Alignment Algorithms

Title: Bridging Online Behavior and Clinical Insight: A Longitudinal LLM-based Study of Suicidality on YouTube Reveals Novel Digital Markers

Title: Give Me FP32 or Give Me Death? Challenges and Solutions for Reproducible Reasoning

Title: TransXSSM: A Hybrid Transformer State Space Model with Unified Rotary Position Embedding

Title: ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning

Title: KG-Infused RAG: Augmenting Corpus-Based RAG with External Knowledge Graphs

Title: Gender Bias in English-to-Greek Machine Translation

Title: Towards Open Foundation Language Model and Corpus for Macedonian: A Low-Resource Language

Title: From Symbolic to Neural and Back: Exploring Knowledge Graph-Large Language Model Synergies

Title: Memorization in Language Models through the Lens of Intrinsic Dimension

Title: Benchmarking Debiasing Methods for LLM-based Parameter Estimates

Title: Learning Efficient and Generalizable Graph Retriever for Knowledge-Graph Question Answering

Title: Bridging the Gap Between Open-Source and Proprietary LLMs in Table QA

Title: Query-Level Uncertainty in Large Language Models

Title: Is Fine-Tuning an Effective Solution? Reassessing Knowledge Editing for Unstructured Data

Title: Inv-Entropy: A Fully Probabilistic Framework for Uncertainty Quantification in Language Models

Title: ComfyUI-R1: Exploring Reasoning Models for Workflow Generation

Title: Do LLMs Give Psychometrically Plausible Responses in Educational Assessments?

Title: CoRT: Code-integrated Reasoning within Thinking

Title: Dataset of News Articles with Provenance Metadata for Media Relevance Assessment

Title: Causal Sufficiency and Necessity Improves Chain-of-Thought Reasoning

Title: Attention Head Embeddings with Trainable Deep Kernels for Hallucination Detection in LLMs

Title: The Emergence of Abstract Thought in Large Language Models Beyond Any Language

Title: PersonaLens: A Benchmark for Personalization Evaluation in Conversational AI Assistants

Title: Aspect-Based Opinion Summarization with Argumentation Schemes

Title: VerIF: Verification Engineering for Reinforcement Learning in Instruction Following

Title: Query-Focused Retrieval Heads Improve Long-Context Reasoning and Re-ranking

Title: Resa: Transparent Reasoning Models via SAEs

Title: When Detection Fails: The Power of Fine-Tuned Models to Generate Human-Like Social Media Text

Title: Step-by-step Instructions and a Simple Tabular Output Format Improve the Dependency Parsing Accuracy of LLMs

Title: Large Language Models for Toxic Language Detection in Low-Resource Balkan Languages

Title: From Judgment to Interference: Early Stopping LLM Harmful Outputs via Streaming Content Monitoring