2025-06-16

Title: TeleEval-OS: Performance evaluations of large language models for operations scheduling

Title: Who is in the Spotlight: The Hidden Bias Undermining Multimodal Retrieval-Augmented Generation

Title: Smotrom tvoja pa ander drogoj verden! Resurrecting Dead Pidgin with Generative Models: Russenorsk Case Study

Title: A Large Language Model Based Pipeline for Review of Systems Entity Recognition from Clinical Notes

Title: Deontological Keyword Bias: The Impact of Modal Expressions on Normative Judgments of Language Models

Title: Targeted control of fast prototyping through domain-specific interface

Title: CLAIM: Mitigating Multilingual Object Hallucination in Large Vision-Language Models with Cross-Lingual Attention Intervention

Title: CyclicReflex: Improving Large Reasoning Models via Cyclical Reflection Token Scheduling

Title: RoE-FND: A Case-Based Reasoning Approach with Dual Verification for Fake News Detection via LLMs

Title: MANBench: Is Your Multimodal Model Smarter than Human?

Title: SAGE:Specification-Aware Grammar Extraction for Automated Test Case Generation with LLMs

Title: PRISM: A Transformer-based Language Model of Structured Clinical Event Data

Title: RedDebate: Safer Responses through Multi-Agent Red Teaming Debates

Title: Two Birds with One Stone: Improving Factuality and Faithfulness of LLMs via Dynamic Interactive Subspace Editing

Title: Customizing Speech Recognition Model with Large Language Model Feedback

Title: Dynamic Context Tuning for Retrieval-Augmented Generation: Enhancing Multi-Turn Planning and Tool Adaptation

Title: The Scales of Justitia: A Comprehensive Survey on Safety Evaluation of LLMs

Title: C-SEO Bench: Does Conversational SEO Work?

Title: Evolutionary Perspectives on the Evaluation of LLM-Based AI Agents: A Comprehensive Survey

Title: You Only Fine-tune Once: Many-Shot In-Context Fine-Tuning for Large Language Model

Title: DAM: Dynamic Attention Mask for Long-Context Large Language Model Inference Acceleration

Title: Enabling On-Device Medical AI Assistants via Input-Driven Saliency Adaptation

Title: Graph-based RAG Enhancement via Global Query Disambiguation and Dependency-Aware Reranking

Title: History-Aware Cross-Attention Reinforcement: Self-Supervised Multi Turn and Chain-of-Thought Fine-Tuning with vLLM

Title: Enhancing Large Language Models for Mobility Analytics with Semantic Location Tokenization

Title: AssertBench: A Benchmark for Evaluating Self-Assertion in Large Language Models

Title: Evaluating and Improving Robustness in Large Language Models: A Survey and Future Directions

Title: Manifesto from Dagstuhl Perspectives Workshop 24352 -- Conversational Agents: A Framework for Evaluation (CAFE)

Title: Breaking the Reviewer: Assessing the Vulnerability of Large Language Models in Automated Peer Review Under Textual Adversarial Attacks

Title: KokushiMD-10: Benchmark for Evaluating Large Language Models on Ten Japanese National Healthcare Licensing Examinations

Title: Incorporating Domain Knowledge into Materials Tokenization

Title: Infinity Instruct: Scaling Instruction Selection and Synthesis to Enhance Language Models

Title: ScIRGen: Synthesize Realistic and Large-Scale RAG Dataset for Scientific Research

Title: Benchmarking Foundation Speech and Language Models for Alzheimer's Disease and Related Dementia Detection from Spontaneous Speech

Title: SDMPrune: Self-Distillation MLP Pruning for Efficient Large Language Models

Title: SUTA-LM: Bridging Test-Time Adaptation and Language Model Rescoring for Robust ASR

Title: ASRJam: Human-Friendly AI Speech Jamming to Prevent Automated Phone Scams

Title: GUIRoboTron-Speech: Towards Automated GUI Agents Based on Speech Instructions

Title: Stronger Language Models Produce More Human-Like Errors

Title: Trustworthy AI for Medicine: Continuous Hallucination Detection and Elimination with CHECK

Title: Large Language Models and Emergence: A Complex Systems Perspective

Title: Scalable Medication Extraction and Discontinuation Identification from Electronic Health Records Using Large Language Models

Title: Iterative Multilingual Spectral Attribute Erasure

Title: No Universal Prompt: Unifying Reasoning through Adaptive Prompting for Temporal Table Reasoning

Title: Learning a Continue-Thinking Token for Enhanced Test-Time Scaling

Title: Beyond Random Sampling: Efficient Language Model Pretraining via Curriculum Learning

Title: Don't Pay Attention

Title: Surprisal from Larger Transformer-based Language Models Predicts fMRI Data More Poorly

Title: From Replication to Redesign: Exploring Pairwise Comparisons for LLM-Based Peer Review

Title: The Biased Samaritan: LLM biases in Perceived Kindness

Title: Curriculum-Guided Layer Scaling for Language Model Pretraining

Title: Predicting Early-Onset Colorectal Cancer with Large Language Models

Title: Efficient Long-Context LLM Inference via KV Cache Clustering

Title: Agent-RLVR: Training Software Engineering Agents via Guidance and Environment Rewards

Title: KoGEC : Korean Grammatical Error Correction with Pre-trained Translation Models

Title: AbsenceBench: Language Models Can't Tell What's Missing

Title: A Gamified Evaluation and Recruitment Platform for Low Resource Language Machine Translation Systems

Title: Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards

Title: ImmunoFOMO: Are Language Models missing what oncologists see?

Title: Relational Schemata in BERT Are Inducible, Not Emergent: A Study of Performance vs. Competence in Language Models

Title: Lag-Relative Sparse Attention In Long Context Training

Title: On the Effectiveness of Integration Methods for Multimodal Dialogue Response Retrieval

Title: From Persona to Person: Enhancing the Naturalness with Multiple Discourse Relations Graph Learning in Personalized Dialogue Generation

Title: Are LLMs Good Text Diacritizers? An Arabic and Yorùbá Case Study

Title: SceneGram: Conceptualizing and Describing Tangrams in Scene Context

Title: LoRA-Gen: Specializing Large Language Model via Online LoRA Generation

Title: Converting Annotated Clinical Cases into Structured Case Report Forms

Title: LLMs for Sentence Simplification: A Hybrid Multi-Agent prompting Approach

Title: Configurable Preference Tuning with Rubric-Guided Synthetic Data

Title: DART: Distilling Autoregressive Reasoning to Silent Thought

Title: DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Title: Long-Short Alignment for Effective Long-Context Modeling in LLMs

Title: Persona-driven Simulation of Voting Behavior in the European Parliament with Large Language Models

Title: Are Multimodal Large Language Models Pragmatically Competent Listeners in Simple Reference Resolution Tasks?

Title: Post Persona Alignment for Multi-Session Dialogue Generation

Title: Beyond Homogeneous Attention: Memory-Efficient LLMs via Fourier-Approximated KV Cache

Title: GeistBERT: Breathing Life into German NLP

Title: Feedback Friction: LLMs Struggle to Fully Incorporate External Feedback

Title: Improving Large Language Model Safety with Contrastive Representation Learning

Title: code_transformed: The Influence of Large Language Models on Code