2025-09-25

Title: Automated Item Neutralization for Non-Cognitive Scales: A Large Language Model Approach to Reducing Social-Desirability Bias

Title: FHIR-AgentBench: Benchmarking LLM Agents for Realistic Interoperable EHR Question Answering

Title: Readme_AI: Dynamic Context Construction for Large Language Models

Title: How Much of Your Data Can Suck? Thresholds for Domain Performance and Emergent Misalignment in LLMs

Title: Unveiling the Merits and Defects of LLMs in Automatic Review Generation for Scientific Papers

Title: A systematic review of trial-matching pipelines using large language models

Title: How Model Size, Temperature, and Prompt Style Affect LLM-Human Assessment Score Alignment

Title: Quantifying Compositionality of Classic and State-of-the-Art Embeddings

Title: Pluralistic Off-policy Evaluation and Alignment

Title: Cognitive-Level Adaptive Generation via Capability-Aware Retrieval and Style Adaptation

Title: Performance of Large Language Models in Answering Critical Care Medicine Questions

Title: SCORE: A Semantic Evaluation Framework for Generative Document Parsing

Title: Benchmarking ChatGPT and DeepSeek in April 2025: A Novel Dual Perspective Sentiment Analysis Using Lexicon-Based and Deep Learning Approaches

Title: Characterizing Knowledge Graph Tasks in LLM Benchmarks Using Cognitive Complexity Frameworks

Title: ShinkaEvolve: Towards Open-Ended And Sample-Efficient Program Evolution

Title: TriSPrompt: A Hierarchical Soft Prompt Model for Multimodal Rumor Detection with Incomplete Modalities

Title: RoadMind: Towards a Geospatial AI Expert for Disaster Response

Title: Benchmarking and Improving LLM Robustness for Personalized Generation

Title: Semantic Representation Attack against Aligned Large Language Models

Title: The Inadequacy of Offline LLM Evaluations: A Need to Account for Personalization in Model Behavior

Title: LLM-Assisted Topic Reduction for BERTopic on Social Media Data

Title: Pipeline Parallelism is All You Need for Optimized Early-Exit Based Self-Speculative Decoding

Title: SLM-Based Agentic AI with P-C-G: Optimized for Korean Tool Use

Title: Meow: End-to-End Outline Writing for Automatic Academic Survey

Title: How to inject knowledge efficiently? Knowledge Infusion Scaling Law for Pre-training Large Language Models

Title: A Pipeline to Assess Merging Methods via Behavior and Internals

Title: Do LLMs Encode Frame Semantics? Evidence from Frame Identification

Title: Confidence Calibration in Large Language Model-Based Entity Matching

Title: Uncertainty in Semantic Language Modeling with PIXELS

Title: Retrieval Augmented Generation based context discovery for ASR

Title: ExPe: Exact Positional Encodings for Generative Transformer Models with Extrapolating Capabilities

Title: LLMs4All: A Review on Large Language Models for Research and Applications in Academic Disciplines

Title: GuessingGame: Measuring the Informativeness of Open-Ended Questions in Large Language Models

Title: Anatomy of a Feeling: Narrating Embodied Emotions via Large Vision-Language Models

Title: Evaluating Language Translation Models by Playing Telephone

Title: AutoSpec: An Agentic Framework for Automatically Drafting Patent Specification

Title: Large Language Models for Pedestrian Safety: An Application to Predicting Driver Yielding Behavior at Unsignalized Intersections

Title: Personality Vector: Modulating Personality of Large Language Models by Model Merging

Title: HiCoLoRA: Addressing Context-Prompt Misalignment via Hierarchical Collaborative LoRA for Zero-Shot DST

Title: PART: Progressive Alignment Representation Training for Multilingual Speech-To-Text with LLMs

Title: CHURRO: Making History Readable with an Open-Weight Large Vision-Language Model for High-Accuracy, Low-Cost Historical Text Recognition

Title: EnAnchored-X2X: English-Anchored Optimization for Many-to-Many Translation

Title: bi-GRPO: Bidirectional Optimization for Jailbreak Backdoor Injection on LLMs

Title: Polarity Detection of Sustainable Detection Goals in News Text

Title: TianHui: A Domain-Specific Large Language Model for Diverse Traditional Chinese Medicine Scenarios

Title: Benchmarking Gaslighting Attacks Against Speech Large Language Models

Title: SINAI at eRisk@CLEF 2025: Transformer-Based and Conversational Strategies for Depression Detection

Title: Do Before You Judge: Self-Reference as a Pathway to Better LLM Evaluation

Title: Future Policy Aware Preference Learning for Mathematical Reasoning

Title: WEST: LLM based Speech Toolkit for Speech Understanding, Generation, and Interaction

Title: The Knowledge-Behaviour Disconnect in LLM-based Chatbots

Title: DiffNator: Generating Structured Explanations of Time-Series Differences

Title: Tokenization and Representation Biases in Multilingual Models on Dialectal NLP Tasks

Title: From Input Perception to Predictive Insight: Modeling Model Blind Spots Before They Become Errors

Title: From Text to Talk: Audio-Language Model Needs Non-Autoregressive Joint Training

Title: OLaPh: Optimal Language Phonemizer

Title: Causal Understanding by LLMs: The Role of Uncertainty

Title: Integrated Framework for LLM Evaluation with Answer Generation

Title: Embedding Domain Knowledge for Large Language Models via Reinforcement Learning from Augmented Generation

Title: Probing Gender Bias in Multilingual LLMs: A Case Study of Stereotypes in Persian

Title: Thinking Augmented Pre-training

Title: Play by the Type Rules: Inferring Constraints for LLM Functions in Declarative Programs

Title: Investigating the Representation of Backchannels and Fillers in Fine-tuned Language Models

Title: Instruction Boundary: Quantifying Biases in LLM Reasoning under Various Coverage

Title: SIM-CoT: Supervised Implicit Chain-of-Thought

Title: Z-Scores: A Metric for Linguistically Assessing Disfluency Removal

Title: DRES: Benchmarking LLMs for Disfluency Removal

Title: EmbeddingGemma: Powerful and Lightweight Text Representations

Title: Language Models that Think, Chat Better