2025-09-16

Title: Uncovering the Vulnerability of Large Language Models in the Financial Domain via Risk Concealment

Title: No Answer Needed: Predicting LLM Answer Accuracy from Question-Only Linear Probes

Title: Context Copying Modulation: The Role of Entropy Neurons in Managing Parametric and Contextual Knowledge Conflicts

Title: Pluralistic Alignment for Healthcare: A Role-Driven Framework

Title: A Survey on Retrieval And Structuring Augmented Generation with Large Language Models

Title: SearchInstruct: Enhancing Domain Adaptation via Retrieval-Based Instruction Dataset Creation

Title: PolyTruth: Multilingual Disinformation Detection using Transformer-Based Language Models

Title: Reasoning Under Uncertainty: Exploring Probabilistic Reasoning Capabilities of LLMs

Title: Automated MCQA Benchmarking at Scale: Evaluating Reasoning Traces as Retrieval Sources for Domain Adaptation of Small Language Models

Title: RECAP: Transparent Inference-Time Emotion Alignment for Medical Dialogue Systems

Title: Judge Q: Trainable Queries for Optimized Information Retention in KV Cache Eviction

Title: Towards Automated Error Discovery: A Study in Conversational AI

Title: Evaluating Large Language Models for Evidence-Based Clinical Question Answering

Title: GAPrune: Gradient-Alignment Pruning for Domain-Aware Embeddings

Title: Quantifier Scope Interpretation in Language Learners and LLMs

Title: CultureSynth: A Hierarchical Taxonomy-Guided and Retrieval-Augmented Framework for Cultural Question-Answer Synthesis

Title: Aligning ESG Controversy Data with International Guidelines through Semi-Automatic Ontology Construction

Title: Introducing Spotlight: A Novel Approach for Generating Captivating Key Information from Documents

Title: An Interpretable Benchmark for Clickbait Detection and Tactic Attribution

Title: EmoBench-Reddit: A Hierarchical Benchmark for Evaluating the Emotional Intelligence of Multimodal Large Language Models

Title: Fluid Language Model Benchmarking

Title: We Argue to Agree: Towards Personality-Driven Argumentation-Based Negotiation Dialogue Systems for Tourism

Title: Joint Effects of Argumentation Theory, Audio Modality and Data Enrichment on LLM-Based Fallacy Classification

Title: When Smiley Turns Hostile: Interpreting How Emojis Trigger LLMs' Toxicity

Title: Text2Mem: A Unified Memory Operation Language for Memory Operating System

Title: Differentially-private text generation degrades output language quality

Title: Optimal Brain Restoration for Joint Quantization and Sparsification of LLMs

Title: RanAT4BIE: Random Adversarial Training for Biomedical Information Extraction

Title: The Prompt Engineering Report Distilled: Quick Start Guide for Life Sciences

Title: Ko-PIQA: A Korean Physical Commonsense Reasoning Dataset with Cultural Context

Title: !MSA at AraHealthQA 2025 Shared Task: Enhancing LLM Performance for Arabic Clinical Question Answering through Prompt Engineering and Ensemble Learning

Title: Transformer Enhanced Relation Classification: A Comparative Analysis of Contextuality, Data Efficiency and Sequence Complexity

Title: Continually Adding New Languages to Multilingual Language Models

Title: CognitiveSky: Scalable Sentiment and Narrative Analysis for Decentralized Social Media

Title: CEMTM: Contextual Embedding-based Multimodal Topic Modeling

Title: Improving LLMs' Learning for Coreference Resolution

Title: ClaimIQ at CheckThat! 2025: Comparing Prompted and Fine-Tuned Language Models for Verifying Numerical Claims

Title: AKCIT-FN at CheckThat! 2025: Switching Fine-Tuned SLMs and LLM Prompting for Multilingual Claim Normalization

Title: LVLMs are Bad at Overhearing Human Referential Communication

Title: PeruMedQA: Benchmarking Large Language Models (LLMs) on Peruvian Medical Exams - Dataset Construction and Evaluation

Title: HARP: Hallucination Detection via Reasoning Subspace Projection

Title: HiChunk: Evaluating and Enhancing Retrieval-Augmented Generation with Hierarchical Chunking

Title: D$^2$HScore: Reasoning-Aware Hallucination Detection via Semantic Breadth and Depth Analysis in LLMs

Title: Bhaasha, Bhasa, Zaban: A Survey for Low-Resourced Languages in South Asia - Current Stage and Challenges

Title: Analyzing Information-Seeking Behaviors in a Hakka AI Chatbot: A Cognitive-Pragmatic Study

Title: HalluDetect: Detecting, Mitigating, and Benchmarking Hallucinations in Conversational Systems

Title: AesBiasBench: Evaluating Bias and Alignment in Multimodal Language Models for Personalized Image Aesthetic Assessment

Title: EthicsMH: A Pilot Benchmark for Ethical Reasoning in Mental Health AI

Title: A Dynamic Knowledge Update-Driven Model with Large Language Models for Fake News Detection

Title: CoachMe: Decoding Sport Elements with a Reference-Based Coaching Instruction Generation Model

Title: An Agentic Toolkit for Adaptive Information Extraction from Regulatory Documents

Title: User eXperience Perception Insights Dataset (UXPID): Synthetic User Feedback from Public Industrial Forums

Title: When Curiosity Signals Danger: Predicting Health Crises Through Online Medication Inquiries

Title: From Fuzzy Speech to Medical Insight: Benchmarking LLMs on Noisy Patient Narratives

Title: MOOM: Maintenance, Organization and Optimization of Memory in Ultra-Long Role-Playing Dialogues

Title: Growing Perspectives: Modelling Embodied Perspective Taking and Inner Narrative Development Using Large Language Models

Title: Uncertainty in Authorship: Why Perfect AI Detection Is Mathematically Impossible

Title: Designing LLMs for cultural sensitivity: Evidence from English-Japanese translation

Title: Spec-LLaVA: Accelerating Vision-Language Models with Dynamic Tree-Based Speculative Decoding

Title: ToolRM: Outcome Reward Models for Tool-Calling Large Language Models

Title: Text Adaptation to Plain Language and Easy Read via Automatic Post-Editing Cycles

Title: Steering Language Models in Multi-Token Generation: A Case Study on Tense and Aspect

Title: Is 'Hope' a person or an idea? A pilot benchmark for NER: comparing traditional NLP tools and large language models on ambiguous entities

Title: GTA: Supervised-Guided Reinforcement Learning for Text Classification with Large Language Models

Title: CBP-Tuning: Efficient Local Customization for Black-box Large Language Models

Title: XplaiNLP at CheckThat! 2025: Multilingual Subjectivity Detection with Finetuned Transformers and Prompt-Based Inference with Large Language Models

Title: Pun Unintended: LLMs and the Illusion of Humor Understanding

Title: RAGs to Riches: RAG-like Few-shot Learning for Large Language Model Role-playing

Title: Preservation of Language Understanding Capabilities in Speech-aware Large Language Models