2025-11-07

Title: Activation-Space Personality Steering: Hybrid Layer Selection for Stable Trait Control in LLMs

Title: TextualVerifier: Verify TextGrad Step-by-Step

Title: GRDD+: An Extended Greek Dialectal Dataset with Cross-Architecture Fine-tuning Evaluation

Title: PLLuM: A Family of Polish Large Language Models

Title: STARS: Segment-level Token Alignment with Rejection Sampling in Large Language Models

Title: Divide, Cache, Conquer: Dichotomic Prompting for Efficient Multi-Label LLM-Based Classification

Title: Evaluating Machine Translation Datasets for Low-Web Data Languages: A Gendered Lens

Title: GRAD: Graph-Retrieved Adaptive Decoding for Hallucination Mitigation

Title: Context informs pragmatic interpretation in vision-language models

Title: The Human Flourishing Geographic Index: A County-Level Dataset for the United States, 2013--2023

Title: Direct Semantic Communication Between Large Language Models via Vector Translation

Title: Abductive Inference in Retrieval-Augmented Language Models: Generating and Validating Missing Premises

Title: T-FIX: Text-Based Explanations with Features Interpretable to eXperts

Title: Plan of Knowledge: Retrieval-Augmented Large Language Models for Temporal Knowledge Graph Question Answering

Title: The truth is no diaper: Human and AI-generated associations to emotional words

Title: Batch Prompting Suppresses Overthinking Reasoning Under Constraint: How Batch Prompting Suppresses Overthinking in Reasoning Models

Title: RIDE: Difficulty Evolving Perturbation with Item Response Theory for Mathematical Reasoning

Title: CantoASR: Prosody-Aware ASR-LALM Collaboration for Low-Resource Cantonese

Title: BAPPA: Benchmarking Agents, Plans, and Pipelines for Automated Text-to-SQL Generation

Title: Trustworthy LLM-Mediated Communication: Evaluating Information Fidelity in LLM as a Communicator (LAAC) Framework in Multiple Application Domains

Title: Computational Turing Test Reveals Systematic Differences Between Human and AI Language

Title: LLM-as-a-Judge is Bad, Based on AI Attempting the Exam Qualifying for the Member of the Polish National Board of Appeal

Title: REMIND: Input Loss Landscapes Reveal Residual Memorization in Post-Unlearning LLMs

Title: Reusing Pre-Training Data at Test Time is a Compute Multiplier

Title: Efficient Topic Extraction via Graph-Based Labeling: A Lightweight Alternative to Deep Models

Title: SSPO: Subsentence-level Policy Optimization

Title: If I Could Turn Back Time: Temporal Reframing as a Historical Reasoning Task for LLMs

Title: ThaiOCRBench: A Task-Diverse Benchmark for Vision-Language Understanding in Thai

Title: RUST-BENCH: Benchmarking LLM Reasoning on Unstructured Text within Structured Tables

Title: OUNLP at TSAR 2025 Shared Task: Multi-Round Text Simplifier via Code Generation

Title: Decoding Emergent Big Five Traits in Large Language Models: Temperature-Dependent Expression and Architectural Clustering

Title: RAGalyst: Automated Human-Aligned Agentic Evaluation for Domain-Specific RAG

Title: Modeling Clinical Uncertainty in Radiology Reports: from Explicit Uncertainty Markers to Implicit Reasoning Pathways

Title: Are language models aware of the road not taken? Token-level uncertainty and hidden state dynamics

Title: IntelliProof: An Argumentation Network-based Conversational Helper for Organized Reflection

Title: From Model to Breach: Towards Actionable LLM-Generated Vulnerabilities Reporting

Title: BanglaMedQA and BanglaMMedBench: Evaluating Retrieval-Augmented Generation Strategies for Bangla Biomedical Question Answering

Title: When retrieval outperforms generation: Dense evidence retrieval for scalable fake news detection

Title: Logit-Entropy Adaptive Stopping Heuristic for Efficient Chain-of-Thought Reasoning