2025-09-15

Title: Cross-Layer Attention Probing for Fine-Grained Hallucination Detection

Title: Creativity Benchmark: A benchmark for marketing creativity for LLM models

Title: CTCC: A Robust and Stealthy Fingerprinting Framework for Large Language Models via Cross-Turn Contextual Correlation Backdoor

Title: Temporal Preferences in Language Models for Long-Horizon Assistance

Title: The Non-Determinism of Small LLMs: Evidence of Low Answer Consistency in Repetition Trials of Standard Multiple-Choice Benchmarks

Title: Beyond I'm Sorry, I Can't: Dissecting Large Language Model Refusal

Title: Assisting Research Proposal Writing with Large Language Models: Evaluation and Refinement

Title: Generating Individual Travel Diaries Using Large Language Models Informed by Census and Land-Use Data

Title: Psychiatry-Bench: A Multi-Task Benchmark for LLMs in Psychiatry

Title: The Thinking Therapist: Training Large Language Models to Deliver Acceptance and Commitment Therapy using Supervised Fine-Tuning and Odds Ratio Policy Optimization

Title: HANRAG: Heuristic Accurate Noise-resistant Retrieval-Augmented Generation for Multi-hop Question Answering

Title: How Small Transformation Expose the Weakness of Semantic Similarity Measures

Title: Investigating Symbolic Triggers of Hallucination in Gemma Models Across HaluEval and TruthfulQA

Title: ALIGNS: Unlocking nomological networks in psychological measurement through a large language model

Title: DiTTO-LLM: Framework for Discovering Topic-based Technology Opportunities via Large Language Model

Title: Natural Language Translation of Formal Proofs through Informalization of Proof Steps and Recursive Summarization along Proof Structure

Title: A Role-Aware Multi-Agent Framework for Financial Education Question Answering with LLMs

Title: A meta-analysis on the performance of machine-learning based language models for sentiment analysis

Title: Benchmarking Vision-Language Models on Chinese Ancient Documents: From OCR to Knowledge Reasoning

Title: MCP-AgentBench: Evaluating Real-World Language Agent Performance with MCP-Mediated Tools

Title: Discrimination by LLMs: Cross-lingual Bias Assessment and Mitigation in Decision-Making and Summarisation

Title: HEFT: A Coarse-to-Fine Hierarchy for Enhancing the Efficiency and Accuracy of Language Model Reasoning

Title: Topic-Guided Reinforcement Learning with LLMs for Enhancing Multi-Document Summarization

Title: Emulating Public Opinion: A Proof-of-Concept of AI-Generated Synthetic Survey Responses for the Chilean Case

Title: Large Language Models Meet Legal Artificial Intelligence: A Survey

Title: Unsupervised Hallucination Detection by Inspecting Reasoning Processes

Title: Multi-Intent Recognition in Dialogue Understanding: A Comparison Between Smaller Open-Source LLMs

Title: Established Psychometric vs. Ecologically Valid Questionnaires: Rethinking Psychological Assessments in Large Language Models

Title: Querying Climate Knowledge: Semantic Retrieval for Scientific Discovery

Title: Arabic Large Language Models for Medical Text Generation

Title: Scaling Arabic Medical Chatbots Using Synthetic Data: Enhancing Generative AI with Synthetic Patient Records

Title: Population-Aligned Persona Generation for LLM-based Social Simulation

Title: Towards Reliable and Interpretable Document Question Answering via VLMs

Title: Benchmark of stylistic variation in LLM-generated texts

Title: Incongruent Positivity: When Miscalibrated Positivity Undermines Online Supportive Conversations

Title: Beyond Token Limits: Assessing Language Model Performance on Long Text Classification

Title: SI-FACT: Mitigating Knowledge Conflict via Self-Improving Faithfulness-Aware Contrastive Tuning

Title: Dropping Experts, Recombining Neurons: Retraining-Free Pruning for Sparse Mixture-of-Experts LLMs

Title: Is In-Context Learning Learning?

Title: Long Context Automated Essay Scoring with Language Models

Title: RefactorCoderQA: Benchmarking LLMs for Multi-Domain Coding Question Solutions in Cloud and Edge Deployment

Title: DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL