2026-02-11

Title: Overview of PAN 2026: Voight-Kampff Generative AI Detection, Text Watermarking, Multi-Author Writing Style Analysis, Generative Plagiarism Detection, and Reasoning Trajectory Detection

Title: Effective Reasoning Chains Reduce Intrinsic Dimensionality

Title: Don't Shoot The Breeze: Topic Continuity Model Using Nonlinear Naive Bayes With Attention

Title: Beyond Uniform Credit: Causal Credit Assignment for Policy Optimization

Title: FM SO.P: A Progressive Task Mixture Framework with Automatic Evaluation for Cross-Domain SOP Understanding

Title: Understanding Risk and Dependency in AI Chatbot Use from User Discourse

Title: Digital Linguistic Bias in Spanish: Evidence from Lexical Variation in LLMs

Title: AgentSkiller: Scaling Generalist Agent Intelligence through Semantically Integrated Cross-Domain Data Synthesis

Title: BiasScope: Towards Automated Detection of Bias in LLM-as-a-Judge Evaluation

Title: Contractual Deepfakes: Can Large Language Models Generate Contracts?

Title: Effective vocabulary expanding of multilingual language models for extremely low-resource languages

Title: Are Language Models Sensitive to Morally Irrelevant Distractors?

Title: Breaking the Pre-Sampling Barrier: Activation-Informed Difficulty-Aware Self-Consistency

Title: Evaluating Social Bias in RAG Systems: When External Context Helps and Reasoning Hurts

Title: Conceptual Cultural Index: A Metric for Cultural Specificity via Relative Generality

Title: NOWJ @BioCreative IX ToxHabits: An Ensemble Deep Learning Approach for Detecting Substance Use and Contextual Information in Clinical Texts

Title: Listen to the Layers: Mitigating Hallucinations with Inter-Layer Disagreement

Title: Where-to-Unmask: Ground-Truth-Guided Unmasking Order Learning for Masked Diffusion Language Models

Title: EcoGym: Evaluating LLMs for Long-Horizon Plan-and-Execute in Interactive Economies

Title: Knowledge Integration Decay in Search-Augmented Reasoning of Large Language Models

Title: UniARM: Towards a Unified Autoregressive Reward Model for Multi-Objective Test-Time Alignment

Title: Comprehensive Comparison of RAG Methods Across Multi-Domain Conversational QA

Title: Advancing Block Diffusion Language Models for Test-Time Scaling

Title: LEMUR: A Corpus for Robust Fine-Tuning of Multilingual Law Embedding Models for Retrieval

Title: Aligning Tree-Search Policies with Fixed Token Budgets in Test-Time Scaling of LLMs

Title: Context-Aware Counterfactual Data Augmentation for Gender Bias Mitigation in Language Models

Title: On the Optimal Reasoning Length for RL-Trained Language Models

Title: Learning from the Irrecoverable: Error-Localized Policy Optimization for Tool-Integrated LLM Reasoning

Title: AlignTune: Modular Toolkit for Post-Training Alignment of Large Language Models

Title: MILE-RefHumEval: A Reference-Free, Multi-Independent LLM Framework for Human-Aligned Evaluation

Title: MATA: Multi-Agent Framework for Reliable and Flexible Table Question Answering

Title: Maastricht University at AMIYA: Adapting LLMs for Dialectal Arabic using Fine-tuning and MBR Decoding

Title: TraceMem: Weaving Narrative Memory Schemata from User Conversational Traces

Title: Unsupervised Layer-Wise Dynamic Test Time Adaptation for LLMs

Title: AI-Assisted Scientific Assessment: A Case Study on Climate Change

Title: Improving Interpretability of Lexical Semantic Change with Neurobiological Features

Title: Decomposing Reasoning Efficiency in Large Language Models

Title: AnalyticsGPT: An LLM Workflow for Scientometric Question Answering

Title: Text summarization via global structure awareness

Title: From FusHa to Folk: Exploring Cross-Lingual Transfer in Arabic Language Models

Title: LLM Reasoning Predicts When Models Are Right: Evidence from Coding Classroom Discourse

Title: SinFoS: A Parallel Dataset for Translating Sinhala Figures of Speech

Title: Steer2Edit: From Activation Steering to Component-Level Editing

Title: The Devil Behind Moltbook: Anthropic Safety is Always Vanishing in Self-Evolving AI Societies

Title: AmharicIR+Instr: A Two-Dataset Resource for Neural Retrieval and Instruction Tuning

Title: LLMs Encode Their Failures: Predicting Success from Pre-Generation Activations

Title: A Unified Assessment of the Poverty of the Stimulus Argument for Neural Language Models

Title: SCORE: Specificity, Context Utilization, Robustness, and Relevance for Reference-Free LLM Evaluation

Title: Decoupled Reasoning with Implicit Fact Tokens (DRIFT): A Dual-Model Framework for Efficient Long-Context Inference

Title: Anagent For Enhancing Scientific Table & Figure Analysis

Title: Quantum-Audit: Evaluating the Reasoning Limits of LLMs on Quantum Computing