2026-03-31

Title: GeoBlock: Inferring Block Granularity from Dependency Geometry in Diffusion Language Models

Title: AlpsBench: An LLM Personalization Benchmark for Real-Dialogue Memorization and Preference Alignment

Title: The Cognitive Divergence: AI Context Windows, Human Attention Decline, and the Delegation Feedback Loop

Title: Do Multilingual VLMs Reason Equally? A Cross-Lingual Visual Reasoning Audit for Indian Languages

Title: LogicDiff: Logic-Guided Denoising Improves Reasoning in Masked Diffusion Language Models

Title: Resolving the Robustness-Precision Trade-off in Financial RAG through Hybrid Document-Routed Retrieval

Title: Arithmetic OOD Failure Unfolds in Stages in Minimal GPTs

Title: Magic Words or Methodical Work? Challenging Conventional Wisdom in LLM-Based Political Text Annotation

Title: The Last Fingerprint: How Markdown Training Shapes LLM Prose

Title: RASPRef: Retrieval-Augmented Self-Supervised Prompt Refinement for Large Reasoning Models

Title: TAPS: Task Aware Proposal Distributions for Speculative Sampling

Title: Debiasing Large Language Models toward Social Factors in Online Behavior Analytics through Prompt Knowledge Tuning

Title: Story2Proposal: A Scaffold for Structured Scientific Paper Writing

Title: Routing Sensitivity Without Controllability: A Diagnostic Study of Fairness in MoE Language Models

Title: Learning to Predict Future-Aligned Research Proposals with Language Models

Title: Rethinking Easy-to-Hard: Limits of Curriculum Learning in Post-Training for Deductive Reasoning

Title: SCOPE: Tree-based Self-Correcting Online Log Parsing via Syntactic-Semantic Collaboration

Title: Mitigating Hallucination on Hallucination in RAG via Ensemble Voting

Title: SACRED: A Faithful Annotated Multimedia Multimodal Multilingual Dataset for Classifying Connectedness Types in Online Spirituality

Title: PubMed Reasoner: Dynamic Reasoning-based Retrieval for Evidence-Grounded Biomedical Question Answering

Title: Culturally Adaptive Explainable LLM Assessment for Multilingual Information Disorder: A Human-in-the-Loop Approach

Title: Improving Attributed Long-form Question Answering with Intent Awareness

Title: Multi-Agent Dialectical Refinement for Enhanced Argument Classification

Title: AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents

Title: Over-Refusal and Representation Subspaces: A Mechanistic Analysis of Task-Conditioned Refusal in Aligned LLMs

Title: Hidden Ads: Behavior Triggered Semantic Backdoors for Advertisement Injection in Vision Language Models

Title: Umwelt Engineering: Designing the Cognitive Worlds of Linguistic Agents

Title: PRBench: End-to-end Paper Reproduction in Physics Research

Title: Investigating the Influence of Language on Sycophantic Behavior of Multilingual LLMs

Title: Can Large Language Models Simulate Human Cognition Beyond Behavioral Imitation?

Title: KAT-Coder-V2 Technical Report

Title: Retromorphic Testing with Hierarchical Verification for Hallucination Detection in RAG

Title: TailNLG: A Multilingual Benchmark Addressing Verbalization of Long-Tail Entities

Title: Understanding Teacher Revisions of Large Language Model-Generated Feedback

Title: Conversational Agents and the Understanding of Human Language: Reflections on AI, LLMs, and Cognitive Science

Title: Improving Clinical Diagnosis with Counterfactual Multi-Agent Reasoning

Title: ProText: A benchmark dataset for measuring (mis)gendering in long-form texts

Title: Model Capability Dominates: Inference-Time Optimization Lessons from AIMO 3

Title: What can LLMs tell us about the mechanisms behind polarity illusions in humans? Experiments across model scales and training steps

Title: KazByte: Adapting Qwen models to Kazakh via Byte-level Adapter

Title: HumMusQA: A Human-written Music Understanding QA Benchmark Dataset

Title: Article and Comment Frames Shape the Quality of Online Comments

Title: EnsemJudge: Enhancing Reliability in Chinese LLM-Generated Text Detection through Diverse Model Ensembles

Title: On the Role of Encoder Depth: Pruning Whisper and LoRA Fine-Tuning in SLAM-ASR

Title: Rethinking Atomic Decomposition for LLM Judges: A Prompt-Controlled Study of Reference-Grounded QA Evaluation

Title: Who Wrote the Book? Detecting and Attributing LLM Ghostwriters

Title: From Reviews to Requirements: Can LLMs Generate Human-Like User Stories?

Title: DongYuan: An LLM-Based Framework for Integrative Chinese and Western Medicine Spleen-Stomach Disorders Diagnosis

Title: \textit{Versteasch du mi?} Computational and Socio-Linguistic Perspectives on GenAI, LLMs, and Non-Standard Language

Title: Categorical Perception in Large Language Model Hidden States: Structural Warping at Digit-Count Boundaries

Title: Merge and Conquer: Instructing Multilingual Models by Adding Target Language Weights

Title: The Necessity of Setting Temperature in LLM-as-a-Judge

Title: Kernel-Smith: A Unified Recipe for Evolutionary Kernel Optimization

Title: Marco DeepResearch: Unlocking Efficient Deep Research Agents via Verification-Centric Design

Title: Courtroom-Style Multi-Agent Debate with Progressive RAG and Role-Switching for Controversial Claim Verification

Title: EarlySciRev: A Dataset of Early-Stage Scientific Revisions Extracted from LaTeX Writing Traces

Title: GraphWalker: Agentic Knowledge Graph Question Answering via Synthetic Trajectory Curriculum

Title: Compressing Transformer Language Models via Matrix Product Operator Decomposition: A Case Study on PicoGPT

Title: Training data generation for context-dependent rubric-based short answer grading

Title: EpiScreen: Early Epilepsy Detection from Electronic Health Records with Large Language Models

Title: Adaptive Block-Scaled Data Types