2026-03-17

Title: Slang Context-based Inference Enhancement via Greedy Search-Guided Chain-of-Thought Prompting

Title: Steering at the Source: Style Modulation Heads for Robust Persona Control

Title: Training-Free Agentic AI: Probabilistic Control and Coordination in Multi-Agent LLM Systems

Title: How Transformers Reject Wrong Answers: Rotational Dynamics of Factual Constraint Processing

Title: Explain in Your Own Words: Improving Reasoning via Token-Selective Dual Knowledge Distillation

Title: Design and evaluation of an agentic workflow for crisis-related synthetic tweet datasets

Title: Widespread Gender and Pronoun Bias in Moral Judgments Across LLMs

Title: Benchmarking Large Language Models on Reference Extraction and Parsing in the Social Sciences and Humanities

Title: Preconditioned Test-Time Adaptation for Out-of-Distribution Debiasing in Narrative Generation

Title: QuarkMedBench: A Real-World Scenario Driven Benchmark for Evaluating Large Language Models

Title: Repetition Without Exclusivity: Scale Sensitivity of Referential Mechanisms in Child-Scale Language Models

Title: Can We Trust LLMs on Memristors? Diving into Reasoning Ability under Non-Ideality

Title: Knowledge Distillation for Large Language Models

Title: LiveWeb-IE: A Benchmark For Online Web Information Extraction

Title: Generate Then Correct: Single Shot Global Correction for Aspect Sentiment Quad Prediction

Title: Projection-Free Evolution Strategies for Continuous Prompt Search

Title: DeceptGuard :A Constitutional Oversight Framework For Detecting Deception in LLM Agents

Title: PMIScore: An Unsupervised Approach to Quantify Dialogue Engagement

Title: APEX-Searcher: Augmenting LLMs' Search Capabilities through Agentic Planning and Execution

Title: GradMem: Learning to Write Context into Memory with Test-Time Gradient Descent

Title: Large Language Models Reproduce Racial Stereotypes When Used for Text Annotation

Title: OmniCompliance-100K: A Multi-Domain, Rule-Grounded, Real-World Safety Compliance Dataset

Title: ToolFlood: Beyond Selection -- Hiding Valid Tools from LLM Agents via Semantic Covering

Title: FLUX: Data Worth Training On

Title: Beyond Explicit Edges: Robust Reasoning over Noisy and Sparse Knowledge Graphs

Title: SemEval-2026 Task 6: CLARITY -- Unmasking Political Question Evasions

Title: CMHL: Contrastive Multi-Head Learning for Emotionally Consistent Text Classification

Title: OasisSimp: An Open-source Asian-English Sentence Simplification Dataset

Title: The GELATO Dataset for Legislative NER

Title: MMOU: A Massive Multi-Task Omni Understanding and Reasoning Benchmark for Long and Complex Real-World Videos

Title: Selective Fine-Tuning of GPT Architectures for Parameter-Efficient Clinical Text Classification

Title: Rethinking Evaluation in Retrieval-Augmented Personalized Dialogue: A Cognitive and Linguistic Perspective

Title: QiMeng-CodeV-SVA: Training Specialized LLMs for Hardware Assertion Generation via RTL-Grounded Bidirectional Data Synthesis

Title: Mitigating Overthinking in Large Reasoning Language Models via Reasoning Path Deviation Monitoring

Title: Automatic Inter-document Multi-hop Scientific QA Generation

Title: MedPriv-Bench: Benchmarking the Privacy-Utility Trade-off of Large Language Models in Medical Open-End Question Answering

Title: Mind the Shift: Decoding Monetary Policy Stance from FOMC Statements with Large Language Models

Title: Motivation in Large Language Models

Title: Exposing Long-Tail Safety Failures in Large Language Models through Efficient Diverse Response Sampling

Title: Extending Minimal Pairs with Ordinal Surprisal Curves and Entropy Across Applied Domains

Title: BiT-MCTS: A Theme-based Bidirectional MCTS Approach to Chinese Fiction Generation

Title: Creative Convergence or Imitation? Genre-Specific Homogeneity in LLM-Generated Chinese Literature

Title: PARSA-Bench: A Comprehensive Persian Audio-Language Model Benchmark

Title: Distilling Reasoning Without Knowledge: A Framework for Reliable LLMs

Title: An Industrial-Scale Insurance LLM Achieving Verifiable Domain Mastery and Hallucination Control without Competence Trade-offs

Title: AI Can Learn Scientific Taste

Title: Infinite Problem Generator: Verifiably Scaling Physics Reasoning Data with Agentic Workflows

Title: MALicious INTent Dataset and Inoculating LLMs for Enhanced Disinformation Detection

Title: Multilingual TinyStories: A Synthetic Combinatorial Corpus of Indic Children's Stories for Training Small Language Models

Title: $PA^3$: $\textbf{P}$olicy-$\textbf{A}$ware $\textbf{A}$gent $\textbf{A}$lignment through Chain-of-Thought

Title: Seamless Deception: Larger Language Models Are Better Knowledge Concealers

Title: Towards Next-Generation LLM Training: From the Data-Centric Perspective

Title: Information Asymmetry across Language Varieties: A Case Study on Cantonese-Mandarin and Bavarian-German QA

Title: The Impact of Ideological Discourses in RAG: A Case Study with COVID-19 Treatments

Title: ContiGuard: A Framework for Continual Toxicity Detection Against Evolving Evasive Perturbations

Title: Shopping Companion: A Memory-Augmented LLM Agent for Real-World E-Commerce Tasks

Title: Decision-Level Ordinal Modeling for Multimodal Essay Scoring with Large Language Models

Title: LLMs as Signal Detectors: Sensitivity, Bias, and the Temperature-Criterion Analogy

Title: ExPosST: Explicit Positioning with Adaptive Masking for LLM-Based Simultaneous Machine Translation

Title: Beyond Benchmark Islands: Toward Representative Trustworthiness Evaluation for Agentic AI

Title: OrgForge: A Multi-Agent Simulation Framework for Verifiable Synthetic Corporate Corpora

Title: Attention Residuals

Title: Interpretable Predictability-Based AI Text Detection: A Replication Study

Title: Thinking in Latents: Adaptive Anchor Refinement for Implicit Reasoning in LLMs

Title: Writer-R1: Enhancing Generative Writing in LLMs via Memory-augmented Replay Policy Optimization

Title: Indirect Question Answering in English, German and Bavarian: A Challenging Task for High- and Low-Resource Languages Alike

Title: HindSight: Evaluating Research Idea Generation via Future Impact

Title: The Hrunting of AI: Where and How to Improve English Dialectal Fairness

Title: Efficient Document Parsing via Parallel Token Prediction

Title: Bidirectional Chinese and English Passive Sentences Dataset for Machine Translation

Title: Practicing with Language Models Cultivates Human Empathic Communication

Title: From Documents to Spans: Code-Centric Learning for LLM-based ICD Coding

Title: Datasets for Verb Alternations across Languages: BLM Templates and Data Augmentation Strategies

Title: CCTU: A Benchmark for Tool Use under Complex Constraints

Title: DOS: Dependency-Oriented Sampler for Masked Diffusion Language Models

Title: When Does Sparsity Mitigate the Curse of Depth in LLMs

Title: A Closer Look into LLMs for Table Understanding

Title: Fusian: Multi-LoRA Fusion for Fine-Grained Continuous MBTI Personality Control in Large Language Models

Title: SEA-Vision: A Multilingual Benchmark for Comprehensive Document and Scene Text Understanding in Southeast Asia

Title: CLAG: Adaptive Memory Organization via Agent-Driven Clustering for Small Language Model Agents

Title: Invisible failures in human-AI interactions

Title: ViX-Ray: A Vietnamese Chest X-Ray Dataset for Vision-Language Models

Title: Beyond the Covariance Trap: Unlocking Generalization in Same-Subject Knowledge Editing for Large Language Models

Title: SlovKE: A Large-Scale Dataset and LLM Evaluation for Slovak Keyphrase Extraction

Title: Can LLMs Model Incorrect Student Reasoning? A Case Study on Distractor Generation

Title: Code-A1: Adversarial Evolving of Code LLM and Test LLM via Reinforcement Learning

Title: Mechanistic Origin of Moral Indifference in Language Models

Title: Mixture-of-Depths Attention