2026-03-24

Title: Enhancing Safety of Large Language Models via Embedding Space Separation

Title: RedacBench: Can AI Erase Your Secrets?

Title: Children's Intelligence Tests Pose Challenges for MLLMs? KidGym: A 2D Grid-Based Reasoning Benchmark for MLLMs

Title: Fast-Slow Thinking RM: Efficient Integration of Scalar and Generative Reward Models

Title: Multi-Agent Debate with Memory Masking

Title: Locally Coherent Parallel Decoding in Diffusion Language Models

Title: Expected Reward Prediction, with Applications to Model Routing

Title: An experimental study of KV cache reuse strategies in chunk-level caching systems

Title: Thinking into the Future: Latent Lookahead Training for Transformers

Title: Beyond Test-Time Compute Strategies: Advocating Energy-per-Token in LLM Inference

Title: Decoding the decoder: Contextual sequence-to-sequence modeling for intracortical speech decoding

Title: FinReflectKG -- HalluBench: GraphRAG Hallucination Benchmark for Financial Question Answering Systems

Title: SciNav: A General Agent Framework for Scientific Coding Tasks

Title: The production of meaning in the processing of natural language

Title: Coding Agents are Effective Long-Context Processors

Title: A Training-Free Regeneration Paradigm: Contrastive Reflection Memory Guided Self-Verification and Self-Improvement

Title: Policies Permitting LLM Use for Polishing Peer Reviews Are Currently Not Enforceable

Title: Diffutron: A Masked Diffusion Language Model for Turkish Language

Title: PARHAF, a human-authored corpus of clinical reports for fictitious patients in French

Title: Evaluating Large Language Models on Historical Health Crisis Knowledge in Resource-Limited Settings: A Hybrid Multi-Metric Study

Title: Permutation-Consensus Listwise Judging for Robust Factuality Evaluation

Title: JUBAKU: An Adversarial Benchmark for Exposing Culturally Grounded Stereotypes in Japanese LLMs

Title: A Modular LLM Framework for Explainable Price Outlier Detection

Title: Hear Both Sides: Efficient Multi-Agent Debate via Diversity-Aware Message Retention

Title: Weber's Law in Transformer Magnitude Representations: Efficient Coding, Representational Geometry, and Psychophysical Laws in Language Models

Title: PAVE: Premise-Aware Validation and Editing for Retrieval-Augmented LLMs

Title: Reasoning Topology Matters: Network-of-Thought for Complex Reasoning Tasks

Title: MzansiText and MzansiLM: An Open Corpus and Decoder-Only Language Model for South African Languages

Title: Code-MIE: A Code-style Model for Multimodal Information Extraction with Scene Graph and Entity Attribute Knowledge Enhancement

Title: The Anatomy of an Edit: Mechanism-Guided Activation Steering for Knowledge Editing

Title: RLVR Training of LLMs Does Not Improve Thinking Ability for General QA: Evaluation Method and a Simple Solution

Title: BenchBench: Benchmarking Automated Benchmark Generation

Title: HiCI: Hierarchical Construction-Integration for Long-Context Attention

Title: Can ChatGPT Really Understand Modern Chinese Poetry?

Title: SozKZ: Training Efficient Small Language Models for Kazakh from Scratch

Title: NoveltyAgent: Autonomous Novelty Reporting Agent with Point-wise Novelty Analysis and Self-Validation

Title: LLM Router: Prefill is All You Need

Title: Mitigating Shortcut Reasoning in Language Models: A Gradient-Aware Training Approach

Title: The Hidden Puppet Master: A Theoretical and Real-World Account of Emotional Manipulation in LLMs

Title: User Preference Modeling for Conversational LLM Agents: Weak Rewards from Retrieval-Augmented Interaction

Title: Alignment Whack-a-Mole : Finetuning Activates Verbatim Recall of Copyrighted Books in Large Language Models

Title: DiscoUQ: Structured Disagreement Analysis for Uncertainty Quantification in LLM Agent Ensembles

Title: Mitigating Selection Bias in Large Language Models via Permutation-Aware GRPO

Title: Left Behind: Cross-Lingual Transfer as a Bridge for Low-Resource Languages in Large Language Models

Title: Evaluating Reasoning-Based Scaffolds for Human-AI Co-Annotation: The ReasonAlign Annotation Protocol

Title: Many Dialects, Many Languages, One Cultural Lens: Evaluating Multilingual VLMs for Bengali Culture Understanding Across Historically Linked Languages and Regional Dialects

Title: Entropy Alone is Insufficient for Safe Selective Prediction in LLMs

Title: Explainable Semantic Textual Similarity via Dissimilar Span Detection

Title: Context Selection for Hypothesis and Statistical Evidence Extraction from Full-Text Scientific Articles

Title: Graph Fusion Across Languages using Large Language Models

Title: Conversation Tree Architecture: A Structured Framework for Context-Aware Multi-Branch LLM Conversations

Title: More Than Sum of Its Parts: Deciphering Intent Shifts in Multimodal Hate Speech Detection

Title: enhancing reasoning accuracy in large language models during inference time

Title: TimeTox: An LLM-Based Pipeline for Automated Extraction of Time Toxicity from Clinical Trial Protocols

Title: Beyond Memorization: Distinguishing between Reductive and Epistemic Reasoning in LLMs using Classic Logic Puzzles

Title: Benchmarking Bengali Dialectal Bias: A Multi-Stage Framework Integrating RAG-Based Translation and Human-Augmented RLAIF

Title: Conspiracy Frame: a Semiotically-Driven Approach for Conspiracy Theories Detection

Title: Task-Specific Efficiency Analysis: When Small Language Models Outperform Large Language Models

Title: Multi-Perspective LLM Annotations for Valid Analyses in Subjective Tasks

Title: Efficient Fine-Tuning Methods for Portuguese Question Answering: A Comparative Study of PEFT on BERTimbau and Exploratory Evaluation of Generative LLMs

Title: PROMPT2BOX: Uncovering Entailment Structure among LLM Prompts

Title: KG-Hopper: Empowering Compact Open LLMs with Knowledge Graph Reasoning via Reinforcement Learning

Title: Cross-Context Verification: Hierarchical Detection of Benchmark Contamination through Session-Isolated Analysis

Title: DRTriton: Large-Scale Synthetic Data Reinforcement Learning for Triton Kernel Generation

Title: TaigiSpeech: A Low-Resource Real-World Speech Intent Dataset and Preliminary Results with Scalable Data Mining In-the-Wild

Title: Effective Strategies for Asynchronous Software Engineering Agents

Title: Agentic Automation of BT-RADS Scoring: End-to-End Multi-Agent System for Standardized Brain Tumor Follow-up Assessment

Title: Generalizable Self-Evolving Memory for Automatic Prompt Optimization

Title: CatRAG: Functor-Guided Structural Debiasing with Retrieval Augmentation for Fair LLMs

Title: SynSym: A Synthetic Data Generation Framework for Psychiatric Symptom Identification

Title: DATASHI: A Parallel English-Tashlhiyt Corpus for Orthography Normalization and Low-Resource Language Processing

Title: A Comparative Analysis of LLM Memorization at Statistical and Internal Levels: Cross-Model Commonalities and Model-Specific Signatures

Title: TAMTRL: Teacher-Aligned Reward Reshaping for Multi-Turn Reinforcement Learning in Long-Context Compression

Title: Optimizing Multi-Agent Weather Captioning via Text Gradient Descent: A Training-Free Approach with Consensus-Aware Gradient Fusion

Title: Probing How Scalable Table Data Enhances General Long-Context Reasoning

Title: SemEval-2026 Task 12: Abductive Event Reasoning: Towards Real-World Event Causal Inference for Large Language Models

Title: Riding Brainwaves in LLM Space: Understanding Activation Patterns Using Individual Neural Signatures

Title: SLURP-TN : Resource for Tunisian Dialect Spoken Language Understanding

Title: Parameter-Efficient Fine-Tuning for Medical Text Summarization: A Comparative Study of Lora, Prompt Tuning, and Full Fine-Tuning

Title: Dual-Space Knowledge Distillation with Key-Query Matching for Large Language Models with Vocabulary Mismatch

Title: Autoregressive vs. Masked Diffusion Language Models: A Controlled Comparison

Title: Multiperspectivity as a Resource for Narrative Similarity Prediction

Title: Enhancing Document-Level Machine Translation via Filtered Synthetic Corpora and Two-Stage LLM Adaptation

Title: Gumbel Distillation for Parallel Text Generation

Title: MemDLM: Memory-Enhanced DLM Training

Title: Greater accessibility can amplify discrimination in generative AI

Title: TiCo: Time-Controllable Training for Spoken Dialogue Models