2026-03-04

Title: Characterizing Memorization in Diffusion Language Models: Generalized Extraction and Sampling Effects

Title: Detecting AI-Generated Essays in Writing Assessment: Responsible Use and Generalizability Across LLMs

Title: CoDAR: Continuous Diffusion Language Models are More Powerful Than You Think

Title: How Controllable Are Large Language Models? A Unified Evaluation across Behavioral Granularities

Title: ExpGuard: LLM Content Moderation in Specialized Domains

Title: GPUTOK: GPU Accelerated Byte Level BPE Tokenization

Title: Think, But Don't Overthink: Reproducing Recursive Language Models

Title: Cross-Family Speculative Prefill: Training-Free Long-Context Compression with Small Draft Models

Title: Real-Time Generation of Game Video Commentary with Multimodal LLMs: Pause-Aware Decoding Approaches

Title: Evaluating Cross-Modal Reasoning Ability and Problem Characteristics with Multimodal Item Response Theory

Title: ITLC at SemEval-2026 Task 11: Normalization and Deterministic Parsing for Formal Reasoning in LLMs

Title: HateMirage: An Explainable Multi-Dimensional Dataset for Decoding Faux Hate and Subtle Online Abuse

Title: Graph-GRPO: Stabilizing Multi-Agent Topology Learning via Group Relative Policy Optimization

Title: Sensory-Aware Sequential Recommendation via Review-Distilled Representations

Title: Efficient Self-Evaluation for Diffusion Language Models via Sequence Regeneration

Title: From Solver to Tutor: Evaluating the Pedagogical Intelligence of LLMs with KMP-Bench

Title: OCR or Not? Rethinking Document Information Extraction in the MLLMs Era with Real-World Large-Scale Datasets

Title: Faster, Cheaper, More Accurate: Specialised Knowledge Tracing Models Outperform LLMs

Title: Nodes Are Early, Edges Are Late: Probing Diagram Representations in Large Vision-Language Models

Title: LaTeX Compilation: Challenges in the Era of LLMs

Title: Eval4Sim: An Evaluation Framework for Persona Simulation

Title: Learning to Generate and Extract: A Multi-Agent Collaboration Framework For Zero-shot Document-level Event Arguments Extraction

Title: ACE-Merging: Data-Free Model Merging with Adaptive Covariance Estimation

Title: MaBERT:A Padding Safe Interleaved Transformer Mamba Hybrid Encoder for Efficient Extended Context Masked Language Modeling

Title: TrustMH-Bench: A Comprehensive Benchmark for Evaluating the Trustworthiness of Large Language Models in Mental Health

Title: PrivMedChat: End-to-End Differentially Private RLHF for Medical Dialogue Systems

Title: TAO-Attack: Toward Advanced Optimization-Based Jailbreak Attacks for Large Language Models

Title: Compact Prompting in Instruction-tuned LLMs for Joint Argumentative Component Detection

Title: Evaluating Performance Drift from Model Switching in Multi-Turn LLM Systems

Title: UniSkill: A Dataset for Matching University Curricula to Professional Competencies

Title: APRES: An Agentic Paper Revision and Evaluation System

Title: BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing?

Title: Code2Math: Can Your Code Agent Effectively Evolve Math Problems Through Exploration?

Title: Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool Use