2026-03-06

Title: CTRL-RAG: Contrastive Likelihood Reward Based Reinforcement Learning for Context-Faithful RAG Models

Title: Semantic Containment as a Fundamental Property of Emergent Misalignment

Title: Probing Memes in LLMs: A Paradigm for the Entangled Evaluation World

Title: Unpacking Human Preference for LLMs: Demographically Aware Evaluation with the HUMAINE Framework

Title: SalamahBench: Toward Standardized Safety Evaluation for Arabic Language Models

Title: One Size Does Not Fit All: Token-Wise Adaptive Compression for KV Cache

Title: Additive Multi-Step Markov Chains and the Curse of Dimensionality in Large Language Models

Title: Simulating Meaning, Nevermore! Introducing ICR: A Semiotic-Hermeneutic Metric for Evaluating Meaning in LLM Text Summaries

Title: The Thinking Boundary: Quantifying Reasoning Suitability of Multimodal Tasks via Dual Tuning

Title: Optimizing What We Trust: Reliability-Guided QUBO Selection of Multi-Agent Weak Framing Signals for Arabic Sentiment Prediction

Title: Same Input, Different Scores: A Multi Model Study on the Inconsistency of LLM Judge

Title: Context-Dependent Affordance Computation in Vision-Language Models

Title: Do Mixed-Vendor Multi-Agent LLMs Improve Clinical Diagnosis?

Title: Generating Realistic, Protocol-Compliant Maritime Radio Dialogues using Self-Instruct and Low-Rank Adaptation

Title: What Is Missing: Interpretable Ratings for Large Language Model Outputs

Title: A unified foundational framework for knowledge injection and evaluation of Large Language Models in Combustion Science

Title: Induced Numerical Instability: Hidden Costs in Multimodal Large Language Models

Title: Query Disambiguation via Answer-Free Context: Doubling Performance on Humanity's Last Exam

Title: From Static Inference to Dynamic Interaction: Navigating the Landscape of Streaming Large Language Models

Title: Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Title: Coordinated Semantic Alignment and Evidence Constraints for Retrieval-Augmented Generation with Large Language Models

Title: iAgentBench: Benchmarking Sensemaking Capabilities of Information-Seeking Agents on High-Traffic Topics

Title: Stan: An LLM-based thermodynamics course assistant

Title: Optimizing Language Models for Crosslingual Knowledge Consistency

Title: Hate Speech Detection using Large Language Models with Data Augmentation and Feature Enhancement

Title: Detection of Illicit Content on Online Marketplaces using Large Language Models

Title: AI-Assisted Moot Courts: Simulating Justice-Specific Questioning in Oral Arguments

Title: IF-RewardBench: Benchmarking Judge Models for Instruction-Following Evaluation

Title: Stacked from One: Multi-Scale Self-Injection for Context Window Extension

Title: TSEmbed: Unlocking Task Scaling in Universal Multimodal Embeddings

Title: Attention's Gravitational Field:A Power-Law Interpretation of Positional Correlation

Title: Beyond the Context Window: A Cost-Performance Analysis of Fact-Based Memory vs. Long-Context LLMs for Persistent Agents

Title: Autoscoring Anticlimax: A Meta-analytic Understanding of AI's Short-answer Shortcomings and Wording Weaknesses

Title: From Unfamiliar to Familiar: Detecting Pre-training Data via Gradient Deviations in Large Language Models

Title: SinhaLegal: A Benchmark Corpus for Information Extraction and Analysis in Sinhala Legislative Texts

Title: HACHIMI: Scalable and Controllable Student Persona Generation via Orchestrated Agents

Title: FireBench: Evaluating Instruction Following in Enterprise and API-Driven LLM Applications

Title: Free Lunch for Pass@$k$? Low Cost Diverse Sampling for Diffusion Language Models

Title: Can LLMs Capture Expert Uncertainty? A Comparative Analysis of Value Alignment in Ethnographic Qualitative Research

Title: AILS-NTUA at SemEval-2026 Task 10: Agentic LLMs for Psycholinguistic Marker Extraction and Conspiracy Endorsement Detection

Title: AILS-NTUA at SemEval-2026 Task 3: Efficient Dimensional Aspect-Based Sentiment Analysis

Title: Federated Heterogeneous Language Model Optimization for Hybrid Automatic Speech Recognition

Title: LocalSUG: Geography-Aware LLM for Query Suggestion in Local-Life Services

Title: Replaying pre-training data improves fine-tuning

Title: When Weak LLMs Speak with Confidence, Preference Alignment Gets Stronger

Title: VRM: Teaching Reward Models to Understand Authentic Human Preferences

Title: ThaiSafetyBench: Assessing Language Model Safety in Thai Cultural Contexts

Title: HiFlow: Hierarchical Feedback-Driven Optimization for Constrained Long-Form Text Generation

Title: NeuronMoE: Neuron-Guided Mixture-of-Experts for Efficient Multilingual LLM Extension

Title: Measuring the Redundancy of Decoder Layers in SpeechLLMs

Title: LBM: Hierarchical Large Auto-Bidding Model via Reasoning and Acting

Title: Feature Resemblance: On the Theoretical Understanding of Analogical Reasoning in Transformers

Title: C2-Faith: Benchmarking LLM Judges for Causal and Coverage Faithfulness in Chain-of-Thought Reasoning

Title: Sparse-BitNet: 1.58-bit LLMs are Naturally Friendly to Semi-Structured Sparsity

Title: Transducing Language Models

Title: Diffusion LLMs can think EoS-by-EoS

Title: Balancing Coverage and Draft Latency in Vocabulary Trimming for Faster Speculative Decoding

Title: VietJobs: A Vietnamese Job Advertisement Dataset

Title: Oral to Web: Digitizing 'Zero Resource'Languages of Bangladesh

Title: Med-V1: Small Language Models for Zero-shot and Scalable Biomedical Evidence Attribution

Title: PersianPunc: A Large-Scale Dataset and BERT-Based Approach for Persian Punctuation Restoration

Title: DiSCTT: Consensus-Guided Self-Curriculum for Efficient Test-Time Adaptation in Reasoning

Title: Progressive Residual Warmup for Language Model Pretraining

Title: An Exploration-Analysis-Disambiguation Reasoning Framework for Word Sense Disambiguation with Low-Parameter LLMs

Title: Ensembling Language Models with Sequential Monte Carlo

Title: FlashAttention-4: Algorithm and Kernel Pipelining Co-Design for Asymmetric Hardware Scaling

Title: Leveraging LLM Parametric Knowledge for Fact Checking without Retrieval

Title: Reasoning Theater: Disentangling Model Beliefs from Chain-of-Thought