2026-02-02

Title: In Vino Veritas and Vulnerabilities: Examining LLM Safety via Drunk Language Inducement

Title: Prepare Reasoning Language Models for Multi-Agent Debate with Self-Debate Reinforcement Learning

Title: MERMAID: Memory-Enhanced Retrieval and Reasoning with Multi-Agent Iterative Knowledge Grounding for Veracity Assessment

Title: Context Structure Reshapes the Representational Geometry of Language Models

Title: Stability-Aware Prompt Optimization for Clinical Data Abstraction

Title: SPLA: Block Sparse Plus Linear Attention for Long Context Modeling

Title: SP^2DPO: An LLM-assisted Semantic Per-Pair DPO Generalization

Title: Specialists or Generalists? Multi-Agent and Single-Agent LLMs for Essay Grading

Title: Culturally Grounded Personas in Large Language Models: Characterization and Alignment with Socio-Psychological Value Frameworks

Title: Bifocal Attention: Harmonizing Geometric and Spectral Positional Embeddings for Algorithmic Generalization

Title: Word-Centered Semantic Graphs for Interpretable Diachronic Sense Tracking

Title: Large Language Model Agents Are Not Always Faithful Self-Evolvers

Title: Stop Jostling: Adaptive Negative Sampling Reduces the Marginalization of Low-Resource Language Tokens by Cross-Entropy Loss

Title: SSL: Sweet Spot Learning for Differentiated Guidance in Agentic Optimization

Title: Mock Worlds, Real Skills: Building Small Agentic Language Models with Synthetic Tasks, Simulated Environments, and Rubric-Based Rewards

Title: $ρ$-$\texttt{EOS}$: Training-free Bidirectional Variable-Length Control for Masked Diffusion LLMs

Title: Towards the Holographic Characteristic of LLMs for Efficient Short-text Generation

Title: Are LLM Evaluators Really Narcissists? Sanity Checking Self-Preference Evaluations

Title: SpanNorm: Reconciling Training Stability and Performance in Deep Transformers

Title: Rethinking LLM-as-a-Judge: Representation-as-a-Judge with Small Language Models via Semantic Capacity Asymmetry

Title: Language Model Circuits Are Sparse in the Neuron Basis

Title: Layer-wise Swapping for Generalizable Multilingual Safety

Title: Time-Annealed Perturbation Sampling: Diverse Generation for Diffusion Language Models

Title: DART-ing Through the Drift: Dynamic Tracing of Knowledge Neurons for Adaptive Inference-Time Pruning

Title: NAG: A Unified Native Architecture for Encoder-free Text-Graph Modeling in Language Models

Title: TSLM: Tree-Structured Language Modeling for Divergent Thinking

Title: FNF: Functional Network Fingerprint for Large Language Models

Title: Models Know Models Best: Evaluation via Model-Preferred Formats

Title: MM-THEBench: Do Reasoning MLLMs Think Reasonably?

Title: AR-BENCH: Benchmarking Legal Reasoning with Judgment Error Detection, Classification and Correction

Title: RASST: Fast Cross-modal Retrieval-Augmented Simultaneous Speech Translation

Title: Sparse or Dense? A Mechanistic Estimation of Computation Density in Transformer-based LLMs

Title: When Meanings Meet: Investigating the Emergence and Quality of Shared Concept Spaces during Multilingual Language Model Training

Title: Leveraging LLMs For Turkish Skill Extraction

Title: Should LLMs, $\textit{like}$, Generate How Users Talk? Building Dialect-Accurate Dialog[ue]s Beyond the American Default with MDial

Title: DiffuSpeech: Silent Thought, Spoken Answer via Unified Speech-Text Diffusion

Title: LLMs Explain't: A Post-Mortem on Semantic Interpretability in Transformer Models

Title: Benchmarking Machine Translation on Chinese Social Media Texts

Title: Relaxing Positional Alignment in Masked Diffusion Language Models

Title: Autonomous Chain-of-Thought Distillation for Graph-Based Fraud Detection

Title: Residual Context Diffusion Language Models

Title: A Unified View of Attention and Residual Sinks: Outlier-Driven Rescaling is Essential for Transformer Training

Title: ArabicDialectHub: A Cross-Dialectal Arabic Learning Resource and Platform

Title: Bias Beyond Borders: Political Ideology Evaluation and Steering in Multilingual LLMs

Title: InstructDiff: Domain-Adaptive Data Selection via Differential Entropy for Efficient LLM Fine-Tuning

Title: DimABSA: Building Multilingual and Multidomain Datasets for Dimensional Aspect-Based Sentiment Analysis

Title: Character as a Latent Variable in Large Language Models: A Mechanistic Account of Emergent Misalignment and Conditional Safety Failures

Title: Safer Policy Compliance with Dynamic Epistemic Fallback

Title: Evaluating the Utility of Grounding Documents with Reference-Free LLM-based Metrics

Title: Monotonic Reference-Free Refinement for Autoformalization

Title: FourierSampler: Unlocking Non-Autoregressive Potential in Diffusion Language Models via Frequency-Guided Generation

Title: JobResQA: A Benchmark for LLM Machine Reading Comprehension on Multilingual Résumés and JDs

Title: ReGuLaR: Variational Latent Reasoning Guided by Rendered Chain-of-Thought

Title: Deep Search with Hierarchical Meta-Cognitive Monitoring Inspired by Cognitive Neuroscience

Title: Are you going to finish that? A Practical Study of the Tokenization Boundary Problem

Title: Now You Hear Me: Audio Narrative Attacks Against Large Audio-Language Models

Title: PaperBanana: Automating Academic Illustration for AI Scientists

Title: UPA: Unsupervised Prompt Agent via Tree-Based Search and Selection