2026-02-05

Title: Automatic Classification of Pedagogical Materials against CS Curriculum Guidelines

Title: Likelihood-Based Reward Designs for General LLM Reasoning

Title: Transformers perform adaptive partial pooling

Title: On the Credibility of Evaluating LLMs using Survey Questions

Title: Abstraction Induces the Brain Alignment of Language and Speech Models

Title: Expert Selections In MoE Models Reveal (Almost) As Much As Text

Title: DELTA: Deliberative Multi-Agent Reasoning with Reinforcement Learning for Multimodal Psychological Counseling

Title: The Missing Half: Unveiling Training-time Implicit Safety Risks Beyond Deployment

Title: From Helpfulness to Toxic Proactivity: Diagnosing Behavioral Misalignment in LLM Agents

Title: Enforcing Monotonic Progress in Legal Cross-Examination: Preventing Long-Horizon Stagnation in LLM-Based Inquiry

Title: Language Models Struggle to Use Representations Learned In-Context

Title: Tokenization and Morphological Fidelity in Uralic NLP: A Cross-Lingual Evaluation

Title: CoLT: Reasoning with Chain of Latent Tool Calls

Title: Scaling Agentic Verifier for Competitive Coding

Title: ECG-R1: Protocol-Guided and Modality-Agnostic MLLM for Reliable ECG Interpretation

Title: Contextual Drag: How Errors in the Context Affect LLM Reasoning

Title: Proxy Compression for Language Modeling

Title: Guided Verifier: Collaborative Multimodal Reasoning via Dynamic Process Supervision

Title: How Few-shot Demonstrations Affect Prompt-based Defenses Against LLM Jailbreak Attacks

Title: Revisiting Prompt Sensitivity in Large Language Models for Text Classification: The Role of Prompt Underspecification

Title: DeFrame: Debiasing Large Language Models Against Framing Effects

Title: Can Vision Replace Text in Working Memory? Evidence from Spatial n-Back in Vision-Language Models

Title: Beyond Rejection Sampling: Trajectory Fusion for Scaling Mathematical Reasoning

Title: Evaluating the Presence of Sex Bias in Clinical Reasoning by Large Language Models

Title: Bi-directional Bias Attribution: Debiasing Large Language Models without Modifying Prompts

Title: Swordsman: Entropy-Driven Adaptive Block Partition for Efficient Diffusion Language Models

Title: History-Guided Iterative Visual Reasoning with Self-Correction

Title: Fine-Grained Activation Steering: Steering Less, Achieving More

Title: No One-Size-Fits-All: Building Systems For Translation to Bashkir, Kazakh, Kyrgyz, Tatar and Chuvash Using Synthetic And Original Data

Title: Is Micro Domain-Adaptive Pre-Training Effective for Real-World Operations? Multi-Step Evaluation Reveals Potential and Bottlenecks

Title: Beyond Unimodal Shortcuts: MLLMs as Cross-Modal Reasoners for Grounded Named Entity Recognition

Title: Deconstructing sentence disambiguation by joint latent modeling of reading paradigms: LLM surprisal is not enough

Title: PersoDPO: Scalable Preference Optimization for Instruction-Adherent, Persona-Grounded Dialogue via Multi-LLM Evaluation

Title: Model-Dowser: Data-Free Importance Probing to Mitigate Catastrophic Forgetting in Multimodal Large Language Models

Title: $C$-$ΔΘ$: Circuit-Restricted Weight Arithmetic for Selective Refusal

Title: LycheeDecode: Accelerating Long-Context LLM Inference via Hybrid-Head Sparse Decoding

Title: Rethinking Weight Tying: Pseudo-Inverse Tying for Stable LM Training and Updates

Title: Textual Planning with Explicit Latent Transitions

Title: Can LLMs capture stable human-generated sentence entropy measures?

Title: Semantic Self-Distillation for Language Model Uncertainty

Title: Trust The Typical

Title: VILLAIN at AVerImaTeC: Verifying Image-Text Claims via Multi-Agent Collaboration

Title: Beyond Holistic Scores: Automatic Trait-Based Quality Scoring of Argumentative Essays

Title: Focus-LIME: Surgical Interpretation of Long-Context Large Language Models via Proxy-Based Neighborhood Selection

Title: Disentangling meaning from language in LLM-based machine translation

Title: LEAD: Layer-wise Expert-aligned Decoding for Faithful Radiology Report Generation

Title: Mapping the Web of Science, a large-scale graph and text-based dataset with LLM embeddings

Title: Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward Models

Title: Approaches to Semantic Textual Similarity in Slovak Language: From Algorithms to Transformers

Title: Investigating Disability Representations in Text-to-Image Models

Title: LinGO: A Linguistic Graph Optimization Framework with LLMs for Interpreting Intents of Online Uncivil Discourse

Title: LiteToken: Removing Intermediate Merge Residues From BPE Tokenizers

Title: "Be My Cheese?": Cultural Nuance Benchmarking for Machine Translation in Multilingual LLMs

Title: Less Finetuning, Better Retrieval: Rethinking LLM Adaptation for Biomedical Retrievers via Synthetic Data and Model Merging

Title: Alignment Drift in Multimodal LLMs: A Two-Phase, Longitudinal Evaluation of Harm Across Eight Model Releases

Title: Exploiting contextual information to improve stance detection in informal political discourse with LLMs

Title: When Silence Is Golden: Can LLMs Learn to Abstain in Temporal QA and Beyond?

Title: Beyond Many-Shot Translation: Scaling In-Context Demonstrations For Low-Resource Machine Translation

Title: OmniSIFT: Modality-Asymmetric Token Compression for Efficient Omni-modal Large Language Models

Title: SE-Bench: Benchmarking Self-Evolution with Knowledge Internalization

Title: Decomposed Prompting Does Not Fix Knowledge Gaps, But Helps Models Say "I Don't Know"

Title: CoT is Not the Chain of Truth: An Empirical Internal Analysis of Reasoning LLMs for Fake News Generation

Title: Reinforced Attention Learning