2026-02-10

Title: Does Visual Rendering Bypass Tokenization? Investigating Script-Tokenizer Misalignment in Pixel-Based Language Models

Title: BiomechAgent: AI-Assisted Biomechanical Analysis Through Code-Generating Agents

Title: Bridging the Knowledge Void: Inference-time Acquisition of Unfamiliar Programming Languages for Coding Tasks

Title: Anchored Decoding: Provably Reducing Copyright Risk for Any Language Model

Title: Your Language Model Secretly Contains Personality Subnetworks

Title: Open TutorAI: An Open-source Platform for Personalized and Immersive Learning with Generative AI

Title: Can LLMs Discern the Traits Influencing Your Preferences? Evaluating Personality-Driven Preference Alignment in LLMs

Title: Equipping LLM with Directional Multi-Talker Speech Understanding Capabilities

Title: Beyond Accuracy: Risk-Sensitive Evaluation of Hallucinated Medical Advice

Title: Intent Mismatch Causes LLMs to Get Lost in Multi-Turn Conversation

Title: ViHERMES: A Graph-Grounded Multihop Question Answering Benchmark and System for Vietnamese Healthcare Regulations

Title: TernaryLM: Memory-Efficient Language Modeling via Native 1-Bit Quantization with Adaptive Layer-wise Scaling

Title: Efficient Post-Training Pruning of Large Language Models with Statistical Correction

Title: Do Large Language Models Reflect Demographic Pluralism in Safety?

Title: When the Model Said 'No Comment', We Knew Helpfulness Was Dead, Honesty Was Alive, and Safety Was Terrified

Title: Advantages of Domain Knowledge Injection for Legal Document Summarization: A Case Study on Summarizing Indian Court Judgments in English and Hindi

Title: DLLM Agent: See Farther, Run Faster

Title: SED-SFT: Selectively Encouraging Diversity in Supervised Fine-Tuning

Title: From Native Memes to Global Moderation: Cros-Cultural Evaluation of Vision-Language Models for Hateful Meme Detection

Title: Let's Simplify Step by Step: Guiding LLM Towards Multilingual Unsupervised Proficiency-Controlled Sentence Simplification

Title: Improving Variable-Length Generation in Diffusion Language Models via Length Regularization

Title: Learning to Self-Verify Makes Language Models Better Reasoners

Title: SciClaimEval: Cross-modal Claim Verification in Scientific Papers

Title: Letting Tutor Personas "Speak Up" for LLMs: Learning Steering Vectors from Dialogue via Preference Optimization

Title: Blind to the Human Touch: Overlap Bias in LLM-Based Summary Evaluation

Title: SRR-Judge: Step-Level Rating and Refinement for Enhancing Search-Integrated Reasoning in Search Agents

Title: Attn-GS: Attention-Guided Context Compression for Efficient Personalized LLMs

Title: Emergent Structured Representations Support Flexible In-Context Inference in Large Language Models

Title: Thinking Makes LLM Agents Introverted: How Mandatory Thinking Can Backfire in User-Engaged Agents

Title: Pruning as a Cooperative Game: Surrogate-Assisted Layer Contribution Estimation for Large Language Models

Title: LLMs Know More About Numbers than They Can Say

Title: TodoEvolve: Learning to Architect Agent Planning Systems

Title: Evaluating and Calibrating LLM Confidence on Questions with Multiple Correct Answers

Title: SparseEval: Efficient Evaluation of Large Language Models by Sparse Optimization

Title: Patches of Nonlinearity: Instruction Vectors in Large Language Models

Title: Bielik Guard: Efficient Polish Language Safety Classifiers for LLM Content Moderation

Title: Lost in Translation? A Comparative Study on the Cross-Lingual Transfer of Composite Harms

Title: Cross-Linguistic Persona-Driven Data Synthesis for Robust Multimodal Cognitive Decline Detection

Title: The Judge Who Never Admits: Hidden Shortcuts in LLM-based Evaluation

Title: DeltaKV: Residual-Based KV Cache Compression via Long-Range Similarity

Title: Diverge to Induce Prompting: Multi-Rationale Induction for Zero-Shot Reasoning

Title: Beyond Raw Detection Scores: Markov-Informed Calibration for Boosting Machine-Generated Text Detection

Title: TDGNet: Hallucination Detection in Diffusion Language Models via Temporal Dynamic Graphs

Title: Emergent Search and Backtracking in Latent Reasoning Models

Title: Gender and Race Bias in Consumer Product Recommendations by Large Language Models

Title: DIAL-SUMMER: A Structured Evaluation Framework of Hierarchical Errors in Dialogue Summaries

Title: LLMs and people both learn to form conventions -- just not with each other

Title: Pretraining with Token-Level Adaptive Latent Chain-of-Thought

Title: CoRect: Context-Aware Logit Contrast for Hidden State Rectification to Resolve Knowledge Conflicts

Title: When Benign Inputs Lead to Severe Harms: Eliciting Unsafe Unintended Behaviors of Computer-Use Agents

Title: Document Reconstruction Unlocks Scalable Long-Context RLVR

Title: Language Predicts Identity Fusion Across Cultures and Reveals Divergent Pathways to Violence

Title: Language Modeling and Understanding Through Paraphrase Generation and Detection

Title: New Skills or Sharper Primitives? A Probabilistic Perspective on the Emergence of Reasoning in RLVR

Title: When Does Context Help? Error Dynamics of Contextual Information in Large Language Models

Title: Improving Data and Reward Design for Scientific Reasoning in Large Language Models

Title: Latent Reasoning with Supervised Thinking States

Title: UReason: Benchmarking the Reasoning Paradox in Unified Multimodal Models

Title: WorldTravel: A Realistic Multimodal Travel-Planning Benchmark with Tightly Coupled Constraints

Title: ViGoEmotions: A Benchmark Dataset For Fine-grained Emotion Detection on Vietnamese Texts

Title: Dynamic Long Context Reasoning over Compressed Memory via End-to-End Reinforcement Learning

Title: TEAM: Temporal-Spatial Consistency Guided Expert Activation for MoE Diffusion Language Model Acceleration

Title: Prism: Spectral-Aware Block-Sparse Attention

Title: Large Language Models and Impossible Language Acquisition: "False Promise" or an Overturn of our Current Perspective towards AI

Title: GISA: A Benchmark for General Information-Seeking Assistant

Title: How Do Language Models Understand Tables? A Mechanistic Analysis of Cell Location

Title: Beyond Scalar Scores: Reinforcement Learning for Error-Aware Quality Estimation of Machine Translation

Title: VocalNet-MDM: Accelerating Streaming Speech LLM via Self-Distilled Masked Diffusion Modeling

Title: Do Multilingual LLMs have specialized language heads?

Title: Fundamental Reasoning Paradigms Induce Out-of-Domain Generalization in Language Models

Title: Learning to Judge: LLMs Designing and Applying Evaluation Rubrics

Title: Old wine in old glasses: Comparing computational and qualitative methods in identifying incivility on Persian Twitter during the #MahsaAmini movement

Title: FactSim: Fact-Checking for Opinion Summarization

Title: PERSPECTRA: A Scalable and Configurable Pluralist Benchmark of Perspectives from Arguments

Title: LakeHopper: Cross Data Lakes Column Type Annotation through Model Adaptation

Title: Affective Flow Language Model for Emotional Support Conversation

Title: WildReward: Learning Reward Models from In-the-Wild Human Interactions

Title: Large Language Models for Geolocation Extraction in Humanitarian Crisis Response

Title: Is Reasoning Capability Enough for Safety in Long-Context Language Models?

Title: Next Concept Prediction in Discrete Latent Space Leads to Stronger Language Models

Title: When Actions Go Off-Task: Detecting and Correcting Misaligned Actions in Computer-Use Agents