2026-02-03

Title: PPoGA: Predictive Plan-on-Graph with Action for Knowledge Graph Question Answering

Title: Unlocking Electronic Health Records: A Hybrid Graph RAG Approach to Safe Clinical AI for Patient QA

Title: G-MemLLM: Gated Latent Memory Augmentation for Long-Context Reasoning in Large Language Models

Title: PTCBENCH: Benchmarking Contextual Stability of Personality Traits in LLM Systems

Title: SafeTalkCoach: Diversity-Driven Multi-Agent Simulation for Parent-Teen Health Conversations

Title: Reversible Diffusion Decoding for Diffusion Language Models

Title: DIVERGE: Diversity-Enhanced RAG for Open-Ended Information Seeking

Title: Benchmarking Uncertainty Calibration in Large Language Model Long-Form Question Answering

Title: Faithful-Patchscopes: Understanding and Mitigating Model Bias in Hidden Representations Explanation of Large Language Models

Title: MiNER: A Two-Stage Pipeline for Metadata Extraction from Municipal Meeting Minutes

Title: Detecting AI-Generated Content in Academic Peer Reviews

Title: DETOUR: An Interactive Benchmark for Dual-Agent Search and Reasoning

Title: DecompressionLM: Deterministic, Diagnostic, and Zero-Shot Concept Graph Extraction from Language Models

Title: Clause-Internal or Clause-External? Testing Turkish Reflexive Binding in Adapted versus Chain of Thought Large Language Models

Title: When Agents "Misremember" Collectively: Exploring the Mandela Effect in LLM-based Multi-Agent Systems

Title: What Matters to an LLM? Behavioral and Computational Evidences from Summarization

Title: Intention-Adaptive LLM Fine-Tuning for Text Revision Generation

Title: From Knowledge to Inference: Scaling Laws of Specialized Reasoning on GlobalHealthAtlas

Title: Culturally-Grounded Governance for Multilingual Language Models: Rights, Data Boundaries, and Accountable AI Design

Title: Reasoning by Commented Code for Table Question Answering

Title: The French Drama Revolution: Political Economy and Literary Production, 1700-1900

Title: Kanade: A Simple Disentangled Tokenizer for Spoken Language Modeling

Title: Hermes the Polyglot: A Unified Framework to Enhance Expressiveness for Multimodal Interlingual Subtitling

Title: Lookahead-then-Verify: Reliable Constrained Decoding for Diffusion LLMs under Context-Free Grammars

Title: Transformer-Based Model for Multilingual Hope Speech Detection

Title: Jailbreaking LLMs via Calibration

Title: Formal Semantic Control over Language Models

Title: LegalOne: A Family of Foundation Models for Reliable Legal Reasoning

Title: Can Small Language Models Handle Context-Summarized Multi-Turn Customer-Service QA? A Synthetic Data-Driven Comparative Evaluation

Title: ExperienceWeaver: Optimizing Small-sample Experience Learning for LLM-based Clinical Text Improvement

Title: CURP: Codebook-based Continuous User Representation for Personalized Generation with LLMs

Title: Decouple Searching from Training: Scaling Data Mixing via Model Merging for Large Language Model Pre-training

Title: Temporal Leakage in Search-Engine Date-Filtered Web Retrieval: A Case Study from Retrospective Forecasting

Title: Adaptive Ability Decomposing for Unlocking Large Reasoning Model Effective Reinforcement Learning

Title: WordCraft: Scaffolding the Keyword Method for L2 Vocabulary Learning with Multimodal LLMs

Title: Eliciting Trustworthiness Priors of Large Language Models via Economic Games

Title: Reasoning as State Transition: A Representational Analysis of Reasoning Evolution in Large Language Models

Title: HyLRA: Hybrid Layer Reuse Attention for Efficient Long-Context Inference

Title: Omni-RRM: Advancing Omni Reward Modeling via Automatic Rubric-Grounded Preference Synthesis

Title: Factuality on Demand: Controlling the Factuality-Informativeness Trade-off in Text Generation

Title: Unifying Adversarial Robustness and Training Across Text Scoring Models

Title: ILSIC: Corpora for Identifying Indian Legal Statutes from Queries by Laypeople

Title: EffGen: Enabling Small Language Models as Capable Autonomous Agents

Title: Do Schwartz Higher-Order Values Help Sentence-Level Human Value Detection? When Hard Gating Hurts

Title: Neural FOXP2 -- Language Specific Neuron Steering for Targeted Language Improvement in LLMs

Title: Verification Required: The Impact of Information Credibility on AI Persuasion

Title: Trust in One Round: Confidence Estimation for Large Language Models via Structural Signals

Title: MedSpeak: A Knowledge Graph-Aided ASR Error Correction Framework for Spoken Medical QA

Title: DISPO: Enhancing Training Efficiency and Stability in Reinforcement Learning for Large Language Model Mathematical Reasoning

Title: Sparse Reward Subsystem in Large Language Models

Title: DeALOG: Decentralized Multi-Agents Log-Mediated Reasoning Framework

Title: Reliable Use of Lemmas via Eligibility Reasoning and Section$-$Aware Reinforcement Learning

Title: Distilling Token-Trained Models into Byte-Level Models

Title: Large Language Models as Students Who Think Aloud: Overly Coherent, Verbose, and Confident

Title: Bias in the Ear of the Listener: Assessing Sensitivity in Audio Language Models Across Linguistic, Demographic, and Positional Variations

Title: Personality Expression Across Contexts: Linguistic and Behavioral Variation in LLM Agents

Title: Exploring Knowledge Purification in Multi-Teacher Knowledge Distillation for LLMs

Title: From Utterance to Vividity: Training Expressive Subtitle Translation LLM via Adaptive Local Preference Optimization

Title: What If We Allocate Test-Time Compute Adaptively?

Title: Logic-Oriented Retriever Enhancement via Contrastive Learning

Title: Tendem: A Hybrid AI+Human Platform

Title: Long-range Modeling and Processing of Multimodal Event Sequences

Title: Don't Judge a Book by its Cover: Testing LLMs' Robustness Under Logical Obfuscation

Title: Beyond Training for Cultural Awareness: The Role of Dataset Linguistic Structure in Large Language Models

Title: Typologically-Informed Candidate Reranking for LLM-based Translation into Low-Resource Languages

Title: PedagoSense: A Pedology Grounded LLM System for Pedagogical Strategy Detection and Contextual Response Generation in Learning Dialogues

Title: Bridging Lexical Ambiguity and Vision: A Mini Review on Visual Word Sense Disambiguation

Title: Attention Sink Forges Native MoE in Attention Layers: Sink-Aware Training to Address Head Collapse

Title: ASTER: Agentic Scaling with Tool-integrated Extended Reasoning

Title: Chronos: Learning Temporal Dynamics of Reasoning Chains for Test-Time Scaling

Title: Inferential Question Answering

Title: Minimizing Mismatch Risk: A Prototype-Based Routing Framework for Zero-shot LLM-generated Text Detection

Title: Large-Scale Terminal Agentic Trajectory Generation from Dockerized Environments

Title: PARSE: An Open-Domain Reasoning Question Answering Benchmark for Persian

Title: PACER: Blockwise Pre-verification for Speculative Decoding with Adaptive Length

Title: EverMemBench: Benchmarking Long-Term Interactive Memory in Large Language ModelsEverMemBench: Benchmarking Long-Term Interactive Memory in Large Language Models

Title: DreamOn: Diffusion Language Models For Code Infilling Beyond Fixed-size Canvas

Title: CRAFT: Calibrated Reasoning with Answer-Faithful Traces via Reinforcement Learning for Multi-Hop Question Answering

Title: Balancing Understanding and Generation in Discrete Diffusion Models

Title: Context Dependence and Reliability in Autoregressive Language Models

Title: On the Power of (Approximate) Reward Models for Inference-Time Scaling

Title: Rethinking Selective Knowledge Distillation

Title: From Pragmas to Partners: A Symbiotic Evolution of Agentic High-Level Synthesis

Title: Understanding QA generation: Extracting Parametric and Contextual Knowledge with CQA for Low Resource Bangla Language

Title: ConPress: Learning Efficient Reasoning from Multi-Question Contextual Pressure

Title: Ebisu: Benchmarking Large Language Models in Japanese Finance

Title: Alternating Reinforcement Learning for Rubric-Based Reward Modeling in Non-Verifiable LLM Post-Training

Title: Argument Rarity-based Originality Assessment for AI-Assisted Writing

Title: FS-Researcher: Test-Time Scaling for Long-Horizon Research Tasks with File-System-Based Agents

Title: LLM-based Embeddings: Attention Values Encode Sentence Semantics Better Than Hidden States

Title: Provable Defense Framework for LLM Jailbreaks via Noise-Augumented Alignment

Title: Wiki Live Challenge: Challenging Deep Research Agents with Expert-Level Wikipedia Articles

Title: The Art of Socratic Inquiry: A Framework for Proactive Template-Guided Therapeutic Conversation Generation

Title: SEA-Guard: Culturally Grounded Multilingual Safeguard for Southeast Asia

Title: A2Eval: Agentic and Automated Evaluation for Embodied Brain

Title: Steering Vector Fields for Context-Aware Inference-Time Control in Large Language Models

Title: Scaling Search-Augmented LLM Reasoning via Adaptive Information Control

Title: Counting Hypothesis: Potential Mechanism of In-Context Learning

Title: Game of Thought: Robust Information Seeking with Large Language Models Using Game Theory

Title: ARTIS: Agentic Risk-Aware Test-Time Scaling via Iterative Simulation

Title: MedAraBench: Large-Scale Arabic Medical Question Answering Dataset and Benchmark

Title: Mechanistic Indicators of Steering Effectiveness in Large Language Models

Title: COMI: Coarse-to-fine Context Compression via Marginal Information Gain

Title: SafePred: A Predictive Guardrail for Computer-Using Agents via World Models

Title: Enhancing Automated Essay Scoring with Three Techniques: Two-Stage Fine-Tuning, Score Alignment, and Self-Training

Title: WorldCup Sampling for Multi-bit LLM Watermarking

Title: Zero2Text: Zero-Training Cross-Domain Inversion Attacks on Textual Embeddings

Title: : One LLM Token for Explicit Graph Structural Understanding

Title: Data Distribution Matters: A Data-Centric Perspective on Context Compression for Large Language Model

Title: CodeOCR: On the Effectiveness of Vision Language Models in Code Understanding

Title: Sentence Curve Language Models

Title: AXE: Low-Cost Cross-Domain Web Structured Information Extraction

Title: Read As Human: Compressing Context via Parallelizable Close Reading and Skimming

Title: PretrainRL: Alleviating Factuality Hallucination of Large Language Models at the Beginning

Title: ES-MemEval: Benchmarking Conversational Agents on Personalized Long-Term Emotional Support

Title: GuideWeb: A Benchmark for Automatic In-App Guide Generation on Real-World Web UIs

Title: From Code-Centric to Concept-Centric: Teaching NLP with LLM-Assisted "Vibe Coding"

Title: Breaking the Static Graph: Context-Aware Traversal for Robust Retrieval-Augmented Generation

Title: Orthogonal Hierarchical Decomposition for Structure-Aware Table Understanding with Large Language Models

Title: Beyond Local Edits: Embedding-Virtualized Knowledge for Broader Evaluation and Preservation of Model Editing

Title: S3-CoT: Self-Sampled Succinct Reasoning Enables Efficient Chain-of-Thought LLMs

Title: From Latent Signals to Reflection Behavior: Tracing Meta-Cognitive Activation Trajectory in R1-Style LLMs

Title: Beyond RAG for Agent Memory: Retrieval by Decoupling and Aggregation

Title: WildGraphBench: Benchmarking GraphRAG with Wild-Source Corpora

Title: Closing the Loop: Universal Repository Representation with RPG-Encoder

Title: LEC-KG: An LLM-Embedding Collaborative Framework for Domain-Specific Knowledge Graph Construction -- A Case Study on SDGs

Title: Dicta-LM 3.0: Advancing The Frontier of Hebrew Sovereign LLMs

Title: Out of the Memory Barrier: A Highly Memory Efficient Training System for LLMs with Million-Token Contexts

Title: There Is More to Refusal in Large Language Models than a Single Direction

Title: Focus-dLLM: Accelerating Long-Context Diffusion LLM Inference via Confidence-Guided Context Focusing

Title: AR-MAP: Are Autoregressive Large Language Models Implicit Teachers for Diffusion Large Language Models?

Title: Evaluating Metalinguistic Knowledge in Large Language Models across the World's Languages

Title: Sinhala Physical Common Sense Reasoning Dataset for Global PIQA

Title: Towards AI Evaluation in Domain-Specific RAG Systems: The AgriHubi Case Study

Title: Am I More Pointwise or Pairwise? Revealing Position Bias in Rubric-Based LLM-as-a-Judge

Title: OpenSeal: Good, Fast, and Cheap Construction of an Open-Source Southeast Asian LLM via Parallel Data

Title: dziribot: rag based intelligent conversational agent for algerian arabic dialect

Title: Kimi K2.5: Visual Agentic Intelligence

Title: Cross-Lingual Stability of LLM Judges Under Controlled Generation: Evidence from Finno-Ugric Languages

Title: Hallucination or Creativity: How to Evaluate AI-Generated Scientific Stories?

Title: Advancing General-Purpose Reasoning Models with Modular Gradient Surgery

Title: The Shape of Beliefs: Geometry, Dynamics, and Interventions along Representation Manifolds of Language Models' Posteriors

Title: A Large-Scale Dataset for Molecular Structure-Language Description via a Rule-Regularized Method

Title: Language Steering for Multilingual In-Context Learning

Title: Why Steering Works: Toward a Unified View of Language Model Parameter Dynamics

Title: Automated Multiple Mini Interview (MMI) Scoring

Title: Proof-RM: A Scalable and Generalizable Reward Model for Math Proof

Title: From Sycophancy to Sensemaking: Premise Governance for Human-AI Decision Making

Title: ROG: Retrieval-Augmented LLM Reasoning for Complex First-Order Queries over Knowledge Graphs

Title: Misconception Diagnosis From Student-Tutor Dialogue: Generate, Retrieve, Rerank

Title: Large Language Models for Mental Health: A Multilingual Evaluation

Title: Abstract Activation Spaces for Content-Invariant Reasoning in Large Language Models

Title: From Directions to Regions: Decomposing Activations in Language Models via Local Geometry

Title: Indications of Belief-Guided Agency and Meta-Cognitive Monitoring in Large Language Models

Title: MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents

Title: Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability

Title: RE-TRAC: REcursive TRAjectory Compression for Deep Search Agents

Title: Reward-free Alignment for Conflicting Objectives