2026-01-12

Title: Enhancing Foundation Models in Transaction Understanding with LLM-based Sentence Embeddings

Title: Lost in Execution: On the Multilingual Robustness of Tool Calling in Large Language Models

Title: Same Claim, Different Judgment: Benchmarking Scenario-Induced Bias in Multilingual Financial Misinformation Detection

Title: Glitter: Visualizing Lexical Surprisal for Readability in Administrative Texts

Title: Large Language Models Are Bad Dice Players: LLMs Struggle to Generate Random Numbers from Statistical Distributions

Title: Tracing Moral Foundations in Large Language Models

Title: Do LLMs Need Inherent Reasoning Before Reinforcement Learning? A Study in Korean Self-Correction

Title: Towards Valid Student Simulation with Large Language Models

Title: The Facade of Truth: Uncovering and Mitigating LLM Susceptibility to Deceptive Evidence

Title: MemBuilder: Reinforcing LLMs for Long-Term Memory Construction via Attributed Dense Rewards

Title: FlashMem: Distilling Intrinsic Latent Memory via Computation Reuse

Title: CHisAgent: A Multi-Agent Framework for Event Taxonomy Construction in Ancient Chinese Cultural Systems

Title: Closing the Modality Reasoning Gap for Speech Large Language Models

Title: Can Large Language Models Differentiate Harmful from Argumentative Essays? Steps Toward Ethical Essay Scoring

Title: ReasonAny: Incorporating Reasoning Capability to Any Model via Simple and Effective Model Merging

Title: Can large language models interpret unstructured chat data on dynamic group decision-making processes? Evidence on joint destination choice

Title: ACR: Adaptive Context Refactoring via Context Refactoring Operators for Multi-Turn Dialogue

Title: Data Augmented Pipeline for Legal Information Extraction and Reasoning

Title: GIFT: Games as Informal Training for Generalizable LLMs

Title: Multilingual Amnesia: On the Transferability of Unlearning in Multilingual LLMs

Title: A Framework for Personalized Persuasiveness Prediction via Context-Aware User Profiling

Title: Stephanie2: Thinking, Waiting, and Making Decisions Like Humans in Step-by-Step AI Social Chat

Title: Afri-MCQA: Multimodal Cultural Question Answering for African Languages

Title: Multimodal In-context Learning for ASR of Low-resource Languages

Title: Visualising Information Flow in Word Embeddings with Diffusion Tensor Imaging

Title: Analysing Differences in Persuasive Language in LLM-Generated Text: Uncovering Stereotypical Gender Patterns

Title: AutoMonitor-Bench: Evaluating the Reliability of LLM-Based Misbehavior Monitor

Title: One Script Instead of Hundreds? On Pretraining Romanized Encoder Language Models

Title: Simplify-This: A Comparative Analysis of Prompt-Based and Fine-Tuned LLMs

Title: EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis

Title: LLMs as Science Journalists: Supporting Early-stage Researchers in Communicating Their Science to the Public

Title: Peek2: A Regex-free implementation of pretokenizers for Byte-level BPE

Title: Left, Right, or Center? Evaluating LLM Framing in News Classification and Generation

Title: Router-Suggest: Dynamic Routing for Multimodal Auto-Completion in Visually-Grounded Dialogs

Title: CLewR: Curriculum Learning with Restarts for Machine Translation Preference Learning

Title: FACTUM: Mechanistic Detection of Citation Hallucination in Long-Form RAG

Title: Continual-learning for Modelling Low-Resource Languages from Large Language Models

Title: iReasoner: Trajectory-Aware Intrinsic Reasoning Supervision for Self-Evolving Large Multimodal Models

Title: Gender Bias in LLMs: Preliminary Evidence from Shared Parenting Scenario in Czech Family Law

Title: An Empirical Study on Preference Tuning Generalization and Diversity Under Domain Shift

Title: HAPS: Hierarchical LLM Routing with Joint Architecture and Parameter Search

Title: Illusions of Confidence? Diagnosing LLM Truthfulness via Neighborhood Consistency

Title: Pantagruel: Unified Self-Supervised Encoders for French Text and Speech

Title: Can We Predict Before Executing Machine Learning Agents?

Title: Distilling Feedback into Memory-as-a-Tool

Title: The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning

Title: Don't Break the Cache: An Evaluation of Prompt Caching for Long-Horizon Agentic Tasks

Title: Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

Title: AdaFuse: Adaptive Ensemble Decoding with Test-Time Scaling for LLMs