2026-01-16

Title: LLM-Driven Preference Data Synthesis for Proactive Prediction of the Next User Utterance in Human-Machine Dialogue

Title: Evaluating Novelty in AI-Generated Research Plans Using Multi-Workflow LLM Pipelines

Title: Introducing Axlerod: An LLM-based Chatbot for Assisting Independent Insurance Agents

Title: SALP-CG: Standard-Aligned LLM Pipeline for Classifying and Grading Large Volumes of Online Conversational Health Data

Title: StatLLaMA: A multi-stage training framework for building a domain-optimized statistical language model

Title: Bounded Hyperbolic Tangent: A Stable and Efficient Alternative to Pre-Layer Normalization in Large Language Models

Title: Cross-Platform Evaluation of Large Language Model Safety in Pediatric Consultations: Evolution of Adversarial Robustness and the Scale Paradox

Title: ADMEDTAGGER: an annotation framework for distillation of expert knowledge for the Polish medical language

Title: SagaScale: A Realistic, Scalable, and High-Quality Long-Context Benchmark Built from Full-Length Novels

Title: Syntactic Framing Fragility: An Audit of Robustness in LLM Ethical Decisions

Title: Assessing and Improving Punctuation Robustness in English-Marathi Machine Translation

Title: Forgetting as a Feature: Cognitive Alignment of Large Language Models

Title: SciNets: Graph-Constrained Multi-Hop Reasoning for Scientific Literature Synthesis

Title: Eliminating Agentic Workflow for Introduction Generation with Parametric Stage Tokens

Title: Enhancing Business Analytics through Hybrid Summarization of Financial Reports

Title: Clinical Document Metadata Extraction: A Scoping Review

Title: Benchmarking Cross-Lingual Semantic Alignment in Multilingual Embeddings

Title: Closing the Data Loop: Using OpenDataArena to Engineer Superior Training Datasets

Title: From Detection to Diagnosis: Advancing Hallucination Analysis with Automated Data Synthesis

Title: Stable and Explainable Personality Trait Evaluation in Large Language Models with Internal Activations

Title: Bears, all bears, and some bears. Language Constraints on Language Models' Inductive Inferences

Title: MedRedFlag: Investigating how LLMs Redirect Misconceptions in Real-World Health Communication

Title: OUTLINEFORGE: Hierarchical Reinforcement Learning with Explicit States for Scientific Writing

Title: Patient-Similarity Cohort Reasoning in Clinical Text-to-SQL

Title: Clozing the Gap: Exploring Why Language Model Surprisal Outperforms Cloze Surprisal

Title: Take Out Your Calculators: Estimating the Real Difficulty of Question Items with LLM Student Simulations

Title: Context Volume Drives Performance: Tackling Domain Shift in Extremely Low-Resource Translation via RAG

Title: SocraticKG: Knowledge Graph Construction via QA-Driven Fact Extraction

Title: EHRNavigator: A Multi-Agent System for Patient-Level Clinical Question Answering over Heterogeneous Electronic Health Records

Title: EmplifAI: a Fine-grained Dataset for Japanese Empathetic Medical Dialogues in 28 Emotion Labels

Title: Long-Chain Reasoning Distillation via Adaptive Prefix Alignment

Title: Deriving Character Logic from Storyline as Codified Decision Trees

Title: CALM-IT: Generating Realistic Long-Form Motivational Interviewing Dialogues with Dual-Actor Conversational Dynamics Tracking

Title: SIN-Bench: Tracing Native Evidence Chains in Long-Context Multimodal Scientific Interleaved Literature

Title: Role-Playing Agents Driven by Large Language Models: Current Status, Challenges, and Future Trends

Title: ToolSafe: Enhancing Tool Invocation Safety of LLM-based agents via Proactive Step-level Guardrail and Feedback

Title: What Gets Activated: Uncovering Domain and Driver Experts in MoE Language Models

Title: Alignment Pretraining: AI Discourse Causes Self-Fulfilling (Mis)alignment

Title: AWED-FiNER: Agents, Web applications, and Expert Detectors for Fine-grained Named Entity Recognition across 36 Languages for 6.6 Billion Speakers

Title: Credit C-GPT: A Domain-Specialized Large Language Model for Conversational Understanding in Vietnamese Debt Collection

Title: HOMURA: Taming the Sand-Glass for Time-Constrained LLM Translation via Reinforcement Learning

Title: HUMANLLM: Benchmarking and Reinforcing LLM Anthropomorphism via Human Cognitive Patterns

Title: GeoSteer: Faithful Chain-of-Thought Steering via Latent Manifold Gradients

Title: Loop as a Bridge: Can Looped Transformers Truly Link Representation Space and Natural Language Outputs?

Title: coTherapist: A Behavior-Aligned Small Language Model to Support Mental Healthcare Experts

Title: Untangling Input Language from Reasoning Language: A Diagnostic Framework for Cross-Lingual Moral Alignment in LLMs

Title: Measuring Affinity between Attention-Head Weight Subspaces via the Projection Kernel

Title: MoST: Mixing Speech and Text with Modality-Aware Mixture of Experts

Title: The Straight and Narrow: Do LLMs Possess an Internal Moral Path?

Title: Multilinguality as Sense Adaptation

Title: Boundary-Aware NL2SQL: Integrating Reliability through Hybrid Reward and Data Synthesis

Title: An Efficient Long-Context Ranking Architecture With Calibrated LLM Distillation: Application to Person-Job Fit

Title: OctoBench: Benchmarking Scaffold-Aware Instruction Following in Repository-Grounded Agentic Coding

Title: Training-Trajectory-Aware Token Selection

Title: Unlocking Implicit Experience: Synthesizing Tool-Use Trajectories from Text

Title: The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models

Title: INDIC DIALECT: A Multi Task Benchmark to Evaluate and Translate in Indian Language Dialects

Title: TF3-RO-50M: Training Compact Romanian Language Models from Scratch on Synthetic Moral Microfiction

Title: Are Language Models Models?

Title: SurgGoal: Rethinking Surgical Planning Evaluation via Goal-Satisfiability

Title: Contextual StereoSet: Stress-Testing Bias Alignment Robustness in Large Language Models

Title: DR-Arena: an Automated Evaluation Framework for Deep Research Agents

Title: PERM: Psychology-grounded Empathetic Reward Modeling for Large Language Models

Title: Representation-Aware Unlearning via Activation Signatures: From Suppression to Knowledge-Signature Erasure

Title: Form and Meaning in Intrinsic Multilingual Evaluations

Title: Influential Training Data Retrieval for Explaining Verbalized Confidence of LLMs

Title: Detecting Winning Arguments with Large Language Models and Persuasion Strategies

Title: LIBERTy: A Causal Framework for Benchmarking Concept-Based Explanations of LLMs with Structural Counterfactuals

Title: Grounding Agent Memory in Contextual Intent

Title: MatchTIR: Fine-Grained Supervision for Tool-Integrated Reasoning via Bipartite Matching