2026-01-01

Title: Enriching Historical Records: An OCR and AI-Driven Approach for Database Integration

Title: CAT: A Metric-Driven Framework for Analyzing the Consistency-Accuracy Relation of LLMs under Controlled Input Variations

Title: STED and Consistency Scoring: A Framework for Evaluating LLM Structured Output Reliability

Title: PyBangla at BLP-2025 Task 2: Enhancing Bangla-to-Python Code Generation with Iterative Self-Correction and Multilingual Agents

Title: Noise-Driven Persona Formation in Reflexive Neural Language Generation

Title: HarmTransform: Transforming Explicit Harmful Queries into Stealthy via Multi-Agent Debate

Title: Emergent World Beliefs: Exploring Transformers in Stochastic Games

Title: When in Doubt, Deliberate: Confidence-Based Routing to Expert Debate for Sexism Detection

Title: Break Out the Silverware -- Semantic Understanding of Stored Household Items

Title: Entropy-Aware Speculative Decoding Toward Improved LLM Reasoning

Title: MiMo-Audio: Audio Language Models are Few-Shot Learners

Title: StressRoBERTa: Cross-Condition Transfer Learning from Depression, Anxiety, and PTSD to Stress Detection

Title: Retrieval Augmented Question Answering: When Should LLMs Admit Ignorance?

Title: Adversarial Lens: Exploiting Attention Layers to Generate Adversarial Examples for Evaluation

Title: Integrating Domain Knowledge for Financial QA: A Multi-Retriever RAG Approach with LLMs

Title: Disentangling Learning from Judgment: Representation Learning for Open Response Analytics

Title: Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling

Title: Efficient Context Scaling with LongCat ZigZag Attention

Title: CEC-Zero: Zero-Supervision Character Error Correction with Self-Generated Rewards

Title: Fantastic Reasoning Behaviors and Where to Find Them: Unsupervised Discovery of the Reasoning Process

Title: iCLP: Large Language Model Reasoning with Implicit Cognition Latent Planning

Title: Beyond Hallucinations: A Composite Score for Measuring Reliability in Open-Source Large Language Models

Title: Training a Huggingface Model on AWS Sagemaker (Without Tears)

Title: Activation Steering for Masked Diffusion Language Models

Title: Large Emotional World Model

Title: Training Report of TeleChat3-MoE

Title: MedKGI: Iterative Differential Diagnosis with Medical Knowledge Graphs and Information-Guided Inquiring

Title: LAILA: A Large Trait-Based Dataset for Arabic Automated Essay Scoring

Title: Joint Selection for Large-Scale Pre-Training Data via Policy Gradient-based Mask Learning

Title: Automated Analysis of Sustainability Reports: Using Large Language Models for the Extraction and Prediction of EU Taxonomy-Compliant KPIs

Title: Figure It Out: Improving the Frontier of Reasoning with Active Visual Thinking

Title: QianfanHuijin Technical Report: A Novel Multi-Stage Training Paradigm for Finance Industrial LLMs

Title: World model inspired sarcasm reasoning with large language model agents

Title: Comparing Approaches to Automatic Summarization in Less-Resourced Languages

Title: Cleaning English Abstracts of Scientific Publications

Title: Paragraph Segmentation Revisited: Towards a Standard Task for Structuring Speech

Title: Safe in the Future, Dangerous in the Past: Dissecting Temporal and Linguistic Vulnerabilities in LLMs

Title: HaluNet: Multi-Granular Uncertainty Modeling for Efficient Hallucination Detection in LLM Question Answering

Title: Korean Canonical Legal Benchmark: Toward Knowledge-Independent Evaluation of LLMs' Legal Reasoning Capabilities

Title: Understanding and Steering the Cognitive Behaviors of Reasoning Models at Test-Time

Title: Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

Title: Do Large Language Models Know What They Are Capable Of?

Title: R-Debater: Retrieval-Augmented Debate Generation through Argumentative Memory

Title: MUSIC: MUlti-Step Instruction Contrast for Multi-Turn Reward Models

Title: BIOME-Bench: A Benchmark for Biomolecular Interaction Inference and Multi-Omics Pathway Mechanism Elucidation from Scientific Literature

Title: Compute-Accuracy Pareto Frontiers for Open-Source Reasoning Large Language Models

Title: Triangulation as an Acceptance Rule for Multilingual Mechanistic Interpretability

Title: PrivacyBench: A Conversational Benchmark for Evaluating Privacy in Personalized AI

Title: Encyclo-K: Evaluating LLMs with Dynamically Composed Knowledge Statements

Title: BEDA: Belief Estimation as Probabilistic Constraints for Performing Strategic Dialogue Acts

Title: Adaptive Dependency-aware Prompt Optimization Framework for Multi-Step LLM Pipeline

Title: MAMA-Memeia! Multi-Aspect Multi-Agent Collaboration for Depressive Symptoms Identification in Memes

Title: Modeling Language as a Sequence of Thoughts

Title: AdaGReS:Adaptive Greedy Context Selection via Redundancy-Aware Scoring for Token-Budgeted RAG