2025-12-17

Title: FiNERweb: Datasets and Artifacts for Scalable Multilingual Named Entity Recognition

Title: Olmo 3

Title: Structure-Aware Decoding Mechanisms for Complex Entity Extraction with Large-Scale Language Models

Title: What Affects the Effective Depth of Large Language Models?

Title: Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed

Title: A Unified Sparse Attention via Multi-Granularity Compression

Title: CogMem: A Cognitive Memory Architecture for Sustained Multi-Turn Reasoning in Large Language Models

Title: Astraea: A State-Aware Scheduling Engine for LLM-Powered Agents

Title: A Comparative Analysis of Retrieval-Augmented Generation Techniques for Bengali Standard-to-Dialect Machine Translation Using LLMs

Title: Ladder Up, Memory Down: Low-Cost Fine-Tuning With Side Nets

Title: Two CFG Nahuatl for automatic corpora expansion

Title: From Context to EDUs: Faithful and Structured Context Compression via Elementary Discourse Unit Decomposition

Title: Inflation Attitudes of Large Language Models

Title: Effect of Document Packing on the Latent Multi-Hop Reasoning Capabilities of Large Language Models

Title: SASQ: Static Activation Scaling for Quantization-Aware Training in Large Language Models

Title: C-ing Clearly: Enhanced Binary Code Explanations using C code

Title: Linguists should learn to love speech-based deep learning models

Title: VersatileFFN: Achieving Parameter Efficiency in LLMs via Adaptive Wide-and-Deep Reuse

Title: Dual Language Models: Balancing Training Efficiency and Overfitting Resilience

Title: VLegal-Bench: Cognitively Grounded Benchmark for Vietnamese Legal Reasoning of Large Language Models

Title: Agreement Between Large Language Models and Human Raters in Essay Scoring: A Research Synthesis

Title: Polypersona: Persona-Grounded LLM for Synthetic Survey Responses

Title: Towards Nepali-language LLMs: Efficient GPT training with a Nepali BPE tokenizer

Title: JMMMU-Pro: Image-based Japanese Multi-discipline Multimodal Understanding Benchmark via Vibe Benchmark Construction

Title: TiME: Tiny Monolingual Encoders for Efficient NLP Pipelines

Title: Fast and Accurate Causal Parallel Decoding using Jacobi Forcing

Title: Spoken DialogSum: An Emotion-Rich Conversational Dataset for Spoken Dialogue Summarization

Title: MMGR: Multi-Modal Generative Reasoning