2026-01-30

Title: DeepSearchQA: Bridging the Comprehensiveness Gap for Deep Research Agents

Title: UrduBench: An Urdu Reasoning Benchmark using Contextually Ensembled Translations with Human-in-the-Loop

Title: ChunkWise LoRA: Adaptive Sequence Partitioning for Memory-Efficient Low-Rank Adaptation and Accelerated LLM Inference

Title: Multi-task Code LLMs: Data Mix or Model Merge?

Title: Large Language Models Naively Recover Ethnicity from Individual Records

Title: EnsembleLink: Accurate Record Linkage Without Training Data

Title: Output-Space Search: Targeting LLM Generations in a Frozen Encoder-Defined Output Space

Title: From Linear Input to Hierarchical Structure: Function Words as Statistical Cues for Language Learning

Title: Scaling Embeddings Outperforms Scaling Experts in Language Models

Title: Scaling Reasoning Hop Exposes Weaknesses: Demystifying and Improving Hop Generalization in Large Language Models

Title: Parametric Knowledge is Not All You Need: Toward Honest Large Language Models via Retrieval of Pretraining Data

Title: MGSM-Pro: A Simple Strategy for Robust Multilingual Mathematical Reasoning Evaluation

Title: SHARP: Social Harm Analysis via Risk Profiles for Measuring Inequities in Large Language Models

Title: MoCo: A One-Stop Shop for Model Collaboration Research

Title: CausalEmbed: Auto-Regressive Multi-Vector Generation in Latent Space for Visual Document Embedding

Title: Qwen3-ASR Technical Report

Title: Self-Improving Pretraining: using post-trained models to pretrain better models

Title: The Compliance Paradox: Semantic-Instruction Decoupling in Automated Academic Code Evaluation

Title: User-Centric Evidence Ranking for Attribution and Fact Verification

Title: Conversation for Non-verifiable Learning: Self-Evolving LLMs through Meta-Evaluation

Title: SOUP: Token-level Single-sample Mix-policy Reinforcement Learning for Large Language Models

Title: DimStance: Multilingual Datasets for Dimensional Stance Analysis

Title: inversedMixup: Data Augmentation via Inverting Mixed Embeddings

Title: Note2Chat: Improving LLMs for Multi-Turn Clinical History Taking Using Medical Notes

Title: ASTRA: Automated Synthesis of agentic Trajectories and Reinforcement Arenas

Title: Language Models as Artificial Learners: Investigating Crosslinguistic Influence

Title: ILRR: Inference-Time Steering Method for Masked Diffusion Language Models

Title: AdaptBPE: From General Purpose to Specialized Tokenizers

Title: Scale-Dependent Semantic Dynamics Revealed by Allan Deviation

Title: FIT: Defying Catastrophic Forgetting in Continual LLM Unlearning

Title: Do Not Waste Your Rollouts: Recycling Search Experience for Efficient Test-Time Scaling

Title: Can David Beat Goliath? On Multi-Hop Reasoning with Resource-Constrained Agents

Title: Toward Culturally Aligned LLMs through Ontology-Guided Multi-Agent Reasoning

Title: Why Attention Patterns Exist: A Unifying Temporal Perspective Analysis

Title: TACLer: Tailored Curriculum Reinforcement Learning for Efficient Reasoning

Title: Enhancing Language Models for Robust Greenwashing Detection

Title: Procedural Pretraining: Warming Up Language Models with Abstract Data

Title: CE-GOCD: Central Entity-Guided Graph Optimization for Community Detection to Augment LLM Scientific Question Answering

Title: Temporal Guidance for Large Language Models

Title: CoFrGeNet: Continued Fraction Architectures for Language Generation

Title: Evaluating ChatGPT on Medical Information Extraction Tasks: Performance, Explainability and Beyond

Title: Zonkey: A Hierarchical Diffusion Language Model with Differentiable Tokenization and Probabilistic Attention

Title: Enhancing Conversational Agents via Task-Oriented Adversarial Memory Adaptation

Title: RAG-E: Quantifying Retriever-Generator Alignment and Failure Modes

Title: Distribution-Aware Reward Estimation for Test-Time Reinforcement Learning

Title: Mil-SCORE: Benchmarking Long-Context Geospatial Reasoning and Planning in Large Language Models

Title: Embodied Task Planning via Graph-Informed Action Generation with Large Lanaguage Model

Title: Learn-to-Distance: Distance Learning for Detecting LLM-Generated Text

Title: SONIC: Segmented Optimized Nexus for Information Compression in Key-Value Caching

Title: From Generative Modeling to Clinical Classification: A GPT-Based Architecture for EHR Notes

Title: Token-Guard: Towards Token-Level Hallucination Control via Self-Checking Decoding

Title: Mechanistic Data Attribution: Tracing the Training Origins of Interpretable LLM Units

Title: When "Better" Prompts Hurt: Evaluation-Driven Iteration for LLM Applications

Title: Causal Autoregressive Diffusion Language Model

Title: Thinking Out of Order: When Output Order Stops Reflecting Reasoning Order in Diffusion Language Models

Title: A Separable Architecture for Continuous Token Representation in Language Models

Title: On the Paradoxical Interference between Instruction-Following and Task Solving

Title: MasalBench: A Benchmark for Contextual and Cross-Cultural Understanding of Persian Proverbs in LLMs

Title: $G^2$-Reader: Dual Evolving Graphs for Multimodal Document QA

Title: VTC-R1: Vision-Text Compression for Efficient Long-Context Reasoning

Title: ECO: Quantized Training without Full-Precision Master Weights

Title: A Federated and Parameter-Efficient Framework for Large Language Model Training in Medicine

Title: Reasoning While Asking: Transforming Reasoning Large Language Models from Passive Solvers to Proactive Inquirers

Title: FineInstructions: Scaling Synthetic Instructions to Pre-Training Scale

Title: DynaWeb: Model-Based Reinforcement Learning of Web Agents

Title: Hybrid Linear Attention Done Right: Efficient Distillation and Effective Architectures for Extremely Long Contexts