2026-02-13

Title: HybridRAG: A Practical LLM-based ChatBot Framework based on Pre-Generated Q&A over Raw Unstructured Documents

Title: Response-Based Knowledge Distillation for Multilingual Jailbreak Prevention Unwittingly Compromises Safety

Title: Retrieval Heads are Dynamic

Title: Assessing LLM Reliability on Temporally Recent Open-Domain Questions

Title: Small Updates, Big Doubts: Does Parameter-Efficient Fine-tuning Enhance Hallucination Detection ?

Title: Visualizing and Benchmarking LLM Factual Hallucination Tendencies via Internal State Analysis and Clustering

Title: Disentangling Direction and Magnitude in Transformer Representations: A Double Dissociation Through L2-Matched Perturbation Analysis

Title: PRIME: Policy-Reinforced Iterative Multi-agent Execution for Algorithmic Reasoning in Large Language Models

Title: Efficient Hyper-Parameter Search for LoRA via Language-aided Bayesian Optimization

Title: Synthesizing the Virtual Advocate: A Multi-Persona Speech Generation Framework for Diverse Linguistic Jurisdictions in Indic Languages

Title: Author-in-the-Loop Response Generation and Evaluation: Integrating Author Expertise and Intent in Responses to Peer Review

Title: The Script Tax: Measuring Tokenization-Driven Efficiency and Latency Disparities in Multilingual Language Models

Title: Evaluating Few-Shot Temporal Reasoning of LLMs for Human Activity Prediction in Smart Environments

Title: What Do LLMs Know About Alzheimer's Disease? Fine-Tuning, Probing, and Data Synthesis for AD Detection

Title: From Instruction to Output: The Role of Prompting in Modern NLG

Title: Mechanistic Interpretability for Large Language Model Alignment: Progress, Challenges, and Future Directions

Title: Code Mixologist : A Practitioner's Guide to Building Code-Mixed LLMs

Title: MetaMem: Evolving Meta-Memory for Knowledge Utilization through Self-Reflective Symbolic Optimization

Title: DDL2PropBank Agent: Benchmarking Multi-Agent Frameworks' Developer Experience Through a Novel Relational Schema Mapping Task

Title: When and What to Ask: AskBench and Rubric-Guided RLVR for LLM Clarification

Title: Mechanistic Evidence for Faithfulness Decay in Chain-of-Thought Reasoning

Title: SurveyLens: A Research Discipline-Aware Benchmark for Automatic Survey Generation

Title: Are Aligned Large Language Models Still Misaligned?

Title: Evaluating Alignment of Behavioral Dispositions in LLMs

Title: When Models Examine Themselves: Vocabulary-Activation Correspondence in Self-Referential Processing

Title: Finding the Cracks: Improving LLMs Reasoning with Paraphrastic Probing and Consistency Verification

Title: The Energy of Falsehood: Detecting Hallucinations via Diffusion Model Likelihoods

Title: Advancing AI Trustworthiness Through Patient Simulation: Risk Assessment of Conversational Agents for Antidepressant Selection

Title: Towards Reliable Machine Translation: Scaling LLMs for Critical Error Detection and Safety

Title: LoopFormer: Elastic-Depth Looped Transformers for Latent Reasoning via Shortcut Modulation

Title: ADRD-Bench: A Preliminary LLM Benchmark for Alzheimer's Disease and Related Dementias

Title: When Audio-LLMs Don't Listen: A Cross-Linguistic Study of Modality Arbitration

Title: Multimodal Fact-Level Attribution for Verifiable Reasoning

Title: Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm

Title: SIGHT: Reinforcement Learning with Self-Evidence and Information-Gain Diverse Branching for Search Agent

Title: Scene-Aware Memory Discrimination: Deciding Which Personal Knowledge Stays

Title: Which Feedback Works for Whom? Differential Effects of LLM-Generated Feedback Elements Across Learner Profiles

Title: PatientHub: A Unified Framework for Patient Simulation

Title: Finding Sense in Nonsense with Generated Contexts: Perspectives from Humans and Language Models

Title: Thinking with Drafting: Optical Decompression via Logical Reconstruction

Title: MiniCPM-SALA: Hybridizing Sparse and Linear Attention for Efficient Long-Context Modeling

Title: DMAP: A Distribution Map for Text

Title: Towards Fair and Comprehensive Evaluation of Routers in Collaborative LLM Systems

Title: LLM-based Triplet Extraction from Financial Reports

Title: Benchmark Illusion: Disagreement among LLMs and Its Scientific Consequences

Title: AdaptEvolve: Improving Efficiency of Evolutionary AI Agents through Adaptive Model Selection

Title: Who is the richest club in the championship? Detecting and Rewriting Underspecified Questions Improve QA Performance

Title: Do Large Language Models Adapt to Language Variation across Socioeconomic Status?

Title: Scaling Model and Data for Multilingual Machine Translation with Open Large Language Models

Title: Automatic Simplification of Common Vulnerabilities and Exposures Descriptions

Title: LaCy: What Small Language Models Can and Should Learn is Not Just a Question of Loss

Title: Disentangling Ambiguity from Instability in Large Language Models: A Clinical Text-to-SQL Case Study

Title: Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models

Title: DeepSight: An All-in-One LM Safety Toolkit

Title: P-GenRM: Personalized Generative Reward Model with Test-time User-based Scaling

Title: A Rule-based Computational Model for Gaidhlig Morphology

Title: WavBench: Benchmarking Reasoning, Colloquialism, and Paralinguistics for End-to-End Spoken Dialogue Models

Title: dVoting: Fast Voting for dLLMs

Title: Query-focused and Memory-aware Reranker for Long Context Processing

Title: Visual Reasoning Benchmark: Evaluating Multimodal LLMs on Classroom-Authentic Visual Problems from Primary Education

Title: ExStrucTiny: A Benchmark for Schema-Variable Structured Information Extraction from Document Images

Title: Detecting Overflow in Compressed Token Representations for Retrieval-Augmented Generation

Title: T3D: Few-Step Diffusion Language Models via Trajectory Self-Distillation with Direct Discriminative Optimization

Title: On-Policy Context Distillation for Language Models