2025-11-14

Title: Omnilingual ASR: Open-Source Multilingual Speech Recognition for 1600+ Languages

Title: Order Matters: Rethinking Prompt Construction in In-Context Learning

Title: Contextual morphologically-guided tokenization for Latin encoder models

Title: How Small Can You Go? Compact Language Models for On-Device Critical Error Detection in Machine Translation

Title: Predicate-Argument Structure Divergences in Chinese and English Parallel Sentences and their Impact on Language Transfer

Title: TARG: Training-Free Adaptive Retrieval Gating for Efficient RAG

Title: Khmer Spellchecking: A Holistic Approach

Title: Answering Students' Questions on Course Forums Using Multiple Chain-of-Thought Reasoning and Finetuning RAG-Enabled LLM

Title: TermGPT: Multi-Level Contrastive Fine-Tuning for Terminology Adaptation in Legal and Financial Domain

Title: In-Token Rationality Optimization: Towards Accurate and Concise LLM Reasoning via Self-Feedback

Title: HierRouter: Coordinated Routing of Specialized Large Language Models via Reinforcement Learning

Title: EnchTable: Unified Safety Alignment Transfer in Fine-tuned Large Language Models

Title: MINDS: A Cross-cultural Dialogue Corpus for Social Norm Classification and Adherence Detection

Title: Leveraging Large Language Models for Identifying Knowledge Components

Title: REAP: Enhancing RAG with Recursive Evaluation and Adaptive Planning for Multi-Hop Question Answering

Title: NumPert: Numerical Perturbations to Probe Language Models for Veracity Prediction

Title: Modeling Uncertainty Trends for Timely Retrieval in Dynamic RAG

Title: Language Drift in Multilingual Retrieval-Augmented Generation: Characterization and Decoding-Time Mitigation

Title: PustakAI: Curriculum-Aligned and Interactive Textbooks Using Large Language Models

Title: Do Language Models Associate Sound with Meaning? A Multimodal Study of Sound Symbolism

Title: GraphIF: Enhancing Multi-Turn Instruction Following for Large Language Models with Relation Graph Prompt

Title: Format Matters: The Robustness of Multimodal LLMs in Reviewing Evidence from Tables and Charts

Title: On the Military Applications of Large Language Models

Title: Beyond the Black Box: Demystifying Multi-Turn LLM Reasoning with VISTA

Title: Text2SQL-Flow: A Robust SQL-Aware Data Augmentation Framework for Text-to-SQL

Title: EffiReason-Bench: A Unified Benchmark for Evaluating and Advancing Efficient Reasoning in Large Language Models

Title: Persona-Aware Alignment Framework for Personalized Dialogue Generation

Title: LangGPS: Language Separability Guided Data Pre-Selection for Joint Multilingual Instruction Tuning

Title: VocalNet-M2: Advancing Low-Latency Spoken Language Modeling via Integrated Multi-Codebook Tokenization and Multi-Token Prediction

Title: MTR-DuplexBench: Towards a Comprehensive Evaluation of Multi-Round Conversations for Full-Duplex Speech Language Models

Title: Rectify Evaluation Preference: Improving LLMs' Critique on Math Reasoning via Perplexity-aware Reinforcement Learning

Title: BhashaKritika: Building Synthetic Pretraining Data at Scale for Indic Languages

Title: Knowledge Graphs Generation from Cultural Heritage Texts: Combining LLMs and Ontological Engineering for Scholarly Debates

Title: TruthfulRAG: Resolving Factual-level Conflicts in Retrieval-Augmented Generation with Knowledge Graphs

Title: Position: On the Methodological Pitfalls of Evaluating Base LLMs for Reasoning

Title: Analogical Structure, Minimal Contextual Cues and Contrastive Distractors: Input Design for Sample-Efficient Linguistic Rule Induction

Title: Reasoning About Intent for Ambiguous Requests

Title: Exploring State Tracking Capabilities of Large Language Models

Title: LocalBench: Benchmarking LLMs on County-Level Local Knowledge and Reasoning

Title: Beyond Elicitation: Provision-based Prompt Optimization for Knowledge-Intensive Tasks

Title: Rubric-Based Benchmarking and Reinforcement Learning for Advancing LLM Instruction Following

Title: Say It Differently: Linguistic Styles as Jailbreak Vectors

Title: Convomem Benchmark: Why Your First 150 Conversations Don't Need RAG

Title: URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding

Title: DESS: DeBERTa Enhanced Syntactic-Semantic Aspect Sentiment Triplet Extraction

Title: Evaluating Prompting Strategies with MedGemma for Medical Order Extraction

Title: Mined Prompting and Metadata-Guided Generation for Wound Care Visual Question Answering

Title: Know Your Limits: Entropy Estimation Modeling for Compression and Generalization

Title: SSR: Socratic Self-Refine for Large Language Model Reasoning

Title: Instella: Fully Open Language Models with Stellar Performance

Title: Black-Box On-Policy Distillation of Large Language Models

Title: ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference