2025-10-07

Title: Decomposing Attention To Find Context-Sensitive Neurons

Title: Graph-S3: Enhancing Agentic textual Graph Retrieval with Synthetic Stepwise Supervision

Title: Implicit Values Embedded in How Humans and LLMs Complete Subjective Everyday Tasks

Title: Omni-Embed-Nemotron: A Unified Multimodal Retrieval Model for Text, Image, Audio, and Video

Title: SEER: The Span-based Emotion Evidence Retrieval Benchmark

Title: ALHD: A Large-Scale and Multigenre Benchmark Dataset for Arabic LLM-Generated Text Detection

Title: TS-Reasoner: Aligning Time Series Foundation Models with LLM Reasoning

Title: Identifying Financial Risk Information Using RAG with a Contrastive Insight

Title: Sample, Align, Synthesize: Graph-Based Response Synthesis with ConGrs

Title: Fine-Tuning on Noisy Instructions: Effects on Generalization and Performance

Title: TriMediQ: A Triplet-Structured Approach for Interactive Medical Question Answering

Title: What is a protest anyway? Codebook conceptualization is still a first-order concern in LLM-era classification

Title: CCD-Bench: Probing Cultural Conflict in Large Language Model Decision-Making

Title: Reactive Transformer (RxT) -- Stateful Real-Time Processing for Event-Driven Reactive Language Models

Title: LLM, Reporting In! Medical Information Extraction Across Prompting, Fine-tuning and Post-correction

Title: Decoupling Task-Solving and Output Formatting in LLM Generation

Title: Can an LLM Induce a Graph? Investigating Memory Drift and Context Length

Title: Towards Unsupervised Speech Recognition at the Syllable-Level

Title: UNIDOC-BENCH: A Unified Benchmark for Document-Centric Multimodal RAG

Title: Fine-Tuning Large Language Models with QLoRA for Offensive Language Detection in Roman Urdu-English Code-Mixed Text

Title: MedReflect: Teaching Medical LLMs to Self-Improve via Reflective Correction

Title: TreePrompt: Leveraging Hierarchical Few-Shot Example Selection for Improved English-Persian and English-German Translation

Title: Prompt Balance Matters: Understanding How Imbalanced Few-Shot Learning Affects Multilingual Sense Disambiguation in LLMs

Title: Rezwan: Leveraging Large Language Models for Comprehensive Hadith Text Processing: A 1.2M Corpus Development

Title: Mechanistic Interpretability of Socio-Political Frames in Language Models

Title: Beyond Token Length: Step Pruner for Efficient and Accurate Reasoning in Large Language Models

Title: Annotate Rhetorical Relations with INCEpTION: A Comparison with Automatic Approaches

Title: Read Between the Lines: A Benchmark for Uncovering Political Bias in Bangla News Articles

Title: PsycholexTherapy: Simulating Reasoning in Psychotherapy with Small Language Models in Persian

Title: Mapping Patient-Perceived Physician Traits from Nationwide Online Reviews with LLMs

Title: Simulating and Understanding Deceptive Behaviors in Long-Horizon Interactions

Title: AgriGPT-VL: Agricultural Vision-Language Understanding Suite

Title: LLM Microscope: What Model Internals Reveal About Answer Correctness and Context Utilization

Title: Thai Semantic End-of-Turn Detection for Real-Time Voice Agents

Title: Does Using Counterfactual Help LLMs Explain Textual Importance in Classification?

Title: Small Language Models for Emergency Departments Decision Support: A Benchmark Study

Title: Exploring Chain-of-Thought Reasoning for Steerable Pluralistic Alignment

Title: What Makes Diffusion Language Models Super Data Learners?

Title: PoLi-RL: A Point-to-List Reinforcement Learning Framework for Conditional Semantic Textual Similarity

Title: Scaling Code-Assisted Chain-of-Thoughts and Instructions for Model Reasoning

Title: Unveiling LLMs' Metaphorical Understanding: Exploring Conceptual Irrelevance, Context Leveraging and Syntactic Influence

Title: Fine Tuning Methods for Low-resource Languages

Title: Self Speculative Decoding for Diffusion Large Language Models

Title: Thinking on the Fly: Test-Time Reasoning Enhancement via Latent Thought Policy Optimization

Title: Teaching LLM to be Persuasive: Reward-Enhanced Policy Optimization for Alignment frm Heterogeneous Rewards

Title: Epistemic Diversity and Knowledge Collapse in Large Language Models

Title: Pushing on Multilingual Reasoning Models with Language-Mixed Chain-of-Thought

Title: LongTail-Swap: benchmarking language models' abilities on rare words

Title: Probing Geometry of Next Token Prediction Using Cumulant Expansion of the Softmax Entropy

Title: SliceMoE: Routing Embedding Slices Instead of Tokens for Fine-Grained and Balanced Transformer Scaling

Title: Equipping Retrieval-Augmented Large Language Models with Document Structure Awareness

Title: Measuring Language Model Hallucinations Through Distributional Correctness

Title: Read the Scene, Not the Script: Outcome-Aware Safety for LLMs

Title: Evaluation of Clinical Trials Reporting Quality using Large Language Models

Title: Inoculation Prompting: Eliciting traits from LLMs during training can suppress them at test-time

Title: Unmasking Backdoors: An Explainable Defense via Gradient-Attention Anomaly Scoring for Pre-trained Language Models

Title: Improving Consistency in Retrieval-Augmented Systems with Group Similarity Rewards

Title: SECA: Semantically Equivalent and Coherent Attacks for Eliciting LLM Hallucinations

Title: Large Language Models Preserve Semantic Isotopies in Story Continuations

Title: On the Role of Unobserved Sequences on Sample-based Uncertainty Quantification for LLMs

Title: Mitigating Forgetting Between Supervised and Reinforcement Learning Yields Stronger Reasoners

Title: Psychological Steering in LLMs: An Evaluation of Effectiveness and Trustworthiness

Title: GenQuest: An LLM-based Text Adventure Game for Language Learners

Title: GRACE: Generative Representation Learning via Contrastive Policy Optimization

Title: Can LLMs Detect Ambiguous Plural Reference? An Analysis of Split-Antecedent and Mereological Reference

Title: Robustness assessment of large audio language models in multiple-choice evaluation

Title: FedSRD: Sparsify-Reconstruct-Decompose for Communication-Efficient Federated Large Language Models Fine-Tuning

Title: Contrastive Learning Using Graph Embeddings for Domain Adaptation of Language Models in the Process Industry

Title: Evaluating LLMs for Demographic-Targeted Social Bias Detection: A Comprehensive Benchmark Study

Title: FocusMed: A Large Language Model-based Framework for Enhancing Medical Question Summarization with Focus Identification

Title: Multi-Agent Tool-Integrated Policy Optimization

Title: TiTok: Transfer Token-level Knowledge via Contrastive Excess to Transplant LoRA

Title: Multilingual Routing in Mixture-of-Experts

Title: JSON Whisperer: Efficient JSON Editing with LLMs

Title: ModernBERT + ColBERT: Enhancing biomedical RAG through an advanced re-ranking retriever

Title: Are BabyLMs Deaf to Gricean Maxims? A Pragmatic Evaluation of Sample-efficient Language Models

Title: Hybrid Architectures for Language Models: Systematic Analysis and Design Insights

Title: Instability in Downstream Task Performance During LLM Pretraining

Title: When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection with PsiloQA

Title: Detecting Distillation Data from Reasoning Models

Title: SocialHarmBench: Revealing LLM Vulnerabilities to Socially Harmful Requests

Title: Do LLMs Align with My Task? Evaluating Text-to-SQL via Dataset Alignment

Title: The Geometry of Truth: Layer-wise Semantic Dynamics for Hallucination Detection in Large Language Models

Title: A First Context-Free Grammar Applied to Nawatl Corpora Augmentation

Title: Mind Your Tone: Investigating How Prompt Politeness Affects LLM Accuracy (short paper)

Title: Resource-Efficient Fine-Tuning of LLaMA-3.2-3B for Medical Chain-of-Thought Reasoning

Title: Imperceptible Jailbreaking against Large Language Models

Title: A Set of Quebec-French Corpus of Regional Expressions and Terms

Title: Guided Query Refinement: Multimodal Hybrid Retrieval with Test-Time Optimization

Title: COLE: a Comprehensive Benchmark for French Language Understanding Evaluation

Title: SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs

Title: Slm-mux: Orchestrating small language models for reasoning

Title: TeachLM: Post-Training LLMs for Education Using Authentic Learning Data

Title: Finish First, Perfect Later: Test-Time Token-Level Cross-Validation for Diffusion Large Language Models