2025-10-06

Title: Hallucination reduction with CASAL: Contrastive Activation Steering For Amortized Learning

Title: Hallucination-Resistant, Domain-Specific Research Assistant with Self-Evaluation and Vector-Grounded Retrieval

Title: KAME: Tandem Architecture for Enhancing Knowledge in Real-Time Speech-to-Speech Conversational AI

Title: AMANDA: Agentic Medical Knowledge Augmentation for Data-Efficient Medical Visual Question Answering

Title: SelfJudge: Faster Speculative Decoding via Self-Supervised Judge Verification

Title: EntropyLong: Effective Long-Context Training via Predictive Uncertainty

Title: Synthetic Dialogue Generation for Interactive Conversational Elicitation & Recommendation (ICER)

Title: Human Mobility Datasets Enriched With Contextual and Social Dimensions

Title: Where Did It Go Wrong? Attributing Undesirable LLM Behaviors via Representation Gradient Tracing

Title: FormalML: A Benchmark for Evaluating Formal Subgoal Completion in Machine Learning Theory

Title: CRACQ: A Multi-Dimensional Approach To Automated Document Assessment

Title: Optimizing Long-Form Clinical Text Generation with Claim-Based Rewards

Title: Evaluating Uncertainty Quantification Methods in Argumentative Large Language Models

Title: Can Prompts Rewind Time for LLMs? Evaluating the Effectiveness of Prompted Knowledge Cutoffs

Title: DRIFT: Learning from Abundant User Dissatisfaction in Real-World Preference Learning

Title: $\texttt{BluePrint}$: A Social Media User Dataset for LLM Persona Evaluation and Training

Title: Breaking the MoE LLM Trilemma: Dynamic Expert Clustering with Structured Compression

Title: Small Language Models for Curriculum-based Guidance

Title: LLMSQL: Upgrading WikiSQL for the LLM Era of Text-to-SQL

Title: Language, Culture, and Ideology: Personalizing Offensiveness Detection in Political Tweets with Reasoning LLMs

Title: Evaluating Bias in Spoken Dialogue LLMs for Real-World Decisions and Recommendations

Title: An Senegalese Legal Texts Structuration Using LLM-augmented Knowledge Graph

Title: Modeling the language cortex with form-independent and enriched representations of sentence meaning reveals remarkable semantic abstractness

Title: DiffuSpec: Unlocking Diffusion Language Models for Speculative Decoding

Title: Emission-GPT: A domain-specific language model agent for knowledge retrieval, emission inventory and data analysis

Title: Spiral of Silence in Large Language Model Agents

Title: ChunkLLM: A Lightweight Pluggable Framework for Accelerating LLMs Inference

Title: A Cross-Lingual Analysis of Bias in Large Language Models Using Romanian History

Title: Beyond Manuals and Tasks: Instance-Level Context Learning for LLM Agents

Title: Training Dynamics of Parametric and In-Context Knowledge Utilization in Language Models

Title: Pretraining with hierarchical memories: separating long-tail and common knowledge

Title: Uncertainty-Aware Answer Selection for Improved Reasoning in Multi-LLM Systems

Title: Learning to Route: A Rule-Driven Agent Framework for Hybrid-Source Retrieval-Augmented Generation

Title: KnowledgeSmith: Uncovering Knowledge Updating in LLMs with Model Editing and Unlearning

Title: Retrieval and Augmentation of Domain Knowledge for Text-to-SQL Semantic Parsing

Title: Words That Make Language Models Perceive

Title: CLARITY: Clinical Assistant for Routing, Inference, and Triage

Title: Unraveling Syntax: How Language Models Learn Context-Free Grammars

Title: Hierarchical Semantic Retrieval with Cobweb

Title: Knowledge-Graph Based RAG System Evaluation Framework

Title: Transcribe, Translate, or Transliterate: An Investigation of Intermediate Representations in Spoken Language Models

Title: Evaluation Framework for Highlight Explanations of Context Utilisation in Language Models

Title: Mind the Gap: Linguistic Divergence and Adaptation Strategies in Human-LLM Assistant vs. Human-Human Interactions

Title: SoT: Structured-of-Thought Prompting Guides Multilingual Reasoning in Large Language Models

Title: Self-Improvement in Multimodal Large Language Models: A Survey

Title: Uncertainty as Feature Gaps: Epistemic Uncertainty Quantification of LLMs in Contextual Question-Answering

Title: Time-To-Inconsistency: A Survival Analysis of Large Language Model Robustness to Adversarial Attacks

Title: TravelBench : Exploring LLM Performance in Low-Resource Domains

Title: IndiCASA: A Dataset and Bias Evaluation Framework in LLMs Using Contrastive Embedding Similarity in the Indian Context

Title: The Path of Self-Evolving Large Language Models: Achieving Data-Efficient Learning via Intrinsic Feedback

Title: StepChain GraphRAG: Reasoning Over Knowledge Graphs for Multi-Hop Question Answering

Title: Evaluating Large Language Models for IUCN Red List Species Information

Title: Self-Reflective Generation at Test Time

Title: Leave No TRACE: Black-box Detection of Copyrighted Dataset Usage in Large Language Models via Watermarking

Title: Grounding Large Language Models in Clinical Evidence: A Retrieval-Augmented Generation System for Querying UK NICE Clinical Guidelines

Title: Revisiting Direct Speech-to-Text Translation with Speech LLMs: Better Scaling than CoT Prompting?

Title: Semantic Similarity in Radiology Reports via LLMs and NER

Title: Listening or Reading? Evaluating Speech Awareness in Chain-of-Thought Speech-to-Text Translation

Title: SurveyBench: How Well Can LLM(-Agents) Write Academic Surveys?

Title: Beyond the Final Layer: Intermediate Representations for Better Multilingual Calibration in Large Language Models

Title: EditLens: Quantifying the Extent of AI Editing in Text

Title: Neural Correlates of Language Models Are Specific to Human Language

Title: Topic Modeling as Long-Form Generation: Can Long-Context LLMs revolutionize NTM via Zero-Shot Prompting?

Title: FocusAgent: Simple Yet Effective Ways of Trimming the Large Context of Web Agents

Title: Cache-to-Cache: Direct Semantic Communication Between Large Language Models

Title: Self-Anchor: Large Language Model Reasoning via Step-by-step Attention Alignment

Title: Reward Models are Metrics in a Trench Coat