2025-07-08

Title: Loki's Dance of Illusions: A Comprehensive Survey of Hallucination in Large Language Models

Title: ChatGPT is not A Man but Das Man: Representativeness and Structural Consistency of Silicon Samples Generated by Large Language Models

Title: A Unified Speech LLM for Diarization and Speech Recognition in Multilingual Conversations

Title: Mitigating Hidden Confounding by Progressive Confounder Imputation via Large Language Models

Title: Theory of Mind in Action: The Instruction Inference Task

Title: A Large Language Model-Empowered Agent for Reliable and Robust Structural Analysis

Title: The Application of Large Language Models on Major Depressive Disorder Support Based on African Natural Products

Title: RADIANT: Retrieval AugmenteD entIty-context AligNmenT -- Introducing RAG-ability and Entity-Context Divergence

Title: Evaluating AI Counseling in Japanese: Counselor, Client, and Evaluator Roles Assessed by Motivational Interviewing Criteria

Title: Advanced Financial Reasoning at Scale: A Comprehensive Evaluation of Large Language Models on CFA Level III

Title: Real-World En Call Center Transcripts Dataset with PII Redaction

Title: RAG-R1 : Incentivize the Search and Reasoning Capabilities of LLMs through Multi-query Parallelism

Title: Less Data, More Security: Advancing Cybersecurity LLMs Specialization via Resource-Efficient Domain-Adaptive Continuous Pre-training with Minimal Tokens

Title: PB-LLMs: Privacy- and Bias-aware NLP Models using Named-Entity Recognition

Title: We Need Knowledge Distillation for Solving Math Word Problems

Title: Truth, Trust, and Trouble: Medical AI on the Edge

Title: From Answers to Rationales: Self-Aligning Multimodal Reasoning with Answer-Oriented Chain-of-Thought

Title: GAF-Guard: An Agentic Framework for Risk Management and Governance in Large Language Models

Title: A Comparative Study of Competency Question Elicitation Methods from Ontology Requirements

Title: `For Argument's Sake, Show Me How to Harm Myself!': Jailbreaking LLMs in Suicide and Self-Harm Contexts

Title: Evaluating Hierarchical Clinical Document Classification Using Reasoning-Based LLMs

Title: Breaking Physical and Linguistic Borders: Multilingual Federated Prompt Tuning for Low-Resource Languages

Title: CLUES: Collaborative High-Quality Data Selection for LLMs via Training Dynamics

Title: PDFMathTranslate: Scientific Document Translation Preserving Layouts

Title: Subversion via Focal Points: Investigating Collusion in LLM Monitoring

Title: Beyond Overcorrection: Evaluating Diversity in T2I Models with DIVBENCH

Title: OpenTable-R1: A Reinforcement Learning Augmented Tool Agent for Open-Domain Table Question Answering

Title: The Book of Life approach: Enabling richness and scale for life course research

Title: Preserving Privacy, Increasing Accessibility, and Reducing Cost: An On-Device Artificial Intelligence Model for Medical Transcription and Note Generation

Title: Cautious Next Token Prediction

Title: Dynamic Long Short-Term Memory Based Memory Storage For Long Horizon LLM Interaction

Title: K-Function: Joint Pronunciation Transcription and Feedback for Evaluating Kids Language Function

Title: Counterfactual Tuning for Temporal Sensitivity Enhancement in Large Language Model-based Recommendation

Title: Large Language Models for Automating Clinical Data Standardization: HL7 FHIR Use Case

Title: ARF-RLHF: Adaptive Reward-Following for RLHF through Emotion-Driven Self-Supervision and Trace-Biased Dynamic Optimization

Title: RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents

Title: ReliableMath: Benchmark of Reliable Mathematical Reasoning on Large Language Models

Title: From Measurement to Mitigation: Exploring the Transferability of Debiasing Approaches to Gender Bias in Maltese Language Models

Title: Expert-level validation of AI-generated medical text with scalable language models

Title: Adversarial Manipulation of Reasoning Models using Internal Representations

Title: How Much Content Do LLMs Generate That Induces Cognitive Bias in Users?

Title: KinyaColBERT: A Lexically Grounded Retrieval Model for Low-Resource Retrieval-Augmented Generation

Title: RefineX: Learning to Refine Pre-training Data at Scale from Expert-Guided Programs

Title: GRAFT: A Graph-based Flow-aware Agentic Framework for Document-level Machine Translation

Title: Read Quietly, Think Aloud: Decoupling Comprehension and Reasoning in LLMs

Title: SHNU Multilingual Conversational Speech Recognition System for INTERSPEECH 2025 MLC-SLM Challenge

Title: WETBench: A Benchmark for Detecting Task-Specific Machine-Generated Text on Wikipedia

Title: Making Sense of Korean Sentences: A Comprehensive Evaluation of LLMs through KoSEnd Dataset

Title: Graph Repairs with Large Language Models: An Empirical Study

Title: SMCLM: Semantically Meaningful Causal Language Modeling for Autoregressive Paraphrase Generation

Title: Improving Social Determinants of Health Documentation in French EHRs Using Large Language Models

Title: Beyond Weaponization: NLP Security for Medium and Lower-Resourced Languages in Their Own Right

Title: Four Shades of Life Sciences: A Dataset for Disinformation Detection in the Life Sciences

Title: AI-VaxGuide: An Agentic RAG-Based LLM for Vaccination Decisions

Title: H2HTalk: Evaluating Large Language Models as Emotional Companion

Title: TRACE: Training and Inference-Time Interpretability Analysis for Language Models

Title: Recon, Answer, Verify: Agents in Search of Truth

Title: TACOS: Open Tagging and Comparative Scoring for Instruction Fine-Tuning Data Selection

Title: STRUCTSENSE: A Task-Agnostic Agentic Framework for Structured Information Extraction with Human-In-The-Loop Evaluation and Benchmarking

Title: Controlling Thinking Speed in Reasoning Models

Title: Can LLMs Play Ô Ăn Quan Game? A Study of Multi-Step Planning and Decision Making

Title: MemOS: A Memory OS for AI System

Title: OrthoRank: Token Selection via Sink Token Orthogonality for Efficient LLM inference

Title: Demystifying ChatGPT: How It Masters Genre Recognition

Title: Losing our Tail -- Again: On (Un)Natural Selection And Multilingual Large Language Models

Title: Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents

Title: Nunchi-Bench: Benchmarking Language Models on Cultural Reasoning with a Focus on Korean Superstition

Title: LLMThinkBench: Towards Basic Math Reasoning and Overthinking in Large Language Models

Title: Patient-Centered RAG for Oncology Visit Aid Following the Ottawa Decision Guide

Title: Beyond Independent Passages: Adaptive Passage Combination Retrieval for Retrieval Augmented Open-Domain Question Answering

Title: Conversation Forests: The Key to Fine Tuning Large Language Models for Multi-Turn Medical Conversations is Branching

Title: BYOKG-RAG: Multi-Strategy Graph Retrieval for Knowledge Graph Question Answering

Title: Token Level Hallucination Detection via Variance in Language Models

Title: Dissecting Clinical Reasoning in Language Models: A Comparative Study of Prompts and Model Adaptation Strategies

Title: Large Language Models for Zero-Shot Multicultural Name Recognition

Title: SymbolicThought: Integrating Language Models and Symbolic Reasoning for Consistent and Interpretable Human Relationship Understanding

Title: Context Tuning for In-Context Optimization

Title: Fairness Evaluation of Large Language Models in Academic Library Reference Services

Title: No Language Data Left Behind: A Comparative Study of CJK Language Datasets in the Hugging Face Ecosystem

Title: Large Language Models' Varying Accuracy in Recognizing Risk-Promoting and Health-Supporting Sentiments in Public Health Discourse: The Cases of HPV Vaccination and Heated Tobacco Products

Title: Does Learning Mathematical Problem-Solving Generalize to Broader Reasoning?

Title: SpiritRAG: A Q&A System for Religion and Spirituality in the United Nations Archive

Title: THM@SimpleText 2025 -- Task 1.1: Revisiting Text Simplification based on Complex Terms for Non-Experts

Title: MOMENTS: A Comprehensive Multimodal Benchmark for Theory of Mind

Title: RAT: Bridging RNN Efficiency and Attention Accuracy in Language Modeling

Title: GradOT: Training-free Gradient-preserving Offsite-tuning for Large Language Models

Title: Think Twice Before You Judge: Mixture of Dual Reasoning Experts for Multimodal Sarcasm Detection

Title: Dual Modality-Aware Gated Prompt Tuning for Few-Shot Multimodal Sarcasm Detection

Title: Unveiling the Potential of Diffusion Large Language Model in Controllable Generation

Title: DP-Fusion: Token-Level Differentially Private Inference for Large Language Models

Title: Nile-Chat: Egyptian Language Models for Arabic and Latin Scripts

Title: PRIME: Large Language Model Personalization with Cognitive Memory and Thought Processes

Title: Knowledge-Aware Self-Correction in Language Models via Structured Memory Graphs

Title: R1-RE: Cross-Domain Relationship Extraction with RLVR

Title: XiYan-SQL: A Novel Multi-Generator Framework For Text-to-SQL

Title: Why We Feel What We Feel: Joint Detection of Emotions and Their Opinion Triggers in E-commerce

Title: LOOM-Scope: a comprehensive and efficient LOng-cOntext Model evaluation framework

Title: "This Suits You the Best": Query Focused Comparative Explainable Summarization

Title: A Tale of Two Scripts: Transliteration and Post-Correction for Judeo-Arabic

Title: LLMs as Architects and Critics for Multi-Source Opinion Summarization

Title: CoSteer: Collaborative Decoding-Time Personalization via Local Delta Steering

Title: Reason to Rote: Rethinking Memorization in Reasoning

Title: A Survey of Pun Generation: Datasets, Evaluations and Methodologies

Title: Spec-TOD: A Specialized Instruction-Tuned LLM Framework for Efficient Task-Oriented Dialogue Systems

Title: Dialogue-Based Multi-Dimensional Relationship Extraction from Novels

Title: $\textit{Grahak-Nyay:}$ Consumer Grievance Redressal through Large Language Models

Title: Building Open-Retrieval Conversational Question Answering Systems by Generating Synthetic Data and Decontextualizing User Questions

Title: Emergent Semantics Beyond Token Embeddings: Transformer LMs with Frozen Visual Unicode Representations

Title: O_FT@EvalLLM2025 : étude comparative de choix de données et de stratégies d'apprentissage pour l'adaptation de modèles de langue à un domaine

Title: SIGIR 2025 -- LiveRAG Challenge Report

Title: ArtifactsBench: Bridging the Visual-Interactive Gap in LLM Code Generation Evaluation

Title: Co-DETECT: Collaborative Discovery of Edge Cases in Text Classification

Title: Verified Language Processing with Hybrid Explainability: A Technical Report

Title: An Evaluation of Large Language Models on Text Summarization Tasks Using Prompt Engineering Techniques

Title: SMART: Simulated Students Aligned with Item Response Theory for Question Difficulty Prediction

Title: Interpretable Mnemonic Generation for Kanji Learning via Expectation-Maximization

Title: AI Generated Text Detection Using Instruction Fine-tuned Large Language and Transformer-Based Models

Title: InfoSteer: Steering Information Utility in Language Model Post-Training

Title: OpenS2S: Advancing Open-Source End-to-End Empathetic Large Speech Language Model

Title: From Fragments to Facts: A Curriculum-Driven DPO Approach for Generating Hindi News Veracity Explanations

Title: Response Attack: Exploiting Contextual Priming to Jailbreak Large Language Models

Title: Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions