2025-11-13

Title: GMTRouter: Personalized LLM Router over Multi-turn User Interactions

Title: The Collective Turing Test: Large Language Models Can Generate Realistic Multi-User Discussions

Title: Knowledge Graph Analysis of Legal Understanding and Violations in LLMs

Title: Diverse Preference Learning for Capabilities and Alignment

Title: Chopping Trees: Semantic Similarity Based Dynamic Pruning for Tree-of-Thought Reasoning

Title: What About the Scene with the Hitler Reference? HAUNT: A Framework to Probe LLMs' Self-consistency Via Adversarial Nudge

Title: Self-HarmLLM: Can Large Language Model Harm Itself?

Title: OKBench: Democratizing LLM Evaluation with Fully Automated, On-Demand, Open Knowledge Benchmarking

Title: Retrieval-Augmented Generation of Pediatric Speech-Language Pathology vignettes: A Proof-of-Concept Study

Title: Mina: A Multilingual LLM-Powered Legal Assistant Agent for Bangladesh for Empowering Access to Justice

Title: A Super-Learner with Large Language Models for Medical Emergency Advising

Title: Learn More, Forget Less: A Gradient-Aware Data Selection Approach for LLM

Title: Structured Uncertainty guided Clarification for LLM Agents

Title: Toward Automated Cognitive Assessment in Parkinson's Disease Using Pretrained Language Models

Title: Beyond Task-Oriented and Chitchat Dialogues: Proactive and Transition-Aware Conversational Agents

Title: BioVerge: A Comprehensive Benchmark and Study of Self-Evaluating Agents for Biomedical Hypothesis Generation

Title: Hallucinate or Memorize? The Two Sides of Probabilistic Learning in Large Language Models

Title: HalluClean: A Unified Framework to Combat Hallucinations in LLMs

Title: TiDAR: Think in Diffusion, Talk in Autoregression

Title: EVADE: LLM-Based Explanation Generation and Validation for Error Detection in NLI

Title: Detecting Emotional Dynamic Trajectories: An Evaluation Framework for Emotional Support in Language Models

Title: A Neurosymbolic Approach to Natural Language Formalization and Verification

Title: MM-CRITIC: A Holistic Evaluation of Large Multimodal Models as Multimodal Critique

Title: Context-Aware Dynamic Chunking for Streaming Tibetan Speech Recognition

Title: Thinking Forward and Backward: Multi-Objective Reinforcement Learning for Retrieval-Augmented Reasoning

Title: Assessing the Capabilities of LLMs in Humor:A Multi-dimensional Analysis of Oogiri Generation and Evaluation

Title: One-Topic-Doesn't-Fit-All: Transcreating Reading Comprehension Test for Personalized Learning

Title: LoopTool: Closing the Data-Training Loop for Robust LLM Tool Calls

Title: A Hybrid Search for Complex Table Question Answering in Securities Report

Title: Context is Enough: Empirical Validation of $\textit{Sequentiality}$ on Essays

Title: The Learning Dynamics of Subword Segmentation for Morphologically Diverse Languages

Title: Stabilizing Reinforcement Learning for Honesty Alignment in Language Models on Deductive Reasoning

Title: POTSA: A Cross-Lingual Speech Alignment Framework for Low Resource Speech-to-Text Translation

Title: C$^3$TG: Conflict-aware, Composite, and Collaborative Controlled Text Generation

Title: LiteraryTaste: A Preference Dataset for Creative Writing Personalization

Title: mmJEE-Eval: A Bilingual Multimodal Benchmark for Evaluating Scientific Reasoning in Vision-Language Models

Title: Seer Self-Consistency: Advance Budget Estimation for Adaptive Test-Time Scaling

Title: MTQ-Eval: Multilingual Text Quality Evaluation for Language Models

Title: Self-Correcting Large Language Models: Generation vs. Multiple Choice

Title: AMaPO: Adaptive Margin-attached Preference Optimization for Language Model Alignment

Title: Multimodal Large Language Models for Low-Resource Languages: A Case Study for Basque

Title: CARE-Bench: A Benchmark of Diverse Client Simulations Guided by Expert Principles for Evaluating LLMs in Psychological Counseling

Title: GSAP-ERE: Fine-Grained Scholarly Entity and Relation Extraction Focused on Machine Learning

Title: BIG5-TPoT: Predicting BIG Five Personality Traits, Facets, and Items Through Targeted Preselection of Texts

Title: SynClaimEval: A Framework for Evaluating the Utility of Synthetic Data in Long-Context Claim Verification