2025-08-26

Title: GreenTEA: Gradient Descent with Topic-modeling and Evolutionary Auto-prompting

Title: Cognitive Decision Routing in Large Language Models: When to Think Fast, When to Think Slow

Title: Trust but Verify! A Survey on Verification Design for Test-time Scaling

Title: Do Cognitively Interpretable Reasoning Traces Improve LLM Performance?

Title: QueryBandits for Hallucination Mitigation: Exploiting Semantic Features for No-Regret Rewriting

Title: Assessing Consciousness-Related Behaviors in Large Language Models Using the Maze Test

Title: Error Reflection Prompting: Can Large Language Models Successfully Understand Errors?

Title: How Good are LLM-based Rerankers? An Empirical Analysis of State-of-the-Art Reranking Models

Title: Toward Socially Aware Vision-Language Models: Evaluating Cultural Competence Through Multimodal Story Generation

Title: Assess and Prompt: A Generative RL Framework for Improving Engagement in Online Mental Health Communities

Title: LLMs Learn Constructions That Humans Do Not Know

Title: If We May De-Presuppose: Robustly Verifying Claims through Presupposition-Free Question Decomposition

Title: Learning from Diverse Reasoning Paths with Routing and Collaboration

Title: QFrCoLA: a Quebec-French Corpus of Linguistic Acceptability Judgments

Title: Dream to Chat: Model-based Reinforcement Learning on Dialogues with User Belief Modeling

Title: ObjexMT: Objective Extraction and Metacognitive Calibration for LLM-as-a-Judge under Multi-Turn Jailbreaks

Title: Unbiased Reasoning for Knowledge-Intensive Tasks in Large Language Models via Conditional Front-Door Adjustment

Title: Being Kind Isn't Always Being Safe: Diagnosing Affective Hallucination in LLMs

Title: Explaining Black-box Language Models with Knowledge Probing Systems: A Post-hoc Explanation Perspective

Title: Decoding Alignment: A Critical Survey of LLM Development Initiatives through Value-setting and Data-centric Lens

Title: ReFactX: Scalable Reasoning with Reliable Facts via Constrained Generation

Title: GRADE: Generating multi-hop QA and fine-gRAined Difficulty matrix for RAG Evaluation

Title: DeAR: Dual-Stage Document Reranking with Reasoning Agents via LLM Distillation

Title: KL-Regularised Q-Learning: A Token-level Action-Value perspective on Online RLHF

Title: Planning for Success: Exploring LLM Long-term Planning Capabilities in Table Understanding

Title: Improving Table Understanding with LLMs and Entity-Oriented Search

Title: GRAID: Synthetic Data Generation with Geometric Constraints and Multi-Agentic Reflection for Harmful Content Detection

Title: Linguistic Neuron Overlap Patterns to Facilitate Cross-lingual Transfer on Low-resource Languages

Title: Token Homogenization under Positional Bias

Title: Natural Language Satisfiability: Exploring the Problem Distribution and Evaluating Transformer-based Language Models

Title: SPORTSQL: An Interactive System for Real-Time Sports Reasoning and Visualization

Title: Quantifying Language Disparities in Multilingual Large Language Models

Title: The Impact of Annotator Personas on LLM Behavior Across the Perspectivism Spectrum

Title: Towards Alignment-Centric Paradigm: A Survey of Instruction Tuning in Large Language Models

Title: Active Domain Knowledge Acquisition with \$100 Budget: Enhancing LLMs via Cost-Efficient, Expert-Involved Interaction in Sensitive Domains

Title: SSFO: Self-Supervised Faithfulness Optimization for Retrieval-Augmented Generation

Title: ClaimGen-CN: A Large-scale Chinese Dataset for Legal Claim Generation

Title: Routing Distilled Knowledge via Mixture of LoRA Experts for Large Language Model based Bundle Generation

Title: Are You Sure You're Positive? Consolidating Chain-of-Thought Agents with Uncertainty Quantification for Aspect-Category Sentiment Analysis

Title: From Language to Action: A Review of Large Language Models as Autonomous Agents and Tool Users

Title: Handling Students Dropouts in an LLM-driven Interactive Online Course Using Language Models

Title: CultranAI at PalmX 2025: Data Augmentation for Cultural Knowledge Representation

Title: DropLoRA: Sparse Low-Rank Adaptation for Parameter-Efficient Fine-Tuning

Title: Capturing Legal Reasoning Paths from Facts to Law in Court Judgments using Knowledge Graphs

Title: UI-Level Evaluation of ALLaM 34B: Measuring an Arabic-Centric LLM via HUMAIN Chat

Title: Agent-Testing Agent: A Meta-Agent for Automated Testing and Evaluation of Conversational AI Agents

Title: DashboardQA: Benchmarking Multimodal Agents for Question Answering on Interactive Dashboards

Title: DS@GT at CheckThat! 2025: A Simple Retrieval-First, LLM-Backed Framework for Claim Normalization

Title: Persuasion Dynamics in LLMs: Investigating Robustness and Adaptability in Knowledge and Safety with DuET-PD

Title: Evaluating the Impact of Verbal Multiword Expressions on Machine Translation

Title: Improving French Synthetic Speech Quality via SSML Prosody Control

Title: Debate or Vote: Which Yields Better Decisions in Multi-Agent Large Language Models?

Title: Humanizing Machines: Rethinking LLM Anthropomorphism Through a Multi-Level Framework of Design

Title: UQ: Assessing Language Models on Unsolved Questions

Title: Less Is More? Examining Fairness in Pruned Large Language Models for Summarising Opinions

Title: Steering When Necessary: Flexible Steering Large Language Models with Backtracking

Title: Stop Spinning Wheels: Mitigating LLM Overthinking via Mining Patterns for Early Reasoning Exit

Title: Weights-Rotated Preference Optimization for Large Language Models

Title: SurveyGen: Quality-Aware Scientific Survey Generation with Large Language Models

Title: CoCoA: Confidence- and Context-Aware Adaptive Decoding for Resolving Knowledge Conflicts in Large Language Models

Title: EMPOWER: Evolutionary Medical Prompt Optimization With Reinforcement Learning

Title: Layerwise Importance Analysis of Feed-Forward Networks in Transformer-based Language Models

Title: SMITE: Enhancing Fairness in LLMs through Optimal In-Context Example Selection via Dynamic Validation

Title: ISACL: Internal State Analyzer for Copyrighted Training Data Leakage

Title: Speculating LLMs' Chinese Training Data Pollution from Their Tokens

Title: DRQA: Dynamic Reasoning Quota Allocation for Controlling Overthinking in Reasoning Large Language Models

Title: Beyond Demographics: Enhancing Cultural Value Survey Simulation with Multi-Stage Personality-Driven Cognitive Reasoning

Title: Speech Discrete Tokens or Continuous Features? A Comparative Analysis for Spoken Language Understanding in SpeechLLMs

Title: ILRe: Intermediate Layer Retrieval for Context Compression in Causal Language Models

Title: Pandora: Leveraging Code-driven Knowledge Transfer for Unified Structured Knowledge Reasoning

Title: AMELIA: A Family of Multi-task End-to-end Language Models for Argumentation

Title: Debiasing Multilingual LLMs in Cross-lingual Latent Space

Title: Understanding Subword Compositionality of Large Language Models

Title: German4All - A Dataset and Model for Readability-Controlled Paraphrasing in German

Title: A Retail-Corpus for Aspect-Based Sentiment Analysis with Large Language Models

Title: Neither Valid nor Reliable? Investigating the Use of LLMs as Judges

Title: How Quantization Shapes Bias in Large Language Models

Title: Agri-Query: A Case Study on RAG vs. Long-Context LLMs for Cross-Lingual Technical Question Answering

Title: Detecting and Characterizing Planning in Language Models

Title: SentiMM: A Multimodal Multi-Agent Framework for Sentiment Analysis in Social Media

Title: DiscussLLM: Teaching Large Language Models When to Speak

Title: Improving End-to-End Training of Retrieval-Augmented Generation Models via Joint Stochastic Approximation

Title: Leveraging Large Language Models for Accurate Sign Language Translation in Low-Resource Scenarios

Title: Why Synthetic Isn't Real Yet: A Diagnostic Framework for Contact Center Dialogue Generation

Title: Better Language Model-Based Judging Reward Modeling through Scaling Comprehension Boundaries

Title: MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols

Title: Demographic Biases and Gaps in the Perception of Sexism in Large Language Models

Title: From BERT to LLMs: Comparing and Understanding Chinese Classifier Prediction in Language Models

Title: MIRAGE: Scaling Test-Time Inference with Parallel Graph-Retrieval-Augmented Reasoning Chains