2025-06-13

Title: TaskCraft: Automated Generation of Agentic Tasks

Title: A quantum semantic framework for natural language processing

Title: Chat-of-Thought: Collaborative Multi-Agent System for Generating Domain Specific Information

Title: When Meaning Stays the Same, but Models Drift: Evaluating Quality of Service under Token-Level Behavioral Instability in LLMs

Title: ChartReasoner: Code-Driven Modality Bridging for Long-Chain Reasoning in Chart Question Answering

Title: Unsupervised Elicitation of Language Models

Title: When Large Language Models are Reliable for Judging Empathic Communication

Title: Can LLMs Generate Good Stories? Insights and Challenges from a Narrative Planning Perspective

Title: Q2E: Query-to-Event Decomposition for Zero-Shot Multilingual Text-to-Video Retrieval

Title: Classifying Unreliable Narrators with Large Language Models

Title: ToxSyn-PT: A Large-Scale Synthetic Dataset for Hate Speech Detection in Portuguese

Title: Do Language Models Have Bayesian Brains? Distinguishing Stochastic and Deterministic Decision Patterns within Large Language Models

Title: ClusterUCB: Efficient Gradient-Based Data Selection for Targeted Fine-Tuning of LLMs

Title: Flick: Few Labels Text Classification using K-Aware Intermediate Learning in Multi-Task Low-Resource Languages

Title: "Check My Work?": Measuring Sycophancy in a Simulated Educational Context

Title: Scheduled Interleaved Speech-Text Training for Speech-to-Speech Translation with LLMs

Title: Code Execution as Grounded Supervision for LLM Reasoning

Title: TableRAG: A Retrieval Augmented Generation Framework for Heterogeneous Document Reasoning

Title: PAG: Multi-Turn Reinforced LLM Self-Correction with Policy as Generative Verifier

Title: Burn After Reading: Do Multimodal Large Language Models Truly Capture Order of Events in Image Sequences?

Title: Beyond the Battlefield: Framing Analysis of Media Coverage in Conflict Reporting

Title: Fast on the Easy, Deep on the Hard: Efficient Reasoning via Powered Length Penalty

Title: Table-Text Alignment: Explaining Claim Verification Against Tables in Scientific Papers

Title: Surface Fairness, Deep Bias: A Comparative Study of Bias in Language Models

Title: Beyond Single-User Dialogue: Assessing Multi-User Dialogue State Tracking Capabilities of Large Language Models

Title: Reliable Reasoning Path: Distilling Effective Guidance for LLM Reasoning with Knowledge Graphs

Title: SDialog: A Python Toolkit for Synthetic Dialogue Generation and Analysis

Title: NeuralNexus at BEA 2025 Shared Task: Retrieval-Augmented Prompting for Mistake Identification in AI Tutors

Title: Spelling-out is not Straightforward: LLMs' Capability of Tokenization from Token to Characters

Title: Large Language Models for Detection of Life-Threatening Texts

Title: Inferring Adjective Hypernyms with Language Models to Increase the Connectivity of Open English Wordnet

Title: PREMISE: Scalable and Strategic Prompt Optimization for Efficient Mathematical Reasoning in Large Models

Title: Beyond True or False: Retrieval-Augmented Hierarchical Analysis of Nuanced Claims

Title: TaxoAdapt: Aligning LLM-Based Multidimensional Taxonomy Construction to Evolving Research Corpora

Title: One Tokenizer To Rule Them All: Emergent Language Plasticity via Multilingual Tokenizers

Title: Different Questions, Different Models: Fine-Grained Evaluation of Uncertainty and Calibration in Clinical QA with LLMs

Title: Improving Named Entity Transcription with Contextual LLM-based Revision

Title: Mitigating Negative Interference in Multilingual Sequential Knowledge Editing through Null-Space Constraints

Title: ReCUT: Balancing Reasoning Length and Accuracy in LLMs via Stepwise Trails and Preference Optimization

Title: CIIR@LiveRAG 2025: Optimizing Multi-Agent Retrieval Augmented Generation through Self-Training

Title: Accelerating Diffusion Large Language Models with SlowFast: The Three Golden Principles

Title: Enhancing Medical Dialogue Generation through Knowledge Refinement and Dynamic Prompt Adjustment

Title: Slimming Down LLMs Without Losing Their Minds

Title: Generalization or Hallucination? Understanding Out-of-Context Reasoning in Transformers

Title: Beyond Gold Standards: Epistemic Ensemble of LLM Judges for Formal Mathematical Reasoning

Title: Magistral

Title: Decomposing MLP Activations into Interpretable Features via Semi-Nonnegative Matrix Factorization

Title: Dynamic Epistemic Friction in Dialogue

Title: Domain2Vec: Vectorizing Datasets to Find the Optimal Data Mixture without Training

Title: ChineseHarm-Bench: A Chinese Harmful Content Detection Benchmark

Title: AutoMind: Adaptive Knowledgeable Agent for Automated Data Science