2025-10-08

Title: Collaborative and Proactive Management of Task-Oriented Conversations

Title: Hallucination is Inevitable for LLMs with the Open World Assumption

Title: Towards Structured Knowledge: Advancing Triple Extraction from Regional Trade Agreements using Large Language Models

Title: MADS: Multi-Agent Dialogue Simulation for Diverse Persuasion Data Generation

Title: Catalog-Native LLM: Speaking Item-ID Dialect with Less Entanglement for Recommendation

Title: Improving Metacognition and Uncertainty Communication in Language Models

Title: Advancing Automated Spatio-Semantic Analysis in Picture Description Using Language Models

Title: Automated Alignment of Math Items to Content Standards in Large-Scale Assessments Using Language Models

Title: Submodular Context Partitioning and Compression for In-Context Learning-short paper

Title: Rationale-Augmented Retrieval with Constrained LLM Re-Ranking for Task Discovery

Title: Training Large Language Models To Reason In Parallel With Global Forking Tokens

Title: Characterizing Model Behavior Under Synthetic Data Training: An Empirical Study Across Scales and Mixing Ratios

Title: Curiosity-Driven LLM-as-a-judge for Personalized Creative Judgment

Title: Linguistic Characteristics of AI-Generated Text: A Survey

Title: Demystifying deep search: a holistic evaluation with hint-free multi-hop questions and factorised metrics

Title: LiRA: A Multi-Agent Framework for Reliable and Readable Literature Review Generation

Title: NLD-LLM: A systematic framework for evaluating small language transformer models on natural language description

Title: To model human linguistic prediction, make LLMs less superhuman

Title: Reliable End-to-End Material Information Extraction from the Literature with Source-Tracked Multi-Stage Large Language Models

Title: Every Step Counts: Decoding Trajectories as Authorship Fingerprints of dLLMs

Title: Chronological Thinking in Full-Duplex Spoken Dialogue Language Models

Title: Exploring Large Language Models for Financial Applications: Techniques, Performance, and Challenges with FinMA

Title: A Single Character can Make or Break Your LLM Evals

Title: Can AI Truly Represent Your Voice in Deliberations? A Comprehensive Study of Large-Scale Opinion Aggregation with LLMs

Title: A novel hallucination classification framework

Title: Let it Calm: Exploratory Annealed Decoding for Verifiable Reinforcement Learning

Title: Camellia: Benchmarking Cultural Biases in LLMs for Asian Languages

Title: RAG Makes Guardrails Unsafe? Investigating Robustness of Guardrails under RAG-style Contexts

Title: WeatherArchive-Bench: Benchmarking Retrieval-Augmented Reasoning for Historical Weather Archives

Title: Residualized Similarity for Faithfully Explainable Authorship Verification

Title: The End of Transformers? On Challenging Attention and the Rise of Sub-Quadratic Architectures

Title: Context Length Alone Hurts LLM Performance Despite Perfect Retrieval

Title: Aligning Language Models with Clinical Expertise: DPO for Heart Failure Nursing Documentation in Critical Care

Title: A Lightweight Large Language Model-Based Multi-Agent System for 2D Frame Structural Analysis

Title: Self-Filtered Distillation with LLMs-generated Trust Indicators for Reliable Patent Classification

Title: SimulatorArena: Are User Simulators Reliable Proxies for Multi-Turn Evaluation of AI Assistants?

Title: AgentRouter: A Knowledge-Graph-Guided LLM Router for Collaborative Multi-Agent Question Answering

Title: SocialNLI: A Dialogue-Centric Social Inference Dataset

Title: Language Model as Planner and Formalizer under Constraints

Title: LANTERN: Scalable Distillation of Large Language Models for Job-Person Fit and Explanation

Title: Prototype-Based Dynamic Steering for Large Language Models

Title: CAM: A Constructivist View of Agentic Memory for LLM-Based Reading Comprehension

Title: KEO: Knowledge Extraction on OMIn via Knowledge Graphs and RAG for Safety-Critical Aviation Maintenance

Title: H1B-KV: Hybrid One-Bit Caches for Memory-Efficient Large Language Model Inference

Title: On the Role of Difficult Prompts in Self-Play Preference Optimization

Title: Activation-Informed Pareto-Guided Low-Rank Compression for Efficient LLM/VLM

Title: Presenting a Paper is an Art: Self-Improvement Aesthetic Agents for Academic Presentations

Title: Mission Impossible: Feedback-Guided Dynamic Interactive Planning for Improving Reasoning on LLMs

Title: A Goal Without a Plan Is Just a Wish: Efficient and Effective Global Planner Training for Long-Horizon Agent Tasks

Title: MADIAVE: Multi-Agent Debate for Implicit Attribute Value Extraction

Title: Code-Switching In-Context Learning for Cross-Lingual Transfer of Large Language Models

Title: DecEx-RAG: Boosting Agentic Retrieval-Augmented Generation with Decision and Execution Optimization via Process Supervision

Title: Adaptive and Multi-Source Entity Matching for Name Standardization of Astronomical Observation Facilities

Title: Data-efficient Targeted Token-level Preference Optimization for LLM-based Text-to-Speech

Title: EEPO: Exploration-Enhanced Policy Optimization via Sample-Then-Forget

Title: Luth: Efficient French Specialization for Small Language Models and Cross-Lingual Transfer

Title: DACP: Domain-Adaptive Continual Pre-Training of Large Language Models for Phone Conversation Summarization

Title: Automated Boilerplate: Prevalence and Quality of Contract Generators in the Context of Swiss Privacy Policies

Title: Revisiting Long-context Modeling from Context Denoising Perspective

Title: Evaluating the Sensitivity of LLMs to Harmful Contents in Long Input

Title: The fragility of "cultural tendencies" in LLMs

Title: Prompt reinforcing for long-term planning of large language models

Title: Hire Your Anthropologist! Rethinking Culture Benchmarks Through an Anthropological Lens

Title: EvalMORAAL: Interpretable Chain-of-Thought and LLM-as-Judge Evaluation for Moral Alignment in Large Language Models

Title: Probing the Difficulty Perception Mechanism of Large Language Models

Title: LexiCon: a Benchmark for Planning under Temporal Constraints in Natural Language

Title: Exploring Gaps in the APS: Direct Minimal Pair Analysis in LLM Syntactic Assessments

Title: MASA: Rethinking the Representational Bottleneck in LoRA with Multi-A Shared Adaptation

Title: Evaluating The Impact of Stimulus Quality in Investigations of LLM Language Performance

Title: CDTP: A Large-Scale Chinese Data-Text Pair Dataset for Comprehensive Evaluation of Chinese LLMs

Title: ASPO: Asymmetric Importance Sampling Policy Optimization

Title: Spectrum Tuning: Post-Training for Distributional Coverage and In-Context Steerability

Title: The Valley of Code Reasoning: Scaling Knowledge Distillation of Large Language Models

Title: Distributional Semantics Tracing: A Framework for Explaining Hallucinations in Large Language Models

Title: Parallel Tokenizers: Rethinking Vocabulary Design for Cross-Lingual Transfer

Title: CreditDecoding: Accelerating Parallel Decoding in Diffusion Large Language Models with Trace Credits

Title: RoSE: Round-robin Synthetic Data Evaluation for Selecting LLM Generators without Human Test Sets

Title: VecInfer: Efficient LLM Inference with Low-Bit KV Cache via Outlier-Suppressed Vector Quantization

Title: Mixing Mechanisms: How Language Models Retrieve Bound Entities In-Context

Title: RECODE-H: A Benchmark for Research Code Development with Interactive Human Feedback

Title: Peeking inside the Black-Box: Reinforcement Learning for Explainable and Accurate Relation Extraction