2025-08-29

Title: Social Bias in Multilingual Language Models: A Survey

Title: Prompting Strategies for Language Model-Based Item Generation in K-12 Education: Bridging the Gap Between Small and Large Language Models

Title: Can Compact Language Models Search Like Agents? Distillation-Guided Policy Optimization for Preserving Agentic RAG Capabilities

Title: GUARD: Guideline Upholding Test through Adaptive Role-play and Jailbreak Diagnostics for LLMs

Title: Joint Enhancement of Relational Reasoning for Long-Context LLMs

Title: Graph-R1: Unleashing LLM Reasoning with NP-Hard Graph Problems

Title: CAPE: Context-Aware Personality Evaluation Framework for Large Language Models

Title: Measuring Reasoning Utility in LLMs via Conditional Entropy Reduction

Title: UI-Bench: A Benchmark for Evaluating Design Capabilities of AI Text-to-App Tools

Title: DentalBench: Benchmarking and Advancing LLMs Capability for Bilingual Dentistry Understanding

Title: KG-CQR: Leveraging Structured Relation Representations in Knowledge Graphs for Contextual Query Retrieval

Title: CAMB: A comprehensive industrial LLM benchmark on civil aviation maintenance

Title: MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers

Title: Prediction of mortality and resource utilization in critical care: a deep learning approach using multimodal electronic health records with natural language processing techniques

Title: ConspirED: A Dataset for Cognitive Traits of Conspiracy Theories and Large Language Model Safety

Title: SciTopic: Enhancing Topic Discovery in Scientific Literature through Advanced LLM

Title: Adaptive Federated Distillation for Multi-Domain Non-IID Textual Data

Title: KCS: Diversify Multi-hop Question Generation with Knowledge Composition Sampling

Title: A Graph Talks, But Who's Listening? Rethinking Evaluations for Graph-Language Models

Title: Multi-Lingual Implicit Discourse Relation Recognition with Multi-Label Hierarchical Learning

Title: Addressing Tokenization Inconsistency in Steganography and Watermarking Based on Large Language Models

Title: rStar2-Agent: Agentic Reasoning Technical Report

Title: Leveraging Semantic Triples for Private Document Generation with Local Differential Privacy Guarantees

Title: Specializing General-purpose LLM Embeddings for Implicit Hate Speech Detection across Datasets

Title: GUARD: Glocal Uncertainty-Aware Robust Decoding for Effective and Efficient Open-Ended Text Generation

Title: Feel the Difference? A Comparative Analysis of Emotional Arcs in Real and LLM-Generated CBT Sessions

Title: Turning the Spell Around: Lightweight Alignment Amplification via Rank-One Safety Injection

Title: Exploring Machine Learning and Language Models for Multimodal Depression Detection

Title: GDLLM: A Global Distance-aware Modeling Approach Based on Large Language Models for Event Temporal Relation Extraction

Title: MSRS: Evaluating Multi-Source Retrieval-Augmented Generation

Title: The Uneven Impact of Post-Training Quantization in Machine Translation

Title: SageLM: A Multi-aspect and Explainable Large Language Model for Speech Judgement

Title: How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on $τ$-bench

Title: STARE at the Structure: Steering ICL Exemplar Selection with Structural Alignment

Title: ProactiveEval: A Unified Evaluation Framework for Proactive Dialogue Agents

Title: Lethe: Purifying Backdoored Large Language Models with Knowledge Dilution

Title: An Agile Method for Implementing Retrieval Augmented Generation Tools in Industrial SMEs

Title: Enabling Equitable Access to Trustworthy Financial Reasoning