2025-07-25

Title: Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning

Title: Dynamic and Generalizable Process Reward Modeling

Title: VeriMinder: Mitigating Analytical Vulnerabilities in NL2SQL

Title: Evaluating the Performance of AI Text Detectors, Few-Shot and Chain-of-Thought Prompting Using DeepSeek Generated Text

Title: Are LLM Belief Updates Consistent with Bayes' Theorem?

Title: Technical Report of TeleChat2, TeleChat2.5 and T1

Title: NeuralDB: Scaling Knowledge Editing in LLMs to 100,000 Facts with Neural KV Database

Title: GrAInS: Gradient-based Attribution for Inference-Time Steering of LLMs and VLMs

Title: Synthetic Data Generation for Phrase Break Prediction with Large Language Model

Title: Privacy-Preserving Synthetic Review Generation with Diverse Writing Styles Using LLMs

Title: TELEVAL: A Dynamic Benchmark Designed for Spoken Language Models in Chinese Interactive Scenarios

Title: Hybrid and Unitary Fine-Tuning of Large Language Models: Methods and Benchmarking under Resource Constraints

Title: GOAT-SLM: A Spoken Language Model with Paralinguistic and Speaker Characteristic Awareness

Title: MathOPEval: A Fine-grained Evaluation Benchmark for Visual Operations of MLLMs in Mathematical Reasoning

Title: HIVMedQA: Benchmarking large language models for HIV medical decision support

Title: SCOPE: Stochastic and Counterbiased Option Placement for Evaluating Large Language Models

Title: TN-AutoRCA: Benchmark Construction and Agentic Framework for Self-Improving Alarm-Based Root Cause Analysis in Telecommunication Networks

Title: Safeguarding RAG Pipelines with GMTP: A Gradient-based Masked Token Probability Method for Poisoned Document Detection

Title: Exploring the Impact of Instruction-Tuning on LLM's Susceptibility to Misinformation

Title: Prune&Comp: Free Lunch for Layer-Pruned LLMs via Iterative Pruning with Magnitude Compensation

Title: Locate-and-Focus: Enhancing Terminology Translation in Speech Language Models

Title: StyleAdaptedLM: Enhancing Instruction Following Models with Efficient Stylistic Transfer

Title: BadReasoner: Planting Tunable Overthinking Backdoors into Large Reasoning Models for Fun or Profit

Title: TDR: Task-Decoupled Retrieval with Fine-Grained LLM Feedback for In-Context Learning

Title: Hybrid Annotation for Propaganda Detection: Integrating LLM Pre-Annotations with Human Intelligence

Title: CLEAR: Error Analysis via LLM-as-a-Judge Made Easy

Title: FinDPO: Financial Sentiment Analysis for Algorithmic Trading through Preference Optimization of LLMs

Title: AraTable: Benchmarking LLMs' Reasoning and Understanding of Arabic Tabular Data

Title: Generation of Synthetic Clinical Text: A Systematic Review

Title: Not All Features Deserve Attention: Graph-Guided Dependency Learning for Tabular Data Generation with Language Models

Title: The Moral Gap of Large Language Models

Title: GLiNER2: An Efficient Multi-Task Information Extraction System with Schema-Driven Interface

Title: Hybrid Tokenization Strategy for DNA Language Model using Byte Pair Encoding and K-MER Methods

Title: Wide-In, Narrow-Out: Revokable Decoding for Efficient and Effective DLLMs

Title: System Report for CCL25-Eval Task 10: SRAG-MAV for Fine-Grained Chinese Hate Speech Recognition

Title: AQuilt: Weaving Logic and Self-Inspection into Low-Cost, High-Relevance Data Synthesis for Specialist LLMs

Title: TRPrompt: Bootstrapping Query-Aware Prompt Optimization from Textual Rewards

Title: Checklists Are Better Than Reward Models For Aligning Language Models