2025-09-24

Title: Dynamic Prompt Fusion for Multi-Task and Cross-Domain Adaptation in LLMs

Title: GAUSS: Benchmarking Structured Mathematical Skills for Large Language Models

Title: Event Causality Identification with Synthetic Control

Title: ZERA: Zero-init Instruction Evolving Refinement Agent - From Zero Instructions to Structured Prompts via Principle-based Optimization

Title: Thinking in a Crowd: How Auxiliary Information Shapes LLM Reasoning

Title: SIRAG: Towards Stable and Interpretable RAG with A Process-Supervised Multi-Agent Framework

Title: ERFC: Happy Customers with Emotion Recognition and Forecasting in Conversation in Call Centers

Title: Evaluating Large Language Models for Detecting Antisemitism

Title: Exploiting Tree Structure for Credit Assignment in RL Training of LLMs

Title: Brittleness and Promise: Knowledge Graph Based Reward Modeling for Diagnostic Reasoning

Title: Speculate Deep and Accurate: Lossless and Training-Free Acceleration for Offloaded LLMs via Substitute Speculative Decoding

Title: Interactive Real-Time Speaker Diarization Correction with Human Feedback

Title: NormGenesis: Multicultural Dialogue Generation via Exemplar-Guided Social Norm Modeling and Violation Recovery

Title: Evaluating the Creativity of LLMs in Persian Literary Text Generation

Title: Developing an AI framework to automatically detect shared decision-making in patient-doctor conversations

Title: CogniLoad: A Synthetic Natural Language Reasoning Benchmark With Tunable Length, Intrinsic Difficulty, and Distractor Density

Title: LAWCAT: Efficient Distillation from Quadratic to Linear Attention with Convolution across Tokens for Long Context Modeling

Title: Actions Speak Louder than Prompts: A Large-Scale Study of LLMs for Graph Inference

Title: Trace Is In Sentences: Unbiased Lightweight ChatGPT-Generated Text Detector

Title: CCQA: Generating Question from Solution Can Improve Inference-Time Reasoning in SLMs

Title: Prior-based Noisy Text Data Filtering: Fast and Strong Alternative For Perplexity

Title: UniECG: Understanding and Generating ECG in One Unified Model

Title: A Good Plan is Hard to Find: Aligning Models with Preferences is Misaligned with What Helps Users

Title: Analyzing Uncertainty of LLM-as-a-Judge: Interval Evaluations with Conformal Prediction

Title: MemOrb: A Plug-and-Play Verbal-Reinforcement Memory Layer for E-Commerce Customer Service

Title: Global-Recent Semantic Reasoning on Dynamic Text-Attributed Graphs with Large Language Models

Title: False Friends Are Not Foes: Investigating Vocabulary Overlap in Multilingual Language Models

Title: When Long Helps Short: How Context Length in Supervised Fine-tuning Affects Behavior of Large Language Models

Title: AECBench: A Hierarchical Benchmark for Knowledge Evaluation of Large Language Models in the AEC Field

Title: Beyond the Leaderboard: Understanding Performance Disparities in Large Language Models via Model Diffing

Title: MAPEX: A Multi-Agent Pipeline for Keyphrase Extraction

Title: Are Smaller Open-Weight LLMs Closing the Gap to Proprietary Models for Biomedical Question Answering?

Title: Multi-Hierarchical Feature Detection for Large Language Model Generated Text

Title: Diversity Boosts AI-Generated Text Detection

Title: Extractive Fact Decomposition for Interpretable Natural Language Inference in one Forward Pass

Title: Charting a Decade of Computational Linguistics in Italy: The CLiC-it Corpus

Title: Pathways of Thoughts: Multi-Directional Thinking for Long-form Personalized Question Answering

Title: Context-Aware Hierarchical Taxonomy Generation for Scientific Papers via LLM-Guided Multi-Aspect Clustering

Title: Anecdoctoring: Automated Red-Teaming Across Language and Place

Title: Soft Tokens, Hard Truths

Title: Online Process Reward Leanring for Agentic Reinforcement Learning

Title: Steering Multimodal Large Language Models Decoding for Context-Aware Safety

Title: Systematic Comparative Analysis of Large Pretrained Language Models on Contextualized Medication Event Extraction

Title: CompLLM: Compression for Long Context Q&A

Title: Reinforcement Learning on Pre-Training Data

Title: Extracting Conceptual Spaces from LLMs Using Prototype Embeddings

Title: DRISHTIKON: A Multimodal Multilingual Benchmark for Testing Language Models' Understanding on Indian Culture