2025-09-17

Title: MTEB-NL and E5-NL: Embedding Benchmark and Models for Dutch

Title: MORABLES: A Benchmark for Assessing Abstract Moral Reasoning in LLMs with Fables

Title: LLM-as-a-Judge: Rapid Evaluation of Legal Document Recommendation for Retrieval-Augmented Generation

Title: SENTRA: Selected-Next-Token Transformer for LLM Text Detection

Title: MORQA: Benchmarking Evaluation Metrics for Medical Open-Ended Question Answering

Title: MedFact: Benchmarking the Fact-Checking Capabilities of Large Language Models on Chinese Medical Texts

Title: Topic Coverage-based Demonstration Retrieval for In-Context Learning

Title: Does Language Model Understand Language?

Title: Audited Reasoning Refinement: Fine-Tuning Language Models via LLM-Guided Step-Wise Evaluation and Correction

Title: FunAudio-ASR Technical Report

Title: MAGIC-Enhanced Keyword Prompting for Zero-Shot Audio Captioning with CLIP Models

Title: EconProver: Towards More Economical Test-Time Scaling for Automated Theorem Proving

Title: PAC: Pronunciation-Aware Contextualized Large Language Model-based Automatic Speech Recognition

Title: Don't Change My View: Ideological Bias Auditing in Large Language Models

Title: Mitigating Strategy Preference Bias in Emotional Support Conversation via Uncertainty Estimations

Title: Chat-Driven Text Generation and Interaction for Person Retrieval

Title: Towards Inclusive Toxic Content Moderation: Addressing Vulnerabilities to Adversarial Attacks in Toxicity Classifiers Tackling LLM-generated Content

Title: HistoryBankQA: Multilingual Temporal Question Answering on Historical Events

Title: ConvergeWriter: Data-Driven Bottom-Up Article Construction

Title: Benchmarking and Improving LVLMs on Event Extraction from Multimedia Documents

Title: The LLM Already Knows: Estimating LLM-Perceived Question Difficulty via Hidden Representations

Title: Conan-Embedding-v2: Training an LLM from Scratch for Text Embeddings

Title: All Roads Lead to Rome: Graph-Based Confidence Estimation for Large Language Model Reasoning

Title: Automated Generation of Research Workflows from Academic Papers: A Full-text Mining Framework

Title: Investigating ReLoRA: Effects on the Learning Dynamics of Small Language Models

Title: Do LLMs Understand Wine Descriptors Across Cultures? A Benchmark for Cultural Adaptations of Wine Reviews

Title: SitLLM: Large Language Models for Sitting Posture Health Understanding via Pressure Sensor Data

Title: Multi-Model Synthetic Training for Mission-Critical Small Language Models

Title: Shaping Explanations: Semantic Reward Modeling with Encoder-Only Transformers for GRPO

Title: Empowering LLMs with Parameterized Skills for Adversarial Long-Horizon Planning

Title: LLM Hallucination Detection: A Fast Fourier Transform Method Based on Hidden Layer Temporal Signals

Title: The Few-shot Dilemma: Over-prompting Large Language Models

Title: Evaluating LLM Alignment on Personality Inference from Real-World Interview Data

Title: ChartGaze: Enhancing Chart Understanding in LVLMs with Eye-Tracking Guided Attention Refinement

Title: WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents

Title: Scaling Agents via Continual Pre-training

Title: Towards General Agentic Intelligence via Environment Scaling

Title: WebWeaver: Structuring Web-Scale Evidence with Dynamic Outlines for Open-Ended Deep Research

Title: ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization

Title: Do Natural Language Descriptions of Model Activations Convey Privileged Information?