2025-07-14

Title: RepeaTTS: Towards Feature Discovery through Repeated Fine-Tuning

Title: MedicalBERT: enhancing biomedical natural language processing using pretrained BERT-based model

Title: Mass-Scale Analysis of In-the-Wild Conversations Reveals Complexity Bounds on LLM Jailbreaking

Title: Assessing the Capabilities and Limitations of FinGPT Model in Financial NLP Applications

Title: Mechanistic Indicators of Understanding in Large Language Models

Title: Signal or Noise? Evaluating Large Language Models in Resume Screening Across Contextual Variations and Human Expert Benchmarks

Title: Circumventing Safety Alignment in Large Language Models Through Embedding Space Toxicity Attenuation

Title: Unveiling Effective In-Context Configurations for Image Captioning: An External & Internal Analysis

Title: "Amazing, They All Lean Left" -- Analyzing the Political Temperaments of Current LLMs

Title: A Systematic Analysis of Declining Medical Safety Messaging in Generative AI Models

Title: Beyond Scale: Small Language Models are Comparable to GPT-4 in Mental Health Understanding

Title: Integrating External Tools with Large Language Models to Improve Accuracy

Title: CRISP: Complex Reasoning with Interpretable Step-based Plans

Title: AblationBench: Evaluating Automated Planning of Ablations in Empirical AI Research

Title: Krul: Efficient State Restoration for Multi-turn Conversations with Dynamic Cross-layer KV Sharing

Title: GRASP: Generic Reasoning And SPARQL Generation across Knowledge Graphs

Title: Audit, Alignment, and Optimization of LM-Powered Subroutines with Application to Public Comment Processing

Title: Compactor: Calibrated Query-Agnostic KV Cache Compression with Approximate Leverage Scores

Title: Distilling Empathy from Large Language Models

Title: TruthTorchLM: A Comprehensive Library for Predicting Truthfulness in LLM Outputs

Title: Simple Mechanistic Explanations for Out-Of-Context Reasoning

Title: Can LLMs Reliably Simulate Real Students' Abilities in Mathematics and Reading Comprehension?

Title: KAT-V1: Kwai-AutoThink Technical Report

Title: Improving MLLM's Document Image Machine Translation via Synchronously Self-reviewing Its OCR Proficiency

Title: CRMAgent: A Multi-Agent LLM System for E-Commerce CRM Message Template Generation

Title: MK2 at PBIG Competition: A Prompt Generation Solution

Title: What Factors Affect LLMs and RLLMs in Financial Question Answering?

Title: Exploring Design of Multi-Agent LLM Dialogues for Research Ideation

Title: The Curious Case of Factuality Finetuning: Models' Internal Beliefs Can Improve Factuality

Title: A Survey of Large Language Models in Discipline-specific Research: Challenges, Methods and Opportunities

Title: ChainEdit: Propagating Ripple Effects in LLM Knowledge Editing through Logical Rule-Guided Chains

Title: Finding Common Ground: Using Large Language Models to Detect Agreement in Multi-Agent Decision Conferences

Title: Diagnosing Failures in Large Language Models' Answers: Integrating Error Attribution into Evaluation Framework

Title: Using Large Language Models for Legal Decision-Making in Austrian Value-Added Tax Law: An Experimental Study

Title: ILT-Iterative LoRA Training through Focus-Feedback-Fix for Multilingual Speech Recognition

Title: A Third Paradigm for LLM Evaluation: Dialogue Game-Based Evaluation using clembench

Title: LLaPa: A Vision-Language Model Framework for Counterfactual-Aware Procedural Planning

Title: Semantic-Augmented Latent Topic Modeling with LLM-in-the-Loop

Title: The AI Language Proficiency Monitor -- Tracking the Progress of LLMs on Multilingual Benchmarks

Title: A comprehensive study of LLM-based argument classification: from LLAMA through GPT-4o to Deepseek-R1

Title: KELPS: A Framework for Verified Multi-Language Autoformalization via Semantic-Syntactic Alignment

Title: KG-Attention: Knowledge Graph-Guided Attention at Test-Time via Bidirectional Information Aggregation

Title: Multilingual Multimodal Software Developer for Code Generation

Title: KV Cache Steering for Inducing Reasoning in Small Language Models