2025-06-10

Title: How Significant Are the Real Performance Gains? An Unbiased Evaluation Framework for GraphRAG

Title: TESU-LLM: Training Speech-LLMs Without Speech via Unified Encoder Alignment

Title: Unified Game Moderation: Soft-Prompting and LLM-Assisted Label Transfer for Resource-Efficient Toxicity Detection

Title: Relationship Detection on Tabular Data Using Statistical Analysis and Large Language Models

Title: Enhancing Decision-Making of Large Language Models via Actor-Critic

Title: Detection Method for Prompt Injection by Integrating Pre-trained Model and Heuristic Feature Engineering

Title: Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models

Title: Natural Language Interaction with Databases on Edge Devices in the Internet of Battlefield Things

Title: Direct Behavior Optimization: Unlocking the Potential of Lightweight LLMs

Title: Unintended Harms of Value-Aligned LLMs: Psychological and Empirical Insights

Title: SMAR: Soft Modality-Aware Routing Strategy for MoE-based Multimodal Large Language Models Preserving Language Capabilities

Title: Canonical Autoregressive Generation

Title: What Is Seen Cannot Be Unseen: The Disruptive Effect of Knowledge Conflict on Large Language Models

Title: Improving LLM-Powered EDA Assistants with RAFT

Title: Biases Propagate in Encoder-based Vision-Language Models: A Systematic Analysis From Intrinsic Measures to Zero-shot Retrieval Outcomes

Title: Fixing It in Post: A Comparative Study of LLM Post-Training Data Quality and Model Performance

Title: Beyond Facts: Evaluating Intent Hallucination in Large Language Models

Title: LaMP-Cap: Personalized Figure Caption Generation With Multimodal Figure Profiles

Title: Precise Information Control in Long-Form Text Generation

Title: MedCite: Can Language Models Generate Verifiable Text for Medicine?

Title: Training-Free Tokenizer Transplantation via Orthogonal Matching Pursuit

Title: Transferring Features Across Language Models With Model Stitching

Title: Interpretable Depression Detection from Social Media Text Using LLM-Derived Embeddings

Title: BriefMe: A Legal NLP Benchmark for Assisting with Legal Briefs

Title: Psychological Counseling Cannot Be Achieved Overnight: Automated Psychological Counseling Through Multi-Session Conversations

Title: SafeLawBench: Towards Safe Alignment of Large Language Models

Title: Quantile Regression with Large Language Models for Price Prediction

Title: Learning Distribution-Wise Control in Representation Space for Language Models

Title: Dynamic and Parametric Retrieval-Augmented Generation

Title: DivScore: Zero-Shot Detection of LLM-Generated Text in Specialized Domains

Title: C-PATH: Conversational Patient Assistance and Triage in Healthcare System

Title: Geopolitical biases in LLMs: what are the "good" and the "bad" countries according to contemporary language models

Title: They want to pretend not to understand: The Limits of Current LLMs in Interpreting Implicit Content of Political Discourse

Title: On the Adaptive Psychological Persuasion of Large Language Models

Title: Not quite Sherlock Holmes: Language model predictions do not reliably differentiate impossible from improbable events

Title: Beyond Classification: Towards Speech Emotion Reasoning with Multitask AudioLLMs

Title: Can LLMs Generate Reliable Test Case Generators? A Study on Competition-Level Programming Problems

Title: PCoT: Persuasion-Augmented Chain of Thought for Detecting Fake News and Social Media Disinformation

Title: Adapt Once, Thrive with Updates: Transferable Parameter-Efficient Fine-Tuning on Evolving Base Models

Title: Right Is Not Enough: The Pitfalls of Outcome Supervision in Training LLMs for Math Reasoning

Title: Mixture of Small and Large Models for Chinese Spelling Check

Title: Automatic Speech Recognition of African American English: Lexical and Contextual Effects

Title: DiscoSum: Discourse-aware News Summarization

Title: What Makes a Good Natural Language Prompt?

Title: BIS Reasoning 1.0: The First Large-Scale Japanese Benchmark for Belief-Inconsistent Syllogistic Reasoning

Title: Learning to Clarify by Reinforcement Learning Through Reward-Weighted Fine-Tuning

Title: Break-The-Chain: Reasoning Failures in LLMs via Adversarial Prompting in Code Generation

Title: Atomic Reasoning for Scientific Table Claim Verification

Title: Chain of Methodologies: Scaling Test Time Computation without Training

Title: Cultural Bias Matters: A Cross-Cultural Benchmark Dataset and Sentiment-Enriched Model for Understanding Multimodal Metaphors

Title: Adversarial Paraphrasing: A Universal Attack for Humanizing AI-Generated Text

Title: KG2QA: Knowledge Graph-enhanced Retrieval-Augmented Generation for Communication Standards Question Answering

Title: Reasoning with RAGged events: RAG-Enhanced Event Knowledge Base Construction and reasoning with proof-assistants

Title: Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

Title: Com$^2$: A Causal-Guided Benchmark for Exploring Complex Commonsense Reasoning in Large Language Models

Title: Representation Decomposition for Learning Similarity and Contrastness Across Modalities for Affective Computing

Title: How Far Are We from Optimal Reasoning Efficiency?

Title: Theorem-of-Thought: A Multi-Agent Framework for Abductive, Deductive, and Inductive Reasoning in Language Models

Title: Prompting Science Report 2: The Decreasing Value of Chain of Thought in Prompting

Title: Semantic-preserved Augmentation with Confidence-weighted Fine-tuning for Aspect Category Sentiment Analysis

Title: Syntactic Control of Language Models by Posterior Inference

Title: GeometryZero: Improving Geometry Solving for LLM with Group Contrastive Policy Optimization

Title: CTDGSI: A comprehensive exploitation of instance selection methods for automatic text classification. VII Concurso de Teses, Dissertações e Trabalhos de Graduação em SI -- XXI Simpósio Brasileiro de Sistemas de Informação

Title: RULE: Reinforcement UnLEarning Achieves Forget-Retain Pareto Optimality

Title: Flattery in Motion: Benchmarking and Analyzing Sycophancy in Video-LLMs

Title: SDE-SQL: Enhancing Text-to-SQL Generation in Large Language Models via Self-Driven Exploration with SQL Probes

Title: Bias Attribution in Filipino Language Models: Extending a Bias Interpretability Metric for Application on Agglutinative Languages

Title: Question Answering under Temporal Conflict: Evaluating and Organizing Evolving Knowledge with LLMs

Title: Parsing the Switch: LLM-Based UD Annotation for Complex Code-Switched and Low-Resource Languages

Title: Exploring the Impact of Temperature on Large Language Models:Hot or Cold?

Title: ConfQA: Answer Only If You Are Confident

Title: Reward Model Interpretability via Optimal and Pessimal Tokens

Title: Improving LLM Reasoning through Interpretable Role-Playing Steering

Title: Refusal-Feature-guided Teacher for Safe Finetuning via Data Filtering and Alignment Distillation

Title: Plug-in and Fine-tuning: Bridging the Gap between Small Language Models and Large Language Models

Title: Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding

Title: LG-ANNA-Embedding technical report

Title: KScope: A Framework for Characterizing the Knowledge Status of Language Models

Title: From Calibration to Collaboration: LLM Uncertainty Quantification Should Be More Human-Centered

Title: CCI4.0: A Bilingual Pretraining Dataset for Enhancing Reasoning in Large Language Models

Title: Improving Fairness of Large Language Models in Multi-document Summarization

Title: A Hybrid GA LLM Framework for Structured Task Optimization

Title: DEBATE: A Dataset for Disentangling Textual Ambiguity in Mandarin Through Speech

Title: Towards Large Language Models with Self-Consistent Natural Language Explanations

Title: Bit-level BPE: Below the byte boundary

Title: SELT: Self-Evaluation Tree Search for LLMs with Task Decomposition

Title: Beyond the Sentence: A Survey on Context-Aware Machine Translation with Large Language Models

Title: Instructing Large Language Models for Low-Resource Languages: A Systematic Study for Basque

Title: PolitiSky24: U.S. Political Bluesky Dataset with User Stance Labels

Title: Vuyko Mistral: Adapting LLMs for Low-Resource Dialectal Translation

Title: LoRMA: Low-Rank Multiplicative Adaptation for LLMs

Title: Intent Matters: Enhancing AI Tutoring with Fine-Grained Pedagogical Intent Annotation

Title: Unblocking Fine-Grained Evaluation of Detailed Captions: An Explaining AutoRater and Critic-and-Revise Pipeline

Title: TreeReview: A Dynamic Tree of Questions Framework for Deep and Efficient LLM-based Scientific Peer Review

Title: Evaluating LLMs Robustness in Less Resourced Languages with Proxy Models

Title: Transcript-Prompted Whisper with Dictionary-Enhanced Decoding for Japanese Speech Annotation

Title: Beyond Benchmarks: A Novel Framework for Domain-Specific LLM Evaluation and Knowledge Mapping

Title: Synthesis by Design: Controlled Data Generation via Structural Guidance

Title: Silencing Empowerment, Allowing Bigotry: Auditing the Moderation of Hate Speech on Twitch

Title: GaRAGe: A Benchmark with Grounding Annotations for RAG Evaluation

Title: Training Superior Sparse Autoencoders for Instruct Models

Title: Through the Valley: Path to Effective Long CoT Training for Small Language Models

Title: Swiss Parliaments Corpus Re-Imagined (SPC_R): Enhanced Transcription with RAG-based Correction and Predicted BLEU

Title: Augmenting LLMs' Reasoning by Reinforcing Abstract Thinking

Title: LLM Unlearning Should Be Form-Independent

Title: WebUIBench: A Comprehensive Benchmark for Evaluating Multimodal Large Language Models in WebUI-to-Code

Title: Learning to Focus: Causal Attention Distillation via Gradient-Guided Token Pruning

Title: MEMOIR: Lifelong Model Editing with Minimal Overwrite and Informed Retention for LLMs

Title: MiniCPM4: Ultra-Efficient LLMs on End Devices

Title: Quantum Graph Transformer for NLP Sentiment Classification

Title: Statistical Hypothesis Testing for Auditing Robustness in Language Models

Title: Language Models over Canonical Byte-Pair Encodings

Title: Correlated Errors in Large Language Models

Title: Reinforcement Pre-Training