2025-09-09

Title: An Empirical Analysis of Discrete Unit Representations in Speech Language Modeling Pre-training

Title: Beyond ROUGE: N-Gram Subspace Features for LLM Hallucination Detection

Title: A Lightweight Framework for Trigger-Guided LoRA-Based Self-Adaptation in LLMs

Title: Talk Isn't Always Cheap: Understanding Failure Modes in Multi-Agent Debate

Title: No Translation Needed: Forecasting Quality from Fertility and Metadata

Title: Direct-Scoring NLG Evaluators Can Use Pairwise Comparisons Too

Title: From Staff Messages to Actionable Insights: A Multi-Stage LLM Classification Framework for Healthcare Analytics

Title: The Token Tax: Systematic Bias in Multilingual Tokenization

Title: Biomedical Literature Q&A System Using Retrieval-Augmented Generation (RAG)

Title: Using Contrastive Learning to Improve Two-Way Reasoning in Large Language Models: The Obfuscation Task as a Case Study

Title: Ad hoc conventions generalize to new referents

Title: Mitigating Spurious Correlations Between Question and Answer via Chain-of-Thought Correctness Perception Distillation

Title: Icon$^{2}$: Aligning Large Language Models Using Self-Synthetic Preference Data via Inherent Regulation

Title: Beyond Keywords: Driving Generative Search Engine Optimization with Content-Centric Agents

Title: New Insights into Optimal Alignment of Acoustic and Linguistic Representations for Knowledge Transfer in ASR

Title: From Joy to Fear: A Benchmark of Emotion Estimation in Pop Song Lyrics

Title: Few-Shot Query Intent Detection via Relation-Aware Prompt Learning

Title: LM-Searcher: Cross-domain Neural Architecture Search with LLMs via Unified Numerical Encoding

Title: Cross-Question Method Reuse in Large Language Models: From Word-Level Prediction to Rational Logical-Layer Reasoning

Title: Llama-GENBA-10B: A Trilingual Large Language Model for German, English and Bavarian

Title: A Survey of the State-of-the-Art in Conversational Question Answering Systems

Title: Exploring Subjective Tasks in Farsi: A Survey Analysis and Evaluation of Language Models

Title: Enhancing Factual Accuracy and Citation Generation in LLMs via Multi-Stage Self-Verification

Title: ZhiFangDanTai: Fine-tuning Graph-based Retrieval-Augmented Generation Model for Traditional Chinese Medicine Formula

Title: MedFactEval and MedAgentBrief: A Framework and Workflow for Generating and Evaluating Factual Clinical Summaries

Title: Let's Roleplay: Examining LLM Alignment in Collaborative Dialogues

Title: Accelerating Large Language Model Inference via Early-Exiting Algorithms

Title: KatotohananQA: Evaluating Truthfulness of Large Language Models in Filipino

Title: Multimodal Reasoning for Science: Technical Report and 1st Place Solution to the ICML 2025 SeePhys Challenge

Title: Orthogonal Low-rank Adaptation in Lie Groups for Continual Learning of Large Language Models

Title: Benchmarking Gender and Political Bias in Large Language Models

Title: Understanding the Influence of Synthetic Data for Text Embedders

Title: Augmented Fine-Tuned LLMs for Enhanced Recruitment Automation

Title: MSLEF: Multi-Segment LLM Ensemble Finetuning in Recruitment

Title: Mask-GCG: Are All Tokens in Adversarial Suffixes Necessary for Jailbreak Attacks?

Title: PL-CA: A Parametric Legal Case Augmentation Framework

Title: Do LLMs exhibit the same commonsense capabilities across languages?

Title: WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents

Title: Crown, Frame, Reverse: Layer-Wise Scaling Variants for LLM Pre-Training

Title: LAMDAS: LLM as an Implicit Classifier for Domain-specific Data Selection

Title: SLiNT: Structure-aware Language Model with Injection and Contrastive Training for Knowledge Graph Completion

Title: HAVE: Head-Adaptive Gating and ValuE Calibration for Hallucination Mitigation in Large Language Models

Title: Guided Decoding and Its Critical Role in Retrieval-Augmented Generation

Title: Domain-Aware RAG: MoL-Enhanced RL for Efficient Training and Scalable Retrieval

Title: IntrEx: A Dataset for Modeling Engagement in Educational Conversations

Title: Anchoring Refusal Direction: Mitigating Safety Risks in Tuning via Projection Constraint

Title: MachineLearningLM: Continued Pretraining Language Models on Millions of Synthetic Tabular Prediction Tasks Scales In-Context ML

Title: MoGU V2: Toward a Higher Pareto Frontier Between Model Usability and Security

Title: Saturation-Driven Dataset Generation for LLM Mathematical Reasoning in the TPTP Ecosystem

Title: A Comparative Benchmark of Large Language Models for Labelling Wind Turbine Maintenance Logs

Title: COMPACT: Common-token Optimized Model Pruning Across Channels and Tokens

Title: EPT Benchmark: Evaluation of Persian Trustworthiness in Large Language Models

Title: The Majority is not always right: RL training for solution aggregation

Title: UNH at CheckThat! 2025: Fine-tuning Vs Prompting in Claim Extraction

Title: mmBERT: A Modern Multilingual Encoder with Annealed Language Learning

Title: Proof-Carrying Numbers (PCN): A Protocol for Trustworthy Numeric Answers from LLMs via Claim Verification

Title: Beyond Two-Stage Training: Cooperative SFT and RL for LLM Reasoning

Title: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models

Title: On the Same Wavelength? Evaluating Pragmatic Reasoning in Language Models across Broad Concepts