2025-07-29

Title: Setting The Table with Intent: Intent-aware Schema Generation and Editing for Literature Review Tables

Title: Mind the Language Gap in Digital Humanities: LLM-Aided Translation of SKOS Thesauri

Title: Mitigating Geospatial Knowledge Hallucination in Large Language Models: Benchmarking and Dynamic Factuality Aligning

Title: Efficient Attention Mechanisms for Large Language Models: A Survey

Title: MOCHA: Are Code Language Models Robust Against Multi-Turn Malicious Coding Prompts?

Title: HITSZ's End-To-End Speech Translation Systems Combining Sequence-to-Sequence Auto Speech Recognition Model and Indic Large Language Model for IWSLT 2025 in Indic Track

Title: MCIF: Multimodal Crosslingual Instruction-Following Benchmark from Scientific Talks

Title: RoD-TAL: A Benchmark for Answering Questions in Romanian Driving License Exams

Title: Towards Inclusive NLP: Assessing Compressed Multilingual Transformers across Diverse Language Benchmarks

Title: Ta-G-T: Subjectivity Capture in Table to Text Generation via RDF Graphs

Title: Basic Reading Distillation

Title: JT-Math: A Multi-Stage Framework for Advanced Mathematical Reasoning in Large Language Models

Title: UloRL:An Ultra-Long Output Reinforcement Learning Approach for Advancing Large Language Models' Reasoning Abilities

Title: Flora: Effortless Context Construction to Arbitrary Length and Scale

Title: HCAttention: Extreme KV Cache Compression via Heterogeneous Attention Computing for LLMs

Title: DRIVE: Disfluency-Rich Synthetic Dialog Data Generation Framework for Intelligent Vehicle Environments

Title: Zero-shot Performance of Generative AI in Brazilian Portuguese Medical Exam

Title: A Gold Standard Dataset and Evaluation Framework for Depression Detection and Explanation in Social Media using LLMs

Title: CaliDrop: KV Cache Compression with Calibration

Title: KLAAD: Refining Attention Mechanisms to Reduce Societal Bias in Generative Language Models

Title: Text2Vis: A Challenging and Diverse Benchmark for Generating Multimodal Visualizations from Text

Title: Exploring LLM Autoscoring Reliability in Large-Scale Writing Assessments Using Generalizability Theory

Title: VLQA: The First Comprehensive, Large, and High-Quality Vietnamese Dataset for Legal Question Answering

Title: FAEDKV: Infinite-Window Fourier Transform for Unbiased KV Cache Compression

Title: Infogen: Generating Complex Statistical Infographics from Documents

Title: RAG in the Wild: On the (In)effectiveness of LLMs with Mixture-of-Knowledge Retrieval Augmentation

Title: ProsodyLM: Uncovering the Emerging Prosody Processing Capabilities in Speech Language Models

Title: AI-Driven Generation of Old English: A Framework for Low-Resource Languages

Title: Sem-DPO: Mitigating Semantic Inconsistency in Preference Optimization for Prompt Engineering

Title: Multi-Stage Verification-Centric Framework for Mitigating Hallucination in Multi-Modal RAG

Title: Multi-Agent Interactive Question Generation Framework for Long Document Understanding

Title: Goal Alignment in LLM-Based User Simulators for Conversational AI

Title: SGPO: Self-Generated Preference Optimization based on Self-Improver

Title: SessionIntentBench: A Multi-task Inter-session Intention-shift Modeling Benchmark for E-commerce Customer Behavior Understanding

Title: Diversity-Enhanced Reasoning for Subjective Questions

Title: IQ Test for LLMs: An Evaluation Framework for Uncovering Core Skills in LLMs

Title: Reframe Your Life Story: Interactive Narrative Therapist and Innovative Moment Assessment with Large Language Models

Title: Modeling Professionalism in Expert Questioning through Linguistic Differentiation

Title: Post-Completion Learning for Language Models

Title: EMBRACE: Shaping Inclusive Opinion Representation by Aligning Implicit Conversations with Social Norms

Title: MoL-RL: Distilling Multi-Step Environmental Feedback into LLMs for Feedback-Independent Reasoning

Title: What Language(s) Does Aya-23 Think In? How Multilinguality Affects Internal Language Representations

Title: Advancing Dialectal Arabic to Modern Standard Arabic Machine Translation

Title: RMTBench: Benchmarking LLMs Through Multi-Turn User-Centric Role-Playing

Title: Length Representations in Large Language Models

Title: Cognitive Chain-of-Thought: Structured Multimodal Reasoning about Social Situations

Title: CONCAP: Seeing Beyond English with Concepts Retrieval-Augmented Captioning

Title: CodeNER: Code Prompting for Named Entity Recognition

Title: Speaking in Words, Thinking in Logic: A Dual-Process Framework in QA Systems

Title: AQUA: A Large Language Model for Aquaculture & Fisheries

Title: SAND-Math: Using LLMs to Generate Novel, Difficult and Useful Mathematics Questions and Answers

Title: Enhancing Hallucination Detection via Future Context

Title: ZSE-Cap: A Zero-Shot Ensemble for Image Retrieval and Prompt-Guided Captioning

Title: Before the Outrage: Challenges and Advances in Predicting Online Antisocial Behavior

Title: Ontology-Enhanced Knowledge Graph Completion using Large Language Models

Title: Geometric-Mean Policy Optimization

Title: When Scale Meets Diversity: Evaluating Language Models on Fine-Grained Multilingual Claim Verification

Title: Text2VLM: Adapting Text-Only Datasets to Evaluate Alignment Training in Visual Language Models

Title: Investigating Structural Pruning and Recovery Techniques for Compressing Multimodal Large Language Models: An Empirical Study

Title: Multilingual Self-Taught Faithfulness Evaluators

Title: On The Role of Pretrained Language Models in General-Purpose Text Embeddings: A Survey

Title: Automating Thematic Review of Prevention of Future Deaths Reports: Replicating the ONS Child Suicide Study using Large Language Models

Title: Latent Inter-User Difference Modeling for LLM Personalization

Title: Leveraging Open-Source Large Language Models for Clinical Information Extraction in Resource-Constrained Settings

Title: Soft Injection of Task Embeddings Outperforms Prompt-Based In-Context Learning

Title: MediQAl: A French Medical Question Answering Dataset for Knowledge and Reasoning Evaluation

Title: FHSTP@EXIST 2025 Benchmark: Sexism Detection with Transparent Speech Concept Bottleneck Models

Title: FRED: Financial Retrieval-Enhanced Detection and Editing of Hallucinations in Language Models

Title: Mind the Gap: Conformative Decoding to Improve Output Diversity of Instruction-Tuned Large Language Models

Title: Memorization in Fine-Tuned Large Language Models

Title: Multi-Agent-as-Judge: Aligning LLM-Agent-Based Automated Evaluation with Multi-Dimensional Human Evaluation