2025-04-29

Title: Mind the Language Gap: Automated and Augmented Evaluation of Bias in LLMs for High- and Low-Resource Languages

Title: Span-Level Hallucination Detection for LLM-Generated Answers

Title: Can Third-parties Read Our Emotions?

Title: Building UD Cairo for Old English in the Classroom

Title: EvidenceBench: A Benchmark for Extracting Evidence from Biomedical Papers

Title: SynLexLM: Scaling Legal LLMs with Synthetic Data and Curriculum Learning

Title: Stealing Creator's Workflow: A Creator-Inspired Agentic Framework with Iterative Feedback Loop for Improved Scientific Short-form Generation

Title: Toward Generalizable Evaluation in the LLM Era: A Survey Beyond Benchmarks

Title: Towards Robust Dialogue Breakdown Detection: Addressing Disruptors in Large Language Models with Self-Guided Reasoning

Title: When2Call: When (not) to Call Tools

Title: Effective Length Extrapolation via Dimension-Wise Positional Embeddings Manipulation

Title: Latent Adversarial Training Improves the Representation of Refusal

Title: A Simple Ensemble Strategy for LLM Inference: Towards More Stable Text Classification

Title: MTCSC: Retrieval-Augmented Iterative Refinement for Chinese Spelling Correction

Title: LawFlow : Collecting and Simulating Lawyers' Thought Processes

Title: Dynamic Fisher-weighted Model Merging via Bayesian Optimization

Title: Graph of Attacks: Improved Black-Box and Interpretable Jailbreaks for LLMs

Title: Advancing Scientific Text Classification: Fine-Tuned Models with Dataset Expansion and Hard-Voting

Title: KETCHUP: K-Step Return Estimation for Sequential Knowledge Distillation

Title: Calibrating Translation Decoding with Quality Estimation on LLMs

Title: Hallucinations and Key Information Extraction in Medical Texts: A Comprehensive Assessment of Open-Source Large Language Models

Title: ClimaEmpact: Domain-Aligned Small Language Models and Datasets for Extreme Weather Analytics

Title: Sample-Efficient Language Model for Hinglish Conversational AI

Title: Efficient Reasoning for LLMs through Speculative Chain-of-Thought

Title: Privacy-Preserving Federated Embedding Learning for Localized Retrieval-Augmented Generation

Title: APE-Bench I: Towards File-level Automated Proof Engineering of Formal Math Libraries

Title: SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning

Title: WuNeng: Hybrid State with Attention

Title: Uncertainty Quantification for Language Models: A Suite of Black-Box, White-Box, LLM Judge, and Ensemble Scorers

Title: VIST-GPT: Ushering in the Era of Visual Storytelling with LLMs?

Title: AndroidGen: Building an Android Language Agent under Data Scarcity

Title: BrowseComp-ZH: Benchmarking Web Browsing Ability of Large Language Models in Chinese

Title: Unified Multi-Task Learning & Model Fusion for Efficient Language Model Guardrailing

Title: Explanatory Summarization with Discourse-Driven Planning

Title: ICL CIPHERS: Quantifying "Learning'' in In-Context Learning via Substitution Ciphers

Title: Context Selection and Rewriting for Video-based EducationalQuestion Generation

Title: Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory

Title: Context-Guided Dynamic Retrieval for Improving Generation Quality in RAG Models

Title: Systematic Bias in Large Language Models: Discrepant Response Patterns in Binary vs. Continuous Judgment Tasks

Title: Towards Long Context Hallucination Detection

Title: BRIDGE: Benchmarking Large Language Models for Understanding Real-world Clinical Practice Text

Title: Conflicts in Texts: Data, Implications and Challenges

Title: Detecting Effects of AI-Mediated Communication on Language Complexity and Sentiment

Title: m-KAILIN: Knowledge-Driven Agentic Scientific Corpus Distillation Framework for Biomedical Large Language Models Training

Title: Coreference Resolution for Vietnamese Narrative Texts

Title: VCM: Vision Concept Modeling Based on Implicit Contrastive Learning with Vision-Language Instruction Fine-Tuning

Title: Annif at SemEval-2025 Task 5: Traditional XMTC augmented by LLMs

Title: Taming the Titans: A Survey of Efficient LLM Inference Serving

Title: LLM-Assisted Automated Deductive Coding of Dialogue Data: Leveraging Dialogue-Specific Characteristics to Enhance Contextual Understanding

Title: Moral Reasoning Across Languages: The Critical Role of Low-Resource Languages in LLMs

Title: Can a Crow Hatch a Falcon? Lineage Matters in Predicting Large Language Model Performance

Title: Efficient Domain-adaptive Continual Pretraining for the Process Industry in the German Language

Title: semi-PD: Towards Efficient LLM Serving via Phase-Wise Disaggregated Computation and Unified Storage

Title: GenCLS++: Pushing the Boundaries of Generative Classification in LLMs Through Comprehensive SFT and RL Studies Across Diverse Datasets

Title: Assessing the Potential of Generative Agents in Crowdsourced Fact-Checking

Title: TD-EVAL: Revisiting Task-Oriented Dialogue Evaluation by Combining Turn-Level Precision with Dialogue-Level Comparisons

Title: Knowledge Distillation of Domain-adapted LLMs for Question-Answering in Telecom

Title: LLM-Generated Fake News Induces Truth Decay in News Ecosystem: A Case Study on Neural News Recommendation

Title: Better To Ask in English? Evaluating Factual Accuracy of Multilingual LLMs in English and Low-Resource Languages

Title: AutoJudge: Judge Decoding Without Manual Annotation