2025-04-18

Title: Unmasking the Reality of PII Masking Models: Performance Gaps and the Call for Accountability

Title: Learning Optimal Prompt Ensemble for Multi-source Visual Prompt Transfer

Title: Socrates or Smartypants: Testing Logic Reasoning Capabilities of Large Language Models with Logic Programming-based Test Oracles

Title: Exploring the Impact of Personality Traits on Conversational Recommender Systems: A Simulation with Large Language Models

Title: How to Detect and Defeat Molecular Mirage: A Metric-Driven Benchmark for Hallucination in LLM-based Molecular Comprehension

Title: Capybara-OMNI: An Efficient Paradigm for Building Omni-Modal Language Models

Title: Data Metabolism: An Efficient Data Design Schema For Vision Language Model

Title: ChatGPT as Linguistic Equalizer? Quantifying LLM-Driven Lexical Shifts in Academic Writing

Title: Has the Creativity of Large-Language Models peaked? An analysis of inter- and intra-LLM variability

Title: AttentionDefense: Leveraging System Prompt Attention for Explainable Defense Against Novel Jailbreaks

Title: A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis

Title: The Other Side of the Coin: Exploring Fairness in Retrieval-Augmented Generation

Title: Cross-Document Cross-Lingual Natural Language Inference via RST-enhanced Graph Fusion and Interpretability Prediction

Title: LLMTaxo: Leveraging Large Language Models for Constructing Taxonomy of Factual Claims from Social Media

Title: Reconstructing Sepsis Trajectories from Clinical Case Reports using LLMs: the Textual Time Series Corpus for Sepsis

Title: A Comprehensive Survey of Reward Models: Taxonomy, Applications, Challenges, and Future

Title: HM-RAG: Hierarchical Multi-Agent Multimodal Retrieval Augmented Generation

Title: Span-level Emotion-Cause-Category Triplet Extraction with Instruction Tuning LLMs and Data Augmentation

Title: Can the capability of Large Language Models be described by human ability? A Meta Study

Title: Meta-Evaluating Local LLMs: Rethinking Performance Metrics for Serious Games

Title: QM-ToT: A Medical Tree of Thoughts Reasoning Framework for Quantized Model

Title: You've Changed: Detecting Modification of Black-Box Large Language Models

Title: "It Listens Better Than My Therapist": Exploring Social Media Discourse on LLMs as Mental Health Tool

Title: Paging Dr. GPT: Extracting Information from Clinical Notes to Enhance Patient Predictions

Title: GOAT-TTS: LLM-based Text-To-Speech Generation Optimized via A Dual-Branch Architecture

Title: Streamlining Biomedical Research with Specialized LLMs

Title: Benchmarking Biopharmaceuticals Retrieval-Augmented Generation Evaluation

Title: Propaganda via AI? A Study on Semantic Backdoors in Large Language Models

Title: Reimagining Urban Science: Scaling Causal Inference with Large Language Models

Title: Mathematical Capabilities of Large Language Models in Finnish Matriculation Examination

Title: A Large-Language Model Framework for Relative Timeline Extraction from PubMed Case Reports

Title: Leveraging Large Language Models for Multi-Class and Multi-Label Detection of Drug Use and Overdose Symptoms on Social Media

Title: Replicating ReLM Results: Validating Large Language Models with ReLM

Title: Position: The Most Expensive Part of an LLM should be its Training Data

Title: On Linear Representations and Pretraining Data Frequency in Language Models

Title: SLURG: Investigating the Feasibility of Generating Synthetic Online Fallacious Discourse

Title: Integrating Structural and Semantic Signals in Text-Attributed Graphs with BiGTex

Title: Can Pre-training Indicators Reliably Predict Fine-tuning Outcomes of LLMs?

Title: BrowseComp: A Simple Yet Challenging Benchmark for Browsing Agents

Title: Evaluating the Diversity and Quality of LLM Generated Content

Title: Memorization vs. Reasoning: Updating LLMs with New Knowledge

Title: Memorization: A Close Look at Books

Title: ELAB: Extensive LLM Alignment Benchmark in Persian Language

Title: CDF-RAG: Causal Dynamic Feedback for Adaptive Retrieval-Augmented Generation

Title: MetaSynth: Meta-Prompting-Driven Agentic Scaffolds for Diverse Synthetic Data Generation

Title: Identifying and Mitigating the Influence of the Prior Distribution in Large Language Models

Title: GeoSense: Evaluating Identification and Application of Geometric Principles in Multimodal Reasoning

Title: Towards Characterizing Subjectivity of Individuals through Modeling Value Conflicts and Trade-offs

Title: Scaling Instruction-Tuned LLMs to Million-Token Contexts via Hierarchical Synthetic Data Generation

Title: Persona-judge: Personalized Alignment of Large Language Models via Token-level Self-judgment

Title: ACoRN: Noise-Robust Abstractive Compression in Retrieval-Augmented Language Models

Title: GRAIL: Gradient-Based Adaptive Unlearning for Privacy and Copyright in LLMs

Title: Data-efficient LLM Fine-tuning for Code Generation

Title: Why and How LLMs Hallucinate: Connecting the Dots with Subsequence Associations

Title: Pandora: A Code-Driven Large Language Model Agent for Unified Reasoning Across Diverse Structured Knowledge

Title: Chinese-Vicuna: A Chinese Instruction-following Llama-based Model

Title: Out of Sight Out of Mind, Out of Sight Out of Mind: Measuring Bias in Language Models Against Overlooked Marginalized Groups in Regional Contexts

Title: Enhancing the Geometric Problem-Solving Ability of Multimodal LLMs via Symbolic-Neural Integration

Title: Assesing LLMs in Art Contexts: Critique Generation and Theory of Mind Evaluation

Title: Can LLMs reason over extended multilingual contexts? Towards long-context evaluation beyond retrieval and haystacks

Title: ViClaim: A Multilingual Multilabel Dataset for Automatic Claim Detection in Videos

Title: Are AI agents the new machine translation frontier? Challenges and opportunities of single- and multi-agent systems for multilingual digital communication

Title: Information Gain-Guided Causal Intervention for Autonomous Debiasing Large Language Models

Title: Benchmarking Multi-National Value Alignment for Large Language Models

Title: MAIN: Mutual Alignment Is Necessary for instruction tuning

Title: ConExion: Concept Extraction with Large Language Models

Title: Are Retrials All You Need? Enhancing Large Language Model Reasoning Without Verbalized Feedback

Title: Estimating Optimal Context Length for Hybrid Retrieval-augmented Multi-document Summarization

Title: Sparks of Science: Hypothesis Generation Using Structured Paper Data

Title: Accommodate Knowledge Conflicts in Retrieval-augmented LLMs: Towards Reliable Response Generation in the Wild

Title: SHA256 at SemEval-2025 Task 4: Selective Amnesia -- Constrained Unlearning for Large Language Models via Knowledge Isolation

Title: ChatEXAONEPath: An Expert-level Multimodal Large Language Model for Histopathology Using Whole Slide Images

Title: Aspect-Based Summarization with Self-Aspect Retrieval Enhanced Generation

Title: Accuracy is Not Agreement: Expert-Aligned Evaluation of Crash Narrative Classification Models

Title: Retrieval-Augmented Generation with Conflicting Evidence

Title: LLMs Meet Finance: Fine-Tuning Foundation Models for the Open FinLLM Leaderboard

Title: Energy-Based Reward Models for Robust Language Model Alignment

Title: Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo

Title: CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training