2025-04-08

Title: A Unified Virtual Mixture-of-Experts Framework:Enhanced Inference and Hallucination Mitigation in Single-Model System

Title: Do "New Snow Tablets" Contain Snow? Large Language Models Over-Rely on Names to Identify Ingredients of Chinese Drugs

Title: Sample, Don't Search: Rethinking Test-Time Alignment for Language Models

Title: Entropy-Based Block Pruning for Efficient Large Language Models

Title: What Large Language Models Do Not Talk About: An Empirical Study of Moderation and Censorship Practices

Title: Do LLM Evaluators Prefer Themselves for a Reason?

Title: CliME: Evaluating Multimodal Climate Discourse on Social Media and the Climate Alignment Quotient (CAQ)

Title: Adaptation of Large Language Models

Title: YaleNLP @ PerAnsSumm 2025: Multi-Perspective Integration via Mixture-of-Agents for Enhanced Healthcare QA Summarization

Title: Language Models Are Implicitly Continuous

Title: Clinical ModernBERT: An efficient and long context encoder for biomedical text

Title: Structured Extraction of Process Structure Properties Relationships in Materials Science

Title: Algorithmic Prompt Generation for Diverse Human-like Teaming and Communication with Large Language Models

Title: Rethinking Reflection in Pre-Training

Title: SyLeR: A Framework for Explicit Syllogistic Legal Reasoning in Large Language Models

Title: FISH-Tuning: Enhancing PEFT Methods with Fisher Information

Title: VocalNet: Speech LLM with Multi-Token Prediction for Faster and High-Quality Generation

Title: Collaboration and Controversy Among Experts: Rumor Early Detection by Tuning a Comment Generator

Title: A Benchmark for End-to-End Zero-Shot Biomedical Relation Extraction with LLMs: Experiments with OpenAI Models

Title: Precise Legal Sentence Boundary Detection for Retrieval at Scale: NUPunkt and CharBoundary

Title: Cognitive Debiasing Large Language Models for Decision-Making

Title: Reasoning on Multiple Needles In A Haystack

Title: STEP: Staged Parameter-Efficient Pre-training for Large Language Models

Title: Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources

Title: GlotEval: A Test Suite for Massively Multilingual Evaluation of Large Language Models

Title: Adaptive Elicitation of Latent Information Using Natural Language

Title: Towards Understanding and Improving Refusal in Compressed Models via Mechanistic Interpretability

Title: A Perplexity and Menger Curvature-Based Approach for Similarity Evaluation of Large Language Models

Title: Sensitivity Meets Sparsity: The Impact of Extremely Sparse Parameter Patterns on Theory-of-Mind of Large Language Models

Title: Lost in Multilinguality: Dissecting Cross-lingual Factual Inconsistency in Transformer Language Models

Title: Could AI Trace and Explain the Origins of AI-Generated Images and Text?

Title: Cross-Asset Risk Management: Integrating LLMs for Real-Time Monitoring of Equity, Fixed Income, and Currency Markets

Title: Dynamic Hedging Strategies in Derivatives Markets with LLM-Driven Sentiment and News Analytics

Title: CO-Bench: Benchmarking Language Model Agents in Algorithm Search for Combinatorial Optimization

Title: Balancing Complexity and Informativeness in LLM-Based Clustering: Finding the Goldilocks Zone

Title: IMPersona: Evaluating Individual Level LM Impersonation

Title: Hallucination Detection using Multi-View Attention Features

Title: Generative Large Language Models Trained for Detecting Errors in Radiology Reports

Title: Compression Laws for Large Language Models

Title: StyleRec: A Benchmark Dataset for Prompt Recovery in Writing Style Transformation

Title: PolyGuard: A Multilingual Safety Moderation Tool for 17 Languages

Title: Pre-trained Language Models and Few-shot Learning for Medical Entity Extraction

Title: An overview of model uncertainty and variability in LLM-based sentiment analysis. Challenges, mitigation strategies and the role of explainability

Title: Saliency-driven Dynamic Token Pruning for Large Language Models

Title: An Empirical Comparison of Text Summarization: A Multi-Dimensional Evaluation of Large Language Models

Title: KnowsLM: A framework for evaluation of small language models for knowledge augmentation and humanised conversations

Title: Steering off Course: Reliability Challenges in Steering Language Models

Title: Splits! A Flexible Dataset for Evaluating a Model's Demographic Social Inference

Title: scAgent: Universal Single-Cell Annotation via a LLM Agent

Title: Causal Retrieval with Semantic Consideration

Title: Sequential-NIAH: A Needle-In-A-Haystack Benchmark for Extracting Sequential Needles from Long Contexts

Title: Are You Getting What You Pay For? Auditing Model Substitution in LLM APIs

Title: Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language Models

Title: T1: Tool-integrated Self-verification for Test-time Compute Scaling in Small Language Models

Title: TathyaNyaya and FactLegalLlama: Advancing Factual Judgment Prediction and Explanation in the Indian Legal Context

Title: Can LLMs Interpret and Leverage Structured Linguistic Representations? A Case Study with AMRs

Title: Improving Multilingual Retrieval-Augmented Language Models through Dialectic Reasoning Argumentations

Title: Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models

Title: SAFT: Structure-aware Transformers for Textual Interaction Classification

Title: Leveraging Large Language Models for Cost-Effective, Multilingual Depression Detection and Severity Assessment

Title: Collab-RAG: Boosting Retrieval-Augmented Generation for Complex Question Answering via White-Box and Black-Box LLM Collaboration

Title: M-Prometheus: A Suite of Open Multilingual LLM Judges

Title: A Domain-Based Taxonomy of Jailbreak Vulnerabilities in Large Language Models

Title: Following the Whispers of Values: Unraveling Neural Mechanisms Behind Value-Oriented Behaviors in LLMs

Title: Surveying Professional Writers on AI: Limitations, Expectations, and Fears

Title: Revealing the Intrinsic Ethical Vulnerability of Aligned Large Language Models

Title: Not All Data Are Unlearned Equally

Title: On the Performance of an Explainable Language Model on PubMedQA

Title: The Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning

Title: AI for Climate Finance: Agentic Retrieval and Multi-Step Reasoning for Early Warning System Investments

Title: DoCIA: An Online Document-Level Context Incorporation Agent for Speech Translation

Title: CARE: Aligning Language Models for Regional Cultural Awareness

Title: Concise Reasoning via Reinforcement Learning

Title: Post-Training Language Models for Continual Relation Extraction

Title: NoveltyBench: Evaluating Creativity and Diversity in Language Models

Title: LLM-based Automated Grading with Human-in-the-Loop

Title: Do PhD-level LLMs Truly Grasp Elementary Addition? Probing Rule Learning vs. Memorization in Large Language Models

Title: Enhancing LLM-Based Short Answer Grading with Retrieval-Augmented Generation

Title: Truthful or Fabricated? Using Causal Attribution to Mitigate Reward Hacking in Explanations