2025-03-18

Title: Explainable Sentiment Analysis with DeepSeek-R1: Performance, Efficiency, and Few-Shot Learning

Title: TRUTH DECAY: Quantifying Multi-Turn Sycophancy in Language Models

Title: Automating Mathematical Proof Generation Using Large Language Model Agents and Knowledge Graphs

Title: LogitLens4LLMs: Extending Logit Lens Analysis to Modern Large Language Models

Title: reWordBench: Benchmarking and Improving the Robustness of Reward Models with Transformed Inputs

Title: Key, Value, Compress: A Systematic Exploration of KV Cache Compression Techniques

Title: Bridging the LLM Accessibility Divide? Performance, Fairness, and Cost of Closed versus Open LLMs for Automated Essay Scoring

Title: A Transformer and Prototype-based Interpretable Model for Contextual Sarcasm Detection

Title: OpeNLGauge: An Explainable Metric for NLG Evaluation with Open-Weights LLMs

Title: GPT's Devastated and LLaMA's Content: Emotion Representation Alignment in LLMs for Keyword-based Generation

Title: Resolving UnderEdit & OverEdit with Iterative & Neighbor-Assisted Model Editing

Title: LLMs for Translation: Historical, Low-Resourced Languages and Contemporary AI Models

Title: LAG-MMLU: Benchmarking Frontier LLM Understanding in Latvian and Giriama

Title: REGEN: A Dataset and Benchmarks with Natural Language Critiques and Narratives

Title: Integration of Explainable AI Techniques with Large Language Models for Enhanced Interpretability for Sentiment Analysis

Title: HInter: Exposing Hidden Intersectional Bias in Large Language Models

Title: No LLM is Free From Bias: A Comprehensive Study of Bias Evaluation in Large Language models

Title: Applications of Large Language Model Reasoning in Feature Generation

Title: TLUE: A Tibetan Language Understanding Evaluation Benchmark

Title: Information-Guided Identification of Training Data Imprint in (Proprietary) Large Language Models

Title: Large Language Models in Legislative Content Analysis: A Dataset from the Polish Parliament

Title: RECSIP: REpeated Clustering of Scores Improving the Precision

Title: MT-RewardTree: A Comprehensive Framework for Advancing LLM-Based Machine Translation via Reward Modeling

Title: Seeing Sarcasm Through Different Eyes: Analyzing Multimodal Sarcasm Perception in Large Vision-Language Models

Title: Improving LLM-based Document-level Machine Translation with Multi-Knowledge Fusion

Title: PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing

Title: Interpretation Gaps in LLM-Assisted Comprehension of Privacy Documents

Title: Integrating Chain-of-Thought and Retrieval Augmented Generation Enhances Rare Disease Diagnosis from Clinical Notes

Title: The Lucie-7B LLM and the Lucie Training Dataset: Open resources for multilingual language generation

Title: SVD-LLM V2: Optimizing Singular Value Truncation for Large Language Model Compression

Title: General Table Question Answering via Answer-Formula Joint Generation

Title: Synthesizing Privacy-Preserving Text Data via Finetuning without Finetuning Billion-Scale LLMs

Title: Understanding Common Ground Misalignment in Goal-Oriented Dialog: A Case-Study with Ubuntu Chat Logs

Title: HKCanto-Eval: A Benchmark for Evaluating Cantonese Language Understanding and Cultural Comprehension in LLMs

Title: CAKE: Cascading and Adaptive KV Cache Eviction with Layer Preferences

Title: EXAONE Deep: Reasoning Enhanced Language Models

Title: Investigating Human-Aligned Large Language Model Uncertainty

Title: Basic Category Usage in Vision Language Models

Title: From Guessing to Asking: An Approach to Resolving the Persona Knowledge Gap in LLMs during Multi-Turn Conversations

Title: RaSA: Rank-Sharing Low-Rank Adaptation

Title: UniBERTs: Adversarial Training for Language-Universal Representations

Title: Plausibility Vaccine: Injecting LLM Knowledge for Event Plausibility

Title: RAG-RL: Advancing Retrieval-Augmented Generation via RL and Curriculum Learning

Title: Enhancing LLM Reasoning with Iterative DPO: A Comprehensive Empirical Investigation

Title: nvBench 2.0: A Benchmark for Natural Language to Visualization under Ambiguity

Title: HICD: Hallucination-Inducing via Attention Dispersion for Contrastive Decoding to Mitigate Hallucinations in Large Language Models

Title: ThinkPatterns-21k: A Systematic Study on the Impact of Thinking Patterns in LLMs

Title: HiDe-LLaVA: Hierarchical Decoupling for Continual Instruction Tuning of Multimodal Large Language Model

Title: A Multi-Stage Framework with Taxonomy-Guided Reasoning for Occupation Classification Using Large Language Models

Title: Overview of the NTCIR-18 Automatic Evaluation of LLMs (AEOLLM) Task

Title: A Framework to Assess Multilingual Vulnerabilities of LLMs

Title: ClusComp: A Simple Paradigm for Model Compression and Efficient Finetuning

Title: Who Wrote This? Identifying Machine vs Human-Generated Text in Hausa

Title: REPA: Russian Error Types Annotation for Evaluating Text Generation and Judgment Capabilities

Title: Code-Driven Inductive Synthesis: Enhancing Reasoning Abilities of Large Language Models with Sequences

Title: Improving Complex Reasoning with Dynamic Prompt Corruption: A soft prompt Optimization Approach

Title: Can Language Models Follow Multiple Turns of Entangled Instructions?

Title: TablePilot; Recommending Human-Preferred Tabular Data Analysis with Large Language Models

Title: LLM-Match: An Open-Sourced Patient Matching Model Based on Large Language Models and Retrieval-Augmented Generation

Title: A Survey on Transformer Context Extension: Approaches and Evaluation

Title: Computation Mechanism Behind LLM Position Generalization

Title: Reliable and Efficient Amortized Model-based Evaluation

Title: Valid Text-to-SQL Generation with Unification-based DeepStochLog

Title: Aligned Probing: Relating Toxic Behavior and Model Internals

Title: Using the Tools of Cognitive Science to Understand Large Language Models at Different Levels of Analysis

Title: DLPO: Towards a Robust, Efficient, and Generalizable Prompt Optimization Framework from a Deep-Learning Perspective

Title: SuperBPE: Space Travel for Language Models

Title: Faithfulness of LLM Self-Explanations for Commonsense Tasks: Larger Is Better, and Instruction-Tuning Allows Trade-Offs but Not Pareto Dominance

Title: MetaScale: Test-Time Scaling with Evolving Meta-Thoughts