2025-03-04

Title: Eeyore: Realistic Depression Simulation via Supervised and Preference Optimization

Title: A Systematic Review of Open Datasets Used in Text-to-Image (T2I) Gen AI Model Safety

Title: KVCrush: Key value cache size-reduction using similarity in head-behaviour

Title: Do Emotions Really Affect Argument Convincingness? A Dynamic Approach with LLM-based Manipulation Checks

Title: Evaluating Large Language Models on the Spanish Medical Intern Resident (MIR) Examination 2024/2025:A Comparative Analysis of Clinical Reasoning and Knowledge Application

Title: Detecting LLM-Generated Korean Text through Linguistic Feature Analysis

Title: Constraining Sequential Model Editing with Editing Anchor Compression

Title: Zero-Shot Defense Against Toxic Images via Inherent Multimodal Alignment in LVLMs

Title: from Benign import Toxic: Jailbreaking the Language Model via Adversarial Metaphors

Title: Evaluation of LLMs-based Hidden States as Author Representations for Psychological Human-Centered NLP Tasks

Title: AnnoCaseLaw: A Richly-Annotated Dataset For Benchmarking Explainable Legal Judgment Prediction

Title: Personalized Causal Graph Reasoning for LLMs: A Case Study on Dietary Recommendations

Title: SCORE: Systematic COnsistency and Robustness Evaluation for Large Language Models

Title: Palm: A Culturally Inclusive and Linguistically Diverse Dataset for Arabic LLMs

Title: A Survey of Uncertainty Estimation Methods on Large Language Models

Title: Llamarine: Open-source Maritime Industry-specific Large Language Model

Title: À la recherche du sens perdu: your favourite LLM might have more to say than you can understand

Title: Jawaher: A Multidialectal Dataset of Arabic Proverbs for LLM Benchmarking

Title: Decoupling Content and Expression: Two-Dimensional Detection of AI-Generated Text

Title: Robust Multi-Objective Preference Alignment with Online DPO

Title: Unlocking Efficient, Scalable, and Continual Knowledge Editing with Basis-Level Representation Fine-Tuning

Title: How Deep is Love in LLMs' Hearts? Exploring Semantic Size in Human-like Cognition

Title: More of the Same: Persistent Representational Harms Under Increased Representation

Title: U-NIAH: Unified RAG and LLM Evaluation for Long Context Needle-In-A-Haystack

Title: Structured Reasoning for Fairness: A Multi-Agent Approach to Bias Detection in Textual Data

Title: BERT-based model for Vietnamese Fact Verification Dataset

Title: Approaching the Limits to EFL Writing Enhancement with AI-generated Text and Diverse Learners

Title: Smoothing Grounding and Reasoning for MLLM-Powered GUI Agents with Query-Oriented Pivot Tasks

Title: A Multi-Labeled Dataset for Indonesian Discourse: Examining Toxicity, Polarization, and Demographics Information

Title: AILS-NTUA at SemEval-2025 Task 8: Language-to-Code prompting and Error Fixing for Tabular Question Answering

Title: Rehearse With User: Personalized Opinion Summarization via Role-Playing based on Large Language Models

Title: Embracing Diversity: A Multi-Perspective Approach with Soft Labels

Title: Tutorial Proposal: Speculative Decoding for Efficient LLM Inference

Title: ToolDial: Multi-turn Dialogue Generation Method for Tool-Augmented Language Models

Title: LoR2C : Low-Rank Residual Connection Adaptation for Parameter-Efficient Fine-Tuning

Title: BadJudge: Backdoor Vulnerabilities of LLM-as-a-Judge

Title: Zero-Shot Keyphrase Generation: Investigating Specialized Instructions and Multi-Sample Aggregation on Large Language Models

Title: An evaluation of DeepSeek Models in Biomedical Natural Language Processing

Title: Unmasking Digital Falsehoods: A Comparative Analysis of LLM-Based Misinformation Detection Strategies

Title: RAPID: Efficient Retrieval-Augmented Long Text Generation with Writing Planning and Information Discovery

Title: Evaluating Personalized Tool-Augmented LLMs from the Perspectives of Personalization and Proactivity

Title: DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting

Title: Predictive Data Selection: The Data That Predicts Is the Data That Teaches

Title: Waste Not, Want Not; Recycled Gumbel Noise Improves Consistency in Natural Language Generation

Title: Rewarding Graph Reasoning Process makes LLMs more Generalized Reasoners

Title: Argument Summarization and its Evaluation in the Era of Large Language Models

Title: Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers

Title: DUAL: Diversity and Uncertainty Active Learning for Text Summarization

Title: Instruct-of-Reflection: Enhancing Large Language Models Iterative Reflection Capabilities via Dynamic-Meta Instruction

Title: HiBench: Benchmarking LLMs Capability on Hierarchical Structure Reasoning

Title: SemViQA: A Semantic Question Answering System for Vietnamese Information Fact-Checking

Title: Dialogue Without Limits: Constant-Sized KV Caches for Extended Responses in LLMs

Title: Evaluating Polish linguistic and cultural competency in large language models

Title: Language Models Predict Empathy Gaps Between Social In-groups and Out-groups

Title: Language-agnostic, automated assessment of listeners' speech recall using large language models

Title: AI-Invented Tonal Languages: Preventing a Machine Lingua Franca Beyond Human Understanding

Title: Scientific Reasoning: Assessment of Multimodal Generative LLMs

Title: Precise Localization of Memories: A Fine-grained Neuron-level Knowledge Editing Technique for LLMs

Title: Beyond QA Pairs: Assessing Parameter-Efficient Fine-Tuning for Fact Embedding in LLMs

Title: How Well do LLMs Compress Their Own Chain-of-Thought? A Token Complexity Approach

Title: MiLiC-Eval: Benchmarking Multilingual LLMs for China's Minority Languages

Title: ReaderLM-v2: Small Language Model for HTML to Markdown and JSON

Title: Nature-Inspired Population-Based Evolution of Large Language Models

Title: Large Language Models for Healthcare Text Classification: A Systematic Review

Title: Cancer Type, Stage and Prognosis Assessment from Pathology Reports using LLMs

Title: PEO: Improving Bi-Factorial Preference Alignment with Post-Training Policy Extrapolation

Title: ChatGPT for President! Presupposed content in politicians versus GPT-generated texts

Title: Enhancing Non-English Capabilities of English-Centric Large Language Models through Deep Supervision Fine-Tuning

Title: PROPER: A Progressive Learning Framework for Personalized Large Language Models with Group-Level Adaptation

Title: Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs

Title: Explainable Depression Detection in Clinical Interviews with Personalized Retrieval-Augmented Generation

Title: WeightedKV: Attention Scores Weighted Key-Value Cache Merging for Large Language Models

Title: Answer, Refuse, or Guess? Investigating Risk-Aware Decision Making in Language Models

Title: Same Question, Different Words: A Latent Adversarial Framework for Prompt Robustness

Title: SRAG: Structured Retrieval-Augmented Generation for Multi-Entity Question Answering over Wikipedia Graph

Title: SwiLTra-Bench: The Swiss Legal Translation Benchmark

Title: Q-NL Verifier: Leveraging Synthetic Data for Robust Knowledge Graph Question Answering

Title: Parameter-Efficient Fine-Tuning of Large Language Models via Deconvolution in Subspace

Title: Sampling-Efficient Test-Time Scaling: Self-Estimating the Best-of-N Sampling in Early Decoding

Title: Rethinking Data: Towards Better Performing Domain-Specific Small Language Models

Title: SePer: Measure Retrieval Utility Through The Lens Of Semantic Perplexity Reduction

Title: Improving Retrospective Language Agents via Joint Policy Gradient Optimization

Title: Llama-3.1-Sherkala-8B-Chat: An Open Large Language Model for Kazakh

Title: Liger: Linearizing Large Language Models to Gated Recurrent Structures

Title: SampleMix: A Sample-wise Pre-training Data Mixing Strategey by Coordinating Data Quality and Diversity

Title: KoWit-24: A Richly Annotated Dataset of Wordplay in News Headlines

Title: Evaluation and Facilitation of Online Discussions in the LLM Era: A Survey

Title: Pragmatic Inference Chain (PIC) Improving LLMs' Reasoning of Authentic Implicit Toxic Language

Title: Revisiting Large Language Model Pruning using Neuron Semantic Attribution

Title: Attention Condensation via Sparsity Induced Regularized Training

Title: Beyond Prompting: An Efficient Embedding Framework for Open-Domain Question Answering

Title: In-context Learning vs. Instruction Tuning: The Case of Small and Multilingual Language Models

Title: DOVE: A Large-Scale Multi-Dimensional Predictions Dataset Towards Meaningful LLM Evaluation

Title: Detecting Stylistic Fingerprints of Large Language Models

Title: Evaluating LLMs' Assessment of Mixed-Context Hallucination Through the Lens of Summarization

Title: Automated Annotation of Evolving Corpora for Augmenting Longitudinal Network Data: A Framework Integrating Large Language Models and Expert Knowledge

Title: When an LLM is apprehensive about its answers -- and when its uncertainty is justified

Title: Generate, Discriminate, Evolve: Enhancing Context Faithfulness via Fine-Grained Sentence-Level Self-Evolution

Title: Word Form Matters: LLMs' Semantic Reconstruction under Typoglycemia

Title: Syntactic Learnability of Echo State Neural Language Models at Scale

Title: Building Safe GenAI Applications: An End-to-End Overview of Red Teaming for Large Language Models

Title: Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Title: Retrieval Models Aren't Tool-Savvy: Benchmarking Tool Retrieval for Large Language Models

Title: Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas

Title: Cats Confuse Reasoning LLM: Query Agnostic Adversarial Triggers for Reasoning Models

Title: $\texttt{SEM-CTRL}$: Semantically Controlled Decoding

Title: Large-Scale Data Selection for Instruction Tuning

Title: Persuade Me if You Can: A Framework for Evaluating Persuasion Effectiveness and Susceptibility Among Large Language Models

Title: From Language to Cognition: How LLMs Outgrow the Human Language Network

Title: Rotary Outliers and Rotary Offset Features in Large Language Models

Title: CrowdSelect: Synthetic Instruction Data Selection with Multi-LLM Wisdom

Title: EAGLE-3: Scaling up Inference Acceleration of Large Language Models via Training-Time Test

Title: Can (A)I Change Your Mind?