2025-02-21

Title: MaskPrune: Mask-based LLM Pruning for Layer-wise Uniform Structures

Title: DiffSampling: Enhancing Diversity and Accuracy in Neural Text Generation

Title: Semantic Decomposition and Selective Context Filtering -- Text Processing Techniques for Context-Aware NLP-Based Systems

Title: Diversity-driven Data Selection for Language Model Tuning through Sparse Autoencoder

Title: RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression

Title: Are Rules Meant to be Broken? Understanding Multilingual Moral Reasoning as a Computational Pipeline with UniMoral

Title: Navigating Semantic Relations: Challenges for Language Models in Abstract Common-Sense Reasoning

Title: Retrieving Versus Understanding Extractive Evidence in Few-Shot Learning

Title: Towards Context-Robust LLMs: A Gated Representation Fine-tuning Approach

Title: Meaning Beyond Truth Conditions: Evaluating Discourse Level Understanding via Anaphora Accessibility

Title: Benchmarking LLMs for Political Science: A United Nations Perspective

Title: Which of These Best Describes Multiple Choice Evaluation with LLMs? A) Forced B) Flawed C) Fixable D) All of the Above

Title: Can Community Notes Replace Professional Fact-Checkers?

Title: Self-Regularization with Latent Space Explanations for Controllable LLM-based Classification

Title: UM_FHS at TREC 2024 PLABA: Exploration of Fine-tuning and AI agent approach for plain language adaptations of biomedical text

Title: LLM-Enhanced Dialogue Management for Full-Duplex Spoken Dialogue Systems

Title: Enhancing Conversational Agents with Theory of Mind: Aligning Beliefs, Desires, and Intentions for Human-Like Interaction

Title: QUAD-LLM-MLTC: Large Language Models Ensemble Learning for Healthcare Text Multi-Label Classification

Title: NLP-AKG: Few-Shot Construction of NLP Academic Knowledge Graph Based on LLM

Title: On-the-fly Preference Alignment via Principle-Guided Decoding

Title: Transfer-Prompting: Enhancing Cross-Task Adaptation in Large Language Models via Dual-Stage Prompts Optimization

Title: Mitigating Lost-in-Retrieval Problems in Retrieval Augmented Multi-Hop Question Answering

Title: Effects of Prompt Length on Domain-specific Tasks for Large Language Models

Title: Does Time Have Its Place? Temporal Heads: Where Language Models Recall Time-specific Information

Title: MCQA-Eval: Efficient Confidence Evaluation in NLG with Gold-Standard Correctness Labels

Title: PaperHelper: Knowledge-Based LLM QA Paper Reading Assistant

Title: Capturing Nuanced Preferences: Preference-Aligned Distillation for Small Language Models

Title: Fact or Guesswork? Evaluating Large Language Model's Medical Knowledge with Structured One-Hop Judgment

Title: EpMAN: Episodic Memory AttentioN for Generalizing to Longer Contexts

Title: Vulnerability of Text-to-Image Models to Prompt Template Stealing: A Differential Evolution Approach

Title: Drift: Decoding-time Personalized Alignments with Implicit User Preferences

Title: SEA-HELM: Southeast Asian Holistic Evaluation of Language Models

Title: MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models

Title: Unveiling Cultural Blind Spots: Analyzing the Limitations of mLLMs in Procedural Text Comprehension

Title: ParallelComp: Parallel Long-Context Compressor for Length Extrapolation

Title: Line Goes Up? Inherent Limitations of Benchmarks for Evaluating Large Language Models

Title: A Survey on Feedback-based Multi-step Reasoning for Large Language Models on Mathematics

Title: English Please: Evaluating Machine Translation for Multilingual Bug Reports

Title: Earlier Tokens Contribute More: Learning Direct Preference Optimization From Temporal Decay Perspective

Title: SR-LLM: Rethinking the Structured Representation in Large Language Model

Title: Full-Step-DPO: Self-Supervised Preference Optimization with Step-wise Rewards for Mathematical Reasoning

Title: Triangulating LLM Progress through Benchmarks, Games, and Cognitive Tests

Title: Entropy-UID: A Method for Optimizing Information Density

Title: A Similarity Paradigm Through Textual Regularization Without Forgetting

Title: Rumor Detection by Multi-task Suffix Learning based on Time-series Dual Sentiments

Title: Tradutor: Building a Variety Specific Translation Model

Title: Leveraging Small LLMs for Argument Mining in Education: Argument Component Identification, Classification, and Assessment

Title: Unstructured Evidence Attribution for Long Context Query Focused Summarization

Title: A Survey on Data Contamination for Large Language Models

Title: Token-Level Density-Based Uncertainty Quantification Methods for Eliciting Truthfulness of Large Language Models

Title: Natural Language Generation

Title: PredictaBoard: Benchmarking LLM Score Predictability

Title: Optimal word order for non-causal text generation with Large Language Models: the Spanish case

Title: Enhancing Smart Environments with Context-Aware Chatbots using Large Language Models

Title: Argument-Based Comparative Question Answering Evaluation Benchmark

Title: Unshackling Context Length: An Efficient Selective Attention Approach through Query-Key Compression

Title: NLoRA: Nyström-Initiated Low-Rank Adaptation for Large Language Models

Title: StructFlowBench: A Structured Flow Benchmark for Multi-turn Instruction Following

Title: Enhancing Language Multi-Agent Learning with Multi-Agent Credit Re-Assignment for Interactive Environment Generalization

Title: MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Title: How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

Title: Can LLMs Simulate L2-English Dialogue? An Information-Theoretic Analysis of L1-Dependent Biases

Title: CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language Models

Title: LoRA-GGPO: Mitigating Double Descent in LoRA Fine-Tuning via Gradient-Guided Perturbation Optimization

Title: LLM-based User Profile Management for Recommender System

Title: Multiscale Byte Language Models -- A Hierarchical Architecture for Causal Million-Length Sequence Modeling

Title: Can LLMs Predict Citation Intent? An Experimental Analysis of In-context Learning and Fine-tuning on Open LLMs

Title: Behavioral Analysis of Information Salience in Large Language Models

Title: FIND: Fine-grained Information Density Guided Adaptive Retrieval-Augmented Generation for Disease Diagnosis

Title: Exploring RWKV for Sentence Embeddings: Layer-wise Analysis and Baseline Comparison for Semantic Similarity

Title: NAVIG: Natural Language-guided Analysis with Vision Language Models for Image Geo-localization

Title: How Far are LLMs from Being Our Digital Twins? A Benchmark for Persona-Based Behavior Chain Simulation

Title: Length-Controlled Margin-Based Preference Optimization without Reference Model

Title: LIFT: Improving Long Context Understanding of Large Language Models through Long Input Fine-Tuning

Title: Edit Once, Update Everywhere: A Simple Framework for Cross-Lingual Knowledge Synchronization in LLMs

Title: InstructAgent: Building User Controllable Recommender via LLM Agent

Title: AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPO

Title: Explanations of Deep Language Models Explain Language Representations in the Brain

Title: Data-Constrained Synthesis of Training Data for De-Identification

Title: How to Get Your LLM to Generate Challenging Problems for Evaluation

Title: Bridging the Gap: Transforming Natural Language Questions into SQL Queries via Abstract Query Pattern and Contextual Schema Markup

Title: I-MCTS: Enhancing Agentic AutoML via Introspective Monte Carlo Tree Search

Title: Entity Framing and Role Portrayal in the News

Title: SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Title: HiddenDetect: Detecting Jailbreak Attacks against Large Vision-Language Models via Monitoring Hidden States

Title: Large Language Models Struggle to Describe the Haystack without Human Help: Human-in-the-loop Evaluation of LLMs

Title: TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators

Title: On the Influence of Context Size and Model Choice in Retrieval-Augmented Generation Systems

Title: Step-by-Step Fact Verification System for Medical Claims with Explainable Reasoning

Title: Tree-of-Debate: Multi-Persona Debate Trees Elicit Critical Thinking for Scientific Comparative Analysis

Title: Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning

Title: SurveyX: Academic Survey Automation via Large Language Models

Title: Harnessing PDF Data for Improving Japanese Large Multimodal Models

Title: ReVision: A Dataset and Baseline VLM for Privacy-Preserving Task-Oriented Visual Instruction Rewriting

Title: Rapid Word Learning Through Meta In-Context Learning

Title: From RAG to Memory: Non-Parametric Continual Learning for Large Language Models

Title: eC-Tab2Text: Aspect-Based Text Generation from e-Commerce Product Tables

Title: Measuring Faithfulness of Chains of Thought by Unlearning Reasoning Steps

Title: Middle-Layer Representation Alignment for Cross-Lingual Transfer in Fine-Tuned LLMs

Title: Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs

Title: Revealing and Mitigating Over-Attention in Knowledge Editing

Title: GATE: Graph-based Adaptive Tool Evolution Across Diverse Tasks

Title: CLIPPER: Compression enables long-context synthetic data generation

Title: FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling

Title: Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning

Title: LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention