2025-01-28

Title: Unmasking Conversational Bias in AI Multiagent Systems

Title: JustLogic: A Comprehensive Benchmark for Evaluating Deductive Reasoning in Large Language Models

Title: Dynamic Adaptation of LoRA Fine-Tuning for Efficient and Task-Specific Optimization of Large Language Models

Title: DrawEduMath: Evaluating Vision Language Models with Expert-Annotated Students' Hand-Drawn Math Images

Title: Verify with Caution: The Pitfalls of Relying on Imperfect Factuality Metrics

Title: Self-reflecting Large Language Models: A Hegelian Dialectical Approach

Title: Context-Aware Neural Gradient Mapping for Fine-Grained Instruction Processing

Title: CASE-Bench: Context-Aware Safety Evaluation Benchmark for Large Language Models

Title: ExPerT: Effective and Explainable Evaluation of Personalized Long-Form Text Generation

Title: Federated Retrieval Augmented Generation for Multi-Product Question Answering

Title: MDEval: Evaluating and Enhancing Markdown Awareness in Large Language Models

Title: AKVQ-VL: Attention-Aware KV Cache Adaptive 2-Bit Quantization for Vision-Language Models

Title: Using Large Language Models for education managements in Vietnamese with low resources

Title: An Attempt to Unraveling Token Prediction Refinement and Identifying Essential Layers of Large Language Models

Title: LongReason: A Synthetic Long-Context Reasoning Benchmark via Context Expansion

Title: Speech Translation Refinement using Large Language Models

Title: Knowledge Hierarchy Guided Biological-Medical Dataset Distillation for Domain LLM Training

Title: Task-KV: Task-aware KV Cache Optimization via Semantic Differentiation of Attention Heads

Title: Option-ID Based Elimination For Multiple Choice Questions

Title: SEAL: Scaling to Emphasize Attention for Long-Context Retrieval

Title: Improving Retrieval-Augmented Generation through Multi-Agent Reinforcement Learning

Title: ASRank: Zero-Shot Re-Ranking with Answer Scent for Document Retrieval

Title: Prompting ChatGPT for Chinese Learning as L2: A CEFR and EBCL Level Study

Title: New Evaluation Paradigm for Lexical Simplification

Title: Pre-training a Transformer-Based Generative Model Using a Small Sepedi Dataset

Title: Are Human Interactions Replicable by Generative Agents? A Case Study on Pronoun Usage in Hierarchical Interactions

Title: You Only Prune Once: Designing Calibration-Free Model Compression With Policy Learning

Title: The Multicultural Medical Assistant: Can LLMs Improve Medical ASR Errors Across Borders?

Title: Figurative-cum-Commonsense Knowledge Infusion for Multimodal Mental Health Meme Classification

Title: Large Language Models as Theory of Mind Aware Generative Agents with Counterfactual Reflection

Title: Baichuan-Omni-1.5 Technical Report

Title: Evaluating the Effectiveness of XAI Techniques for Encoder-Based Language Models

Title: Qwen2.5-1M Technical Report

Title: How Green are Neural Language Models? Analyzing Energy Consumption in Text Summarization Fine-tuning

Title: Semantic Layered Embedding Diffusion in Large Language Models for Multi-Contextual Consistency

Title: OpenCharacter: Training Customizable Role-Playing LLMs with Large-Scale Synthetic Personas

Title: Token Democracy: The Architectural Limits of Alignment in Transformer-Based Language Models

Title: STATE ToxiCN: A Benchmark for Span-level Target-Aware Toxicity Extraction in Chinese Hate Speech Detection

Title: Data-adaptive Safety Rules for Training Reward Models

Title: ARWKV: Pretrain is not what we need, an RNN-Attention-Based Language Model Born from Transformer

Title: Cross-Cultural Fashion Design via Interactive Large Language Models and Diffusion Models

Title: Instruction Tuning for Story Understanding and Generation with Weak Supervision

Title: Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework

Title: SCP-116K: A High-Quality Problem-Solution Dataset and a Generalized Pipeline for Automated Extraction in the Higher Education Science Domain

Title: Improving Estonian Text Simplification through Pretrained Language Models and Custom Datasets

Title: People who frequently use ChatGPT for writing tasks are accurate and robust detectors of AI-generated text

Title: TensorLLM: Tensorising Multi-Head Attention for Enhanced Reasoning and Compression in LLMs

Title: Transformer-Based Multimodal Knowledge Graph Completion with Link-Aware Contexts

Title: Adapting Biomedical Abstracts into Plain language using Large Language Models

Title: StaICC: Standardized Evaluation for Classification Task in In-context Learning

Title: ESGSenticNet: A Neurosymbolic Knowledge Base for Corporate Sustainability Analysis

Title: IndicMMLU-Pro: Benchmarking the Indic Large Language Models

Title: Weight-based Analysis of Detokenization in Language Models: Understanding the First Stage of Inference Without Inference

Title: Is It Navajo? Accurate Language Detection in Endangered Athabaskan Languages

Title: Large Language Models to Diffusion Finetuning

Title: MADP: Multi-Agent Deductive Planning for Enhanced Cognitive-Behavioral Mental Health Question Answer

Title: LCTG Bench: LLM Controlled Text Generation Benchmark

Title: Parametric Retrieval Augmented Generation

Title: MEL: Legal Spanish Language Model

Title: PISCO: Pretty Simple Compression for Retrieval-Augmented Generation

Title: Integration of LLM Quality Assurance into an NLG System

Title: AdaCoT: Rethinking Cross-Lingual Factual Reasoning through Adaptive Chain-of-Thought

Title: Can summarization approximate simplification? A gold standard comparison

Title: Provence: efficient and robust context pruning for retrieval-augmented generation

Title: DBRouting: Routing End User Queries to Databases for Answerability

Title: A foundation model for human-AI collaboration in medical literature mining

Title: Return of the Encoder: Maximizing Parameter Efficiency for SLMs

Title: URAG: Implementing a Unified Hybrid RAG for Precise Answers in University Admission Chatbots -- A Case Study at HCMUT

Title: Matryoshka Re-Ranker: A Flexible Re-Ranking Architecture With Configurable Depth and Width

Title: RAPID: Retrieval-Augmented Parallel Inference Drafting for Text-Based Video Event Retrieval

Title: LUCY: Linguistic Understanding and Control Yielding Early Stage of Her