2024-11-26

Title: Can Open-source LLMs Enhance Data Augmentation for Toxic Detection?: An Experimental Study

Title: Sycophancy in Large Language Models: Causes and Mitigations

Title: PPLqa: An Unsupervised Information-Theoretic Quality Metric for Comparing Generative Large Language Models

Title: Transforming NLU with Babylon: A Case Study in Development of Real-time, Edge-Efficient, Multi-Intent Translation System for Automated Drive-Thru Ordering

Title: On the Impact of Fine-Tuning on Chain-of-Thought Reasoning

Title: From Jack of All Trades to Master of One: Specializing LLM-based Autoraters to a Test Set

Title: Exploring Large Language Models for Multimodal Sentiment Analysis: Challenges, Benchmarks, and Future Directions

Title: Lifelong Knowledge Editing for Vision Language Models with Low-Rank Mixture-of-Experts

Title: Towards Robust Evaluation of Unlearning in LLMs via Data Transformations

Title: Seed-Free Synthetic Data Generation Framework for Instruction-Tuning LLMs: A Case Study in Thai

Title: Automatic Evaluation for Text-to-image Generation: Task-decomposed Framework, Distilled Training, and Meta-evaluation Benchmark

Title: Traditional Chinese Medicine Case Analysis System for High-Level Semantic Abstraction: Optimized with Prompt and RAG

Title: Enhancing Grammatical Error Detection using BERT with Cleaned Lang-8 Dataset

Title: From MTEB to MTOB: Retrieval-Augmented Classification for Descriptive Grammars

Title: A Survey on LLM-as-a-Judge

Title: Multi-label Sequential Sentence Classification via Large Language Model

Title: "All that Glitters": Approaches to Evaluations with Unreliable Model and Human Annotations

Title: AfriMed-QA: A Pan-African, Multi-Specialty, Medical Question-Answering Benchmark Dataset

Title: Improving Next Tokens via Second-Last Predictions with Generate and Refine

Title: Ontology-Constrained Generation of Domain-Specific Clinical Summaries

Title: RAMIE: Retrieval-Augmented Multi-task Information Extraction with Large Language Models on Dietary Supplements

Title: LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training

Title: Development of Pre-Trained Transformer-based Models for the Nepali Language

Title: A Method for Building Large Language Models with Predefined KV Cache Capacity

Title: LoRA-Mini : Adaptation Matrices Decomposition and Selective Training

Title: Is Training Data Quality or Quantity More Impactful to Small Language Model Performance?

Title: LLMs Do Not Think Step-by-step In Implicit Reasoning

Title: Evaluating Large Language Models for Causal Modeling

Title: Generative Context Distillation

Title: Investigating Factuality in Long-Form Text Generation: The Roles of Self-Known and Self-Unknown

Title: Multi-ToM: Evaluating Multilingual Theory of Mind Capabilities in Large Language Models

Title: Exploring Performance Contrasts in TableQA: Step-by-Step Reasoning Boosts Bigger Language Models, Limits Smaller Language Models

Title: TransCompressor: LLM-Powered Multimodal Data Compression for Smart Transportation

Title: SAGEval: The frontiers of Satisfactory Agent based NLG Evaluation for reference-free open-ended text

Title: LLM Augmentations to support Analytical Reasoning over Multiple Documents

Title: MH-MoE:Multi-Head Mixture-of-Experts

Title: DoubleCCA: Improving Foundation Model Group Robustness with Random Sentence Embeddings

Title: NormXLogit: The Head-on-Top Never Lies

Title: BayLing 2: A Multilingual Large Language Model with Efficient Language Alignment

Title: Can AI grade your essays? A comparative analysis of large language models and teacher ratings in multidimensional essay scoring

Title: Preference Optimization for Reasoning with Pseudo Feedback

Title: The Two-Hop Curse: LLMs trained on A->B, B->C fail to learn A-->C

Title: FineWeb-zhtw: Scalable Curation of Traditional Chinese Text Data from the Web

Title: Human-Calibrated Automated Testing and Validation of Generative Language Models

Title: Adapter-based Approaches to Knowledge-enhanced Language Models -- A Survey

Title: Finding Structure in Language Models

Title: Learning by Analogy: Enhancing Few-Shot Prompting for Math Word Problem Solving with Computational Graph-Based Retrieval

Title: When Babies Teach Babies: Can student knowledge sharing outperform Teacher-Guided Distillation on small datasets?

Title: O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Title: AtomR: Atomic Operator-Empowered Large Language Models for Heterogeneous Knowledge Reasoning

Title: Profiling Bias in LLMs: Stereotype Dimensions in Contextual Word Embeddings

Title: Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision

Title: StructFormer: Document Structure-based Masked Attention and its Impact on Language Model Pre-Training

Title: Do Automatic Factuality Metrics Measure Factuality? A Critical Evaluation

Title: Self-Generated Critiques Boost Reward Modeling for Language Models

Title: Do Large Language Models Perform Latent Multi-Hop Reasoning without Exploiting Shortcuts?