2024-12-10

Title: Text Is Not All You Need: Multimodal Prompting Helps LLMs Understand Humor

Title: Multi-Party Supervised Fine-tuning of Language Models for Multi-Party Dialogue Generation

Title: Incremental Sentence Processing Mechanisms in Autoregressive Transformer Language Models

Title: CALICO: Conversational Agent Localization via Synthetic Data Generation

Title: Towards Effective GenAI Multi-Agent Collaboration: Design and Evaluation for Enterprise Applications

Title: Knowledge Graphs are all you need: Leveraging KGs in Physics Question Answering

Title: A Survey on Uncertainty Quantification of Large Language Models: Taxonomy, Open Research Challenges, and Future Directions

Title: A polar coordinate system represents syntax in large language models

Title: LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods

Title: CharacterBox: Evaluating the Role-Playing Capabilities of LLMs in Text-Based Virtual Worlds

Title: Shifting NER into High Gear: The Auto-AdvER Approach

Title: Batch-Max: Higher LLM Throughput using Larger Batch Sizes and KV Cache Compression

Title: On the effective transfer of knowledge from English to Hindi Wikipedia

Title: PromptRefine: Enhancing Few-Shot Performance on Low-Resource Indic Languages with Example Selection from Related Example Banks

Title: A Comparative Study on Code Generation with Transformers

Title: Uncovering Uncertainty in Transformer Inference

Title: An Entailment Tree Generation Approach for Multimodal Multi-Hop Question Answering with Mixture-of-Experts and Iterative Feedback Mechanism

Title: A Self-Learning Multimodal Approach for Fake News Detection

Title: Are Clinical T5 Models Better for Clinical Text?

Title: Cooperative SQL Generation for Segmented Databases By Using Multi-functional LLM Agents

Title: Domain-Specific Translation with Open-Source Large Language Models: Resource-Oriented Analysis

Title: Paraphrase-Aligned Machine Translation

Title: A Cross-Validation Study of Turkish Sentiment Analysis Datasets and Tools

Title: Language hooks: a modular framework for augmenting LLM reasoning that decouples tool usage from the model and its prompt

Title: Does RLHF Scale? Exploring the Impacts From Data, Model, and Method

Title: 1-800-SHARED-TASKS at RegNLP: Lexical Reranking of Semantic Retrieval (LeSeR) for Regulatory Question Answering

Title: Steering Large Language Models to Evaluate and Amplify Creativity

Title: KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models

Title: Enhanced Computationally Efficient Long LoRA Inspired Perceiver Architectures for Auto-Regressive Language Modeling

Title: Infusing Prompts with Syntax and Semantics

Title: Evaluating and Mitigating Social Bias for Large Language Models in Open-ended Settings

Title: AIDE: Task-Specific Fine Tuning with Attribute Guided Multi-Hop Data Expansion

Title: Hate Speech According to the Law: An Analysis for Effective Detection

Title: SparseAccelerate: Efficient Long-Context Inference for Mid-Range GPUs

Title: SiReRAG: Indexing Similar and Related Information for Multihop Reasoning

Title: A Comparative Study of Learning Paradigms in Large Language Models via Intrinsic Dimension

Title: Optimizing Multi-Task Learning for Enhanced Performance in Large Language Models

Title: Methods for Legal Citation Prediction in the Age of LLMs: An Australian Law Case Study

Title: PediaBench: A Comprehensive Chinese Pediatric Dataset for Benchmarking Large Language Models

Title: LLM-BIP: Structured Pruning for Large Language Models with Block-Wise Forward Importance Propagation

Title: Gated Delta Networks: Improving Mamba2 with Delta Rule

Title: SafeWorld: Geo-Diverse Safety Alignment

Title: Small Languages, Big Models: A Study of Continual Training on Languages of Norway

Title: Data Quality Enhancement on the Basis of Diversity with Large Language Models for Text Classification: Uncovered, Difficult, and Noisy

Title: Anchoring Bias in Large Language Models: An Experimental Study

Title: Towards Controllable Speech Synthesis in the Era of Large Language Models: A Survey

Title: GEAR: A Simple GENERATE, EMBED, AVERAGE AND RANK Approach for Unsupervised Reverse Dictionary

Title: OmniEvalKit: A Modular, Lightweight Toolbox for Evaluating Large Language Model and its Omni-Extensions

Title: JAPAGEN: Efficient Few/Zero-shot Learning via Japanese Training Dataset Generation with LLM

Title: Training Large Language Models to Reason in a Continuous Latent Space