2024-02-27

Title: Beware of Words: Evaluating the Lexical Richness of Conversational Large Language Models

Title: Detecting misinformation through Framing Theory: the Frame Element-based Model

Title: Chain-of-Specificity: An Iteratively Refining Method for Eliciting Knowledge from Large Language Models

Title: PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain

Title: Evaluating the Performance of ChatGPT for Spam Email Detection

Title: Foundation Policies with Hilbert Representations

Title: Prompting LLMs to Compose Meta-Review Drafts from Peer-Review Narratives of Scholarly Manuscripts

Title: Training Nonlinear Transformers for Efficient In-Context Learning: A Theoretical Learning and Generalization Analysis

Title: Selective "Selective Prediction": Reducing Unnecessary Abstention in Vision-Language Reasoning

Title: Towards Efficient Active Learning in NLP via Pretrained Representations

Title: Language-Based User Profiles for Recommendation

Title: MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs

Title: Fine-Grained Self-Endorsement Improves Factuality and Reasoning

Title: Addressing Order Sensitivity of In-Context Demonstration Examples in Causal Language Models

Title: Exploring Failure Cases in Multimodal Reasoning About Physical Dynamics

Title: Contact Complexity in Customer Service

Title: Leveraging ChatGPT in Pharmacovigilance Event Extraction: An Empirical Study

Title: Teacher-Student Learning on Complexity in Intelligent Routing

Title: Foot In The Door: Understanding Large Language Model Jailbreaking via Cognitive Psychology

Title: Is Offline Decision Making Possible with Only Few Samples? Reliable Decisions in Data-Starved Bandits via Trust Region Enhancement

Title: Query Augmentation by Decoding Semantics from Brain Signals

Title: Making Pre-trained Language Models Better Continual Few-Shot Relation Extractors

Title: Hal-Eval: A Universal and Fine-grained Hallucination Evaluation Framework for Large Vision Language Models

Title: How Do Humans Write Code? Large Models Do It the Same Way Too

Title: GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation

Title: Sparse MeZO: Less Parameters for Better Performance in Zeroth-Order LLM Fine-Tuning

Title: HD-Eval: Aligning Large Language Model Evaluators Through Hierarchical Criteria Decomposition

Title: Dental Severity Assessment through Few-shot Learning and SBERT Fine-tuning

Title: Chimera: A Lossless Decoding Method for Accelerating Large Language Models Inference by Fusing all Tokens

Title: Look Before You Leap: Problem Elaboration Prompting Improves Mathematical Reasoning in Large Language Models

Title: Empowering Large Language Model Agents through Action Learning

Title: Measuring Bargaining Abilities of LLMs: A Benchmark and A Buyer-Enhancement Method

Title: A Theoretical Result on the Inductive Bias of RNN Language Models

Title: Linguistic Intelligence in Large Language Models for Telecommunications

Title: Reward Design for Justifiable Sequential Decision-Making

Title: Prompt Perturbation Consistency Learning for Robust Language Models

Title: MATHWELL: Generating Educational Math Word Problems at Scale

Title: SportQA: A Benchmark for Sports Understanding in Large Language Models

Title: SemEval-2024 Task 8: Weighted Layer Averaging RoBERTa for Black-Box Machine-Generated Text Detection

Title: Predicting Outcomes in Video Games with Long Short Term Memory Networks

Title: MultiContrievers: Analysis of Dense Retrieval Representations

Title: QuaCer-C: Quantitative Certification of Knowledge Comprehension in LLMs

Title: Evaluating Prompting Strategies for Grammatical Error Correction Based on Language Proficiency

Title: Frustratingly Simple Prompting-based Text Denoising

Title: Scalable Volt-VAR Optimization using RLlib-IMPALA Framework: A Reinforcement Learning Approach

Title: Generalization or Memorization: Data Contamination and Trustworthy Evaluation for Large Language Models

Title: GreenLLaMA: A Framework for Detoxification with Explanations

Title: Budget-Constrained Tool Learning with Planning

Title: Likelihood-based Mitigation of Evaluation Bias in Large Language Models

Title: PIDformer: Transformer Meets Control Theory

Title: $C^3$: Confidence Calibration Model Cascade for Inference-Efficient Cross-Lingual Natural Language Understanding

Title: From Noise to Clarity: Unraveling the Adversarial Suffix of Large Language Model Attacks via Translation of Text Embeddings

Title: HiGPT: Heterogeneous Graph Language Model

Title: GraphWiz: An Instruction-Following Language Model for Graph Problems

Title: Don't Forget Your Reward Values: Language Model Alignment via Value-based Calibration

Title: Text Understanding and Generation Using Transformer Models for Intelligent E-commerce Recommendations

Title: Deep Learning Approaches for Improving Question Answering Systems in Hepatocellular Carcinoma Research

Title: EHRNoteQA: A Patient-Specific Question Answering Benchmark for Evaluating Large Language Models in Clinical Settings

Title: Detecting Machine-Generated Texts by Multi-Population Aware Optimization for Maximum Mean Discrepancy

Title: LLMs with Chain-of-Thought Are Non-Causal Reasoners

Title: Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression

Title: How Large Language Models Encode Context Knowledge? A Layer-Wise Probing Study

Title: Citation-Enhanced Generation for LLM-based Chatbot

Title: Training a Bilingual Language Model by Mapping Tokens onto a Shared Character Space

Title: Behavioral Refinement via Interpolant-based Policy Diffusion

Title: FuseChat: Knowledge Fusion of Chat Models

Title: InstructEdit: Instruction-based Knowledge Editing for Large Language Models

Title: LSTPrompt: Large Language Models as Zero-Shot Time Series Forecasters by Long-Short-Term Prompting

Title: What Generative Artificial Intelligence Means for Terminological Definitions

Title: PeriodicLoRA: Breaking the Low-Rank Bottleneck in LoRA Optimization

Title: From Text to Transformation: A Comprehensive Review of Large Language Models' Versatility

Title: DistALANER: Distantly Supervised Active Learning Augmented Named Entity Recognition in the Open Source Software Ecosystem

Title: Hitting "Probe"rty with Non-Linearity, and More

Title: How Can LLM Guide RL? A Value-Based Approach

Title: Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing

Title: ASEM: Enhancing Empathy in Chatbot through Attention-based Sentiment and Emotion Modeling

Title: HypoTermQA: Hypothetical Terms Dataset for Benchmarking Hallucination Tendency of LLMs

Title: Learning Translations: Emergent Communication Pretraining for Cooperative Language Acquisition

Title: Topic-to-essay generation with knowledge-based content selection

Title: Foundation Model Transparency Reports

Title: From Large Language Models and Optimization to Decision Optimization CoPilot: A Research Manifesto

Title: PerLTQA: A Personal Long-Term Memory Dataset for Memory Classification, Retrieval, and Synthesis in Question Answering

Title: Referee Can Play: An Alternative Approach to Conditional Generation via Model Inversion

Title: Cross-domain Chinese Sentence Pattern Parsing

Title: Federated Contextual Cascading Bandits with Asynchronous Communication and Heterogeneous Users

Title: Chain-of-Discussion: A Multi-Model Framework for Complex Evidence-Based Question Answering

Title: Data-freeWeight Compress and Denoise for Large Language Models

Title: CodeS: Towards Building Open-source Language Models for Text-to-SQL

Title: MathGenie: Generating Synthetic Data with Question Back-translation for Enhancing Mathematical Reasoning of LLMs

Title: Language-guided Skill Learning with Temporal Variational Inference

Title: An Integrated Data Processing Framework for Pretraining Foundation Models

Title: Layer-wise Regularized Dropout for Neural Language Models

Title: LLM Inference Unveiled: Survey and Roofline Model Insights

Title: Where Do We Go from Here? Multi-scale Allocentric Relational Inference from Natural Spatial Descriptions

Title: Unraveling Babel: Exploring Multilingual Activation Patterns within Large Language Models

Title: Improving LLM-based Machine Translation with Systematic Self-Correction

Title: Immunization against harmful fine-tuning attacks

Title: MoZIP: A Multilingual Benchmark to Evaluate Large Language Models in Intellectual Property

Title: From RAGs to riches: Using large language models to write documents for clinical trials

Title: Predicting Sustainable Development Goals Using Course Descriptions -- from LLMs to Conventional Foundation Models

Title: RoCoIns: Enhancing Robustness of Large Language Models through Code-Style Instructions

Title: Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language Models

Title: ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors

Title: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering

Title: Defending LLMs against Jailbreaking Attacks via Backtranslation

Title: Unveiling Vulnerability of Self-Attention

Title: mEdIT: Multilingual Text Editing via Instruction Tuning

Title: On Languaging a Simulation Engine

Title: LLMArena: Assessing Capabilities of Large Language Models in Dynamic Multi-Agent Environments

Title: Memory GAPS: Would LLM pass the Tulving Test?

Title: LLM-based Privacy Data Augmentation Guided by Knowledge Distillation with a Distribution Tutor for Medical Text Classification

Title: Label Learning Method Based on Tensor Projection

Title: Q-FOX Learning: Breaking Tradition in Reinforcement Learning

Title: Aligning Large Language Models to a Domain-specific Graph Database

Title: Two-stage Generative Question Answering on Temporal Knowledge Graph Using Large Language Models

Title: Multi-Bit Distortion-Free Watermarking for Large Language Models

Title: Rethinking Negative Instances for Generative Named Entity Recognition

Title: Understanding the Dataset Practitioners Behind Large Language Model Development

Title: Long-Context Language Modeling with Parallel Context Encoding

Title: GenAINet: Enabling Wireless Collective Intelligence via Knowledge Transfer and Reasoning

Title: ESG Sentiment Analysis: comparing human and language model performance including GPT

Title: GigaPevt: Multimodal Medical Assistant

Title: RepoAgent: An LLM-Powered Open-Source Framework for Repository-level Code Documentation Generation

Title: StructLM: Towards Building Generalist Models for Structured Knowledge Grounding

Title: Adaptation of Biomedical and Clinical Pretrained Models to French Long Documents: A Comparative Study

Title: HumanEval-XL: A Multilingual Code Generation Benchmark for Cross-lingual Natural Language Generalization

Title: Look Before You Leap: Towards Decision-Aware and Generalizable Tool-Usage for Large Language Models

Title: Generating Effective Ensembles for Sentiment Analysis

Title: SelectIT: Selective Instruction Tuning for Large Language Models via Uncertainty-Aware Self-Reflection

Title: CodeChameleon: Personalized Encryption Framework for Jailbreaking Large Language Models

Title: Value Preferences Estimation and Disambiguation in Hybrid Participatory Systems

Title: A Comprehensive Evaluation of Quantization Strategies for Large Language Models

Title: Political Compass or Spinning Arrow? Towards More Meaningful Evaluations for Values and Opinions in Large Language Models

Title: Set the Clock: Temporal Alignment of Pretrained Language Models

Title: OncoGPT: A Medical Conversational Model Tailored with Oncology Domain Expertise on a Large Language Model Meta-AI (LLaMA)

Title: Investigating the Effectiveness of HyperTuning via Gisting

Title: Nemotron-4 15B Technical Report

Title: Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts

Title: Language Agents as Optimizable Graphs

Title: A Survey on Data Selection for Language Models

Title: GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embedding Fine-tuning

Title: Mysterious Projections: Multimodal LLMs Gain Domain-Specific Visual Capabilities Without Richer Cross-Modal Projections

Title: Eight Methods to Evaluate Robust Unlearning in LLMs

Title: Do Large Language Models Latently Perform Multi-Hop Reasoning?

Title: MobiLlama: Towards Accurate and Lightweight Fully Transparent GPT

Title: Think Big, Generate Quick: LLM-to-SLM for Fast Autoregressive Decoding