2024-02-21

Title: Turn Waste into Worth: Rectifying Top-$k$ Router of MoE

Title: ModelGPT: Unleashing LLM's Capabilities for Tailored Model Generation

Title: EBFT: Effective and Block-Wise Fine-Tuning for Sparse LLMs

Title: Simulacra as Conscious Exotica

Title: Tables as Images? Exploring the Strengths and Limitations of LLMs on Multimodal Representations of Tabular Data

Title: Understanding Fine-grained Distortions in Reports of Scientific Findings

Title: Neuro-mimetic Task-free Unsupervised Online Learning with Continual Self-Organizing Maps

Title: In deep reinforcement learning, a pruned network is a good network

Title: Artifacts or Abduction: How Do LLMs Answer Multiple-Choice Questions Without the Question?

Title: Do Pre-Trained Language Models Detect and Understand Semantic Underspecification? Ask the DUST!

Title: Towards Cross-Domain Continual Learning

Title: Your Vision-Language Model Itself Is a Strong Filter: Towards High-Quality Instruction Tuning with Data Selection

Title: Induced Model Matching: How Restricted Models Can Help Larger Ones

Title: The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning

Title: Parallel Structures in Pre-training Data Yield In-Context Learning

Title: TrustScore: Reference-Free Evaluation of LLM Response Trustworthiness

Title: Creating a Fine Grained Entity Type Taxonomy Using LLMs

Title: CausalGym: Benchmarking causal interpretability methods on linguistic tasks

Title: Confidence Matters: Revisiting Intrinsic Self-Correction Capabilities of Large Language Models

Title: GenAudit: Fixing Factual Errors in Language Model Outputs with Evidence

Title: Offline Multi-task Transfer RL with Representational Penalization

Title: Evolving AI Collectives to Enhance Human Diversity and Enable Self-Regulation

Title: Standardize: Aligning Language Models with Expert-Defined Standards for Content Generation

Title: Reflect-RL: Two-Player Online RL Fine-Tuning for LMs

Title: Bias in Language Models: Beyond Trick Tests and Toward RUTEd Evaluation

Title: OWSM-CTC: An Open Encoder-Only Speech Foundation Model for Speech Recognition, Translation, and Language Identification

Title: HyperMoE: Towards Better Mixture of Experts via Transferring Among Experts

Title: The FinBen: An Holistic Financial Benchmark for Large Language Models

Title: SoftQE: Learned Representations of Queries Expanded by LLMs

Title: Beyond Worst-case Attacks: Robust RL with Adaptive Defense via Non-dominated Policies

Title: XRL-Bench: A Benchmark for Evaluating and Comparing Explainable Reinforcement Learning Techniques

Title: Tree-Planted Transformers: Large Language Models with Implicit Syntactic Supervision

Title: FormulaQA: A Question Answering Dataset for Formula-Based Numerical Reasoning

Title: Are Large Language Models Rational Investors?

Title: UMBCLU at SemEval-2024 Task 1A and 1C: Semantic Textual Relatedness with and without machine translation

Title: Can Large Language Models be Used to Provide Psychological Counselling? An Analysis of GPT-4-Generated Responses Using Role-play Dialogues

Title: Me LLaMA: Foundation Large Language Models for Medical Applications

Title: Acknowledgment of Emotional States: Generating Validating Responses for Empathetic Dialogue

Title: Advancing Large Language Models to Capture Varied Speaking Styles and Respond Properly in Spoken Conversations

Title: Few shot clinical entity recognition in three languages: Masked language models outperform LLM prompting

Title: SymBa: Symbolic Backward Chaining for Multi-step Natural Language Reasoning

Title: Scalable Decentralized Algorithms for Online Personalized Mean Estimation

Title: On Sensitivity of Learning with Limited Labelled Data to the Effects of Randomness: Impact of Interactions and Systematic Choices

Title: Fine-Tuning, Prompting, In-Context Learning and Instruction-Tuning: How Many Labelled Samples Do We Need?

Title: Identifying Factual Inconsistency in Summaries: Towards Effective Utilization of Large Language Model

Title: PANDA: Preference Adaptation for Enhancing Domain-Specific Abilities of LLMs

Title: ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic

Title: PromptKD: Distilling Student-Friendly Knowledge for Generative Language Models via Prompt Tuning

Title: MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces

Title: Instruction-tuned Language Models are Better Knowledge Learners

Title: MoELoRA: Contrastive Learning Guided Mixture of Experts on Parameter-Efficient Fine-Tuning for Large Language Models

Title: Backward Lens: Projecting Language Model Gradients into the Vocabulary Space

Title: Exploring the Impact of Table-to-Text Methods on Augmenting LLM-based Question Answering with Domain Hybrid Data

Title: Skill or Luck? Return Decomposition via Advantage Functions

Title: Chain of Thought Empowers Transformers to Solve Inherently Serial Problems

Title: GRAFFORD: A Benchmark Dataset for Testing the Knowledge of Object Affordances of Language and Vision Models

Title: OPDAI at SemEval-2024 Task 6: Small LLMs can Accelerate Hallucination Detection with Weakly Supervised Data

Title: Large Language Model-based Human-Agent Collaboration for Complex Task Solving

Title: Discovering Behavioral Modes in Deep Reinforcement Learning Policies Using Trajectory Clustering in Latent Space

Title: GumbelSoft: Diversified Language Model Watermarking via the GumbelMax-trick

Title: GlórIA - A Generative and Open Large Language Model for Portuguese

Title: The Impact of Demonstrations on Multilingual In-Context Learning: A Multidimensional Analysis

Title: Can GNN be Good Adapter for LLMs?

Title: TRAP: Targeted Random Adversarial Prompt Honeypot for Black-Box Identification

Title: Phonotactic Complexity across Dialects

Title: Investigating the Impact of Model Instability on Explanations and Uncertainty

Title: Code Needs Comments: Enhancing Code LLMs with Comment Augmentation

Title: Understanding the effects of language-specific class imbalance in multilingual fine-tuning

Title: SoMeLVLM: A Large Vision Language Model for Social Media Processing

Title: Heterogeneous Graph Reasoning for Fact Checking over Texts and Tables

Title: Learning to Check: Unleashing Potentials for Self-Correction in Large Language Models

Title: SiLLM: Large Language Models for Simultaneous Machine Translation

Title: Align Your Intents: Offline Imitation Learning via Optimal Transport

Title: Text-Guided Molecule Generation with Diffusion Language Model

Title: Effective and Efficient Conversation Retrieval for Dialogue State Tracking with Implicit Text Summaries

Title: Stable Knowledge Editing in Large Language Models

Title: Identifying Semantic Induction Heads to Understand In-Context Learning

Title: Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models

Title: Event-level Knowledge Editing

Title: ELAD: Explanation-Guided Large Language Models Active Distillation

Title: CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models

Title: A Survey on Knowledge Distillation of Large Language Models

Title: TreeEval: Benchmark-Free Evaluation of Large Language Models through Tree Planning

Title: The Hidden Space of Transformer Language Adapters

Title: Defending Jailbreak Prompts via In-Context Adversarial Game

Title: Benchmarking Retrieval-Augmented Generation for Medicine

Title: Order-Optimal Regret in Distributed Kernel Bandits using Uniform Sampling with Shared Randomness

Title: What if LLMs Have Different World Views: Simulating Alien Civilizations with LLM-based Agents

Title: Question Calibration and Multi-Hop Modeling for Temporal Question Answering

Title: How do Hyenas deal with Human Speech? Speech Recognition and Translation with ConfHyena

Title: Bayesian Reward Models for LLM Alignment

Title: Can Large Language Models be Good Emotional Supporter? Mitigating Preference Bias on Emotional Support Conversation

Title: Soft Self-Consistency Improves Language Model Agents

Title: Softmax Probabilities (Mostly) Predict Large Language Model Correctness on Multiple-Choice Q&A

Title: RoCode: A Dataset for Measuring Code Intelligence from Problem Definitions in Romanian

Title: AgentMD: Empowering Language Agents for Risk Prediction with Large-Scale Clinical Tool Learning

Title: Smaug: Fixing Failure Modes of Preference Optimisation with DPO-Positive

Title: Investigating Cultural Alignment of Large Language Models

Title: TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization

Title: BiMediX: Bilingual Medical Mixture of Experts LLM