2024-02-20

Title: Taxonomy-based CheckList for Large Language Model Evaluation

Title: LLM-Assisted Crisis Management: Building Advanced LLM Platforms for Effective Emergency Response and Public Collaboration

Title: Advances and Limitations in Open Source Arabic-Script OCR: A Case Study

Title: CultureLLM: Incorporating Cultural Differences into Large Language Models

Title: Zero-shot Explainable Mental Health Analysis on Social Media by incorporating Mental Scales

Title: The Unreasonable Effectiveness of Eccentric Automatic Prompts

Title: DAEDRA: A language model for predicting outcomes in passive pharmacovigilance reporting

Title: Relative Preference Optimization: Enhancing LLM Alignment through Contrasting Responses across Identical and Diverse Prompts

Title: Measuring and Controlling Persona Drift in Language Model Dialogs

Title: GLoRe: When, Where, and How to Improve LLM Reasoning via Global and Local Refinements

Title: Generalization in Healthcare AI: Evaluation of a Clinical Large Language Model

Title: Generative AI and Process Systems Engineering: The Next Frontier

Title: Language Models with Conformal Factuality Guarantees

Title: SportsMetrics: Blending Text and Numerical Data to Understand Information Fusion in LLMs

Title: FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models

Title: WilKE: Wise-Layer Knowledge Editor for Lifelong Knowledge Editing

Title: "Understanding AI": Semantic Grounding in Large Language Models

Title: The Evolution of Statistical Induction Heads: In-Context Learning Markov Chains

Title: Exploring Value Biases: How LLMs Deviate Towards the Ideal

Title: PAT-Questions: A Self-Updating Benchmark for Present-Anchored Temporal Question-Answering

Title: Retrieval-Augmented Generation: Is Dense Passage Retrieval Retrieving?

Title: Large Language Models Fall Short: Understanding Complex Relationships in Detective Narratives

Title: Persona-DB: Efficient Large Language Model Personalization for Response Prediction with Collaborative Data Refinement

Title: Bridging Causal Discovery and Large Language Models: A Comprehensive Survey of Integrative Approaches and Future Directions

Title: AFaCTA: Assisting the Annotation of Factual Claim Detection with Reliable LLM Annotators

Title: Word Embeddings Revisited: Do LLMs Offer Something New?

Title: When LLMs Meet Cunning Questions: A Fallacy Understanding Benchmark for Large Language Models

Title: Whose Emotions and Moral Sentiments Do Language Models Reflect?

Title: Navigating the Dual Facets: A Comprehensive Evaluation of Sequential Memory Editing in Large Language Models

Title: BlendFilter: Advancing Retrieval-Augmented Large Language Models via Query Generation Blending and Knowledge Filtering

Title: Speculative Streaming: Fast LLM Inference without Auxiliary Models

Title: TuneTables: Context Optimization for Scalable Prior-Data Fitted Networks

Title: Contrastive Instruction Tuning

Title: Boosting of Thoughts: Trial-and-Error Problem Solving with Large Language Models

Title: Grasping the Essentials: Tailoring Large Language Models for Zero-Shot Relation Extraction

Title: Understanding News Thumbnail Representativeness by Counterfactual Text-Guided Contrastive Language-Image Pretraining

Title: PANDA (Pedantic ANswer-correctness Determination and Adjudication):Improving Automatic Evaluation for Question Answering and Text Generation

Title: KG-Agent: An Efficient Autonomous Agent Framework for Complex Reasoning over Knowledge Graph

Title: GenDec: A robust generative Question-decomposition method for Multi-hop reasoning

Title: Token-Ensemble Text Generation: On Attacking the Automatic AI-Generated Text Detection

Title: M4GT-Bench: Evaluation Benchmark for Black-Box Machine-Generated Text Detection

Title: KnowTuning: Knowledge-aware Fine-tuning for Large Language Models

Title: RENOVI: A Benchmark Towards Remediating Norm Violations in Socio-Cultural Conversations

Title: LaCo: Large Language Model Pruning via Layer Collapse

Title: Disclosure and Mitigation of Gender Bias in LLMs

Title: I Learn Better If You Speak My Language: Enhancing Large Language Model Fine-Tuning with Style-Aligned Response Adjustments

Title: Assessing LLMs' Mathematical Reasoning in Financial Document Question Answering

Title: Direct Evaluation of Chain-of-Thought in Multi-hop Reasoning with Knowledge Graphs

Title: Asclepius: A Spectrum Evaluation Benchmark for Medical Multi-Modal Large Language Models

Title: Controlled Text Generation for Large Language Model with Dynamic Attribute Graphs

Title: ZeroG: Investigating Cross-dataset Zero-shot Transferability in Graphs

Title: Can Large Language Models perform Relation-based Argument Mining?

Title: LLM can Achieve Self-Regulation via Hyperparameter Aware Generation

Title: Aligning Large Language Models by On-Policy Self-Judgment

Title: C-ICL: Contrastive In-context Learning for Information Extraction

Title: MoRAL: MoE Augmented LoRA for LLMs' Lifelong Learning

Title: Multi-Perspective Consistency Enhances Confidence Estimation in Large Language Models

Title: Can Large Multimodal Models Uncover Deep Semantics Behind Images?

Title: Puzzle Solving using Reasoning of Large Language Models: A Survey

Title: OneBit: Towards Extremely Low-bit Large Language Models

Title: Dissecting Human and LLM Preferences

Title: MMMModal -- Multi-Images Multi-Audio Multi-turn Multi-Modal

Title: EVEDIT: Event-based Knowledge Editing with Deductive Editing Boundaries

Title: PhaseEvo: Towards Unified In-Context Prompt Optimization for Large Language Models

Title: Tasks That Language Models Don't Learn

Title: What Changed? Converting Representational Interventions to Natural Language

Title: Training Language Model Agents without Modifying Language Models

Title: Multi Task Inverse Reinforcement Learning for Common Sense Reward

Title: Reinforcement learning to maximise wind turbine energy generation

Title: Reasoning before Comparison: LLM-Enhanced Semantic Similarity Metrics for Domain Specialized Text Analysis

Title: Don't Go To Extremes: Revealing the Excessive Sensitivity and Calibration Limitations of LLMs in Implicit Hate Speech Detection

Title: Multi-dimensional Evaluation of Empathetic Dialog Responses

Title: Aligning Modalities in Vision Large Language Models via Preference Fine-tuning

Title: LoRETTA: Low-Rank Economic Tensor-Train Adaptation for Ultra-Low-Parameter Fine-Tuning of Large Language Models

Title: Rethinking the Roles of Large Language Models in Chinese Grammatical Error Correction

Title: EventRL: Enhancing Event Extraction with Outcome Supervision for Large Language Models

Title: Can Deception Detection Go Deeper? Dataset, Evaluation, and Benchmark for Deception Reasoning

Title: Perils of Self-Feedback: Self-Bias Amplifies in Large Language Models

Title: InfuserKI: Enhancing Large Language Models with Knowledge Graphs via Infuser-Guided Knowledge Integration

Title: Can LLMs Reason with Rules? Logic Scaffolding for Stress-Testing and Improving LLMs

Title: Benchmark Self-Evolving: A Multi-Agent Framework for Dynamic LLM Evaluation

Title: In-Context Example Ordering Guided by Label Distributions

Title: SciAgent: Tool-augmented Language Models for Scientific Reasoning

Title: AutoPRM: Automating Procedural Supervision for Multi-Step Reasoning via Controllable Question Decomposition

Title: MatPlotAgent: Method and Evaluation for LLM-Based Agentic Scientific Data Visualization

Title: LoRA-Flow: Dynamic LoRA Fusion for Large Language Models in Generative Tasks

Title: FactPICO: Factuality Evaluation for Plain Language Summarization of Medical Evidence

Title: When Do LLMs Need Retrieval Augmentation? Mitigating LLMs' Overconfidence Helps Retrieval Augmentation

Title: A Curious Case of Searching for the Correlation between Training Data and Adversarial Robustness of Transformer Textual Models

Title: DictLLM: Harnessing Key-Value Data Structures with Large Language Models for Enhanced Medical Diagnostics

Title: LEIA: Facilitating Cross-Lingual Knowledge Transfer in Language Models with Entity-based Data Augmentation

Title: What's the Plan? Evaluating and Developing Planning-Aware Techniques for LLMs

Title: Benchmarking Knowledge Boundary for Large Language Model: A Different Perspective on Model Evaluation

Title: Federated Fine-tuning of Large Language Models under Heterogeneous Language Tasks and Client Resources

Title: From Prejudice to Parity: A New Approach to Debiasing Large Language Model Word Embeddings

Title: Knowledge-to-SQL: Enhancing SQL Generation with Data Expert LLM

Title: Large Language Model-driven Meta-structure Discovery in Heterogeneous Information Network

Title: Unveiling the Secrets of Engaging Conversations: Factors that Keep Users Hooked on Role-Playing Dialog Agents

Title: Chain-of-Instructions: Compositional Instruction Tuning on Large Language Models

Title: PreAct: Predicting Future in ReAct Enhances Agent's Planning Ability

Title: Deciphering the lmpact of Pretraining Data on Large Language Models through Machine Unlearning

Title: Counter-intuitive: Large Language Models Can Better Understand Knowledge Graphs Than We Thought

Title: KMMLU: Measuring Massive Multitask Language Understanding in Korean

Title: LongAgent: Scaling Language Models to 128k Context through Multi-Agent Collaboration

Title: Cobra Effect in Reference-Free Image Captioning Metrics

Title: BGE Landmark Embedding: A Chunking-Free Embedding Method For Retrieval Augmented Long-Context Large Language Models

Title: Extensible Embedding: A Flexible Multipler For LLM's Context Length

Title: Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark

Title: Multi-Task Inference: Can Large Language Models Follow Multiple Instructions at Once?

Title: Self-evolving Autoencoder Embedded Q-Network

Title: Metric-Learning Encoding Models Identify Processing Profiles of Linguistic Features in BERT's Representations

Title: Decoding News Narratives: A Critical Analysis of Large Language Models in Framing Bias Detection

Title: SpeCrawler: Generating OpenAPI Specifications from API Documentation Using Large Language Models

Title: Metacognitive Retrieval-Augmented Large Language Models

Title: Self-seeding and Multi-intent Self-instructing LLMs for Generating Intent-aware Information-Seeking dialogs

Title: Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks

Title: Towards Versatile Graph Learning Approach: from the Perspective of Large Language Models

Title: Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agents

Title: Combinatorial Client-Master Multiagent Deep Reinforcement Learning for Task Offloading in Mobile Edge Computing

Title: Competition of Mechanisms: Tracing How Language Models Handle Facts and Counterfactuals

Title: Dynamic planning in hierarchical active inference

Title: Autocorrect for Estonian texts: final report from project EKTB25

Title: A Multi-Aspect Framework for Counter Narrative Evaluation using Large Language Models

Title: Opening the black box of language acquisition

Title: One Prompt To Rule Them All: LLMs for Opinion Summary Evaluation

Title: ALLaVA: Harnessing GPT4V-synthesized Data for A Lite Vision-Language Model

Title: Vision-Flan: Scaling Human-Labeled Tasks in Visual Instruction Tuning

Title: Why Lift so Heavy? Slimming Large Language Models by Cutting Off the Layers

Title: GNNavi: Navigating the Information Flow in Large Language Models by Graph Neural Network

Title: A Note on Bias to Complete

Title: MORL-Prompt: An Empirical Analysis of Multi-Objective Reinforcement Learning for Discrete Prompt Optimization

Title: Modelling Political Coalition Negotiations Using LLM-based Agents

Title: How Susceptible are Large Language Models to Ideological Manipulation?

Title: Numerical Claim Detection in Finance: A New Financial Dataset, Weak-Supervision Model, and Market Analysis

Title: Language Models are Homer Simpson! Safety Re-Alignment of Fine-tuned Language Models through Task Arithmetic

Title: In-Context Learning Demonstration Selection via Influence Analysis

Title: ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs

Title: SPML: A DSL for Defending Language Models Against Prompt Attacks

Title: MARS: Meaning-Aware Response Scoring for Uncertainty Estimation in Generative LLMs

Title: ChatGPT Based Data Augmentation for Improved Parameter-Efficient Debiasing of LLMs

Title: Structured Chain-of-Thought Prompting for Few-Shot Generation of Content-Grounded QA Conversations

Title: Evaluating the Effectiveness of Index-Based Treatment Allocation

Title: Uncovering Latent Human Wellbeing in Language Model Embeddings

Title: What Evidence Do Language Models Find Convincing?

Title: Unveiling the Magic: Investigating Attention Distillation in Retrieval-augmented Generation

Title: Stochastic Approximation with Delayed Updates: Finite-Time Rates under Markovian Sampling

Title: LLM as Prompter: Low-resource Inductive Reasoning on Arbitrary Knowledge Graphs

Title: Generation Meets Verification: Accelerating Large Language Model Inference with Smart Parallel Auto-Correct Decoding

Title: FIPO: Free-form Instruction-oriented Prompt Optimization with Preference Dataset and Modular Fine-tuning Schema

Title: HU at SemEval-2024 Task 8A: Can Contrastive Learning Learn Embeddings to Detect Machine-Generated Text?

Title: Where It Really Matters: Few-Shot Environmental Conservation Media Monitoring for Low-Resource Languages

Title: Head-wise Shareable Attention for Large Language Models

Title: Microstructures and Accuracy of Graph Recall by Large Language Models

Title: Easy as ABCs: Unifying Boltzmann Q-Learning and Counterfactual Regret Minimization

Title: UniST: A Prompt-Empowered Universal Model for Urban Spatio-Temporal Prediction

Title: Modularized Networks for Few-shot Hateful Meme Detection

Title: How Interpretable are Reasoning Explanations from Prompting Large Language Models?

Title: LoRA Training in the NTK Regime has No Spurious Local Minima

Title: M2K-VDG: Model-Adaptive Multimodal Knowledge Anchor Enhanced Video-grounded Dialogue Generation

Title: The Colorful Future of LLMs: Evaluating and Improving LLMs as Emotional Supporters for Queer Youth

Title: ROSE Doesn't Do That: Boosting the Safety of Instruction-Tuned Large Language Models with Reverse Prompt Contrastive Decoding

Title: Revisiting Knowledge Distillation for Autoregressive Language Models

Title: Discerning and Resolving Knowledge Conflicts through Adaptive Decoding with Contextual Information-Entropy Constraint

Title: Have Seen Me Before? Automating Dataset Updates Towards Reliable and Timely Evaluation

Title: SIBO: A Simple Booster for Parameter-Efficient Fine-Tuning

Title: Investigating Multi-Hop Factual Shortcuts in Knowledge Editing of Large Language Models

Title: SoLA: Solver-Layer Adaption of LLM for Better Logic Reasoning

Title: Learning to Edit: Aligning LLMs with Knowledge Editing

Title: Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation

Title: A Generative Pre-Training Framework for Spatio-Temporal Graph Transfer Learning

Title: MRKE: The Multi-hop Reasoning Evaluation of LLMs by Knowledge Edition

Title: Comprehensive Cognitive LLM Agent for Smartphone GUI Automation

Title: LEMMA: Towards LVLM-Enhanced Multimodal Misinformation Detection with External Knowledge Augmentation

Title: Automatic Evaluation for Mental Health Counseling using LLMs

Title: DB-LLM: Accurate Dual-Binarization for Efficient LLMs

Title: Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conversations

Title: Remember This Event That Year? Assessing Temporal Information and Reasoning in Large Language Models

Title: A Systematic Comparison of Contextualized Word Embeddings for Lexical Semantic Change

Title: Distilling Large Language Models for Text-Attributed Graph Learning

Title: Speech Translation with Speech Foundation Models and Large Language Models: What is There and What is Missing?

Title: Acquiring Clean Language Models from Backdoor Poisoned Datasets by Downscaling Frequency Space

Title: Towards Cross-Tokenizer Distillation: the Universal Logit Distillation Loss for LLMs

Title: Language Model Adaptation to Specialized Domains through Selective Masking based on Genre and Topical Characteristics

Title: Self-AMPLIFY: Improving Small Language Models with Self Post Hoc Explanations

Title: Model Tailor: Mitigating Catastrophic Forgetting in Multi-modal Large Language Models

Title: Small Models, Big Insights: Leveraging Slim Proxy Models To Decide When and What to Retrieve for LLMs

Title: Are LLM-based Evaluators Confusing NLG Quality Criteria?

Title: All Language Models Large and Small

Title: WKVQuant: Quantizing Weight and Key/Value Cache for Large Language Models Gains More

Title: Interpretable Brain-Inspired Representations Improve RL Performance on Visual Navigation Tasks

Title: EmoBench: Evaluating the Emotional Intelligence of Large Language Models

Title: Can LLMs Compute with Reasons?

Title: Do Large Language Models Understand Logic or Just Mimick Context?

Title: Groot: Adversarial Testing for Generative Text-to-Image Models with Tree-based Semantic Transformation

Title: Is It a Free Lunch for Removing Outliers during Pretraining?

Title: Evaluating Image Review Ability of Vision Language Models

Title: Meta Ranking: Less Capable Language Models are Capable for Single Response Judgement

Title: End-to-end multilingual fact-checking at scale

Title: Your Large Language Model is Secretly a Fairness Proponent and You Should Prompt it Like One

Title: Transformer-based Causal Language Models Perform Clustering

Title: Endowing Pre-trained Graph Models with Provable Fairness

Title: Unsupervised LLM Adaptation for Question Answering

Title: BIDER: Bridging Knowledge Inconsistency for Efficient Retrieval-Augmented LLMs via Key Supporting Evidence

Title: Mafin: Enhancing Black-Box Embeddings with Model Augmented Fine-tuning

Title: Amplifying Training Data Exposure through Fine-Tuning with Pseudo-Labeled Memberships

Title: A Chinese Dataset for Evaluating the Safeguards in Large Language Models

Title: Browse and Concentrate: Comprehending Multimodal Content via prior-LLM Context Fusion

Title: Zero shot VLMs for hate meme detection: Are we there yet?

Title: Dictionary Learning Improves Patch-Free Circuit Discovery in Mechanistic Interpretability: A Case Study on Othello-GPT

Title: Enhancing Multilingual Capabilities of Large Language Models through Self-Distillation from Resource-Rich Languages

Title: Polarization of Autonomous Generative AI Agents Under Echo Chambers

Title: Reformatted Alignment

Title: AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling

Title: Empirical Study on Updating Key-Value Memories in Transformer Feed-forward Layers

Title: Task-Oriented Dialogue with In-Context Learning

Title: Understanding the Effects of Noise in Text-to-SQL: An Examination of the BIRD-Bench Benchmark

Title: Shallow Synthesis of Knowledge in GPT-Generated Texts: A Case Study in Automatic Related Work Composition

Title: NEO-BENCH: Evaluating Robustness of Large Language Models with Neologisms

Title: Uncertainty quantification in fine-tuned LLMs using LoRA ensembles

Title: High-quality Data-to-Text Generation for Severely Under-Resourced Languages with Out-of-the-box Large Language Models

Title: WorldCoder, a Model-Based LLM Agent: Building World Models by Writing Code and Interacting with the Environment

Title: Key ingredients for effective zero-shot cross-lingual knowledge transfer in generative tasks

Title: Adaptive Skeleton Graph Decoding

Title: Refining Minimax Regret for Unsupervised Environment Design

Title: KARL: Knowledge-Aware Retrieval and Representations aid Retention and Learning in Students

Title: Is Open-Source There Yet? A Comparative Study on Commercial and Open-Source LLMs in Their Ability to Label Chest X-Ray Reports

Title: ARKS: Active Retrieval in Knowledge Soup for Code Generation

Title: LLM Agents for Psychology: A Study on Gamified Assessments

Title: Shall We Talk: Exploring Spontaneous Collaborations of Competing LLM Agents

Title: Query-Based Adversarial Prompt Generation

Title: Robust CLIP: Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models

Title: Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!

Title: GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations

Title: Graph-Based Retriever Captures the Long Tail of Biomedical Knowledge

Title: Emergent Word Order Universals from Cognitively-Motivated Language Models

Title: A Critical Evaluation of AI Feedback for Aligning Large Language Models

Title: A synthetic data approach for domain generalization of NLI models

Title: AnaloBench: Benchmarking the Identification of Abstract and Long-context Analogies

Title: Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding