2024-10-29

Title: Ensembling Finetuned Language Models for Text Classification

Title: Improving Multimodal Large Language Models Using Continual Learning

Title: Layer by Layer: Uncovering Where Multi-Task Learning Happens in Instruction-Tuned Large Language Models

Title: A Survey of Small Language Models

Title: Vulnerability of LLMs to Vertically Aligned Text Manipulations

Title: Attacks against Abstractive Text Summarization Models through Lead Bias and Influence Functions

Title: Think Carefully and Check Again! Meta-Generation Unlocking LLMs for Low-Resource Cross-Lingual Summarization

Title: Dynamic layer selection in decoder-only transformers

Title: Beyond Fine-Tuning: Effective Strategies for Mitigating Hallucinations in Large Language Models for Data Analytics

Title: Architectural Flaw Detection in Civil Engineering Using GPT-4

Title: RARe: Retrieval Augmented Retrieval with In-Context Examples

Title: Reasoning or a Semblance of it? A Diagnostic Study of Transitive Reasoning in LLMs

Title: DAWN-ICL: Strategic Planning of Problem-solving Trajectories for Zero-Shot In-Context Learning

Title: Generative linguistics contribution to artificial intelligence: Where this contribution lies?

Title: A Survey of Large Language Models for Arabic Language and its Dialects

Title: Improving Model Evaluation using SMART Filtering of Benchmark Datasets

Title: Fast Best-of-N Decoding via Speculative Rejection

Title: Fine-Tuning and Evaluating Open-Source Large Language Models for the Army Domain

Title: Learning from Response not Preference: A Stackelberg Approach for LLM Detoxification using Non-parallel Data

Title: Improving Speech-based Emotion Recognition with Contextual Utterance Analysis and LLMs

Title: Get Large Language Models Ready to Speak: A Late-fusion Approach for Speech Generation

Title: Maintaining Informative Coherence: Migrating Hallucinations in Large Language Models via Absorbing Markov Chains

Title: Rethinking Data Synthesis: A Teacher Model Training Recipe with Interpretation

Title: MedGo: A Chinese Medical Large Language Model

Title: TrajAgent: An Agent Framework for Unified Trajectory Modelling

Title: What Factors Affect Multi-Modal In-Context Learning? An In-Depth Exploration

Title: FIRP: Faster LLM inference via future intermediate representation prediction

Title: $\textit{Who Speaks Matters}$: Analysing the Influence of the Speaker's Ethnicity on Hate Classification

Title: MatViX: Multimodal Information Extraction from Visually Rich Articles

Title: Is Moral Self-correction An Innate Capability of Large Language Models? A Mechanistic Analysis to Self-correction

Title: SubjECTive-QA: Measuring Subjectivity in Earnings Call Transcripts' QA Through Six-Dimensional Feature Analysis

Title: Visualizing attention zones in machine reading comprehension models

Title: Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA

Title: Combining Domain-Specific Models and LLMs for Automated Disease Phenotyping from Survey Data

Title: DisasterQA: A Benchmark for Assessing the performance of LLMs in Disaster Response

Title: Relation-based Counterfactual Data Augmentation and Contrastive Learning for Robustifying Natural Language Inference Models

Title: Simple is Effective: The Roles of Graphs and Large Language Models in Knowledge-Graph-Based Retrieval-Augmented Generation

Title: Gender Bias in LLM-generated Interview Responses

Title: ElectionSim: Massive Population Election Simulation Powered by Large Language Model Driven Agents

Title: Plan$\times$RAG: Planning-guided Retrieval Augmented Generation

Title: Evaluating LLMs for Targeted Concept Simplification forDomain-Specific Texts

Title: MrT5: Dynamic Token Merging for Efficient Byte-level Language Models

Title: Are LLM-Judges Robust to Expressions of Uncertainty? Investigating the effect of Epistemic Markers on LLM-based Evaluation

Title: KD-LoRA: A Hybrid Approach to Efficient Fine-Tuning with LoRA and Knowledge Distillation

Title: Graph-based Uncertainty Metrics for Long-form Language Model Outputs

Title: SCULPT: Systematic Tuning of Long Prompts

Title: Rephrasing natural text data with different languages and quality levels for Large Language Model pre-training

Title: NewTerm: Benchmarking Real-Time New Terms for Large Language Models with Annual Updates

Title: LLMs are Biased Evaluators But Not Biased for Retrieval Augmented Generation

Title: A Simple Yet Effective Corpus Construction Framework for Indonesian Grammatical Error Correction

Title: Reward Modeling with Weak Supervision for Language Models

Title: AutoRAG: Automated Framework for optimization of Retrieval Augmented Generation Pipeline

Title: NeuGPT: Unified multi-modal Neural GPT

Title: Long Sequence Modeling with Attention Tensorization: From Sequence to Tensor Learning

Title: Autoformalize Mathematical Statements by Symbolic Equivalence and Semantic Consistency

Title: Attacking Misinformation Detection Using Adversarial Examples Generated by Language Models

Title: Instruction-Tuned LLMs Succeed in Document-Level MT Without Fine-Tuning -- But BLEU Turns a Blind Eye

Title: DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive Learning

Title: Is GPT-4 Less Politically Biased than GPT-3.5? A Renewed Investigation of ChatGPT's Political Biases

Title: FACT: Examining the Effectiveness of Iterative Context Rewriting for Multi-fact Retrieval

Title: CRAT: A Multi-Agent Framework for Causality-Enhanced Reflective and Retrieval-Augmented Translation with Large Language Models

Title: Stealthy Jailbreak Attacks on Large Language Models via Benign Data Mirroring

Title: Retrieval-Enhanced Mutation Mastery: Augmenting Zero-Shot Prediction of Protein Language Model

Title: Palisade -- Prompt Injection Detection Framework

Title: SciER: An Entity and Relation Extraction Dataset for Datasets, Methods, and Tasks in Scientific Documents

Title: M2rc-Eval: Massively Multilingual Repository-level Code Completion Evaluation

Title: Belief in the Machine: Investigating Epistemological Blind Spots of Language Models

Title: BongLLaMA: LLaMA for Bangla Language

Title: HoPE: A Novel Positional Encoding Without Long-Term Decay for Enhanced Context Awareness and Extrapolation

Title: LongReward: Improving Long-context Large Language Models with AI Feedback

Title: Are BabyLMs Second Language Learners?

Title: EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation

Title: Arithmetic Without Algorithms: Language Models Solve Math With a Bag of Heuristics

Title: GPT-4o System Card