2024-12-17

Title: Reinforcement Learning Enhanced LLMs: A Survey

Title: Evaluating Robustness of LLMs on Crisis-Related Microblogs across Events, Information Types, and Linguistic Features

Title: Exploring Complex Mental Health Symptoms via Classifying Social Media Data with Explainable LLMs

Title: Generative Adversarial Reviews: When LLMs Become the Critic

Title: SUPERMERGE: An Approach For Gradient-Based Model Merging

Title: Leveraging Audio and Text Modalities in Mental Health: A Study of LLMs Performance

Title: Constrained Decoding with Speculative Lookaheads

Title: AutoPrep: Natural Language Question-Aware Data Preparation with a Multi-Agent Framework

Title: Look Before You Leap: Enhancing Attention and Vigilance Regarding Harmful Content with GuidelineLLM

Title: LLM-AS-AN-INTERVIEWER: Beyond Static Testing Through Dynamic LLM Evaluation

Title: Active Inference for Self-Organizing Multi-LLM Systems: A Bayesian Thermodynamic Approach to Adaptation

Title: Identifying and Manipulating Personality Traits in LLMs Through Activation Engineering

Title: Imitate Before Detect: Aligning Machine Stylistic Preference for Machine-Revised Text Detection

Title: NAT-NL2GQL: A Novel Multi-Agent Framework for Translating Natural Language to Graph Query Language

Title: On Adversarial Robustness and Out-of-Distribution Robustness of Large Language Models

Title: Too Big to Fool: Resisting Deception in Language Models

Title: Evidence Contextualization and Counterfactual Attribution for Conversational QA over Heterogeneous Data with RAG Systems

Title: WHAT-IF: Exploring Branching Narratives by Meta-Prompting Large Language Models

Title: Thinking with Knowledge Graphs: Enhancing LLM Reasoning Through Structured Data

Title: Chasing Progress, Not Perfection: Revisiting Strategies for End-to-End LLM Plan Generation

Title: Inference Scaling for Bridging Retrieval and Augmented Generation

Title: Learning to Verify Summary Facts with Fine-Grained LLM Feedback

Title: VisDoM: Multi-Document QA with Visually Rich Elements Using Multimodal Retrieval-Augmented Generation

Title: HITgram: A Platform for Experimenting with n-gram Language Models

Title: WEPO: Web Element Preference Optimization for LLM-based Web Navigation

Title: Are Language Models Agnostic to Linguistically Grounded Perturbations? A Case Study of Indic Languages

Title: FinGPT: Enhancing Sentiment-Based Stock Movement Prediction with Dissemination-Aware and Context-Enriched LLMs

Title: Rethinking Chain-of-Thought from the Perspective of Self-Training

Title: Large Language Models for Medical Forecasting -- Foresight 2

Title: BgGPT 1.0: Extending English-centric LLMs to other languages

Title: SusGen-GPT: A Data-Centric LLM for Financial NLP and Sustainability Report Generation

Title: LLMs-in-the-Loop Part 2: Expert Small AI Models for Anonymization and De-identification of PHI Across Multiple Languages

Title: Tokens, the oft-overlooked appetizer: Large language models, the distributional hypothesis, and meaning

Title: Enhancing Discoverability in Enterprise Conversational Systems with Proactive Question Suggestions

Title: Can LLMs Help Create Grammar?: Automating Grammar Creation for Endangered Languages with In-Context Learning

Title: A Contextualized BERT model for Knowledge Graph Completion

Title: Separate the Wheat from the Chaff: A Post-Hoc Approach to Safety Re-Alignment for Fine-Tuned Language Models

Title: NITRO: LLM Inference on Intel Laptop NPUs

Title: AD-LLM: Benchmarking Large Language Models for Anomaly Detection

Title: The Superalignment of Superhuman Intelligence with Large Language Models

Title: Cultural Palette: Pluralising Culture Alignment via Multi-agent Palette

Title: Drawing the Line: Enhancing Trustworthiness of MLLMs Through the Power of Refusal

Title: Task-Oriented Dialog Systems for the Senegalese Wolof Language

Title: Smaller Language Models Are Better Instruction Evolvers

Title: Beyond Discrete Personas: Personality Modeling Through Journal Intensive Conversations

Title: CATER: Leveraging LLM to Pioneer a Multidimensional, Reference-Independent Paradigm in Translation Quality Evaluation

Title: Sequence-Level Analysis of Leakage Risk of Training Data in Large Language Models

Title: Reliable, Reproducible, and Really Fast Leaderboards with Evalica

Title: RoLargeSum: A Large Dialect-Aware Romanian News Dataset for Summary, Headline, and Keyword Generation

Title: Generics are puzzling. Can language models find the missing piece?

Title: Segment-Level Diffusion: A Framework for Controllable Long-Form Generation with Diffusion Language Models

Title: Can AI Extract Antecedent Factors of Human Trust in AI? An Application of Information Extraction for Scientific Literature in Behavioural and Computer Sciences

Title: ChatTime: A Unified Multimodal Time Series Foundation Model Bridging Numerical and Textual Data

Title: Why Does ChatGPT "Delve" So Much? Exploring the Sources of Lexical Overrepresentation in Large Language Models

Title: INTERACT: Enabling Interactive, Question-Driven Learning in Large Language Models

Title: Biased or Flawed? Mitigating Stereotypes in Generative Language Models by Addressing Task-Specific Flaws

Title: ConceptEdit: Conceptualization-Augmented Knowledge Editing in Large Language Models for Commonsense Reasoning

Title: Optimized Quran Passage Retrieval Using an Expanded QA Dataset and Fine-Tuned Language Models

Title: ACE-$M^3$: Automatic Capability Evaluator for Multimodal Medical Models

Title: Towards Better Multi-task Learning: A Framework for Optimizing Dataset Combinations in Large Language Models

Title: Understanding Knowledge Hijack Mechanism in In-context Learning through Associative Memory

Title: FTP: A Fine-grained Token-wise Pruner for Large Language Models via Token Routing

Title: Glimpse: Enabling White-Box Methods to Use Proprietary Models for Zero-Shot LLM-Generated Text Detection

Title: DART: An AIGT Detector using AMR of Rephrased Text

Title: Let your LLM generate a few tokens and you will reduce the need for retrieval

Title: Towards a Speech Foundation Model for Singapore and Beyond

Title: Token Prepending: A Training-Free Approach for Eliciting Better Sentence Embeddings from LLMs

Title: The Role of Natural Language Processing Tasks in Automatic Literary Character Network Construction

Title: SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

Title: MT-LENS: An all-in-one Toolkit for Better Machine Translation Evaluation

Title: Fool Me, Fool Me: User Attitudes Toward LLM Falsehoods

Title: SE-GCL: An Event-Based Simple and Effective Graph Contrastive Learning for Text Representation

Title: Self-Adaptive Paraphrasing and Preference Learning for Improved Claim Verifiability

Title: C3oT: Generating Shorter Chain-of-Thought without Compromising Effectiveness

Title: BioBridge: Unified Bio-Embedding with Bridging Modality in Code-Switched EMR

Title: Bias Vector: Mitigating Biases in Language Models with Task Arithmetic Approach

Title: Multilingual and Explainable Text Detoxification with Parallel Corpora

Title: CoinMath: Harnessing the Power of Coding Instruction for Math LLMs

Title: Vocabulary Expansion of Chat Models with Unlabeled Target Language Data

Title: MiMoTable: A Multi-scale Spreadsheet Benchmark with Meta Operations for Table Reasoning

Title: Seeker: Towards Exception Safety Code Generation with Intermediate Language Agents Framework

Title: LLMs Can Simulate Standardized Patients via Agent Coevolution

Title: Personalized LLM for Generating Customized Responses to the Same Query from Different Users

Title: CSR:Achieving 1 Bit Key-Value Cache via Sparse Representation

Title: Common Ground, Diverse Roots: The Difficulty of Classifying Common Examples in Spanish Varieties

Title: QUENCH: Measuring the gap between Indic and Non-Indic Contextual General Reasoning in LLMs

Title: UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models

Title: EventSum: A Large-Scale Event-Centric Summarization Dataset for Chinese Multi-News Documents

Title: Are You Doubtful? Oh, It Might Be Difficult Then! Exploring the Use of Model Uncertainty for Question Difficulty Estimation

Title: Improved Models for Media Bias Detection and Subcategorization

Title: A Benchmark and Robustness Study of In-Context-Learning with Large Language Models in Music Entity Detection

Title: Using Instruction-Tuned Large Language Models to Identify Indicators of Vulnerability in Police Incident Narratives

Title: Can Language Models Rival Mathematics Students? Evaluating Mathematical Reasoning through Textual Manipulation and Human Experiments

Title: CharacterBench: Benchmarking Character Customization of Large Language Models

Title: RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation

Title: PICLe: Pseudo-Annotations for In-Context Learning in Low-Resource Named Entity Detection

Title: A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges

Title: Precise Length Control in Large Language Models

Title: The Impact of Token Granularity on the Predictive Power of Language Model Surprisal

Title: Inferring Functionality of Attention Heads from their Parameters

Title: DARWIN 1.5: Large Language Models as Materials Science Adapted Learners

Title: SciFaultyQA: Benchmarking LLMs on Faulty Science Question Detection with a GAN-Inspired Approach to Synthetic Dataset Generation

Title: ExecRepoBench: Multi-level Executable Code Completion Evaluation

Title: LLM-RG4: Flexible and Factual Radiology Report Generation across Diverse Input Contexts

Title: The Open Source Advantage in Large Language Models (LLMs)

Title: How Private are Language Models in Abstractive Summarization?

Title: Making FETCH! Happen: Finding Emergent Dog Whistles Through Common Habitats

Title: SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator