2024-09-04

Title: Automating Knowledge Discovery from Scientific Literature via LLMs: A Dual-Agent Approach with Progressive Ontology Prompting

Title: Understanding Literary Texts by LLMs: A Case Study of Ancient Chinese Poetry

Title: Urban Mobility Assessment Using LLMs

Title: Learning to Plan Long-Term for Language Modeling

Title: Are LLM-based methods good enough for detecting unfair terms of service?

Title: Towards Human-Level Understanding of Complex Process Engineering Schematics: A Pedagogical, Introspective Multi-Agent Framework for Open-Domain Question Answering

Title: Vision-Language and Large Language Model Performance in Gastroenterology: GPT, Claude, Llama, Phi, Mistral, Gemma, and Quantized Models

Title: Genetic Approach to Mitigate Hallucination in Generative IR

Title: On-Device Language Models: A Comprehensive Review

Title: Evaluating ChatGPT on Nuclear Domain-Specific Data

Title: Classification of Safety Events at Nuclear Sites using Large Language Models

Title: PatentGPT: A Large Language Model for Patent Drafting Using Knowledge-based Fine-tuning Method

Title: Examining Independence in Ensemble Sentiment Analysis: A Study on the Limits of Large Language Models Using the Condorcet Jury Theorem

Title: Non-instructional Fine-tuning: Enabling Instruction-Following Capabilities in Pre-trained Language Models without Instruction-Following Data

Title: Large Language Models for Disease Diagnosis: A Scoping Review

Title: Nuance Matters: Probing Epistemic Consistency in Causal Reasoning

Title: Negation Blindness in Large Language Models: Unveiling the NO Syndrome in Image Generation

Title: Zero-Shot Visual Reasoning by Vision-Language Models: Benchmarking and Analysis

Title: Toward Large Language Models as a Therapeutic Tool: Comparing Prompting Techniques to Improve GPT-Delivered Problem-Solving Therapy

Title: When All Options Are Wrong: Evaluating Large Language Model Robustness with Incorrect Multiple-Choice Options

Title: FedMCP: Parameter-Efficient Federated Learning with Model-Contrastive Personalization

Title: ConCSE: Unified Contrastive Learning and Augmentation for Code-Switched Embeddings

Title: Can AI Replace Human Subjects? A Large-Scale Replication of Psychological Experiments with LLMs

Title: Logic Contrastive Reasoning with Lightweight Large Language Model for Math Word Problems

Title: A Survey for Large Language Models in Biomedicine

Title: HoneyComb: A Flexible LLM-Based Agent System for Materials Science

Title: PrivacyLens: Evaluating Privacy Norm Awareness of Language Models in Action

Title: Dynamic Depth Decoding: Faster Speculative Decoding for LLMs

Title: MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models

Title: Speaker Tagging Correction With Non-Autoregressive Language Models

Title: Developing an End-to-End Framework for Predicting the Social Communication Severity Scores of Children with Autism Spectrum Disorder

Title: LLMs hallucinate graphs too: a structural perspective

Title: Sequence to Sequence Reward Modeling: Improving RLHF by Language Feedback

Title: The creative psychometric item generator: a framework for item generation and validation using large language models

Title: Enhancing Event Reasoning in Large Language Models through Instruction Fine-Tuning with Semantic Causal Graphs

Title: Enhancing Document-level Argument Extraction with Definition-augmented Heuristic-driven Prompting for LLMs

Title: ProGRes: Prompted Generative Rescoring on ASR n-Best

Title: Can Large Language Models Address Open-Target Stance Detection?

Title: Pre-Training Multimodal Hallucination Detectors with Corrupted Grounding Data

Title: DiverseDialogue: A Methodology for Designing Chatbots with Human-Like Diversity

Title: Leveraging a Cognitive Model to Measure Subjective Similarity of Human and GPT-4 Written Content

Title: OnlySportsLM: Optimizing Sports-Domain Language Models with SOTA Performance under Billion Parameters

Title: REFFLY: Melody-Constrained Lyrics Editing Model

Title: From Prediction to Application: Language Model-based Code Knowledge Tracing with Domain Adaptive Pre-Training and Automatic Feedback System with Pedagogical Prompting for Comprehensive Programming Education

Title: WikiCausal: Corpus and Evaluation Framework for Causal Knowledge Graph Construction

Title: Evaluating the Effectiveness of Large Language Models in Representing and Understanding Movement Trajectories

Title: Does Alignment Tuning Really Break LLMs' Internal Confidence?

Title: Predicting the Target Word of Game-playing Conversations using a Low-Rank Dialect Adapter for Decoder Models

Title: An Empirical Study on Information Extraction using Large Language Models

Title: Rethinking Backdoor Detection Evaluation for Language Models

Title: LongRecipe: Recipe for Efficient Long Context Generalization in Large Languge Models

Title: Post-OCR Text Correction for Bulgarian Historical Documents

Title: Large Language Models-Enabled Digital Twins for Precision Medicine in Rare Gynecological Tumors

Title: Testing and Evaluation of Large Language Models: Correctness, Non-Toxicity, and Fairness

Title: Learning to Ask: When LLMs Meet Unclear Instruction

Title: Automatic Pseudo-Harmful Prompt Generation for Evaluating False Refusals in Large Language Models

Title: TinyAgent: Function Calling at the Edge

Title: Does Knowledge Localization Hold True? Surprising Differences Between Entity and Relation Perspectives in Language Models

Title: Polyrating: A Cost-Effective and Bias-Aware Rating System for LLM Evaluation

Title: Generating Media Background Checks for Automated Source Critical Reasoning

Title: The Dark Side of Human Feedback: Poisoning Large Language Models via User Inputs

Title: Modeling Text-Label Alignment for Hierarchical Text Classification

Title: Comparing Discrete and Continuous Space LLMs for Speech Recognition

Title: LanguaShrink: Reducing Token Overhead with Psycholinguistics

Title: Harnessing the Power of Semi-Structured Knowledge and LLMs with Triplet-Based Prefiltering for Question Answering

Title: Self-evolving Agents with reflective and memory-augmented abilities

Title: User-Specific Dialogue Generation with User Profile-Aware Pre-Training Model and Parameter-Efficient Fine-Tuning

Title: Self-Judge: Selective Instruction Following with Alignment Self-Evaluation

Title: Large Language Models for Automatic Detection of Sensitive Topics

Title: What does it take to get state of the art in simultaneous speech-to-speech translation?

Title: DataSculpt: Crafting Data Landscapes for LLM Post-Training through Multi-objective Partitioning

Title: Unleashing the Power of Task-Specific Directions in Parameter Efficient Fine-tuning

Title: NYK-MS: A Well-annotated Multi-modal Metaphor and Sarcasm Understanding Benchmark on Cartoon-Caption Dataset

Title: A Perspective on Literary Metaphor in the Context of Generative AI

Title: Pre-Trained Language Models for Keyphrase Prediction: A Review

Title: Prompt Compression with Context-Aware Sentence Encoding for Fast and Improved LLM Inference

Title: THInC: A Theory-Driven Framework for Computational Humor Detection

Title: Path-Consistency: Prefix Enhancement for Efficient Inference in LLM

Title: Language Models Benefit from Preparation with Elicited Knowledge

Title: CHESS: Optimizing LLM Inference via Channel-Wise Thresholding and Selective Sparsification

Title: GenAgent: Build Collaborative AI Systems with Automated Workflow Generation -- Case Studies on ComfyUI

Title: PoliPrompt: A High-Performance Cost-Effective LLM-Based Text Classification Framework for Political Science

Title: Masked Mixers for Language Generation and Retrieval

Title: The Compressor-Retriever Architecture for Language Model OS

Title: DiversityMedQA: Assessing Demographic Biases in Medical Diagnosis using Large Language Models

Title: S$^3$c-Math: Spontaneous Step-level Self-correction Makes Large Language Models Better Mathematical Reasoners

Title: It is Time to Develop an Auditing Framework to Promote Value Aware Chatbots

Title: Self-Instructed Derived Prompt Generation Meets In-Context Learning: Unlocking New Potential of Black-Box LLMs

Title: Benchmarking Cognitive Domains for LLMs: Insights from Taiwanese Hakka Culture

Title: An Implementation of Werewolf Agent That does not Truly Trust LLMs

Title: AdaComp: Extractive Context Compression with Adaptive Predictor for Retrieval-Augmented Large Language Models

Title: Towards Cross-Lingual Explanation of Artwork in Large-scale Vision Language Models

Title: Booster: Tackling Harmful Fine-tuing for Large Language Models via Attenuating Harmful Perturbation

Title: From Yes-Men to Truth-Tellers: Addressing Sycophancy in Large Language Models with Pinpoint Tuning

Title: Interpreting and Improving Large Language Models in Arithmetic Calculation

Title: In Defense of RAG in the Era of Long-Context Language Models

Title: LLM-GAN: Construct Generative Adversarial Network Through Large Language Models For Explainable Fake News Detection

Title: Training on the Benchmark Is Not All You Need

Title: Dialogue You Can Trust: Human and AI Perspectives on Generated Conversations

Title: AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction

Title: Investigating Expert-in-the-Loop LLM Discourse Patterns for Ancient Intertextual Analysis

Title: What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices

Title: Towards Leveraging Large Language Models for Automated Medical Q&A Evaluation

Title: FuzzCoder: Byte-level Fuzzing Test via Large Language Model

Title: BEAVER: An Enterprise Benchmark for Text-to-SQL

Title: OLMoE: Open Mixture-of-Experts Language Models

Title: Spinning the Golden Thread: Benchmarking Long-Form Generation in Language Models

Title: Political DEBATE: Efficient Zero-shot and Few-shot Classifiers for Political Text

Title: CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation