2024-06-19

Title: Reframing linguistic bootstrapping as joint inference using visually-grounded grammar induction models

Title: Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner

Title: FinTruthQA: A Benchmark Dataset for Evaluating the Quality of Financial Information Disclosure

Title: CItruS: Chunked Instruction-aware State Eviction for Long Sequence Modeling

Title: LiLiuM: eBay's Large Language Models for e-commerce

Title: Unveiling and Mitigating Bias in Mental Health Analysis with Large Language Models

Title: Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts

Title: MedCalc-Bench: Evaluating Large Language Models for Medical Calculations

Title: Soft Prompting for Unlearning in Large Language Models

Title: Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning

Title: UniGLM: Training One Unified Language Model for Text-Attributed Graphs

Title: InternalInspector $I^2$: Robust Confidence Estimation in LLMs through Internal States

Title: Language Models are Surprisingly Fragile to Drug Names in Biomedical Benchmarks

Title: Satyrn: A Platform for Analytics Augmented Generation

Title: COMMUNITY-CROSS-INSTRUCT: Unsupervised Instruction Generation for Aligning Large Language Models to Online Communities

Title: When Reasoning Meets Information Aggregation: A Case Study with Sports Narratives

Title: Who's asking? User personas and the mechanics of latent misalignment

Title: End-to-end Text-to-SQL Generation within an Analytics Insight Engine

Title: Can LLMs Learn Macroeconomic Narratives from Social Media?

Title: Enhancing Text Classification through LLM-Driven Active Learning and Human Annotation

Title: Decoding the Narratives: Analyzing Personal Drug Experiences Shared on Reddit

Title: AI "News" Content Farms Are Easy to Make and Hard to Detect: A Case Study in Italian

Title: LLMs Are Prone to Fallacies in Causal Inference

Title: Exploring the Impact of a Transformer's Latent Space Geometry on Downstream Task Performance

Title: Aqulia-Med LLM: Pioneering Full-Process Open-Source Medical Language Models

Title: Debate as Optimization: Adaptive Conformal Prediction and Diverse Retrieval for Event Extraction

Title: Knowledge Fusion By Evolving Weights of Language Models

Title: LLM-Oracle Machines

Title: Is persona enough for personality? Using ChatGPT to reconstruct an agent's latent personality from simple descriptions

Title: On-Policy Fine-grained Knowledge Feedback for Hallucination Mitigation

Title: ToxiCloakCN: Evaluating Robustness of Offensive Language Detection in Chinese with Cloaking Perturbations

Title: MCSD: An Efficient Language Model with Diverse Fusion

Title: PFID: Privacy First Inference Delegation Framework for LLMs

Title: Mitigate Negative Transfer with Similarity Heuristic Lifelong Prompt Tuning

Title: A Hopfieldian View-based Interpretation for Chain-of-Thought Reasoning

Title: Defending Against Social Engineering Attacks in the Age of LLMs

Title: Towards a Client-Centered Assessment of LLM Therapists by Client Simulation

Title: Unveiling Implicit Table Knowledge with Question-Then-Pinpoint Reasoner for Insightful Table Summarization

Title: SafeInfer: Context Adaptive Decoding Time Safety Alignment for Large Language Models

Title: What Matters in Learning Facts in Language Models? Multifaceted Knowledge Probing with Diverse Multi-Prompt Datasets

Title: Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding

Title: Can Tool-augmented Large Language Models be Aware of Incomplete Conditions?

Title: PRePair: Pointwise Reasoning Enhance Pairwise Evaluating for Robust Instruction-Following Assessments

Title: SNAP: Unlearning Selective Knowledge in Large Language Models with Negative Instructions

Title: Retrieval Meets Reasoning: Dynamic In-Context Editing for Long-Text Understanding

Title: Attention Score is not All You Need for Token Importance Indicator in KV Cache Reduction: Value Also Matters

Title: Interpreting Bias in Large Language Models: A Feature-Based Approach

Title: Cross-Lingual Unlearning of Selective Knowledge in Multilingual Language Models

Title: WebCanvas: Benchmarking Web Agents in Online Environments

Title: QOG:Question and Options Generation based on Language Model

Title: From Instance Training to Instruction Learning: Task Adapters Generation from Instructions

Title: IPEval: A Bilingual Intellectual Property Agency Consultation Evaluation Benchmark for Large Language Models

Title: Unveiling the Flaws: Exploring Imperfections in Synthetic Data and Mitigation Strategies for Large Language Models

Title: QueerBench: Quantifying Discrimination in Language Models Toward Queer Identities

Title: Flee the Flaw: Annotating the Underlying Logic of Fallacious Arguments Through Templates and Slot-filling

Title: PDSS: A Privacy-Preserving Framework for Step-by-Step Distillation of Large Language Models

Title: Beyond Under-Alignment: Atomic Preference Enhanced Factuality Tuning for Large Language Models

Title: MMUTF: Multimodal Multimedia Event Argument Extraction with Unified Template Filling

Title: PSLM: Parallel Generation of Text and Speech with LLMs for Low-Latency Spoken Dialogue Systems

Title: PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers

Title: Abstraction-of-Thought Makes Language Models Better Reasoners

Title: Adaptive Token Biaser: Knowledge Editing via Biasing Key Entities

Title: Fighting Randomness with Randomness: Mitigating Optimisation Instability of Fine-Tuning using Delayed Ensemble and Noisy Interpolation

Title: The Power of LLM-Generated Synthetic Data for Stance Detection in Online Political Discussions

Title: LightPAL: Lightweight Passage Retrieval for Open Domain Multi-Document Summarization

Title: Code-Optimise: Self-Generated Preference Data for Correctness and Efficiency

Title: FuseGen: PLM Fusion for Data-generation based Zero-shot Learning

Title: Unified Active Retrieval for Retrieval Augmented Generation

Title: Liar, Liar, Logical Mire: A Benchmark for Suppositional Reasoning in Large Language Models

Title: P-Tailor: Customizing Personality Traits for Language Models via Mixture of Specialized LoRA Experts

Title: MultiSocial: Multilingual Benchmark of Machine-Generated Text Detection of Social-Media Texts

Title: RichRAG: Crafting Rich Responses for Multi-faceted Queries in Retrieval-Augmented Generation

Title: Applying Ensemble Methods to Model-Agnostic Machine-Generated Text Detection

Title: Mathador-LM: A Dynamic Benchmark for Mathematical Reasoning on Large Language Models

Title: Breaking the Ceiling of the LLM Community by Treating Token Generation as a Classification for Ensembling

Title: Low-Redundant Optimization for Large Language Model Alignment

Title: Bridging Local Details and Global Context in Text-Attributed Graphs

Title: What makes two models think alike?

Title: Judging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-Judges

Title: Ask-before-Plan: Proactive Language Agents for Real-World Planning

Title: DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence?

Title: Hierarchical Prompting Taxonomy: A Universal Evaluation Framework for Large Language Models

Title: Evaluating Transparency of Machine Generated Fact Checking Explanations

Title: CollabStory: Multi-LLM Collaborative Story Generation and Authorship Analysis

Title: Estimating Knowledge in Large Language Models Without Generating a Single Token

Title: Vernacular? I Barely Know Her: Challenges with Style Control and Stereotyping

Title: Measuring Psychological Depth in Language Models

Title: Using LLMs to Aid Annotation and Collection of Clinically-Enriched Data in Bipolar Disorder and Schizophrenia

Title: MAGIC: Generating Self-Correction Guideline for In-Context Text-to-SQL

Title: Jailbreak Paradox: The Achilles' Heel of LLMs

Title: Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction

Title: AgentReview: Exploring Peer Review Dynamics with LLM Agents

Title: On the Robustness of Language Models for Tabular Question Answering

Title: Can Large Language Models Code Like a Linguist?: A Case Study in Low Resource Sound Law Induction

Title: Large Language Model as a Universal Clinical Multi-task Decoder

Title: Self-Distillation for Model Stacking Unlocks Cross-Lingual NLU in 200+ Languages

Title: Rationale-based Ensemble of Multiple QA Strategies for Zero-shot Knowledge-based VQA

Title: OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI

Title: Chumor 1.0: A Truly Funny and Challenging Chinese Humor Understanding Dataset from Ruo Zhi Ba

Title: Hopping Too Late: Exploring the Limitations of Large Language Models on Multi-Hop Queries

Title: UBENCH: Benchmarking Uncertainty in Large Language Models with Multiple Choice Questions

Title: Generating Educational Materials with Different Levels of Readability using LLMs

Title: ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

Title: Can Large Language Models Always Solve Easy Problems if They Can Solve Harder Ones?

Title: Is It Good Data for Multilingual Instruction Tuning or Just Bad Multilingual Evaluation for Large Language Models?

Title: From RAGs to rich parameters: Probing how language models utilize external knowledge over parametric information for factual queries

Title: What Are the Odds? Language Models Are Capable of Probabilistic Reasoning

Title: LaMDA: Large Model Fine-Tuning via Spectrally Decomposed Low-Dimensional Adaptation