2024-10-30

Title: Natural Language Processing for the Legal Domain: A Survey of Tasks, Datasets, Models, and Challenges

Title: Decoding Diffusion: A Scalable Framework for Unsupervised Analysis of Latent Space Biases and Representations Using Natural Language Prompts

Title: Mathematical Derivation Graphs: A Task for Summarizing Equation Dependencies in STEM Manuscripts

Title: LLM Robustness Against Misinformation in Biomedical Question Answering

Title: Fine-tuned Large Language Models (LLMs): Improved Prompt Injection Attacks Detection

Title: FinTeamExperts: Role Specialized MOEs For Financial Analysis

Title: Large Language Model Benchmarks in Medical Tasks

Title: LLMCBench: Benchmarking Large Language Model Compression for Efficient Deployment

Title: Causal Interventions on Causal Paths: Mapping GPT-2's Reasoning From Syntax to Semantics

Title: Energy-Based Diffusion Language Models for Text Generation

Title: Can Machines Think Like Humans? A Behavioral Evaluation of LLM-Agents in Dictator Games

Title: A Survey on Automatic Credibility Assessment of Textual Credibility Signals in the Era of Large Language Models

Title: CT2C-QA: Multimodal Question Answering over Chinese Text, Table and Chart

Title: UFT: Unifying Fine-Tuning of SFT and RLHF/DPO/UNA through a Generalized Implicit Reward Function

Title: Estimating Causal Effects of Text Interventions Leveraging LLMs

Title: TransformLLM: Adapting Large Language Models via LLM-Transformed Reading Comprehension Text

Title: SpeechQE: Estimating the Quality of Direct Speech Translation

Title: Can Large Language Models Act as Symbolic Reasoners?

Title: RoBIn: A Transformer-Based Model For Risk Of Bias Inference With Machine Reading Comprehension

Title: SandboxAQ's submission to MRL 2024 Shared Task on Multi-lingual Multi-task Information Retrieval

Title: Efficient Training of Sparse Autoencoders for Large Language Models via Layer Groups

Title: Unveiling Context-Aware Criteria in Self-Assessing LLMs

Title: MultiTok: Variable-Length Tokenization for Efficient LLMs Adapted from LZW Compression

Title: Thank You, Stingray: Multilingual Large Language Models Can Not (Yet) Disambiguate Cross-Lingual Word Sense

Title: Reducing the Scope of Language Models with Circuit Breakers

Title: MCPDial: A Minecraft Persona-driven Dialogue Dataset

Title: Are Paraphrases Generated by Large Language Models Invertible?

Title: $f$-PO: Generalizing Preference Optimization with $f$-divergence Minimization

Title: CFSafety: Comprehensive Fine-grained Safety Assessment for LLMs

Title: A Bayesian Approach to Harnessing the Power of LLMs in Authorship Attribution

Title: Let's Be Self-generated via Step by Step: A Curriculum Learning Approach to Automated Reasoning with Large Language Models

Title: Enhancing Financial Question Answering with a Multi-Agent Reflection Framework

Title: Learning and Unlearning of Fabricated Knowledge in Language Models

Title: Leveraging LLMs for Hypothetical Deduction in Logical Inference: A Neuro-Symbolic Approach

Title: Enhancing Adversarial Attacks through Chain of Thought

Title: SimSiam Naming Game: A Unified Approach for Representation Learning and Emergent Communication

Title: Self-Preference Bias in LLM-as-a-Judge

Title: Multi-aspect Depression Severity Assessment via Inductive Dialogue System

Title: Improving In-Context Learning with Small Language Model Ensembles

Title: SceneGenAgent: Precise Industrial Scene Generation with Coding Agent

Title: Beyond Text: Optimizing RAG with Multimodal Inputs for Industrial Applications

Title: SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types

Title: Not All Languages are Equal: Insights into Multilingual Retrieval-Augmented Generation

Title: Are VLMs Really Blind

Title: Distinguishing Ignorance from Error in LLM Hallucinations

Title: Protecting Privacy in Multimodal Large Language Models with MLLMU-Bench

Title: The Impact of Inference Acceleration Strategies on Bias of LLMs

Title: AmpleGCG-Plus: A Strong Generative Model of Adversarial Suffixes to Jailbreak LLMs with Higher Success Rates in Fewer Attempts

Title: Benchmarking LLM Guardrails in Handling Multilingual Toxicity

Title: ProMQA: Question Answering Dataset for Multimodal Procedural Activity Understanding

Title: DISCERN: Decoding Systematic Errors in Natural Language for Text Classifiers

Title: FactBench: A Dynamic Benchmark for In-the-Wild Language Model Factuality Evaluation

Title: From melodic note sequences to pitches using word2vec

Title: Flow-DPO: Improving LLM Mathematical Reasoning through Online Multi-Agent Learning

Title: Natural Language Inference Improves Compositionality in Vision-Language Models

Title: Understanding Synthetic Context Extension via Retrieval Heads