2024-06-06

Title: Cross-Modal Safety Alignment: Is textual unlearning all you need?

Title: Are PPO-ed Language Models Hackable?

Title: Block Transformer: Global-to-Local Language Modeling for Fast Inference

Title: Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller

Title: RATT: AThought Structure for Coherent and Correct LLMReasoning

Title: Aligning Large Language Models via Fine-grained Supervision

Title: Disentangling Logic: The Role of Context in Large Language Model Reasoning Capabilities

Title: Chain of Agents: Large Language Models Collaborating on Long-Context Tasks

Title: Exploring Robustness in Doctor-Patient Conversation Summarization: An Analysis of Out-of-Domain SOAP Notes

Title: Too Big to Fail: Larger Language Models are Disproportionately Resilient to Induction of Dementia-Related Linguistic Anomalies

Title: Xmodel-LM Technical Report

Title: LLM as a Scorer: The Impact of Output Order on Dialogue Evaluation

Title: NUMCoT: Numerals and Units of Measurement in Chain-of-Thought Reasoning using Large Language Models

Title: PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairs

Title: HYDRA: Model Factorization Framework for Black-Box LLM Personalization

Title: Language Model Can Do Knowledge Tracing: Simple but Effective Method to Integrate Language Model and Knowledge Tracing Task

Title: Open Grounded Planning: Challenges and Benchmark Construction

Title: Improving In-Context Learning with Prediction Feedback for Sentiment Analysis

Title: MultifacetEval: Multifaceted Evaluation to Probe LLMs in Mastering Medical Knowledge

Title: Adversarial Moment-Matching Distillation of Large Language Models

Title: Docs2KG: Unified Knowledge Graph Construction from Heterogeneous Documents Assisted by Large Language Models

Title: Evaluation of data inconsistency for multi-modal sentiment analysis

Title: BadAgent: Inserting and Activating Backdoor Attacks in LLM Agents

Title: Unveiling Selection Biases: Exploring Order and Token Sensitivity in Large Language Models

Title: From Tarzan to Tolkien: Controlling the Language Proficiency Level of LLMs for Content Generation

Title: RadBARTsum: Domain Specific Adaption of Denoising Sequence-to-Sequence Models for Abstractive Radiology Report Summarization

Title: Towards Detecting LLMs Hallucination via Markov Chain-based Multi-agent Debate Framework

Title: Cryptocurrency Frauds for Dummies: How ChatGPT introduces us to fraud?

Title: FragRel: Exploiting Fragment-level Relations in the External Memory of Large Language Models

Title: Which Side Are You On? A Multi-task Dataset for End-to-End Argument Summarisation and Evaluation

Title: CSS: Contrastive Semantic Similarity for Uncertainty Quantification of LLMs

Title: StatBot.Swiss: Bilingual Open Data Exploration in Natural Language

Title: Missci: Reconstructing Fallacies in Misrepresented Science

Title: The Impossibility of Fair LLMs

Title: Bayesian WeakS-to-Strong from Text Classification to Generation

Title: ChatLang-8: An LLM-Based Synthetic Data Generation Framework for Grammatical Error Correction

Title: Error-preserving Automatic Speech Recognition of Young English Learners' Language

Title: The Challenges of Evaluating LLM Applications: An Analysis of Automated, Human, and LLM-Based Approaches

Title: LLM-based Rewriting of Inappropriate Argumentation using Reinforcement Learning from Machine Feedback

Title: IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models

Title: Automating Turkish Educational Quiz Generation Using Large Language Models

Title: Cycles of Thought: Measuring LLM Confidence through Stable Explanations

Title: Are language models rational? The case of coherence norms and belief revision

Title: What is the Best Way for ChatGPT to Translate Poetry?

Title: BIPED: Pedagogically Informed Tutoring System for ESL Education

Title: Analyzing LLM Behavior in Dialogue Summarization: Unveiling Circumstantial Hallucination Trends

Title: Wings: Learning Multimodal LLMs without Text-only Forgetting