2024-12-18

Title: Frontier AI systems have surpassed the self-replicating red line

Title: Automatic Item Generation for Personality Situational Judgment Tests with Large Language Models

Title: Na'vi or Knave: Jailbreaking Language Models via Metaphorical Avatars

Title: What Makes In-context Learning Effective for Mathematical Reasoning: A Theoretical Analysis

Title: Performance of a large language model-Artificial Intelligence based chatbot for counseling patients with sexually transmitted infections and genital diseases

Title: Greek2MathTex: A Greek Speech-to-Text Framework for LaTeX Equations Generation

Title: A NotSo Simple Way to Beat Simple Bench

Title: Model-diff: A Tool for Comparative Study of Language Models in the Input Space

Title: Emergence of Abstractions: Concept Encoding and Decoding Mechanism for In-Context Learning in Transformers

Title: Unanswerability Evaluation for Retreival Augmented Generation

Title: Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion

Title: BioRAGent: A Retrieval-Augmented Generation System for Showcasing Generative Query Expansion and Domain-Specific Search for Scientific Q&A

Title: Interpretable LLM-based Table Question Answering

Title: Bridging the Gap: Enhancing LLM Performance for Low-Resource African Languages with New Benchmarks, Fine-Tuning, and Cultural Adjustments

Title: Assessing the Limitations of Large Language Models in Clinical Fact Decomposition

Title: Refining Dimensions for Improving Clustering-based Cross-lingual Topic Models

Title: LITA: An Efficient LLM-assisted Iterative Topic Augmentation Framework

Title: Core Context Aware Attention for Long Context Language Modeling

Title: Knowledge Boundary of Large Language Models: A Survey

Title: RareAgents: Autonomous Multi-disciplinary Team for Rare Disease Diagnosis and Treatment

Title: Human-in-the-Loop Generation of Adversarial Texts: A Case Study on Tibetan Script

Title: Boosting Long-Context Information Seeking via Query-Guided Activation Refilling

Title: NLSR: Neuron-Level Safety Realignment of Large Language Models Against Harmful Fine-Tuning

Title: LinguaLIFT: An Effective Two-stage Instruction Tuning Framework for Low-Resource Language Tasks

Title: Beyond Data Quantity: Key Factors Driving Performance in Multilingual Language Models

Title: Can You Trust LLM Judgments? Reliability of LLM-as-a-Judge

Title: Can Large Language Models Understand You Better? An MBTI Personality Detection Dataset Aligned with Population Traits

Title: Solid-SQL: Enhanced Schema-linking based In-context Learning for Robust Text-to-SQL

Title: When to Speak, When to Abstain: Contrastive Decoding with Abstention

Title: LLMCL-GEC: Advancing Grammatical Error Correction with LLM-Driven Curriculum Learning

Title: EXIT: Context-Aware Extractive Compression for Enhancing Retrieval-Augmented Generation

Title: Task-Agnostic Language Model Watermarking via High Entropy Passthrough Layers

Title: Evaluating Zero-Shot Multilingual Aspect-Based Sentiment Analysis with Large Language Models

Title: FCMR: Robust Evaluation of Financial Cross-Modal Multi-Hop Reasoning

Title: Process-Supervised Reward Models for Clinical Note Generation: A Scalable Approach Guided by Domain Expertise

Title: PerSphere: A Comprehensive Framework for Multi-Faceted Perspective Retrieval and Summarization

Title: LLMs are Also Effective Embedding Models: An In-depth Overview

Title: MultiLingPoT: Enhancing Mathematical Reasoning with Multilingual Program Fine-tuning

Title: SynthCypher: A Fully Synthetic Data Generation Framework for Text-to-Cypher Querying in Knowledge Graphs

Title: Jailbreaking? One Step Is Enough!

Title: Make Imagination Clearer! Stable Diffusion-based Visual Imagination for Multimodal Machine Translation

Title: What External Knowledge is Preferred by LLMs? Characterizing and Exploring Chain of Evidence in Imperfect Context

Title: Falcon: Faster and Parallel Inference of Large Language Models through Enhanced Semi-Autoregressive Drafting and Custom-Designed Decoding Tree

Title: LLM-based Discriminative Reasoning for Knowledge Graph Question Answering

Title: iPrOp: Interactive Prompt Optimization for Large Language Models with a Human in the Loop

Title: Train More Parameters But Mind Their Placement: Insights into Language Adaptation with PEFT

Title: Detecting Document-level Paraphrased Machine Generated Content: Mimicking Human Writing Style and Involving Discourse Features

Title: XTransplant: A Probe into the Upper Bound Performance of Multilingual Capability and Culture Adaptability in LLMs via Mutual Cross-lingual Feed-forward Transplantation

Title: Trigger$^3$: Refining Query Correction via Adaptive Model Selector

Title: More Tokens, Lower Precision: Towards the Optimal Token-Precision Trade-off in KV Cache Compression

Title: Enhancing Naturalness in LLM-Generated Utterances through Disfluency Insertion

Title: Revealing the impact of synthetic native samples and multi-tasking strategies in Hindi-English code-mixed humour and sarcasm detection

Title: Detecting Emotional Incongruity of Sarcasm by Commonsense Reasoning

Title: DSGram: Dynamic Weighting Sub-Metrics for Grammatical Error Correction in the Era of Large Language Models

Title: Benchmarking and Understanding Compositional Relational Reasoning of LLMs

Title: Preference-Oriented Supervised Fine-Tuning: Favoring Target Model Over Aligned Large Language Models

Title: RAG-Star: Enhancing Deliberative Reasoning with Retrieval Augmented Verification and Refinement

Title: Question: How do Large Language Models perform on the Question Answering tasks? Answer:

Title: Truthful Text Sanitization Guided by Inference Attacks

Title: Improving Fine-grained Visual Understanding in VLMs through Text-Only Training

Title: MOPO: Multi-Objective Prompt Optimization for Affective Text Generation

Title: SnakModel: Lessons Learned from Training an Open Danish Large Language Model

Title: Adaptations of AI models for querying the LandMatrix database in natural language

Title: Unlocking LLMs: Addressing Scarce Data and Bias Challenges in Mental Health

Title: OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain

Title: NAVCON: A Cognitively Inspired and Linguistically Grounded Corpus for Vision and Language Navigation

Title: Harnessing Event Sensory Data for Error Pattern Prediction in Vehicles: A Language Model Approach

Title: LMUnit: Fine-grained Evaluation with Natural Language Unit Tests

Title: Uchaguzi-2022: A Dataset of Citizen Reports on the 2022 Kenyan Election

Title: AI PERSONA: Towards Life-long Personalization of LLMs

Title: Algorithmic Fidelity of Large Language Models in Generating Synthetic German Public Opinions: A Case Study

Title: Compressed Chain of Thought: Efficient Reasoning Through Dense Representations

Title: DnDScore: Decontextualization and Decomposition for Factuality Verification in Long-Form Text Generation