2024-12-20

Title: Fake News Detection: Comparative Evaluation of BERT-like Models and Large Language Models with Generative AI-Annotated Data

Title: Multi-OphthaLingua: A Multilingual Benchmark for Assessing and Debiasing LLM Ophthalmological QA in LMICs

Title: A Survey on LLM Inference-Time Self-Improvement

Title: Memorization Over Reasoning? Exposing and Mitigating Verbatim Memorization in Large Language Models' Character Understanding Evaluation

Title: ECG-Byte: A Tokenizer for End-to-End Generative Electrocardiogram Language Modeling

Title: All-in-One Tuning and Structural Pruning for Domain-Specific LLMs

Title: ORBIT: Cost-Effective Dataset Curation for Large Language Model Domain Adaptation with an Astronomy Case Study

Title: From Human Annotation to LLMs: SILICON Annotation Workflow for Management Research

Title: Agent-SafetyBench: Evaluating the Safety of LLM Agents

Title: Why We Build Local Large Language Models: An Observational Analysis from 35 Japanese and Multilingual LLMs

Title: Do Large Language Models Defend Inferentialist Semantics?: On the Logical Expressivism and Anti-Representationalism of LLMs

Title: PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization

Title: Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models

Title: CitaLaw: Enhancing LLM with Citations in Legal Domain

Title: CORD: Balancing COnsistency and Rank Distillation for Robust Retrieval-Augmented Generation

Title: Simulation-Free Hierarchical Latent Policy Planning for Proactive Dialogues

Title: Beyond Guilt: Legal Judgment Prediction with Trichotomous Reasoning

Title: HarmonicEval: Multi-modal, Multi-task, Multi-criteria Automatic Evaluation Using a Vision Language Model

Title: How good is GPT at writing political speeches for the White House?

Title: Learning to Generate Research Idea with Dynamic Control

Title: TOMG-Bench: Evaluating LLMs on Text-based Open Molecule Generation

Title: Length Controlled Generation for Black-box LLMs

Title: Analysis and Visualization of Linguistic Structures in Large Language Models: Neural Representations of Verb-Particle Constructions in BERT

Title: LLMs as mediators: Can they diagnose conflicts accurately?

Title: How to Synthesize Text Data without Model Collapse?

Title: On Verbalized Confidence Scores for LLMs

Title: Query pipeline optimization for cancer patient question answering systems

Title: PsyDraw: A Multi-Agent Multimodal System for Mental Health Screening in Left-Behind Children

Title: ALKAFI-LLAMA3: Fine-Tuning LLMs for Precise Legal Understanding in Palestine

Title: Disentangling Reasoning Tokens and Boilerplate Tokens For Language Model Fine-tuning

Title: ResoFilter: Rine-grained Synthetic Data Filtering for Large Language Models through Data-Parameter Resonance Analysis

Title: Progressive Multimodal Reasoning via Active Retrieval

Title: DynamicKV: Task-Aware Adaptive KV Cache Compression for Long Context LLMs

Title: Mapping and Influencing the Political Ideology of Large Language Models using Synthetic Personas

Title: DS$^2$-ABSA: Dual-Stream Data Synthesis with Label Refinement for Few-Shot Aspect-Based Sentiment Analysis

Title: Think&Cite: Improving Attributed Text Generation with Self-Guided Tree Search and Progress Reward Modeling

Title: Graph-Convolutional Networks: Named Entity Recognition and Large Language Model Embedding in Document Clustering

Title: Why language models collapse when trained on recursively generated text

Title: Dehallucinating Parallel Context Extension for Retrieval-Augmented Generation

Title: RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response

Title: Understanding the Dark Side of LLMs' Intrinsic Self-Correction

Title: Knowledge Injection via Prompt Distillation

Title: Chain-of-MetaWriting: Linguistic and Textual Analysis of How Small Language Models Write Young Students Texts

Title: LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps

Title: ConfliBERT: A Language Model for Political Conflict

Title: AceMath: Advancing Frontier Math Reasoning with Post-Training and Reward Modeling

Title: Review-Then-Refine: A Dynamic Framework for Multi-Hop Question Answering with Temporal Adaptability

Title: Qwen2.5 Technical Report

Title: Outcome-Refining Process Supervision for Code Generation

Title: Adaptive Pruning for Large Language Models with Structural Importance Awareness

Title: Language Models as Continuous Self-Evolving Data Engineers

Title: LlamaFusion: Adapting Pretrained Language Models for Multimodal Generation

Title: Face the Facts! Evaluating RAG-based Fact-checking Pipelines in Realistic Settings

Title: MMLU-CF: A Contamination-free Multi-task Language Understanding Benchmark

Title: LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks