2025-02-05

Title: Large Language Models' Accuracy in Emulating Human Experts' Evaluation of Public Sentiments about Heated Tobacco Products on Social Media

Title: Speculative Ensemble: Fast Large Language Model Ensemble via Speculation

Title: Explainable AI for Sentiment Analysis of Human Metapneumovirus (HMPV) Using XLNet

Title: Benchmark on Peer Review Toxic Detection: A Challenging Task with a New Dataset

Title: LLM-Powered Benchmark Factory: Reliable, Generic, and Efficient

Title: Agent-Based Uncertainty Awareness Improves Automated Radiology Report Labeling with an Open-Source Large Language Model

Title: BARE: Combining Base and Instruction-Tuned Language Models for Better Synthetic Data Generation

Title: Evaluation of Large Language Models via Coupled Token Generation

Title: On Bob Dylan: A Computational Perspective

Title: SelfCheckAgent: Zero-Resource Hallucination Detection in Generative Large Language Models

Title: Latent Lexical Projection in Large Language Models: A Novel Approach to Implicit Representation Refinement

Title: Conceptual Metaphor Theory as a Prompting Paradigm for Large Language Models

Title: PANDAS: Improving Many-shot Jailbreaking via Positive Affirmation, Negative Demonstration, and Adaptive Sampling

Title: Can LLMs Maintain Fundamental Abilities under KV Cache Compression?

Title: Token Cleaning: Fine-Grained Data Selection for LLM Supervised Fine-Tuning

Title: CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level Routing

Title: Gradient-Regularized Latent Space Modulation in Large Language Models for Structured Contextual Synthesis

Title: Can LLMs Assist Annotators in Identifying Morality Frames? -- Case Study on Vaccination Debate on Social Media

Title: Wavelet-based Positional Representation for Long Context

Title: Reasoning Bias of Next Token Prediction Training

Title: Fine-tuning Language Models for Recipe Generation: A Comparative Analysis and Benchmark Study

Title: M2R2: Mixture of Multi-Rate Residuals for Efficient Transformer Inference

Title: Contextual Memory Reweaving in Large Language Models Using Layered Latent State Reconstruction

Title: ASCenD-BDS: Adaptable, Stochastic and Context-aware framework for Detection of Bias, Discrimination and Stereotyping

Title: Rethinking stance detection: A theoretically-informed research agenda for user-level inference using language models

Title: LongDPO: Unlock Better Long-form Generation Abilities for LLMs via Critique-augmented Stepwise Information

Title: Mass-Editing Memory with Attention in Transformers: A cross-lingual exploration of knowledge

Title: When Dimensionality Hurts: The Role of LLM Embedding Compression for Noisy Regression Tasks

Title: Conversation AI Dialog for Medicare powered by Finetuning and Retrieval Augmented Generation

Title: Evalita-LLM: Benchmarking Large Language Models on Italian

Title: Boosting Multimodal Reasoning with MCTS-Automated Structured Thinking

Title: Premise-Augmented Reasoning Chains Improve Error Identification in Math reasoning with LLMs

Title: STAIR: Improving Safety Alignment with Introspective Reasoning

Title: CoAT: Chain-of-Associated-Thoughts Framework for Enhancing Large Language Models Reasoning

Title: Activation-Informed Merging of Large Language Models

Title: Generative Psycho-Lexical Approach for Constructing Value Systems in Large Language Models

Title: Beyond English: Evaluating Automated Measurement of Moral Foundations in Non-English Discourse with a Chinese Case Study

Title: SAISA: Towards Multimodal Large Language Models with Both Training and Inference Efficiency

Title: Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study

Title: Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search

Title: Adaptive Self-improvement LLM Agentic System for ML Library Development

Title: Are Language Models Up to Sequential Optimization Problems? From Evaluation to a Hegelian-Inspired Enhancement

Title: A comparison of translation performance between DeepL and Supertext