2025-01-24

Title: Dagger Behind Smile: Fool LLMs with a Happy Ending Story

Title: MyGO Multiplex CoT: A Method for Self-Reflection in Large Language Models via Double Chain of Thought Thinking

Title: Multilinguality in LLM-Designed Reward Functions for Restless Bandits: Effects on Task Performance and Fairness

Title: Episodic Memories Generation and Evaluation Benchmark for Large Language Models

Title: Zero-Shot Verification-guided Chain of Thoughts

Title: Preference Curriculum: LLMs Should Always Be Pretrained on Their Preferred Data

Title: RAG-Reward: Optimizing RAG with Reward Modeling and RLHF

Title: RAMQA: A Unified Framework for Retrieval-Augmented Multi-Modal Question Answering

Title: Hypothesis Generation for Materials Discovery and Design Using Goal-Driven and Constraint-Guided LLM Agents

Title: Watching the AI Watchdogs: A Fairness and Robustness Analysis of AI Safety Moderation Classifiers

Title: Do as We Do, Not as You Think: the Conformity of Large Language Models

Title: Can Large Language Models Understand Preferences in Personalized Recommendation?

Title: ExLM: Rethinking the Impact of $\texttt{[MASK]}$ Tokens in Masked Language Models

Title: Softplus Attention with Re-weighting Boosts Length Extrapolation in Large Language Models

Title: RECALL: Library-Like Behavior In Language Models is Enhanced by Self-Referencing Causal Cycles

Title: LLMs Can Plan Only If We Tell Them

Title: K-COMP: Retrieval-Augmented Medical Domain Question Answering With Knowledge-Injected Compressor

Title: Improving Contextual Faithfulness of Large Language Models via Retrieval Heads-Induced Optimization

Title: Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Title: LVPruning: An Effective yet Simple Language-Guided Vision Token Pruning Approach for Multi-modal Large Language Models

Title: How to Complete Domain Tuning while Keeping General Ability in LLM: Adaptive Layer-wise and Element-wise Regularization

Title: Question Answering on Patient Medical Records with Private Fine-Tuned LLMs

Title: DI-BENCH: Benchmarking Large Language Models on Dependency Inference with Testable Repositories at Scale

Title: Musical ethnocentrism in Large Language Models

Title: RPO: Retrieval Preference Optimization for Robust Retrieval-Augmented Generation

Title: Pseudocode-Injection Magic: Enabling LLMs to Tackle Graph Computational Tasks

Title: UGMathBench: A Diverse and Dynamic Benchmark for Undergraduate-Level Mathematical Reasoning with Large Language Models

Title: Do Large Language Models Truly Understand Geometric Structures?

Title: Parameter-Efficient Fine-Tuning for Foundation Models

Title: Hallucinations Can Improve Large Language Models in Drug Discovery

Title: Predicting Compact Phrasal Rewrites with Large Language Models for ASR Post Editing

Title: Think Outside the Data: Colonial Biases and Systemic Issues in Automated Moderation Pipelines for Low-Resource Languages

Title: A RAG-Based Institutional Assistant

Title: GUI-Bee: Align GUI Action Grounding to Novel Environments via Autonomous Exploration

Title: Analysis of Indic Language Capabilities in LLMs

Title: The Breeze 2 Herd of Models: Traditional Chinese LLMs Based on Llama with Vision-Aware and Function-Calling Capabilities

Title: CRPO: Confidence-Reward Driven Preference Optimization for Machine Translation