2025-02-10

Title: JingFang: A Traditional Chinese Medicine Large Language Model of Expert-Level Medical Diagnosis and Syndrome Differentiation-Based Treatment

Title: Multi-Lingual Cyber Threat Detection in Tweets/X Using ML, DL, and LLM: A Comparative Analysis

Title: SCALM: Detecting Bad Practices in Smart Contracts Through LLMs

Title: Prompt-based Depth Pruning of Large Language Models

Title: Dynamic benchmarking framework for LLM-based conversational data capture

Title: CodeSteer: Symbolic-Augmented Language Models via Code/Text Guidance

Title: NER4all or Context is All You Need: Using LLMs for low-effort, high-performance NER on historical texts. A humanities informed approach

Title: Investigating the Robustness of Deductive Reasoning with Large Language Models

Title: CognArtive: Large Language Models for Automating Art Analysis and Decoding Aesthetic Elements

Title: Reviving The Classics: Active Reward Modeling in Large Language Model Alignment

Title: LLM-ProS: Analyzing Large Language Models' Performance in Competitive Problem Solving

Title: Open Foundation Models in Healthcare: Challenges, Paradoxes, and Opportunities with GenAI Driven Personalized Prescription

Title: Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs

Title: Position: Scaling LLM Agents Requires Asymptotic Analysis with LLM Primitives

Title: Exploring Spatial Language Grounding Through Referring Expressions

Title: MARAGE: Transferable Multi-Model Adversarial Attack for Retrieval-Augmented Generation Data Extraction

Title: LLMs can be easily Confused by Instructional Distractions

Title: An Analysis for Reasoning Bias of Language Models with Small Initialization

Title: MEETING DELEGATE: Benchmarking LLMs on Attending Meetings on Our Behalf

Title: Diversity as a Reward: Fine-Tuning LLMs on a Mixture of Domain-Undetermined Data

Title: Limitations of Large Language Models in Clinical Problem-Solving Arising from Inflexible Reasoning

Title: Sparse Autoencoders for Hypothesis Generation

Title: Enhancing Reasoning to Adapt Large Language Models for Domain-Specific Applications

Title: FedP$^2$EFT: Federated Learning to Personalize Parameter Efficient Fine-Tuning for Multilingual LLMs

Title: In Praise of Stubbornness: The Case for Cognitive-Dissonance-Aware Knowledge Updates in LLMs

Title: Division-of-Thoughts: Harnessing Hybrid Language Model Synergy for Efficient On-Device Agents

Title: DECT: Harnessing LLM-assisted Fine-Grained Linguistic Knowledge and Label-Switched and Label-Preserved Data Generation for Diagnosis of Alzheimer's Disease

Title: Multimodal Medical Code Tokenizer

Title: Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models

Title: MedRAG: Enhancing Retrieval-augmented Generation with Knowledge Graph-Elicited Reasoning for Healthcare Copilot

Title: EmoBench-M: Benchmarking Emotional Intelligence for Multimodal Large Language Models

Title: Decoding AI Judgment: How LLMs Assess News Credibility and Bias

Title: Confident or Seek Stronger: Exploring Uncertainty-Based On-device LLM Routing From Benchmarking to Generalization

Title: Active Task Disambiguation with LLMs

Title: Building A Unified AI-centric Language System: analysis, framework and future work

Title: Multi-Agent Reinforcement Learning with Focal Diversity Optimization

Title: Verifiable Format Control for Large Language Model Generations

Title: ULPT: Prompt Tuning with Ultra-Low-Dimensional Optimization

Title: When One LLM Drools, Multi-LLM Collaboration Rules

Title: Heterogeneous Swarms: Jointly Optimizing Model Roles and Weights for Multi-LLM Systems

Title: Beyond Sample-Level Feedback: Using Reference-Level Feedback to Guide Data Synthesis

Title: Linear Correlation in LM's Compositional Generalization and Hallucination

Title: Group-Adaptive Threshold Optimization for Robust AI-Generated Text Detection

Title: Contextual Gradient Flow Modeling for Large Language Model Generalization in Multi-Scale Feature Spaces

Title: TruthFlow: Truthful LLM Generation via Representation Flow Correction

Title: My LLM might Mimic AAE -- But When Should it?

Title: Extracting and Understanding the Superficial Knowledge in Alignment

Title: M-IFEval: Multilingual Instruction-Following Evaluation

Title: ARR: Question Answering with Large Language Models via Analyzing, Retrieving, and Reasoning

Title: Evaluating Text Style Transfer Evaluation: Are There Any Reliable Metrics?

Title: Concept Navigation and Classification via Open Source Large Language Model Processing

Title: SeDi-Instruct: Enhancing Alignment of Language Models through Self-Directed Instruction Generation

Title: Probing Internal Representations of Multi-Word Verbs in Large Language Models

Title: S$^2$-MAD: Breaking the Token Barrier to Enhance Multi-Agent Debate Efficiency

Title: Developmentally-plausible Working Memory Shapes a Critical Period for Language Acquisition

Title: Self-Rationalization in the Wild: A Large Scale Out-of-Distribution Evaluation on NLI-related tasks

Title: Claim Extraction for Fact-Checking: Data, Models, and Automated Metrics

Title: SSMLoRA: Enhancing Low-Rank Adaptation with State Space Model

Title: CoCoA: A Generalized Approach to Uncertainty Quantification by Integrating Confidence and Consistency of LLM Outputs

Title: Aligning Black-box Language Models with Human Judgments

Title: nvAgent: Automated Data Visualization from Natural Language via Collaborative Agent Workflow

Title: ChallengeMe: An Adversarial Learning-enabled Text Summarization Framework

Title: Flexible and Efficient Grammar-Constrained Decoding

Title: CodeSCM: Causal Analysis for Multi-Modal Code Generation

Title: Transforming Science with Large Language Models: A Survey on AI-assisted Scientific Discovery, Experimentation, Content Generation, and Evaluation

Title: DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails

Title: NoLiMa: Long-Context Evaluation Beyond Literal Matching