2025-03-11

Title: What Are They Filtering Out? A Survey of Filtering Strategies for Harm Reduction in Pretraining Datasets

Title: Graph Masked Language Models

Title: Medical Hallucinations in Foundation Models and Their Impact on Healthcare

Title: FedMentalCare: Towards Privacy-Preserving Fine-Tuned LLMs to Analyze Mental Health Status Using Federated Learning Framework

Title: Extracting and Emulsifying Cultural Explanation to Improve Multilingual Capability of LLMs

Title: This Is Your Doge, If It Please You: Exploring Deception and Robustness in Mixture of LLMs

Title: QG-SMS: Enhancing Test Item Analysis via Student Modeling and Simulation

Title: MastermindEval: A Simple But Scalable Reasoning Benchmark

Title: From Style to Facts: Mapping the Boundaries of Knowledge Injection with Finetuning

Title: IDEA Prune: An Integrated Enlarge-and-Prune Pipeline in Generative Language Model Pretraining

Title: DETQUS: Decomposition-Enhanced Transformers for QUery-focused Summarization

Title: SANDWiCH: Semantical Analysis of Neighbours for Disambiguating Words in Context ad Hoc

Title: SINdex: Semantic INconsistency Index for Hallucination Detection in LLMs

Title: Intent-Aware Self-Correction for Mitigating Social Biases in Large Language Models

Title: GenieBlue: Integrating both Linguistic and Multimodal Capabilities for Large Language Models on Mobile Devices

Title: SmartBench: Is Your LLM Truly a Good Chinese Smartphone Assistant?

Title: Mitigating Memorization in LLMs using Activation Steering

Title: Constructions are Revealed in Word Distributions

Title: Fine-Grained Bias Detection in LLM: Enhancing detection mechanisms for nuanced biases

Title: A Survey on Post-training of Large Language Models

Title: GEM: Empowering MLLM for Grounded ECG Understanding with Time Series and Images

Title: Towards Conversational AI for Disease Management

Title: Multi-Attribute Multi-Grained Adaptation of Pre-Trained Language Models for Text Understanding from Bayesian Perspective

Title: Evaluating Discourse Cohesion in Pre-trained Language Models

Title: GRP: Goal-Reversed Prompting for Zero-Shot Evaluation with LLMs

Title: Sample-aware Adaptive Structured Pruning for Large Language Models

Title: CUPCase: Clinically Uncommon Patient Cases and Diagnoses Dataset

Title: Text-Speech Language Models with Improved Cross-Modal Transfer by Aligning Abstraction Levels

Title: KnowLogic: A Benchmark for Commonsense Reasoning via Knowledge-Driven Data Synthesis

Title: Integrating Chain-of-Thought for Multimodal Alignment: A Study on 3D Vision-Language Learning

Title: IteRABRe: Iterative Recovery-Aided Block Reduction

Title: States of LLM-generated Texts and Phase Transitions between them

Title: How LLMs Learn: Tracing Internal Representations with Sparse Autoencoders

Title: Training LLM-based Tutors to Improve Student Learning Outcomes in Dialogues

Title: Graph Retrieval-Augmented LLM for Conversational Recommendation Systems

Title: VisualSimpleQA: A Benchmark for Decoupled Evaluation of Large Vision-Language Models in Fact-Seeking Question Answering

Title: GFlowVLM: Enhancing Multi-step Reasoning in Vision-Language Models with Generative Flow Networks

Title: SafeSpeech: A Comprehensive and Interactive Tool for Analysing Sexist and Abusive Language in Conversations

Title: BingoGuard: LLM Content Moderation Tools with Risk Levels

Title: WildIFEval: Instruction Following in the Wild

Title: Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation

Title: Enhancing NLP Robustness and Generalization through LLM-Generated Contrast Sets: A Scalable Framework for Systematic Evaluation and Adversarial Training

Title: InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models

Title: PFDial: A Structured Dialogue Instruction Fine-tuning Method Based on UML Flowcharts

Title: Alignment for Efficient Tool Calling of Large Language Models

Title: Delusions of Large Language Models

Title: Gender Encoding Patterns in Pretrained Language Model Representations

Title: Effectiveness of Zero-shot-CoT in Japanese Prompts

Title: Large Language Models Are Effective Human Annotation Assistants, But Not Good Independent Annotators

Title: Dr Genre: Reinforcement Learning from Decoupled LLM Feedback for Generic Text Rewriting

Title: On the Mutual Influence of Gender and Occupation in LLM Representations

Title: Enhanced Multi-Tuple Extraction for Alloys: Integrating Pointer Networks and Augmented Attention

Title: Lost-in-the-Middle in Long-Text Generation: Synthetic Dataset, Evaluation Framework, and Mitigation

Title: KwaiChat: A Large-Scale Video-Driven Multilingual Mixed-Type Dialogue Corpus

Title: Effect of Selection Format on LLM Performance

Title: Lshan-1.0 Technical Report

Title: CtrlRAG: Black-box Adversarial Attacks Based on Masked Language Models in Retrieval-Augmented Language Generation

Title: Exploring Multimodal Perception in Large Language Models Through Perceptual Strength Ratings

Title: Social Bias Benchmark for Generation: A Comparison of Generation and QA-Based Evaluations

Title: Large Language Models Often Say One Thing and Do Another

Title: Toward Multi-Session Personalized Conversation: A Large-Scale Dataset and Hierarchical Tree Framework for Implicit Reasoning

Title: Multimodal Human-AI Synergy for Medical Imaging Quality Control: A Hybrid Intelligence Framework with Adaptive Dataset Curation and Closed-Loop Evaluation

Title: Bot Wars Evolved: Orchestrating Competing LLMs in a Counterstrike Against Phone Scams

Title: TCM-3CEval: A Triaxial Benchmark for Assessing Responses from Large Language Models in Traditional Chinese Medicine

Title: DatawiseAgent: A Notebook-Centric LLM Agent Framework for Automated Data Science

Title: DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs

Title: Linguistic Knowledge Transfer Learning for Speech Enhancement

Title: A Novel Ophthalmic Benchmark for Evaluating Multimodal Large Language Models with Fundus Photographs and OCT Images

Title: ASTRA: A Negotiation Agent with Adaptive and Strategic Reasoning through Action in Dynamic Offer Optimization

Title: Application of Multiple Chain-of-Thought in Contrastive Reasoning for Implicit Sentiment Analysis

Title: MRCEval: A Comprehensive, Challenging and Accessible Machine Reading Comprehension Benchmark

Title: DeFine: A Decomposed and Fine-Grained Annotated Dataset for Long-form Article Generation

Title: Contextual Cues in Machine Translation: Investigating the Potential of Multi-Source Input Strategies in LLMs and NMT Systems

Title: LLM-C3MOD: A Human-LLM Collaborative System for Cross-Cultural Hate Speech Moderation

Title: A Graph-based Verification Framework for Fact-Checking

Title: Benchmarking Chinese Medical LLMs: A Medbench-based Analysis of Performance Gaps and Hierarchical Optimization Strategies

Title: Assessing the Macro and Micro Effects of Random Seeds on Fine-Tuning Large Language Models

Title: RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing

Title: Is My Text in Your AI Model? Gradient-based Membership Inference Test applied to LLMs

Title: Revisiting Noise in Natural Language Processing for Computational Social Science

Title: LLMs syntactically adapt their language use to their conversational partner

Title: MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning

Title: Language Models Fail to Introspect About Their Knowledge of Language

Title: TokenButler: Token Importance is Predictable

Title: XIFBench: Evaluating Large Language Models on Multilingual Instruction Following

Title: KSOD: Knowledge Supplement for LLMs On Demand

Title: Detection Avoidance Techniques for Large Language Models

Title: Implicit Reasoning in Transformers is Reasoning through Shortcuts

Title: SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models