2025-05-21

Title: Evaluating Reasoning LLMs for Suicide Screening with the Columbia-Suicide Severity Rating Scale

Title: Detecting Prefix Bias in LLM-based Reward Models

Title: Source framing triggers systematic evaluation bias in Large Language Models

Title: ProdRev: A DNN framework for empowering customers using generative pre-trained transformers

Title: LLM4CD: Leveraging Large Language Models for Open-World Knowledge Augmented Cognitive Diagnosis

Title: IRLBench: A Multi-modal, Culturally Grounded, Parallel Irish-English Benchmark for Open-Ended LLM Reasoning Evaluation

Title: Noise Injection Systemically Degrades Large Language Model Safety Guardrails

Title: EcoSafeRAG: Efficient Security through Context Analysis in Retrieval-Augmented Generation

Title: Time-R1: Towards Comprehensive Temporal Reasoning in LLMs

Title: Induction Head Toxicity Mechanistically Explains Repetition Curse in Large Language Models

Title: Logic Jailbreak: Efficiently Unlocking LLM Safety Restrictions Through Formal Logical Expression

Title: Combining the Best of Both Worlds: A Method for Hybrid NMT and LLM Translation

Title: CS-Sum: A Benchmark for Code-Switching Dialogue Summarization and the Limits of Large Language Models

Title: Are Large Language Models Good at Detecting Propaganda?

Title: SQLForge: Synthesizing Reliable and Diverse Data to Enhance Text-to-SQL Reasoning in LLMs

Title: Simulation Agent: A Framework for Integrating Simulation and Large Language Models for Enhanced Decision-Making

Title: Krikri: Advancing Open Large Language Models for Greek

Title: Interpretable Traces, Unexpected Outcomes: Investigating the Disconnect in Trace-Based Knowledge Distillation

Title: EfficientLLM: Efficiency in Large Language Models

Title: Improve Language Model and Brain Alignment via Associative Memory

Title: Domain Gating Ensemble Networks for AI-Generated Text Detection

Title: Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning

Title: Code2Logic: Game-Code-Driven Data Synthesis for Enhancing VLMs General Reasoning

Title: Mapping the Minds of LLMs: A Graph-Based Analysis of Reasoning LLM

Title: InfiGFusion: Graph-on-Logits Distillation via Efficient Gromov-Wasserstein for Model Fusion

Title: Let's Verify Math Questions Step by Step

Title: Cross-Linguistic Transfer in Multilingual NLP: The Role of Language Families and Morphology

Title: EEG-to-Text Translation: A Model for Deciphering Human Brain Activity

Title: Towards Rehearsal-Free Continual Relation Extraction: Capturing Within-Task Variance with Adaptive Prompting

Title: Memory-Centric Embodied Question Answer

Title: FlashThink: An Early Exit Method For Efficient Reasoning

Title: Through a Compressed Lens: Investigating the Impact of Quantization on LLM Explainability and Interpretability

Title: CAFES: A Collaborative Multi-Agent Framework for Multi-Granular Multimodal Essay Scoring

Title: Truth or Twist? Optimal Model Selection for Reliable Label Flipping Evaluation in LLM-based Counterfactuals

Title: Toward Effective Reinforcement Learning Fine-Tuning for Medical VQA in Vision-Language Models

Title: DRP: Distilled Reasoning Pruning with Skill-aware Step Decomposition for Efficient Large Reasoning Models

Title: The Hallucination Tax of Reinforcement Finetuning

Title: DecIF: Improving Instruction-Following through Meta-Decomposition

Title: Social Sycophancy: A Broader Understanding of LLM Sycophancy

Title: Activation-Guided Consensus Merging for Large Language Models

Title: AUTOLAW: Enhancing Legal Compliance in Large Language Models via Case Law Generation and Jury-Inspired Deliberation

Title: From Unaligned to Aligned: Scaling Multilingual LLMs with Multi-Way Parallel Corpora

Title: Improved Methods for Model Pruning and Knowledge Distillation

Title: Enhancing LLMs via High-Knowledge Data Selection

Title: BAR: A Backward Reasoning based Agent for Complex Minecraft Tasks

Title: Gender Trouble in Language Models: An Empirical Audit Guided by Gender Performativity Theory

Title: Beyond Chains: Bridging Large Language Models and Knowledge Bases in Complex Question Answering

Title: MultiHal: Multilingual Dataset for Knowledge-Graph Grounded Evaluation of LLM Hallucinations

Title: Legal Rule Induction: Towards Generalizable Principle Discovery from Analogous Judicial Precedents

Title: A Personalized Conversational Benchmark: Towards Simulating Personalized Conversations

Title: DiagnosisArena: Benchmarking Diagnostic Reasoning for Large Language Models

Title: Invisible Entropy: Towards Safe and Efficient Low-Entropy LLM Watermarking

Title: Self-Reasoning Language Models: Unfold Hidden Reasoning Chains with Few Reasoning Catalyst

Title: Texts or Images? A Fine-grained Analysis on the Effectiveness of Input Representations and Models for Table Question Answering

Title: Prior Prompt Engineering for Reinforcement Fine-Tuning

Title: Temporal Alignment of Time Sensitive Facts with Activation Engineering

Title: Breaking Language Barriers or Reinforcing Bias? A Study of Gender and Racial Disparities in Multilingual Contrastive Vision Language Models

Title: PL-FGSA: A Prompt Learning Framework for Fine-Grained Sentiment Analysis Based on MindSpore

Title: The Strawberry Problem: Emergence of Character-level Understanding in Tokenized Language Models

Title: Cheaper, Better, Faster, Stronger: Robust Text-to-SQL without Chain-of-Thought or Fine-Tuning

Title: Tokenization Constraints in LLMs: A Study of Symbolic and Arithmetic Reasoning Limits

Title: SlangDIT: Benchmarking LLMs in Interpretative Slang Translation

Title: ThinkSwitcher: When to Think Hard, When to Think Fast

Title: Unraveling Interwoven Roles of Large Language Models in Authorship Privacy: Obfuscation, Mimicking, and Verification

Title: Automatic Dataset Generation for Knowledge Intensive Question Answering Tasks

Title: "Haet Bhasha aur Diskrimineshun": Phonetic Perturbations in Code-Mixed Hinglish to Red-Team LLMs

Title: Mechanistic Fine-tuning for In-context Learning

Title: ABBA: Highly Expressive Hadamard Product Adaptation for Large Language Models

Title: TransBench: Benchmarking Machine Translation for Industrial-Scale Applications

Title: FuxiMT: Sparsifying Large Language Models for Chinese-Centric Multilingual Machine Translation

Title: Think-J: Learning to Think for Generative LLM-as-a-Judge

Title: YESciEval: Robust LLM-as-a-Judge for Scientific Question Answering

Title: Universal Acoustic Adversarial Attacks for Flexible Control of Speech-LLMs

Title: Cross-Lingual Optimization for Language Transfer in Large Language Models

Title: JOLT-SQL: Joint Loss Tuning of Text-to-SQL with Confusion-aware Noisy Schema Sampling

Title: Studying the Role of Input-Neighbor Overlap in Retrieval-Augmented Language Models Training Efficiency

Title: HausaNLP: Current Status, Challenges and Future Directions for Hausa Natural Language Processing

Title: A MIND for Reasoning: Meta-learning for In-context Deduction

Title: QA-prompting: Improving Summarization with Large Language Models using Question-Answering

Title: OSoRA: Output-Dimension and Singular-Value Initialized Low-Rank Adaptation

Title: WirelessMathBench: A Mathematical Modeling Benchmark for LLMs in Wireless Communications

Title: Dual Decomposition of Weights and Singular Value Low Rank Adaptation

Title: AutoRev: Automatic Peer Review System for Academic Research Papers

Title: Editing Across Languages: A Survey of Multilingual Knowledge Editing

Title: MUG-Eval: A Proxy Evaluation Framework for Multilingual Generation Capabilities in Any Language

Title: Log-Augmented Generation: Scaling Test-Time Reasoning with Reusable Computation

Title: Pierce the Mists, Greet the Sky: Decipher Knowledge Overshadowing via Knowledge Circuit Analysis

Title: Hidden Ghost Hand: Unveiling Backdoor Vulnerabilities in MLLM-Powered Mobile GUI Agents

Title: SAE-FiRE: Enhancing Earnings Surprise Predictions Through Sparse Autoencoder Feature Selection

Title: Scaling Low-Resource MT via Synthetic Data Generation with LLMs

Title: From Templates to Natural Language: Generalization Challenges in Instruction-Tuned LLMs for Spatial Reasoning

Title: Neural Incompatibility: The Unbridgeable Gap of Cross-Scale Parametric Knowledge Transfer in Large Language Models

Title: Creative Preference Optimization

Title: CtrlDiff: Boosting Large Diffusion Language Models with Dynamic Block Prediction and Controllable Generation

Title: Not All Correct Answers Are Equal: Why Your Distillation Source Matters

Title: Void in Language Models

Title: Attributional Safety Failures in Large Language Models under Code-Mixed Perturbations

Title: Adapting Pretrained Language Models for Citation Classification via Self-Supervised Contrastive Learning

Title: PlanGPT-VL: Enhancing Urban Planning with Domain-Specific Vision-Language Models

Title: MoMoE: Mixture of Moderation Experts Framework for AI-Assisted Online Governance

Title: Enhanced Multimodal Aspect-Based Sentiment Analysis by LLM-Generated Rationales

Title: ModRWKV: Transformer Multimodality in Linear Time

Title: Exploring Graph Representations of Logical Forms for Language Modeling

Title: Internal Chain-of-Thought: Empirical Evidence for Layer-wise Subtask Scheduling in LLMs

Title: Breaking Bad Tokens: Detoxification of LLMs Using Sparse Autoencoders

Title: KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation

Title: TRATES: Trait-Specific Rubric-Assisted Cross-Prompt Essay Scoring

Title: Can Pruning Improve Reasoning? Revisiting Long-CoT Compression with Capability in Mind for Better Reasoning

Title: Context Reasoner: Incentivizing Reasoning Capability for Contextualized Privacy and Safety Compliance via Reinforcement Learning

Title: MCIP: Protecting MCP Safety via Model Contextual Integrity Protocol

Title: Success is in the Details: Evaluate and Enhance Details Sensitivity of Code LLMs through Counterfactuals

Title: Toward Reliable Biomedical Hypothesis Generation: Evaluating Truthfulness and Hallucination in Large Language Models

Title: sudoLLM : On Multi-role Alignment of Language Models

Title: Language Models Optimized to Fool Detectors Still Have a Distinct Style (And How to Change It)

Title: Linear Control of Test Awareness Reveals Differential Compliance in Reasoning Models

Title: Think Only When You Need with Large Hybrid-Reasoning Models

Title: General-Reasoner: Advancing LLM Reasoning Across All Domains

Title: Reward Reasoning Model

Title: UltraEdit: Training-, Subject-, and Memory-Free Lifelong Editing in Large Language Models

Title: Mind the Gap: Bridging Thought Leap for Improved Chain-of-Thought Tuning

Title: Language Models use Lookbacks to Track Beliefs