2025-05-29

Title: More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models

Title: Rethinking Data Mixture for Large Language Models: A Comprehensive Survey and New Perspectives

Title: R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing

Title: How does Misinformation Affect Large Language Model Behaviors and Preferences?

Title: Explainability of Large Language Models using SMILE: Statistical Model-agnostic Interpretability with Local Explanations

Title: Rethinking the Outlier Distribution in Large Language Models: An In-depth Study

Title: LLMPR: A Novel LLM-Driven Transfer Learning based Petition Ranking Model

Title: MAKIEval: A Multilingual Automatic WiKidata-based Framework for Cultural Awareness Evaluation for LLMs

Title: Do We Know What LLMs Don't Know? A Study of Consistency in Knowledge Probing

Title: Assessing and Refining ChatGPT's Performance in Identifying Targeting and Inappropriate Language: A Comparative Study

Title: Counterfactual Simulatability of LLM Explanations for Generation Tasks

Title: BehaviorSFT: Behavioral Token Conditioning for Clinical Agents Across the Proactivity Spectrum

Title: Calibrating LLM Confidence by Probing Perturbed Representation Stability

Title: VeriTrail: Closed-Domain Hallucination Detection with Traceability

Title: Principled Content Selection to Generate Diverse and Personalized Multi-Document Summaries

Title: Evaluating the Retrieval Robustness of Large Language Models

Title: EFIM: Efficient Serving of LLMs for Infilling Tasks with Improved KV Cache Reuse

Title: Co-Saving: Resource Aware Multi-Agent Collaboration for Software Development

Title: RedTeamCUA: Realistic Adversarial Testing of Computer-Use Agents in Hybrid Web-OS Environments

Title: Graph-Assisted Culturally Adaptable Idiomatic Translation for Indic Languages

Title: RISE: Reasoning Enhancement via Iterative Self-Exploration in Multi-hop Question Answering

Title: Test-Time Scaling with Repeated Sampling Improves Multilingual Text Generation

Title: Resolving Knowledge Conflicts in Domain-specific Data Selection: A Case Study on Medical Instruction-tuning

Title: LaMDAgent: An Autonomous Framework for Post-Training Pipeline Optimization via LLM Agents

Title: Seeing the Threat: Vulnerabilities in Vision-Language Models to Adversarial Attack

Title: Pearl: A Multimodal Culturally-Aware Arabic Instruction Dataset

Title: Leveraging Interview-Informed LLMs to Model Survey Responses: Comparative Insights from AI-Generated and Human Data

Title: Found in Translation: Measuring Multilingual LLM Consistency as Simple as Translate then Evaluate

Title: Legal Assist AI: Leveraging Transformer-Based Model for Effective Legal Assistance

Title: CoThink: Token-Efficient Reasoning via Instruct Models Guiding Reasoning Models

Title: Jailbreak Distillation: Renewable Safety Benchmarking

Title: Safeguarding Privacy of Retrieval Data against Membership Inference Attacks: Is This Query Too Close to Home?

Title: Beyond path selection: Better LLMs for Scientific Information Extraction with MimicSFT and Relevance and Rule-induced(R$^2$)GRPO

Title: ArgInstruct: Specialized Instruction Fine-Tuning for Computational Argumentation

Title: Learning to Route Queries Across Knowledge Bases for Step-wise Retrieval-Augmented Reasoning

Title: Knowledge Base Construction for Knowledge-Augmented Text-to-SQL

Title: MemOS: An Operating System for Memory-Augmented Generation (MAG) in Large Language Models

Title: Curse of High Dimensionality Issue in Transformer for Long-context Modeling

Title: THINK-Bench: Evaluating Thinking Efficiency and Chain-of-Thought Quality of Large Reasoning Models

Title: Multimodal Forecasting of Sparse Intraoperative Hypotension Events Powered by Language Model

Title: Multilingual vs Crosslingual Retrieval of Fact-Checked Claims: A Tale of Two Approaches

Title: LoKI: Low-damage Knowledge Implanting of Large Language Models

Title: EULER: Enhancing the Reasoning Ability of Large Language Models through Error-Induced Learning

Title: InComeS: Integrating Compression and Selection Mechanisms into LLMs for Efficient Model Editing

Title: Stratified Selective Sampling for Instruction Tuning with Dedicated Scoring Strategy

Title: ReliableEval: A Recipe for Stochastic LLM Evaluation via Method of Moments

Title: Reverse Preference Optimization for Complex Instruction Following

Title: Speculative Decoding Meets Quantization: Compatibility Evaluation and Hierarchical Framework Design

Title: Breaking the Cloak! Unveiling Chinese Cloaked Toxicity with Homophone Graph and Toxic Lexicon

Title: Let's Predict Sentence by Sentence

Title: Judging Quality Across Languages: A Multilingual Approach to Pretraining Data Filtering with Language Models

Title: BioHopR: A Benchmark for Multi-Hop, Multi-Answer Reasoning in Biomedical Domain

Title: MRT at SemEval-2025 Task 8: Maximizing Recovery from Tables with Multiple Steps

Title: Compensating for Data with Reasoning: Low-Resource Machine Translation with LLMs

Title: Adaptive Detoxification: Safeguarding General Capabilities of LLMs through Toxicity-Aware Knowledge Editing

Title: If Pigs Could Fly... Can LLMs Logically Reason Through Counterfactuals?

Title: Advancing Expert Specialization for Better MoE

Title: NLP for Social Good: A Survey of Challenges, Opportunities, and Responsible Deployment

Title: Advancing Multimodal Reasoning via Reinforcement Learning with Cold Start

Title: Text2Grad: Reinforcement Learning from Natural Language Feedback

Title: LLMs Struggle to Reject False Presuppositions when Misinformation Stakes are High

Title: Pangu Embedded: An Efficient Dual-system LLM Reasoner with Metacognition

Title: RAG-Zeval: Towards Robust and Interpretable Evaluation on RAG Responses through End-to-End Rule-Guided Reasoning

Title: Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO

Title: EvolveSearch: An Iterative Self-Evolving Search Agent

Title: Multi-MLLM Knowledge Distillation for Out-of-Context News Detection

Title: Emotion-o1: Adaptive Long Reasoning for Emotion Understanding in LLMs

Title: ClaimPKG: Enhancing Claim Verification via Pseudo-Subgraph Generation with Lightweight Specialized LLM

Title: Do Large Language Models Think Like the Brain? Sentence-Level Evidence from fMRI and Hierarchical Embeddings

Title: Agent-UniRAG: A Trainable Open-Source LLM Agent Framework for Unified Retrieval-Augmented Generation Systems

Title: Fusion Steering: Prompt-Specific Activation Control

Title: Less, but Better: Efficient Multilingual Expansion for LLMs via Layer-wise Mixture-of-Experts

Title: Precise In-Parameter Concept Erasure in Large Language Models

Title: Self-Error-Instruct: Generalizing from Errors for LLMs Mathematical Reasoning

Title: Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding

Title: Stochastic Chameleons: Irrelevant Context Hallucinations Reveal Class-Based (Mis)Generalization in LLMs

Title: Spatial Knowledge Graph-Guided Multimodal Synthesis

Title: Learning Composable Chains-of-Thought

Title: Characterizing Bias: Benchmarking Large Language Models in Simplified versus Traditional Chinese

Title: WebDancer: Towards Autonomous Information Seeking Agency

Title: The Climb Carves Wisdom Deeper Than the Summit: On the Noisy Rewards in Learning to Reason

Title: GuessArena: Guess Who I Am? A Self-Adaptive Framework for Evaluating LLMs in Domain-Specific Knowledge and Reasoning

Title: AutoL2S: Auto Long-Short Reasoning for Efficient Large Language Models