2025-04-17

Title: SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models

Title: ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Title: Higher-Order Binding of Language Model Virtual Personas: a Study on Approximating Political Partisan Misperceptions

Title: Enhancing Web Agents with Explicit Rollback Mechanisms

Title: Selective Attention Federated Learning: Improving Privacy and Efficiency for Clinical Text Classification

Title: Efficient and Adaptive Simultaneous Speech Translation with Fully Unidirectional Architecture

Title: ARWI: Arabic Write and Improve

Title: Déjà Vu: Multilingual LLM Evaluation through the Lens of Machine Translation Evaluation

Title: Could Thinking Multilingually Empower LLM Reasoning?

Title: FiSMiness: A Finite State Machine Based Paradigm for Emotional Support Conversations

Title: Finding Flawed Fictions: Evaluating Complex Reasoning in Language Models via Plot Hole Detection

Title: An LLM-as-a-judge Approach for Scalable Gender-Neutral Translation Evaluation

Title: Robust and Fine-Grained Detection of AI Generated Texts

Title: LLM-as-a-Judge: Reassessing the Performance of LLMs in Extractive QA

Title: SemEval-2025 Task 3: Mu-SHROOM, the Multilingual Shared Task on Hallucinations and Related Observable Overgeneration Mistakes

Title: Language Models as Quasi-Crystalline Thought: Structure, Constraint, and Emergence in Generative Systems

Title: Selective Demonstration Retrieval for Improved Implicit Hate Speech Detection

Title: Gauging Overprecision in LLMs: An Empirical Study

Title: Entropy-Guided Watermarking for LLMs: A Test-Time Framework for Robust and Traceable Text Generation

Title: Multilingual Contextualization of Large Language Models for Document-Level Machine Translation

Title: Trusting CHATGPT: how minor tweaks in the prompts lead to major differences in sentiment classification

Title: SALAD: Improving Robustness and Generalization through Contrastive Learning with Structure-Aware and LLM-Driven Augmented Data

Title: What Do Large Language Models Know? Tacit Knowledge as a Potential Causal-Explanatory Structure

Title: d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning

Title: BitNet b1.58 2B4T Technical Report