2025-06-18

Title: ClimateChat: Designing Data and Methods for Instruction Tuning LLMs to Answer Climate Change Queries

Title: Investigating the interaction of linguistic and mathematical reasoning in language models using multilingual number puzzles

Title: VL-GenRM: Enhancing Vision-Language Verification via Vision Experts and Iterative Training

Title: EmoNews: A Spoken Dialogue System for Expressive News Conversations

Title: Alignment Quality Index (AQI) : Beyond Refusals: AQI as an Intrinsic Alignment Diagnostic via Latent Geometry, Cluster Divergence, and Layer wise Pooled Representations

Title: ASMR: Augmenting Life Scenario using Large Generative Models for Robotic Action Reflection

Title: Are manual annotations necessary for statutory interpretations retrieval?

Title: AI shares emotion with humans across languages and cultures

Title: Lost in the Mix: Evaluating LLM Understanding of Code-Switched Text

Title: MultiFinBen: A Multilingual, Multimodal, and Difficulty-Aware Benchmark for Financial LLM Evaluation

Title: Ace-CEFR -- A Dataset for Automated Evaluation of the Linguistic Difficulty of Conversational Texts for LLM Applications

Title: Abstract Meaning Representation for Hospital Discharge Summarization

Title: Essential-Web v1.0: 24T tokens of organized web data

Title: Sampling from Your Language Model One Byte at a Time

Title: DCRM: A Heuristic to Measure Response Pair Quality in Preference Optimization

Title: S$^4$C: Speculative Sampling with Syntactic and Semantic Coherence for Efficient Inference of Large Language Models

Title: MIST: Towards Multi-dimensional Implicit Bias and Stereotype Evaluation of LLMs via Theory of Mind

Title: GRAM: A Generative Foundation Reward Model for Reward Generalization

Title: MAS-LitEval : Multi-Agent System for Literary Translation Quality Assessment

Title: ELI-Why: Evaluating the Pedagogical Utility of Language Model Explanations

Title: Intended Target Identification for Anomia Patients with Gradient-based Selective Augmentation

Title: AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents

Title: Explainable Detection of Implicit Influential Patterns in Conversations via Data Augmentation

Title: Xolver: Multi-Agent Reasoning with Holistic Experience Learning Just Like an Olympiad Team

Title: Re-Initialization Token Learning for Tool-Augmented Large Language Models

Title: From What to Respond to When to Respond: Timely Response Generation for Open-domain Dialogue Agents

Title: Expectation Confirmation Preference Optimization for Multi-Turn Conversational Recommendation Agent

Title: Evaluation Should Not Ignore Variation: On the Impact of Reference Set Choice on Summarization Metrics

Title: A Vision for Geo-Temporal Deep Research Systems: Towards Comprehensive, Transparent, and Reproducible Geo-Temporal Information Synthesis

Title: ELLIS Alicante at CQs-Gen 2025: Winning the critical thinking questions shared task: LLM-based question generation and selection

Title: Thunder-NUBench: A Benchmark for LLMs' Sentence-Level Negation Understanding

Title: ImpliRet: Benchmarking the Implicit Fact Retrieval Challenge

Title: LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs

Title: How Far Can LLMs Improve from Experience? Measuring Test-Time Learning Ability in LLMs with Human Comparison

Title: LexiMark: Robust Watermarking via Lexical Substitutions to Enhance Membership Verification of an LLM's Textual Training Data

Title: LingoLoop Attack: Trapping MLLMs via Linguistic Context and State Entrapment into Endless Loops

Title: M2BeamLLM: Multimodal Sensing-empowered mmWave Beam Prediction with Large Language Models

Title: AlphaDecay:Module-wise Weight Decay for Heavy-Tailed Balancing in LLMs

Title: GenerationPrograms: Fine-grained Attribution with Executable Programs

Title: Guaranteed Guess: A Language Modeling Approach for CISC-to-RISC Transpilation with Testing Guarantees

Title: When Does Meaning Backfire? Investigating the Role of AMRs in NLI

Title: Probabilistic Aggregation and Targeted Embedding Optimization for Collective Moral Reasoning in Large Language Models

Title: AIn't Nothing But a Survey? Using Large Language Models for Coding German Open-Ended Survey Responses on Survey Motivation

Title: Revisiting Chain-of-Thought Prompting: Zero-shot Can Be Stronger than Few-shot

Title: Passing the Turing Test in Political Discourse: Fine-Tuning LLMs to Mimic Polarized Social Media Comments

Title: GuiLoMo: Allocating Expert Number and Rank for LoRA-MoE via Bilevel Optimization with GuidedSelection Vectors

Title: Massive Supervised Fine-tuning Experiments Reveal How Data, Layer, and Training Factors Shape LLM Alignment Quality

Title: Treasure Hunt: Real-time Targeting of the Long Tail using Training-Time Markers

Title: Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs

Title: Reasoning with Exploration: An Entropy Perspective

Title: From Bytes to Ideas: Language Modeling with Autoregressive U-Nets

Title: A Variational Framework for Improving Naturalness in Generative Spoken Language Models