2025-10-15

Title: PHANTOM RECALL: When Familiar Puzzles Fool Smart Models

Title: R-WoM: Retrieval-augmented World Model For Computer-use Agents

Title: LLM Knowledge is Brittle: Truthfulness Representations Rely on Superficial Resemblance

Title: LLM Reasoning for Machine Translation: Synthetic Data Generation over Thinking Tokens

Title: TopoAlign: A Framework for Aligning Code to Math via Topological Decomposition

Title: GRAVITY: A Framework for Personalized Text Generation via Profile-Grounded Synthetic Preferences

Title: Evaluating Retrieval-Augmented Generation Systems on Unanswerable, Uncheatable, Realistic, Multi-hop Queries

Title: Direct Multi-Token Decoding

Title: Scaling Long-Horizon LLM Agent via Context-Folding

Title: Conjecturing: An Overlooked Step in Formal Mathematical Reasoning

Title: SAGE: A Top-Down Bottom-Up Knowledge-Grounded User Simulator for Multi-turn AGent Evaluation

Title: Generate Logical Equivalence Questions

Title: Information Extraction from Conversation Transcripts: Neuro-Symbolic vs. LLM

Title: CPR: Mitigating Large Language Model Hallucinations with Curative Prompt Refinement

Title: Multi-stage Prompt Refinement for Mitigating Hallucinations in Large Language Models

Title: Uncertainty Quantification for Hallucination Detection in Large Language Models: Foundations, Methodology, and Future Directions

Title: Improving Text-to-Image Generation with Input-Side Inference-Time Scaling

Title: Hierarchical Alignment: Surgical Fine-Tuning via Functional Layer Specialization in Large Language Models

Title: APCE: Adaptive Progressive Context Expansion for Long Context Processing

Title: An AI-Based Behavioral Health Safety Filter and Dataset for Identifying Mental Health Crises in Text-Based Conversations

Title: Deep Associations, High Creativity: A Simple yet Effective Metric for Evaluating Large Language Models

Title: Tracing Multilingual Knowledge Acquisition Dynamics in Domain Adaptation: A Case Study of English-Japanese Biomedical Adaptation

Title: Understanding the Modality Gap: An Empirical Study on the Speech-Text Alignment Mechanism of Large Speech Language Models

Title: SafeMT: Multi-turn Safety for Multimodal Language Models

Title: Credal Transformer: A Principled Approach for Quantifying and Mitigating Hallucinations in Large Language Models

Title: A Survey on Parallel Reasoning

Title: Towards Inference-time Scaling for Continuous Space Reasoning

Title: From Knowledge to Treatment: Large Language Model Assisted Biomedical Concept Representation for Drug Repurposing

Title: Not in Sync: Unveiling Temporal Bias in Audio Chat Models

Title: DPO-Tuned Large Language Models for Segmentation in Simultaneous Speech Translation

Title: HALF: Harm-Aware LLM Fairness Evaluation Aligned with Deployment

Title: Analysing Moral Bias in Finetuned LLMs through Mechanistic Interpretability

Title: DSAS: A Universal Plug-and-Play Framework for Attention Optimization in Multi-Document Question Answering

Title: Shallow Robustness, Deep Vulnerabilities: Multi-Turn Evaluation of Medical LLMs

Title: A large-scale, unsupervised pipeline for automatic corpus annotation using LLMs: variation and change in the English consider construction

Title: Beating Harmful Stereotypes Through Facts: RAG-based Counter-speech Generation

Title: Fine-grained Analysis of Brain-LLM Alignment through Input Attribution

Title: LLM-REVal: Can We Trust LLM Reviewers Yet?

Title: Tokenization Disparities as Infrastructure Bias: How Subword Systems Create Inequities in LLM Access and Efficiency

Title: PRoH: Dynamic Planning and Reasoning over Knowledge Hypergraphs for Retrieval-Augmented Generation

Title: Probing Latent Knowledge Conflict for Faithful Retrieval-Augmented Generation

Title: Resource-sensitive but language-blind: Community size and not grammatical complexity better predicts the accuracy of Large Language Models in a novel Wug Test

Title: SMEC: Rethinking Matryoshka Representation Learning for Retrieval Embedding Compression

Title: When Personalization Tricks Detectors: The Feature-Inversion Trap in Machine-Generated Text Detection

Title: BoN Appetit Team at LeWiDi-2025: Best-of-N Test-time Scaling Can Not Stomach Annotation Disagreements (Yet)

Title: VISaGE: Understanding Visual Generics and Exceptions

Title: Teaching Language Models to Faithfully Express their Uncertainty

Title: StyleDecipher: Robust and Explainable Detection of LLM-Generated Texts with Stylistic Analysis

Title: ACADATA: Parallel Dataset of Academic Data for Machine Translation

Title: COSTAR-A: A prompting framework for enhancing Large Language Model performance on Point-of-View questions

Title: Reasoning Pattern Matters: Learning to Reason without Human Rationales

Title: Generation Space Size: Understanding and Calibrating Open-Endedness of LLM Generations

Title: Omni-Captioner: Data Pipeline, Models, and Benchmark for Omni Detailed Perception

Title: Which Word Orders Facilitate Length Generalization in LMs? An Investigation with GCG-Based Artificial Languages

Title: Hey, wait a minute: on at-issue sensitivity in Language Models

Title: Language Models Model Language

Title: Dr.LLM: Dynamic Layer Routing in LLMs