2025-07-09

Title: TokenShapley: Token Level Context Attribution with Shapley Value

Title: User Behavior Prediction as a Generic, Robust, Scalable, and Low-Cost Evaluation Strategy for Estimating Generalization in LLMs

Title: Beyond classical and contemporary models: a transformative ai framework for student dropout prediction in distance learning using rag, prompt engineering, and cross-modal fusion

Title: LCDS: A Logic-Controlled Discharge Summary Generation System Supporting Source Attribution and Expert Review

Title: MindFlow: Revolutionizing E-commerce Customer Support with Multimodal LLM Agents

Title: LoRA-Augmented Generation (LAG) for Knowledge-Intensive Language Tasks

Title: On the Bias of Next-Token Predictors Toward Systematically Inefficient Reasoning: A Shortest-Path Case Study

Title: The Generalization Ridge: Information Flow in Natural Language Generation

Title: Controlling What You Share: Assessing Language Model Adherence to Privacy Preferences

Title: Learn Globally, Speak Locally: Bridging the Gaps in Multilingual Reasoning

Title: "Lost-in-the-Later": Framework for Quantifying Contextual Grounding in Large Language Models

Title: PhoniTale: Phonologically Grounded Mnemonic Generation for Typologically Distant Language Pairs

Title: On the Semantics of Large Language Models

Title: ModelCitizens:Representing Community Voices in Online Safety

Title: Empowering Healthcare Practitioners with Language Models: Structuring Speech Transcripts in Two Real-World Clinical Applications

Title: Enhancing Test-Time Scaling of Large Language Models with Hierarchical Retrieval-Augmented MCTS

Title: Self-Review Framework for Enhancing Instruction Following Capability of LLM

Title: Flipping Knowledge Distillation: Leveraging Small Models' Expertise to Enhance LLMs in Text Matching

Title: SARA: Selective and Adaptive Retrieval-augmented Generation with Context Compression

Title: ECom-Bench: Can LLM Agent Resolve Real-World E-commerce Customer Support Issues?

Title: Smoothie-Qwen: Post-Hoc Smoothing to Reduce Language Bias in Multilingual LLMs

Title: Agentic-R1: Distilled Dual-Strategy Reasoning

Title: DRAGON: Dynamic RAG Benchmark On News

Title: HIRAG: Hierarchical-Thought Instruction-Tuning Retrieval-Augmented Generation

Title: Omni-Router: Sharing Routing Decisions in Sparse Mixture-of-Experts for Speech Recognition

Title: GPTKB v1.5: A Massive Knowledge Base for Exploring Factual LLM Knowledge

Title: DocTalk: Scalable Graph-based Dialogue Synthesis for Enhancing LLM Conversational Capabilities

Title: Flippi: End To End GenAI Assistant for E-Commerce

Title: Bridging Perception and Language: A Systematic Benchmark for LVLMs' Understanding of Amodal Completion Reports

Title: Psychometric Item Validation Using Virtual Respondents with Trait-Response Mediators

Title: Few-shot text-based emotion detection

Title: Chat-Ghosting: A Comparative Study of Methods for Auto-Completion in Dialog Systems

Title: OpenFActScore: Open-Source Atomic Evaluation of Factuality in Text Generation

Title: RabakBench: Scaling Human Annotations to Construct Localized Multilingual Safety Benchmarks for Low-Resource Languages

Title: Evolution without Large Models: Training Language Model with Task Principles

Title: DocIE@XLLM25: In-Context Learning for Information Extraction using Fully Synthetic Demonstrations

Title: Conditional Multi-Stage Failure Recovery for Embodied Agents

Title: Entropy-Memorization Law: Evaluating Memorization Difficulty of Data in LLMs

Title: A Survey on Prompt Tuning

Title: NeoBabel: A Multilingual Open Tower for Visual Generation

Title: Coding Triangle: How Does Large Language Model Understand Code?

Title: Skywork-R1V3 Technical Report

Title: CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization

Title: DS@GT at CheckThat! 2025: Detecting Subjectivity via Transfer-Learning and Corrective Data Augmentation

Title: UQLM: A Python Package for Uncertainty Quantification in Large Language Models

Title: A Survey on Latent Reasoning

Title: DS@GT at CheckThat! 2025: Ensemble Methods for Detection of Scientific Discourse on Social Media

Title: Efficiency-Effectiveness Reranking FLOPs for LLM-based Rerankers

Title: Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving