2025-08-07

Title: How Deep Is Representational Bias in LLMs? The Cases of Caste and Religion

Title: FeynTune: Large Language Models for High-Energy Theory

Title: Intent Aware Context Retrieval for Multi-Turn Agricultural Question Answering

Title: Hierarchical Verification of Speculative Beams for Accelerating LLM Inference

Title: WINELL: Wikipedia Never-Ending Updating with LLM Agents

Title: GanitBench: A bi-lingual benchmark for evaluating mathematical reasoning in Vision Language Models

Title: AttnTrace: Attention-based Context Traceback for Long-Context LLMs

Title: Majority Bit-Aware Watermarking For Large Language Models

Title: Hallucination to Truth: A Review of Fact-Checking and Factuality Evaluation in Large Language Models

Title: An Entity Linking Agent for Question Answering

Title: Sotopia-RL: Reward Design for Social Intelligence

Title: CoAct-1: Computer-using Agents with Coding as Actions

Title: CAP-LLM: Context-Augmented Personalized Large Language Models for News Headline Generation

Title: Data and AI governance: Promoting equity, ethics, and fairness in large language models

Title: Confidence-Weighted Token Set Cover for Early Hypothesis Pruning in Self-Consistency

Title: Are Today's LLMs Ready to Explain Well-Being Concepts?

Title: Transferring Expert Cognitive Models to Social Robots via Agentic Concept Bottleneck Models

Title: HarmonyGuard: Toward Safety and Utility in Web Agents via Adaptive Policy Enhancement and Dual-Objective Optimization

Title: Step More: Going Beyond Single Backpropagation in Meta Learning Based Model Editing

Title: ZARA: Zero-shot Motion Time-Series Analysis via Knowledge and Retrieval Driven LLM Agents

Title: Large Reasoning Models Are Autonomous Jailbreak Agents

Title: DTPA: Dynamic Token-level Prefix Augmentation for Controllable Text Generation

Title: PAIRS: Parametric-Verified Adaptive Information Retrieval and Selection for Efficient RAG

Title: Efficient Strategy for Improving Large Language Model (LLM) Capabilities

Title: ToolGrad: Efficient Tool-use Dataset Generation with Textual "Gradients"

Title: GM-PRM: A Generative Multimodal Process Reward Model for Multimodal Mathematical Reasoning

Title: Unveiling Over-Memorization in Finetuning LLMs for Reasoning Tasks

Title: Difficulty-Based Preference Data Selection by DPO Implicit Reward Gap

Title: Hacking Hallucinations of MLLMs with Causal Sufficiency and Necessity

Title: Eliciting and Analyzing Emergent Misalignment in State-of-the-Art Large Language Models

Title: Reasoning Beyond Labels: Measuring LLM Sentiment in Low-Resource, Culturally Nuanced Contexts

Title: Hierarchical Text Classification Using Black Box Large Language Models

Title: DP-GPT4MTS: Dual-Prompt Large Language Model for Textual-Numerical Time Series Forecasting

Title: TalkDep: Clinically Grounded LLM Personas for Conversation-Centric Depression Screening

Title: KVSink: Understanding and Enhancing the Preservation of Attention Sinks in KV Cache Quantization for LLMs

Title: ShoppingBench: A Real-World Intent-Grounded Shopping Benchmark for LLM-based Agents

Title: A Few Words Can Distort Graphs: Knowledge Poisoning Attacks on Graph-based Retrieval-Augmented Generation of Large Language Models

Title: Beyond the Leaderboard: Rethinking Medical Benchmarks for Large Language Models

Title: Modelling and Classifying the Components of a Literature Review

Title: GTPO and GRPO-S: Token and Sequence-Level Reward Shaping with Policy Entropy

Title: Chain of Questions: Guiding Multimodal Curiosity in Language Models

Title: AIC CTU@FEVER 8: On-premise fact checking through long context RAG

Title: Improving Crash Data Quality with Large Language Models: Evidence from Secondary Crash Narratives in Kentucky

Title: Why are LLMs' abilities emergent?

Title: Dialogue Response Prefetching Based on Semantic Similarity and Prediction Confidence of Language Model

Title: Evaluating, Synthesizing, and Enhancing for Customer Support Conversation

Title: StepFun-Formalizer: Unlocking the Autoformalization Potential of LLMs through Knowledge-Reasoning Fusion

Title: Automated Generation of Curriculum-Aligned Multiple-Choice Questions for Malaysian Secondary Mathematics Using Generative AI

Title: CALE : Concept-Aligned Embeddings for Both Within-Lemma and Inter-Lemma Sense Differentiation

Title: StyliTruth : Unlocking Stylized yet Truthful LLM Generation via Disentangled Steering

Title: Unveiling the Landscape of Clinical Depression Assessment: From Behavioral Signatures to Psychiatric Reasoning

Title: Beyond Brainstorming: What Drives High-Quality Scientific Ideas? Lessons from Multi-Agent Collaboration

Title: Share Your Attention: Transformer Weight Sharing via Matrix-based Dictionary Learning

Title: TURA: Tool-Augmented Unified Retrieval Agent for AI Search

Title: Lightweight Transformers for Zero-Shot and Fine-Tuned Text-to-SQL Generation Using Spider

Title: P-Aligner: Enabling Pre-Alignment of Language Models via Principled Instruction Synthesis

Title: IFDECORATOR: Wrapping Instruction Following Reinforcement Learning with Verifiable Rewards

Title: Multi-module GRPO: Composing Policy Gradients and Prompt Optimization for Language Model Programs

Title: Sculptor: Empowering LLMs with Cognitive Agency via Active Context Management

Title: GeRe: Towards Efficient Anti-Forgetting in Continual Learning of LLM via General Samples Replay

Title: FaST: Feature-aware Sampling and Tuning for Personalized Preference Alignment with Limited Data

Title: Hop, Skip, and Overthink: Diagnosing Why Reasoning Models Fumble during Multi-Hop Analysis