2025-08-01

Title: Large Language Models in the Travel Domain: An Industrial Experience

Title: ElectriQ: A Benchmark for Assessing the Response Capability of Large Language Models in Power Marketing

Title: A Language Model-Driven Semi-Supervised Ensemble Framework for Illicit Market Detection Across Deep/Dark Web and Social Platforms

Title: A Hybrid Framework for Subject Analysis: Integrating Embedding-Based Regression Models with Large Language Models

Title: Theoretical Foundations and Mitigation of Hallucination in Large Language Models

Title: Reading Between the Timelines: RAG for Answering Diachronic Questions

Title: Semantic Convergence: Investigating Shared Representations Across Scaled LLMs

Title: A novel language model for predicting serious adverse event results in clinical trials from their prospective registrations

Title: Discrete Tokenization for Multimodal LLMs: A Comprehensive Survey

Title: Fast and Accurate Contextual Knowledge Extraction Using Cascading Language Model Chains and Candidate Answers

Title: Predicting stock prices with ChatGPT-annotated Reddit sentiment

Title: How and Where to Translate? The Impact of Translation Strategies in Cross-lingual LLM Prompting

Title: Hierarchical Memory for High-Efficiency Long-Term Reasoning in LLM Agents

Title: PRGB Benchmark: A Robust Placeholder-Assisted Algorithm for Benchmarking Retrieval-Augmented Generation

Title: How does Chain of Thought Think? Mechanistic Interpretability of Chain-of-Thought Reasoning with Sparse Autoencoding

Title: EH-Benchmark Ophthalmic Hallucination Benchmark and Agent-Driven Top-Down Traceable Reasoning Workflow

Title: Protecting Vulnerable Voices: Synthetic Dataset Generation for Self-Disclosure Detection

Title: Enhancing RAG Efficiency with Adaptive Context Compression

Title: FinMarBa: A Market-Informed Dataset for Financial Sentiment Classification

Title: Augmented Vision-Language Models: A Systematic Review

Title: Trusted Knowledge Extraction for Operations and Maintenance Intelligence

Title: Evaluating Large Language Models (LLMs) in Financial NLP: A Comparative Study on Financial Report Analysis

Title: CoE-Ops: Collaboration of LLM-based Experts for AIOps Question-Answering

Title: A Graph-based Approach for Multi-Modal Question Answering from Flowcharts in Telecom Documents

Title: Trustworthy Reasoning: Evaluating and Enhancing Factual Accuracy in LLM Intermediate Thought Processes

Title: C3: A Bilingual Benchmark for Spoken Dialogue Models Exploring Challenges in Complex Conversations

Title: Math Natural Language Inference: this should be easy!

Title: Exploring In-Context Learning for Frame-Semantic Parsing

Title: Context-aware Rotary Position Embedding

Title: SMART-Editor: A Multi-Agent Framework for Human-Like Design Editing with Structural Integrity

Title: RASL: Retrieval Augmented Schema Linking for Massive Database Text-to-SQL

Title: Uncovering the Fragility of Trustworthy LLMs through Chinese Textual Ambiguity

Title: ISO-Bench: Benchmarking Multimodal Causal Reasoning in Visual-Language Models through Procedural Plans

Title: User Feedback in Human-LLM Dialogues: A Lens to Understand Users But Noisy as a Learning Signal

Title: LENS: Learning Ensemble Confidence from Neural States for Multi-LLM Answer Integration

Title: Geak: Introducing Triton Kernel AI Agent & Evaluation Benchmarks

Title: Failures Are the Stepping Stones to Success: Enhancing Few-Shot In-Context Learning by Leveraging Negative Samples

Title: Model Directions, Not Words: Mechanistic Topic Models Using Sparse Autoencoders

Title: Enabling Few-Shot Alzheimer's Disease Diagnosis on Tabular Biomarker Data with LLMs

Title: P-ReMIS: Pragmatic Reasoning in Mental Health and a Social Implication

Title: Evaluating LLMs' Multilingual Capabilities for Bengali: Benchmark Creation and Performance Analysis

Title: Unveiling Super Experts in Mixture-of-Experts Large Language Models

Title: What's Taboo for You? - An Empirical Evaluation of LLMs Behavior Toward Sensitive Content

Title: MUST-RAG: MUSical Text Question Answering with Retrieval Augmented Generation

Title: Text-to-SQL Task-oriented Dialogue Ontology Construction

Title: MPCC: A Novel Benchmark for Multimodal Planning with Complex Constraints in Multimodal Large Language Models

Title: Causal2Vec: Improving Decoder-only LLMs as Versatile Embedding Models

Title: Beyond the Cloud: Assessing the Benefits and Drawbacks of Local LLM Deployment for Translators

Title: MRGSEM-Sum: An Unsupervised Multi-document Summarization Framework based on Multi-Relational Graphs and Structural Entropy Minimization

Title: Enhanced Arabic Text Retrieval with Attentive Relevance Scoring

Title: Role-Aware Language Models for Secure and Contextualized Access Control in Organizations

Title: A Novel Evaluation Benchmark for Medical LLMs: Illuminating Safety and Effectiveness in Clinical Domains

Title: Med-R$^3$: Enhancing Medical Retrieval-Augmented Reasoning of LLMs via Progressive Reinforcement Learning

Title: DiffLoRA: Differential Low-Rank Adapters for Large Language Models

Title: Rule2Text: Natural Language Explanation of Logical Rules in Knowledge Graphs

Title: Cascaded Information Disclosure for Generalized Evaluation of Problem Solving Capabilities