2025-03-10

Title: Leveraging Large Language Models For Optimized Item Categorization using UNSPSC Taxonomy

Title: WinClick: GUI Grounding with Multimodal Large Language Models

Title: DiMA: An LLM-Powered Ride-Hailing Assistant at DiDi

Title: Invisible Walls in Cities: Leveraging Large Language Models to Predict Urban Segregation Experience with Social Media Content

Title: MV-CLAM: Multi-View Molecular Interpretation with Cross-Modal Projection via Language Model

Title: Comparative Analysis Based on DeepSeek, ChatGPT, and Google Gemini: Features, Techniques, Performance, Future Prospects

Title: KunlunBaize: LLM with Multi-Scale Convolution and Multi-Token Prediction Under TransformerX Framework

Title: Mapping Trustworthiness in Large Language Models: A Bibliometric Analysis Bridging Theory to Practice

Title: Towards Anthropomorphic Conversational AI Part I: A Practical Framework

Title: AgroLLM: Connecting Farmers and Agricultural Practices through Large Language Models for Enhanced Knowledge Transfer and Practical Application

Title: Ext2Gen: Alignment through Unified Extraction and Generation for Robust Retrieval-Augmented Generation

Title: Cross-linguistic disagreement as a conflict of semantic alignment norms in multilingual AI~Linguistic Diversity as a Problem for Philosophy, Cognitive Science, and AI~

Title: Sentence-level Reward Model can Generalize Better for Aligning LLM from Human Preference

Title: Cyber for AI at SemEval-2025 Task 4: Forgotten but Not Lost: The Balancing Act of Selective Unlearning in Large Language Models

Title: Optimizing Multi-Hop Document Retrieval Through Intermediate Representations

Title: HoH: A Dynamic Benchmark for Evaluating the Impact of Outdated Information on Retrieval-Augmented Generation

Title: Exploring and Evaluating Multimodal Knowledge Reasoning Consistency of Multimodal Large Language Models

Title: Call for Rigor in Reporting Quality of Instruction Tuning Data

Title: Learning from Failures in Multi-Attempt Reinforcement Learning

Title: PanguIR Technical Report for NTCIR-18 AEOLLM Task

Title: Multi-Agent System for AI-Assisted Extraction of Narrative Arcs in TV Series

Title: Prompting Science Report 1: Prompt Engineering is Complicated and Contingent

Title: HeTGB: A Comprehensive Benchmark for Heterophilic Text-Attributed Graphs

Title: Preserving Cultural Identity with Context-Aware Translation Through Multi-Agent AI Systems

Title: Beyond Next Word Prediction: Developing Comprehensive Evaluation Frameworks for measuring LLM performance on real world applications

Title: Cite Before You Speak: Enhancing Context-Response Grounding in E-commerce Conversational LLM-Agents

Title: "Only ChatGPT gets me": An Empirical Analysis of GPT versus other Large Language Models for Emotion Detection in Text

Title: Extrapolation Merging: Keep Improving With Extrapolation and Merging

Title: Framing the Game: How Context Shapes LLM Decision-Making

Title: Three tiers of computation in transformers and in brain architectures

Title: Enhancing Collective Intelligence in Large Language Models Through Emotional Integration

Title: One-Shot is Enough: Consolidating Multi-Turn Attacks into Efficient Single-Turn Prompts for LLMs

Title: Codebook Reduction and Saturation: Novel observations on Inductive Thematic Saturation for Large Language Models and initial coding in Thematic Analysis

Title: TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation

Title: Are Large Language Models Good In-context Learners for Financial Sentiment Analysis?

Title: Memory Is All You Need: Testing How Model Memory Affects LLM Performance in Annotation Tasks

Title: Architecture for a Trustworthy Quantum Chatbot

Title: Maximizing Signal in Human-Model Preference Alignment

Title: HILGEN: Hierarchically-Informed Data Generation for Biomedical NER Using Knowledgebases and Large Language Models

Title: VQEL: Enabling Self-Developed Symbolic Language in Agents through Vector Quantization in Emergent Language Games

Title: Collaborative Evaluation of Deepfake Text with Deliberation-Enhancing Dialogue Systems

Title: DB-Explore: Automated Database Exploration and Instruction Synthesis for Text-to-SQL

Title: Beyond RAG: Task-Aware KV Cache Compression for Comprehensive Knowledge Reasoning

Title: Application of integrated gradients explainability to sociopsychological semantic markers

Title: DP-GTR: Differentially Private Prompt Protection via Group Text Rewriting

Title: HieroLM: Egyptian Hieroglyph Recovery with Next Word Prediction Language Model

Title: Balcony: A Lightweight Approach to Dynamic Inference of Generative Language Models

Title: Leveraging Domain Knowledge at Inference Time for LLM Translation: Retrieval versus Generation

Title: Safety is Not Only About Refusal: Reasoning-Enhanced Fine-tuning for Interpretable LLM Safety

Title: Collapse of Dense Retrievers: Short, Early, and Literal Biases Outranking Factual Evidence

Title: Biases in Large Language Model-Elicited Text: A Case Study in Natural Language Inference

Title: Dynamic-KGQA: A Scalable Framework for Generating Adaptive Question Answering Datasets

Title: A Unified Framework with Novel Metrics for Evaluating the Effectiveness of XAI Techniques in LLMs

Title: ModernBERT is More Efficient than Conventional BERT for Chest CT Findings Classification in Japanese Radiology Reports

Title: No Free Labels: Limitations of LLM-as-a-Judge Without Human Grounding

Title: S2S-Arena, Evaluating Speech2Speech Protocols on Instruction Following with Paralinguistic Information

Title: SpecServe: Efficient and SLO-Aware Large Language Model Serving with Adaptive Speculative Decoding

Title: RocketEval: Efficient Automated LLM Evaluation via Grading Checklist

Title: Interpersonal Memory Matters: A New Task for Proactive Dialogue Utilizing Conversational History

Title: Ensemble Debiasing Across Class and Sample Levels for Fairer Prompting Accuracy

Title: Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching

Title: Rewarding Curse: Analyze and Mitigate Reward Modeling Issues for LLM Reasoning

Title: Memory-augmented Query Reconstruction for LLM-based Knowledge Graph Reasoning

Title: ORANSight-2.0: Foundational LLMs for O-RAN

Title: Knowledge Updating? No More Model Editing! Just Selective Contextual Reasoning

Title: Personalized Text Generation with Contrastive Activation Steering

Title: MM-StoryAgent: Immersive Narrated Storybook Video Generation with a Multi-Agent Paradigm across Text, Image and Audio

Title: ZOGRASCOPE: A New Benchmark for Property Graphs

Title: Revealing Hidden Mechanisms of Cross-Country Content Moderation with Natural Language Processing

Title: Similarity-Based Domain Adaptation with LLMs

Title: Coreference as an indicator of context scope in multimodal narrative

Title: Uncertainty-Aware Decoding with Minimum Bayes Risk

Title: Fine-Grained Evaluation for Implicit Discourse Relation Recognition

Title: Dynamic Knowledge Integration for Evidence-Driven Counter-Argument Generation with Large Language Models

Title: AutoIOT: LLM-Driven Automated Natural Language Programming for AIoT Applications

Title: GEMA-Score: Granular Explainable Multi-Agent Score for Radiology Report Evaluation

Title: Chain of Strategy Optimization Makes Large Language Models Better Emotional Supporter

Title: An Empirical Study of Conformal Prediction in LLM with ASP Scaffolds for Robust Reasoning

Title: Statistical Guarantees of Correctness Coverage for Medical Multiple-Choice Question Answering

Title: Quantifying the Robustness of Retrieval-Augmented Language Models Against Spurious Features in Grounding Data

Title: AceWGS: An LLM-Aided Framework to Accelerate Catalyst Design for Water-Gas Shift Reactions

Title: Learning LLM Preference over Intra-Dialogue Pairs: A Framework for Utterance-level Understandings

Title: Symbolic Mixture-of-Experts: Adaptive Skill-based Routing for Heterogeneous Reasoning

Title: Understanding the Limits of Lifelong Knowledge Editing in LLMs