2025-11-04

Title: PlotCraft: Pushing the Limits of LLMs for Complex and Interactive Data Visualization

Title: Cognitive Alignment in Personality Reasoning: Leveraging Prototype Theory for MBTI Inference

Title: ParaScopes: What do Language Models Activations Encode About Future Text?

Title: Training LLMs Beyond Next Token Prediction - Filling the Mutual Information Gap

Title: Consistently Simulating Human Personas with Multi-Turn Reinforcement Learning

Title: AgentBnB: A Browser-Based Cybersecurity Tabletop Exercise with Large Language Model Support and Retrieval-Aligned Scaffolding

Title: IL-PCSR: Legal Corpus for Prior Case and Statute Retrieval

Title: Language Modeling With Factorization Memory

Title: Reversal Invariance in Autoregressive Language Models

Title: LingGym: How Far Are LLMs from Thinking Like Field Linguists?

Title: Reasoning Trajectories for Socratic Debugging of Student Code: From Misconceptions to Contradictions and Updated Beliefs

Title: PADBen: A Comprehensive Benchmark for Evaluating AI Text Detectors Against Paraphrase Attacks

Title: MedRECT: A Medical Reasoning Benchmark for Error Correction in Clinical Texts

Title: G2: Guided Generation for Enhanced Output Diversity in LLMs

Title: Remembering Unequally: Global and Disciplinary Bias in LLM-Generated Co-Authorship Networks

Title: Leveraging the Cross-Domain & Cross-Linguistic Corpus for Low Resource NMT: A Case Study On Bhili-Hindi-English Parallel Corpus

Title: ToM: Leveraging Tree-oriented MapReduce for Long-Context Reasoning in Large Language Models

Title: Zero-RAG: Towards Retrieval-Augmented Generation with Zero Redundant Knowledge

Title: Fine-Tuning DialoGPT on Common Diseases in Rural Nepal for Medical Conversations

Title: Exploring and Mitigating Gender Bias in Encoder-Based Transformer Models

Title: Word Salad Chopper: Reasoning Models Waste A Ton Of Decoding Budget On Useless Repetitions, Self-Knowingly

Title: Multi-refined Feature Enhanced Sentiment Analysis Using Contextual Instruction

Title: Friend or Foe: How LLMs' Safety Mind Gets Fooled by Intent Shift Attack

Title: FlashEVA: Accelerating LLM inference via Efficient Attention

Title: OpenSIR: Open-Ended Self-Improving Reasoner

Title: SpecDiff-2: Scaling Diffusion Drafter Alignment For Faster Speculative Decoding

Title: Certain but not Probable? Differentiating Certainty from Probability in LLM Token Outputs for Probabilistic Scenarios

Title: Do You Know About My Nation? Investigating Multilingual Language Models' Cultural Literacy Through Factual Knowledge

Title: Do Methods to Jailbreak and Defend LLMs Generalize Across Languages?

Title: TriCon-Fair: Triplet Contrastive Learning for Mitigating Social Bias in Pre-trained Language Models

Title: Assessing LLM Reasoning Steps via Principal Knowledge Grounding

Title: ColMate: Contrastive Late Interaction and Masked Text for Multimodal Document Retrieval

Title: The Biased Oracle: Assessing LLMs' Understandability and Empathy in Medical Diagnoses

Title: The Riddle of Reflection: Evaluating Reasoning and Self-Awareness in Multilingual LLMs using Indian Riddles

Title: Advancing Machine-Generated Text Detection from an Easy to Hard Supervision Perspective

Title: MARS-SQL: A multi-agent reinforcement learning framework for Text-to-SQL

Title: IF-CRITIC: Towards a Fine-Grained LLM Critic for Instruction-Following Evaluation

Title: Prompt-R1: Collaborative Automatic Prompting Framework via End-to-end Reinforcement Learning

Title: OceanAI: A Conversational Platform for Accurate, Transparent, Near-Real-Time Oceanographic Insights

Title: VayuChat: An LLM-Powered Conversational Interface for Air Quality Data Analytics

Title: Building a Silver-Standard Dataset from NICE Guidelines for Clinical LLMs

Title: HPLT~3.0: Very Large-Scale Multilingual Resources for LLM and MT. Mono- and Bi-lingual Data, Multilingual Evaluation, and Pre-Trained Models

Title: Improving Romanian LLM Pretraining Data using Diversity and Quality Filtering

Title: TSVer: A Benchmark for Fact Verification Against Time-Series Evidence

Title: MicroRemed: Benchmarking LLMs in Microservices Remediation

Title: Learning When to Quit in Sales Conversations

Title: Surfacing Subtle Stereotypes: A Multilingual, Debate-Oriented Evaluation of Modern LLMs

Title: ZoFia: Zero-Shot Fake News Detection with Entity-Guided Retrieval and Multi-LLM Interaction

Title: DEER: Disentangled Mixture of Experts with Instance-Adaptive Routing for Generalizable Machine-Generated Text Detection

Title: AraFinNews: Arabic Financial Summarisation with Domain-Adapted LLMs

Title: When, What, and How: Rethinking Retrieval-Enhanced Speculative Decoding

Title: "Give a Positive Review Only": An Early Investigation Into In-Paper Prompt Injection Attacks and Defenses for AI Reviewers

Title: FirstAidQA: A Synthetic Dataset for First Aid and Emergency Response in Low-Connectivity Settings

Title: DeepSpecs: Expert-Level Questions Answering in 5G

Title: DEEPAMBIGQA: Ambiguous Multi-hop Questions for Benchmarking LLM Answer Completeness

Title: PrefixNLI: Detecting Factual Inconsistencies as Soon as They Arise

Title: Safer in Translation? Presupposition Robustness in Indic Languages

Title: The Ouroboros of Benchmarking: Reasoning Evaluation in an Era of Saturation

Title: Confounding Factors in Relating Model Performance to Morphology

Title: RAGSmith: A Framework for Finding the Optimal Composition of Retrieval-Augmented Generation Methods Across Datasets

Title: LiveSearchBench: An Automatically Constructed Benchmark for Retrieval and Reasoning over Dynamic Knowledge

Title: "Don't Teach Minerva": Guiding LLMs Through Complex Syntax for Faithful Latin Translation with RAG

Title: BARD: budget-aware reasoning distillation

Title: Towards Consistent Detection of Cognitive Distortions: LLM-Based Annotation and Dataset-Agnostic Evaluation

Title: Synthetic Eggs in Many Baskets: The Impact of Synthetic Data Diversity on LLM Fine-Tuning

Title: BanglaNirTox: A Large-scale Parallel Corpus for Explainable AI in Bengali Text Detoxification

Title: Difficulty-Controllable Cloze Question Distractor Generation

Title: Math anxiety and associative knowledge structure are entwined in psychology students but not in Large Language Models like GPT-3.5 and GPT-4o

Title: ECO Decoding: Entropy-Based Control for Controllability and Fluency in Controllable Dialogue Generation

Title: BIRD: Bronze Inscription Restoration and Dating

Title: Imperfect Language, Artificial Intelligence, and the Human Mind: An Interdisciplinary Approach to Linguistic Errors in Native Spanish Speakers

Title: A Graph-based RAG for Energy Efficiency Question Answering

Title: Evaluating Cultural Knowledge Processing in Large Language Models: A Cognitive Benchmarking Framework Integrating Retrieval-Augmented Generation

Title: EngChain: A Symbolic Benchmark for Verifiable Multi-Step Reasoning in Engineering

Title: SeaLLMs-Audio: Large Audio-Language Models for Southeast Asia

Title: Open Character Training: Shaping the Persona of AI Assistants through Constitutional AI

Title: Multi-Step Knowledge Interaction Analysis via Rank-2 Subspace Disentanglement

Title: Efficient Tool-Calling Multi-Expert NPC Agent for Commonsense Persona-Grounded Dialogue

Title: Accumulating Context Changes the Beliefs of Language Models

Title: Plan-and-Write: Structure-Guided Length Control for LLMs without Model Retraining

Title: KV Cache Transform Coding for Compact Storage in LLM Inference

Title: Tool-to-Agent Retrieval: Bridging Tools and Agents for Scalable LLM Multi-Agent Systems