2025-09-03

Title: MultiStream-LLM: Bridging Modalities for Robust Sign Language Translation

Title: Compiling Prompts, Not Crafting Them: A Reproducible Workflow for AI-Assisted Evidence Synthesis

Title: Explainable Chain-of-Thought Reasoning: An Empirical Analysis on State-Aware Reasoning Dynamics

Title: The Rarity Blind Spot: A Framework for Evaluating Statistical Reasoning in LLMs

Title: The Temporal Game: A New Perspective on Temporal Relation Extraction

Title: Exploring Reasoning-Infused Text Embedding with Large Language Models for Zero-Shot Dense Retrieval

Title: OpinioRAG: Towards Generating User-Centric Opinion Highlights from Large-scale Online Reviews

Title: Wage Sentiment Indices Derived from Survey Comments via Large Language Models

Title: Balanced Actor Initialization: Stable RLHF Training of Distillation-Based Reasoning Models

Title: GIER: Gap-Driven Self-Refinement for Large Language Models

Title: Open Data Synthesis For Deep Research

Title: GraphKV: Breaking the Static Selection Paradigm with Graph-Based KV Cache Eviction

Title: The Resurgence of GCG Adversarial Attacks on Large Language Models

Title: MedSEBA: Synthesizing Evidence-Based Answers Grounded in Evolving Medical Literature

Title: The Gold Medals in an Empty Room: Diagnosing Metalinguistic Reasoning in LLMs with Camlang

Title: GOSU: Retrieval-Augmented Generation with Global-Level Optimized Semantic Unit-Centric Framework

Title: CVPD at QIAS 2025 Shared Task: An Efficient Encoder-Based Approach for Islamic Inheritance Reasoning

Title: TECP: Token-Entropy Conformal Prediction for LLMs

Title: Talk Less, Call Right: Enhancing Role-Play LLM Agents with Automatic Prompt Optimization and Role Prompting

Title: ResearchQA: Evaluating Scholarly Question Answering at Scale Across 75 Fields with Survey-Mined Questions and Rubrics

Title: Entropy-based Coarse and Compressed Semantic Speech Representation Learning

Title: Modeling Motivated Reasoning in Law: Evaluating Strategic Role Conditioning in LLM Summarization

Title: Thinking Hard, Going Misaligned: Emergent Misalignment in LLMs

Title: StealthEval: A Probe-Rewrite-Evaluate Workflow for Reliable Benchmarks

Title: Gated Associative Memory: A Parallel O(N) Architecture for Efficient Sequence Modeling

Title: Can Multi-turn Self-refined Single Agent LMs with Retrieval Solve Hard Coding Problems?

Title: Confident, Calibrated, or Complicit: Probing the Trade-offs between Safety Alignment and Ideological Bias in Language Models in Detecting Hate Speech

Title: Do small language models generate realistic variable-quality fake news headlines?

Title: CE-Bench: Towards a Reliable Contrastive Evaluation Benchmark of Interpretability of Sparse Autoencoders

Title: Learning to Shop Like Humans: A Review-driven Retrieval-Augmented Recommendation Framework with LLMs

Title: Reward-Weighted Sampling: Enhancing Non-Autoregressive Characteristics in Masked Diffusion LLMs

Title: Designing LMS and Instructional Strategies for Integrating Generative-Conversational AI

Title: LLM Encoder vs. Decoder: Robust Detection of Chinese AI-Generated Text with LoRA

Title: Decomposing and Revising What Language Models Generate

Title: CaresAI at BioCreative IX Track 1 -- LLM for Biomedical QA

Title: Neural Models and Language Model Prompting for the Multidimensional Evaluation of Open-Ended Conversations

Title: Negative Matters: Multi-Granularity Hard-Negative Synthesis and Anchor-Token-Aware Pooling for Enhanced Text Embeddings

Title: Prompting Away Stereotypes? Evaluating Bias in Text-to-Image Models for Occupations

Title: Exploring and Mitigating Fawning Hallucinations in Large Language Models

Title: EviNote-RAG: Enhancing RAG Models via Answer-Supportive Evidence Notes

Title: SeLeRoSa: Sentence-Level Romanian Satire Detection Dataset

Title: Supervised In-Context Fine-Tuning for Generative Sequence Labeling

Title: MedCOD: Enhancing English-to-Spanish Medical Translation of Large Language Models Using Enriched Chain-of-Dictionary Framework

Title: Structure and Destructure: Dual Forces in the Making of Knowledge Engines

Title: RPRO:Ranked Preference Reinforcement Optimization for Enhancing Medical QA and Diagnostic Reasoning

Title: We Politely Insist: Your LLM Must Learn the Persian Art of Taarof

Title: A Dynamic Fusion Model for Consistent Crisis Response

Title: Speaking at the Right Level: Literacy-Controlled Counterspeech Generation with RAG-RL

Title: Assessing Large Language Models on Islamic Legal Reasoning: Evidence from Inheritance Law Evaluation

Title: Privacy-Preserving Reasoning with Knowledge-Distilled Parametric Retrieval Augmented Generation

Title: REFRAG: Rethinking RAG based Decoding

Title: Natural Context Drift Undermines the Natural Language Understanding of Large Language Models

Title: Dream-Coder 7B: An Open Diffusion Language Model for Code

Title: Zero-shot Cross-lingual NER via Mitigating Language Difference: An Entity-aligned Translation Perspective

Title: Enhancing Large Language Model for Knowledge Graph Completion via Structure-Aware Alignment-Tuning

Title: Modular Techniques for Synthetic Long-Context Data Generation in Language Model Training and Evaluation

Title: Statutory Construction and Interpretation for Artificial Intelligence

Title: Efficient Large Language Models with Zero-Shot Adjustable Acceleration

Title: Mitigating Catastrophic Forgetting in Continual Learning through Model Growth

Title: DaMoC: Efficiently Selecting the Optimal Large Language Model for Fine-tuning Domain Taks Based on Data and Model Compression

Title: Rethinking the Chain-of-Thought: The Roles of In-Context Learning and Pre-trained Priors

Title: Annotation and modeling of emotions in a textual corpus: an evaluative approach

Title: Culture is Everywhere: A Call for Intentionally Cultural Evaluation

Title: TableZoomer: A Collaborative Agent Framework for Large-scale Table Question Answering

Title: Can Smaller LLMs do better? Unlocking Cross-Domain Potential through Parameter-Efficient Fine-Tuning for Text Summarization

Title: LongCat-Flash Technical Report

Title: KoBLEX: Open Legal Question Answering with Multi-hop Reasoning

Title: Can Large Language Models Master Complex Card Games?

Title: Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic

Title: WATCHED: A Web AI Agent Tool for Combating Hate Speech by Expanding Data

Title: ABCD-LINK: Annotation Bootstrapping for Cross-Document Fine-Grained Links

Title: LLMs cannot spot math errors, even when allowed to peek into the solution

Title: Vis-CoT: A Human-in-the-Loop Framework for Interactive Visualization and Intervention in LLM Chain-of-Thought Reasoning

Title: On the Alignment of Large Language Models with Global Human Opinion

Title: Trusted Uncertainty in Large Language Models: A Unified Framework for Confidence Calibration and Risk-Controlled Refusal

Title: Robust Knowledge Editing via Explicit Reasoning Chains for Distractor-Resilient Multi-Hop QA

Title: Do Retrieval Augmented Language Models Know When They Don't Know?

Title: MeVe: A Modular System for Memory Verification and Effective Context Control in Language Models

Title: CAT: Causal Attention Tuning For Injecting Fine-grained Causal Knowledge into Large Language Models

Title: In-N-Out: A Parameter-Level API Graph Dataset for Tool Agents

Title: Enhancing Uncertainty Estimation in LLMs with Expectation of Aggregated Internal Belief

Title: Benchmarking the Detection of LLMs-Generated Modern Chinese Poetry

Title: chDzDT: Word-level morphology-aware language model for Algerian social media text

Title: Flaw or Artifact? Rethinking Prompt Sensitivity in Evaluating LLMs

Title: Mic Drop or Data Flop? Evaluating the Fitness for Purpose of AI Voice Interviewers for Data Collection within Quantitative & Qualitative Research Contexts

Title: Extracting OPQRST in Electronic Health Records using Large Language Models with Reasoning

Title: DRAssist: Dispute Resolution Assistance using Large Language Models

Title: StructCoh: Structured Contrastive Learning for Context-Aware Text Semantic Matching

Title: DeepSeek performs better than other Large Language Models in Dental Cases

Title: Attributes as Textual Genes: Leveraging LLMs as Genetic Algorithm Simulators for Conditional Synthetic Data Generation

Title: How Instruction-Tuning Imparts Length Control: A Cross-Lingual Mechanistic Analysis

Title: Better by Comparison: Retrieval-Augmented Contrastive Reasoning for Automatic Prompt Optimization

Title: JudgeAgent: Dynamically Evaluate LLMs with Agent-as-Interviewer

Title: CMRAG: Co-modality-based document retrieval and visual question answering

Title: AMBEDKAR-A Multi-level Bias Elimination through a Decoding Approach with Knowledge Augmentation for Robust Constitutional Alignment of Language Models

Title: Avoidance Decoding for Diverse Multi-Branch Story Generation

Title: FActBench: A Benchmark for Fine-grained Automatic Evaluation of LLM-Generated Text in the Medical Domain

Title: Towards Fundamental Language Models: Does Linguistic Competence Scale with Model Size?

Title: LLMs and their Limited Theory of Mind: Evaluating Mental State Annotations in Situated Dialogue

Title: DCPO: Dynamic Clipping Policy Optimization

Title: Implicit Reasoning in Large Language Models: A Comprehensive Survey

Title: Towards Temporal Knowledge-Base Creation for Fine-Grained Opinion Analysis with Language Models

Title: An Ensemble Classification Approach in A Multi-Layered Large Language Model Framework for Disease Prediction

Title: Do LLMs Adhere to Label Definitions? Examining Their Receptivity to External Label Definitions

Title: SpecEval: Evaluating Model Adherence to Behavior Specifications

Title: MoSEs: Uncertainty-Aware AI-Generated Text Detection via Mixture of Stylistics Experts with Conditional Thresholds

Title: L3Cube-IndicHeadline-ID: A Dataset for Headline Identification and Semantic Evaluation in Low-Resource Indian Languages

Title: Top-H Decoding: Adapting the Creativity and Coherence with Bounded Entropy in Text Generation

Title: Comparative Study of Pre-Trained BERT and Large Language Models for Code-Mixed Named Entity Recognition

Title: Implicit Actor Critic Coupling via a Supervised Learning Framework for RLVR

Title: Jointly Reinforcing Diversity and Quality in Language Model Generations

Title: PalmX 2025: The First Shared Task on Benchmarking LLMs on Arabic and Islamic Culture