2025-11-12

Title: A Preliminary Study of RAG for Taiwanese Historical Archives

Title: Large Language Models for Scientific Idea Generation: A Creativity-Centered Survey

Title: GRIP: In-Parameter Graph Reasoning through Fine-Tuning Large Language Models

Title: REFLEX: Reference-Free Evaluation of Log Summarization via Large Language Model Judgment

Title: It Takes Two: A Dual Stage Approach for Terminology-Aware Translation

Title: Motif 2 12.7B technical report

Title: Focusing on Language: Revealing and Exploiting Language Attention Heads in Multilingual Large Language Models

Title: LLM Optimization Unlocks Real-Time Pairwise Reranking

Title: LLMs vs. Traditional Sentiment Tools in Psychology: An Evaluation on Belgian-Dutch Narratives

Title: Revisiting NLI: Towards Cost-Effective and Human-Aligned Metrics for Evaluating LLMs in Question Answering

Title: CAPO: Confidence Aware Preference Optimization Learning for Multilingual Preferences

Title: Critical Confabulation: Can LLMs Hallucinate for Social Good?

Title: Back to the Future: The Role of Past and Future Context Predictability in Incremental Language Production

Title: Design, Results and Industry Implications of the World's First Insurance Large Language Model Evaluation Benchmark

Title: From Experience to Strategy: Empowering LLM Agents with Trainable Graph Memory

Title: AlignSurvey: A Comprehensive Benchmark for Human Preferences Alignment in Social Surveys

Title: Last Layer Logits to Logic: Empowering LLMs with Logic-Consistent Structured Knowledge Reasoning

Title: Unified Work Embeddings: Contrastive Learning of a Bidirectional Multi-task Ranker

Title: NOTAM-Evolve: A Knowledge-Guided Self-Evolving Optimization Framework with LLMs for NOTAM Interpretation

Title: State of the Art in Text Classification for South Slavic Languages: Fine-Tuning or Prompting?

Title: Self-Correction Distillation for Structured Data Question Answering

Title: HyCoRA: Hyper-Contrastive Role-Adaptive Learning for Role-Playing

Title: Estranged Predictions: Measuring Semantic Category Disruption with Masked Language Modelling

Title: Multimodal LLMs Do Not Compose Skills Optimally Across Modalities

Title: Quantification and object perception in Multimodal Large Language Models deviate from human linguistic cognition

Title: Sentence-Anchored Gist Compression for Long-Context LLMs

Title: On the Interplay between Positional Encodings, Morphological Complexity, and Word Order Flexibility

Title: Relation as a Prior: A Novel Paradigm for LLM-based Document-level Relation Extraction

Title: Still Not There: Can LLMs Outperform Smaller Task-Specific Seq2Seq Models on the Poetry-to-Prose Conversion Task?

Title: Do Syntactic Categories Help in Developmentally Motivated Curriculum Learning for Language Models?

Title: Encoder Fine-tuning with Stochastic Sampling Outperforms Open-weight GPT in Astronomy Knowledge Extraction

Title: Benchmarking Educational LLMs with Analytics: A Case Study on Gender Bias in Feedback

Title: VocalBench-zh: Decomposing and Benchmarking the Speech Conversational Abilities in Mandarin Context

Title: Prompt Tuning for Natural Language to SQL with Embedding Fine-Tuning and RAG

Title: ParliaBench: An Evaluation and Benchmarking Framework for LLM-Generated Parliamentary Speech

Title: Hierarchical structure understanding in complex tables with VLLMs: a benchmark and experiments

Title: Automatic Paper Reviewing with Heterogeneous Graph Reasoning over LLM-Simulated Reviewer-Author Debates

Title: Adaptive Multi-Agent Response Refinement in Conversational Systems

Title: AgentPRM: Process Reward Models for LLM Agents via Step-Wise Promise and Progress

Title: DPRM: A Dual Implicit Process Reward Model in Multi-Hop Question Answering

Title: PCRLLM: Proof-Carrying Reasoning with Large Language Models under Stepwise Logical Constraints

Title: Interaction Dynamics as a Reward Signal for LLMs

Title: Bot Meets Shortcut: How Can LLMs Aid in Handling Unknown Invariance OOD Scenarios?

Title: SPEAR-MM: Selective Parameter Evaluation and Restoration via Model Merging for Efficient Financial LLM Adaptation

Title: Structured RAG for Answering Aggregative Questions

Title: Introducing A Bangla Sentence - Gloss Pair Dataset for Bangla Sign Language Translation and Research

Title: AlphaResearch: Accelerating New Algorithm Discovery with Language Models

Title: Investigating CoT Monitorability in Large Reasoning Models

Title: From Semantic Roles to Opinion Roles: SRL Data Extraction for Multi-Task and Transfer Learning in Low-Resource ORL

Title: Moral Susceptibility and Robustness under Persona Role-Play in Large Language Models

Title: Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

Title: Training Language Models to Explain Their Own Computations