2025-10-24

Title: DeBERTa-KC: A Transformer-Based Classifier for Knowledge Construction in Online Learning Discourse

Title: An Evaluation of the Pedagogical Soundness and Usability of AI-Generated Lesson Plans Across Different Models and Prompt Frameworks in High-School Physics

Title: From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model

Title: Stream: Scaling up Mechanistic Interpretability to Long Context in LLMs via Sparse Attention

Title: Automated HIV Screening on Dutch EHR with Large Language Models

Title: An Expert-grounded benchmark of General Purpose LLMs in LCA

Title: Can They Dixit? Yes they Can! Dixit as a Playground for Multimodal Language Model Capabilities

Title: Large Language Model enabled Mathematical Modeling

Title: Learning from Supervision with Semantic and Episodic Memory: A Reflective Approach to Agent Adaptation

Title: LLM-Augmented Symbolic NLU System for More Reliable Continuous Causal Statement Interpretation

Title: Beyond MedQA: Towards Real-world Clinical Decision Making in the Era of LLMs

Title: Improving Transfer Learning for Sequence Labeling Tasks by Adapting Pre-trained Neural Language Models

Title: ToolScope: Enhancing LLM Agent Tool Use through Tool Merging and Context-Aware Filtering

Title: From Facts to Folklore: Evaluating Large Language Models on Bengali Cultural Knowledge

Title: Enhancing Reasoning Skills in Small Persian Medical Language Models Can Outperform Large-Scale Data Training

Title: CreativityPrism: A Holistic Benchmark for Large Language Model Creativity

Title: Leveraging the Power of Large Language Models in Entity Linking via Adaptive Routing and Targeted Reasoning

Title: BoundRL: Efficient Structured Text Segmentation through Reinforced Boundary Generation

Title: Are Stereotypes Leading LLMs' Zero-Shot Stance Detection ?

Title: DeepWideSearch: Benchmarking Depth and Width in Agentic Information Seeking

Title: Mixture-of-Minds: Multi-Agent Reinforcement Learning for Table Understanding

Title: Stuck in the Matrix: Probing Spatial Reasoning in Large Language Models

Title: Decoding-Free Sampling Strategies for LLM Marginalization

Title: Context-level Language Modeling by Learning Predictive Context Embeddings

Title: Citation Failure: Definition, Analysis and Efficient Mitigation

Title: Exploring Generative Process Reward Modeling for Semi-Structured Data: A Case Study of Table Question Answering

Title: Teaching Language Models to Reason with Tools

Title: Evaluating Latent Knowledge of Public Tabular Datasets in Large Language Models

Title: FreeChunker: A Cross-Granularity Chunking Framework

Title: Dialogue Is Not Enough to Make a Communicative BabyLM (But Neither Is Developmentally Inspired Reinforcement Learning)

Title: The Impact of Negated Text on Hallucination with Large Language Models

Title: Teacher Demonstrations in a BabyLM's Zone of Proximal Development for Contingent Multi-Turn Interaction

Title: LM-mixup: Text Data Augmentation via Language Model based Mixup

Title: Systematic Evaluation of Uncertainty Estimation Methods in Large Language Models

Title: Mask and You Shall Receive: Optimizing Masked Language Modeling For Pretraining BabyLMs

Title: RECALL: REpresentation-aligned Catastrophic-forgetting ALLeviation via Hierarchical Model Merging

Title: Steering Evaluation-Aware Language Models To Act Like They Are Deployed

Title: Robust Preference Alignment via Directional Neighborhood Consensus

Title: Hierarchical Sequence Iteration for Heterogeneous Question Answering

Title: Assessing the Political Fairness of Multilingual LLMs: A Case Study based on a 21-way Multiparallel EuroParl Dataset

Title: ARC-Encoder: learning compressed text representations for large language models

Title: The Dog the Cat Chased Stumped the Model: Measuring When Language Models Abandon Structure for Shortcuts

Title: GlobalRAG: Enhancing Global Reasoning in Multi-hop Question Answering via Reinforcement Learning

Title: Beyond Retrieval-Ranking: A Multi-Agent Cognitive Decision Framework for E-Commerce Search

Title: Can ChatGPT Code Communication Data Fairly?: Empirical Evidence from Multiple Collaborative Tasks

Title: Why Did Apple Fall To The Ground: Evaluating Curiosity In Large Language Model

Title: Neural Diversity Regularizes Hallucinations in Small Models

Title: Structure-Conditional Minimum Bayes Risk Decoding

Title: User Perceptions of Privacy and Helpfulness in LLM Responses to Privacy-Sensitive Scenarios

Title: Automated Extraction of Fluoropyrimidine Treatment and Treatment-Related Toxicities from Clinical Notes Using Natural Language Processing

Title: A Use-Case Specific Dataset for Measuring Dimensions of Responsible Performance in LLM-generated Text

Title: Simple Context Compression: Mean-Pooling and Multi-Ratio Training

Title: On the Detectability of LLM-Generated Text: What Exactly Is LLM-Generated Text?