2025-12-29

Title: Teaching People LLM's Errors and Getting it Right

Title: Morality is Contextual: Learning Interpretable Moral Contexts from Human Data with Probabilistic Clustering and Large Language Models

Title: Oogiri-Master: Benchmarking Humor Understanding via Oogiri

Title: Beyond Heuristics: A Decision-Theoretic Framework for Agent Memory Management

Title: A Unified Definition of Hallucination, Or: It's the World Model, Stupid

Title: Gamayun's Path to Multilingual Mastery: Cost-Efficient Training of a 1.5B-Parameter LLM

Title: Heaven-Sent or Hell-Bent? Benchmarking the Intelligence and Defectiveness of LLM Hallucinations

Title: MoRAgent: Parameter Efficient Agent Tuning with Mixture-of-Roles

Title: Detecting AI-Generated Paraphrases in Bengali: A Comparative Study of Zero-Shot and Fine-Tuned Transformers

Title: Do Latent Tokens Think? A Causal and Adversarial Analysis of Chain-of-Continuous-Thought

Title: CATCH: A Controllable Theme Detection Framework with Contextualized Clustering and Hierarchical Generation

Title: Ara-HOPE: Human-Centric Post-Editing Evaluation for Dialectal Arabic to Modern Standard Arabic Translation

Title: Five Years of SciCap: What We Learned and Future Directions for Scientific Figure Captioning

Title: On The Conceptualization and Societal Impact of Cross-Cultural Bias

Title: Method Decoration (DeMe): A Framework for LLM-Driven Adaptive Method Generation in Dynamic IoT Environments

Title: Knowledge Reasoning of Large Language Models Integrating Graph-Structured Information for Pest and Disease Control in Tobacco

Title: AlignAR: Generative Sentence Alignment for Arabic-English Parallel Corpora of Legal and Literary Texts

Title: HeartBench: Probing Core Dimensions of Anthropomorphic Intelligence in LLMs

Title: TimeBill: Time-Budgeted Inference for Large Language Models

Title: Bridging the Copyright Gap: Do Large Vision-Language Models Recognize and Respect Copyrighted Content?

Title: CricBench: A Multilingual Benchmark for Evaluating LLMs in Cricket Analytics

Title: Explainable Statute Prediction via Attention-based Model and LLM Prompting

Title: Accelerate Speculative Decoding with Sparse Computation in Verification

Title: SWE-RM: Execution-free Feedback For Software Engineering Agents

Title: Broken Words, Broken Performance: Effect of Tokenization on Performance of LLMs

Title: Context as a Tool: Context Management for Long-Horizon SWE-Agents

Title: Introducing TrGLUE and SentiTurca: A Comprehensive Benchmark for Turkish General Language Understanding and Sentiment Analysis