2025-04-21

Title: Benchmarking Large Language Models for Calculus Problem-Solving: A Comparative Analysis

Title: BASIR: Budget-Assisted Sectoral Impact Ranking -- A Dataset for Sector Identification and Performance Prediction Using Language Models

Title: KFinEval-Pilot: A Comprehensive Benchmark Suite for Korean Financial Language Understanding

Title: Sustainability via LLM Right-sizing

Title: DIDS: Domain Impact-aware Data Sampling for Large Language Model Training

Title: ImPart: Importance-Aware Delta-Sparsification for Improved Model Compression and Merging in LLMs

Title: CPG-EVAL: A Multi-Tiered Benchmark for Evaluating the Chinese Pedagogical Grammar Competence of Large Language Models

Title: THOUGHTTERMINATOR: Benchmarking, Calibrating, and Mitigating Overthinking in Reasoning Models

Title: Secure Multifaceted-RAG for Enterprise: Hybrid Knowledge Retrieval with Security Filtering

Title: From Large to Super-Tiny: End-to-End Optimization for Cost-Efficient LLMs

Title: LLM Sensitivity Evaluation Framework for Clinical Diagnosis

Title: Prejudge-Before-Think: Enhancing Large Language Models at Test-Time by Process Prejudge Reasoning

Title: CoT-RAG: Integrating Chain of Thought and Retrieval-Augmented Generation to Enhance Reasoning in Large Language Models

Title: DETAM: Defending LLMs Against Jailbreak Attacks via Targeted Attention Modification

Title: Improving Generalization in Intent Detection: GRPO with Reward-Based Curriculum Sampling

Title: Continual Pre-Training is (not) What You Need in Domain Adaption

Title: Long-context Non-factoid Question Answering in Indic Languages

Title: Divergent LLM Adoption and Heterogeneous Convergence Paths in Research Writing

Title: Remedy: Learning Machine Translation Evaluation from Human Preferences with Reward Modeling

Title: Simulating Before Planning: Constructing Intrinsic User World Model for User-Tailored Dialogue Policy Planning

Title: Revisiting Uncertainty Quantification Evaluation in Language Models: Spurious Interactions with Response Length Bias Results

Title: Deep literature reviews: an application of fine-tuned language models to migration research

Title: Controlled Territory and Conflict Tracking (CONTACT): (Geo-)Mapping Occupied Territory from Open Source Intelligence

Title: BadApex: Backdoor Attack Based on Adaptive Optimization Mechanism of Black-box Large Language Models

Title: Analyzing LLMs' Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations

Title: Feature Alignment and Representation Transfer in Knowledge Distillation for Large Language Models

Title: Generative AI Act II: Test Time Scaling Drives Cognition Engineering

Title: Science Hierarchography: Hierarchical Organization of Science Literature