2025-11-05

Title: Multi-Personality Generation of LLMs at Decoding-time

Title: Rethinking LLM Human Simulation: When a Graph is What You Need

Title: IG-Pruning: Input-Guided Block Pruning for Large Language Models

Title: Demo: Statistically Significant Results On Biases and Errors of LLMs Do Not Guarantee Generalizable Results

Title: LTD-Bench: Evaluating Large Language Models by Letting Them Draw

Title: Let Multimodal Embedders Learn When to Augment Query via Adaptive Query Augmentation

Title: LiveSecBench: A Dynamic and Culturally-Relevant AI Safety Benchmark for LLMs in Chinese Context

Title: AyurParam: A State-of-the-Art Bilingual Language Model for Ayurveda

Title: AutoAdv: Automated Adversarial Prompting for Multi-Turn Jailbreaking of Large Language Models

Title: Merging Continual Pretraining Models for Domain-Specialized LLMs: A Case Study in Finance

Title: Prompting for Policy: Forecasting Macroeconomic Scenarios with Synthetic LLM Personas

Title: Next Token Knowledge Tracing: Exploiting Pretrained LLM Representations to Decode Student Behaviour

Title: CGES: Confidence-Guided Early Stopping for Efficient and Accurate Self-Consistency

Title: The Realignment Problem: When Right becomes Wrong in LLMs

Title: Understanding New-Knowledge-Induced Factual Hallucinations in LLMs: Analysis, Solution, and Interpretation

Title: Optimal Singular Damage: Efficient LLM Inference in Low Storage Regimes

Title: AI Diffusion in Low Resource Language Countries

Title: Controlling Performance and Budget of a Centralized Multi-agent LLM System with Reinforcement Learning

Title: MemSearcher: Training LLMs to Reason, Search and Manage Memory via End-to-End Reinforcement Learning

Title: Oolong: Evaluating Long Context Reasoning and Aggregation Capabilities