2025-04-16

Title: LayerFlow: Layer-wise Exploration of LLM Embeddings using Uncertainty-aware Interlinked Projections

Title: Beyond Chains of Thought: Benchmarking Latent-Space Reasoning Abilities in Large Language Models

Title: Better Estimation of the KL Divergence Between Language Models

Title: Weight-of-Thought Reasoning: Exploring Neural Network Weights for Enhanced LLM Reasoning

Title: Improving In-Context Learning with Reasoning Distillation

Title: LITERA: An LLM Based Approach to Latin-to-English Translation

Title: Keyword Extraction, and Aspect Classification in Sinhala, English, and Code-Mixed Content

Title: EMAFusion: A Self-Optimizing System for Seamless LLM Selection and Integration

Title: HELIOS: Adaptive Model And Early-Exit Selection for Efficient LLM Inference Serving

Title: The Art of Audience Engagement: LLM-Based Thin-Slicing of Scientific Talks

Title: GUM-SAGE: A Novel Dataset and Approach for Graded Entity Salience Prediction

Title: Name of Thrones: Evaluating How LLMs Rank Student Names, Race, and Gender in Status Hierarchies

Title: CLASH: Evaluating Language Models on Judging High-Stakes Dilemmas from Multiple Perspectives

Title: Moving Beyond Next-Token Prediction: Transformers are Context-Sensitive Language Generators

Title: Ai2 Scholar QA: Organized Literature Synthesis with Attribution

Title: Efficient Reasoning Models: A Survey

Title: Understanding LLMs' Cross-Lingual Context Retrieval: How Good It Is And Where It Comes From

Title: Exploring the Role of KG-Based RAG in Japanese Medical Question Answering with Small-Scale LLMs

Title: ReZero: Enhancing LLM search ability by trying one-more-time

Title: Dynamic Compressing Prompts for Efficient Inference of Large Language Models

Title: LazyReview A Dataset for Uncovering Lazy Thinking in NLP Peer Reviews

Title: DeepMLF: Multimodal language model with learnable tokens for deep fusion in sentiment analysis

Title: Using LLMs as prompt modifier to avoid biases in AI image generators

Title: Benchmarking Vision Language Models on German Factual Data

Title: MuSeD: A Multimodal Spanish Dataset for Sexism Detection in Social Media Videos

Title: Bias Beyond English: Evaluating Social Bias and Debiasing Methods in a Low-Resource Setting

Title: Benchmarking Next-Generation Reasoning-Focused Large Language Models in Ophthalmology: A Head-to-Head Evaluation on 5,888 Items

Title: From Misleading Queries to Accurate Answers: A Three-Stage Fine-Tuning Method for LLMs

Title: Automated Python Translation

Title: REWARD CONSISTENCY: Improving Multi-Objective Alignment from a Data-Centric Perspective

Title: OpenTuringBench: An Open-Model-based Benchmark and Framework for Machine-Generated Text Detection and Attribution

Title: Cancer-Myth: Evaluating AI Chatbot on Patient Questions with False Presuppositions

Title: RankAlign: A Ranking View of the Generator-Validator Gap in Large Language Models

Title: Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning

Title: Reinforcing Compositional Retrieval: Retrieving Step-by-Step for Composing Informative Contexts

Title: A Dual-Space Framework for General Knowledge Distillation of Large Language Models

Title: Masculine Defaults via Gendered Discourse in Podcasts and Large Language Models

Title: TextArena

Title: DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning