2025-05-28

Title: Guiding Giants: Lightweight Controllers for Weighted Activation Steering in LLMs

Title: Arctic-Text2SQL-R1: Simple Rewards, Strong Reasoning in Text-to-SQL

Title: Beyond Demonstrations: Dynamic Vector Construction from Latent Representations

Title: Less Context, Same Performance: A RAG Framework for Resource-Efficient LLM-Based Clinical NLP

Title: BiomedSQL: Text-to-SQL for Scientific Reasoning on Biomedical Knowledge Bases

Title: Beyond Prompt Engineering: Robust Behavior Control in LLMs via Steering Target Atoms

Title: PMOA-TTS: Introducing the PubMed Open Access Textual Times Series Corpus

Title: Guided by Gut: Efficient Test-Time Scaling with Reinforced Intrinsic Confidence

Title: Multi-Scale Manifold Alignment: A Unified Framework for Enhanced Explainability of Large Language Models

Title: Lookahead Q-Cache: Achieving More Consistent KV Cache Eviction via Pseudo Query

Title: Language Model Distillation: A Temporal Difference Imitation Learning Perspective

Title: MOSLIM:Align with diverse preferences in prompts through reward classification

Title: Assessing the Capability of LLMs in Solving POSCOMP Questions

Title: Dynamic Manifold Evolution Theory: Modeling and Stability Analysis of Latent Representations in Large Language Models

Title: Do LLMs have a Gender (Entropy) Bias?

Title: SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data

Title: Rethinking Text-based Protein Understanding: Retrieval or LLM?

Title: Enhancing Logical Reasoning in Language Models via Symbolically-Guided Monte Carlo Process Supervision

Title: GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation

Title: SEMMA: A Semantic Aware Knowledge Graph Foundation Model

Title: HAMburger: Accelerating LLM Inference via Token Smashing

Title: In-context Language Learning for Endangered Languages in Speech Recognition

Title: Amulet: Putting Complex Multi-Turn Conversations on the Stand with LLM Juries

Title: Conversation Kernels: A Flexible Mechanism to Learn Relevant Context for Online Conversation Understanding

Title: InFact: Informativeness Alignment for Improved LLM Factuality

Title: Beyond Keywords: Evaluating Large Language Model Classification of Nuanced Ableism

Title: Gatsby Without the 'E': Crafting Lipograms with LLMs

Title: Large Language Models for IT Automation Tasks: Are We There Yet?

Title: AstroVisBench: A Code Benchmark for Scientific Computing and Visualization in Astronomy

Title: Paths Not Taken: Understanding and Mending the Multilingual Factual Recall Pipeline

Title: Effectiveness of Prompt Optimization in NL2SQL Systems

Title: REAL-Prover: Retrieval Augmented Lean Prover for Mathematical Reasoning

Title: SeqPO-SiMT: Sequential Policy Optimization for Simultaneous Machine Translation

Title: POLAR: A Benchmark for Multilingual, Multicultural, and Multi-Event Online Polarization

Title: Long Context Scaling: Divide and Conquer via Multi-Agent Question-driven Collaboration

Title: Test-Time Learning for Large Language Models

Title: STEER-BENCH: A Benchmark for Evaluating the Steerability of Large Language Models

Title: FinTagging: An LLM-ready Benchmark for Extracting and Structuring Financial Information

Title: Enhancing Transformation from Natural Language to Signal Temporal Logic Using LLMs with Diverse External Knowledge

Title: BacktrackAgent: Enhancing GUI Agent with Error Detection and Backtracking Mechanism

Title: Self-Route: Automatic Mode Switching via Capability Estimation for Efficient Reasoning

Title: Pretraining Language Models to Ponder in Continuous Space

Title: SELF-PERCEPT: Introspection Improves Large Language Models' Detection of Multi-Person Mental Manipulation in Conversations

Title: Beyond Templates: Dynamic Adaptation of Reasoning Demonstrations via Feasibility-Aware Exploration

Title: Dissecting Physics Reasoning in Small Language Models: A Multi-Dimensional Analysis from an Educational Perspective

Title: SPA-RL: Reinforcing LLM Agents via Stepwise Progress Attribution

Title: Silencer: From Discovery to Mitigation of Self-Bias in LLM-as-Benchmark-Generator

Title: CogniBench: A Legal-inspired Framework and Dataset for Assessing Cognitive Faithfulness of Large Language Models

Title: SpecExtend: A Drop-in Enhancement for Speculative Decoding of Long Sequences

Title: CHIMERA: A Knowledge Base of Idea Recombination in Scientific Literature

Title: Improved Representation Steering for Language Models

Title: Rethinking Information Synthesis in Multimodal Question Answering A Multi-Agent Perspective

Title: Tracing and Reversing Rank-One Model Edits

Title: Reinforced Informativeness Optimization for Long-Form Retrieval-Augmented Generation

Title: AdParaphrase v2.0: Generating Attractive Ad Texts Using a Preference-Annotated Paraphrase Dataset

Title: Concealment of Intent: A Game-Theoretic Analysis

Title: Divide-Then-Align: Honest Alignment based on the Knowledge Boundary of RAG

Title: Can LLMs Learn to Map the World from Local Descriptions?

Title: Trans-EnV: A Framework for Evaluating the Linguistic Robustness of LLMs Against English Varieties

Title: MSA at SemEval-2025 Task 3: High Quality Weak Labeling and LLM Ensemble Verification for Multilingual Hallucination Detection

Title: EasyDistill: A Comprehensive Toolkit for Effective Knowledge Distillation of Large Language Models

Title: A Stereotype Content Analysis on Color-related Social Bias in Large Vision Language Models

Title: Towards Objective Fine-tuning: How LLMs' Prior Knowledge Causes Potential Poor Calibration?

Title: Automated Privacy Information Annotation in Large Language Model Interactions

Title: Automatic Transmission for LLM Tiers: Optimizing Cost and Accuracy in Large Language Models

Title: Multi-objective Large Language Model Alignment with Hierarchical Experts

Title: Information-Theoretic Complementary Prompts for Improved Continual Text Classification

Title: On VLMs for Diverse Tasks in Multimodal Meme Classification

Title: Research Community Perspectives on "Intelligence" and Large Language Models

Title: Context-Aware Content Moderation for German Newspaper Comments

Title: Reason-Align-Respond: Aligning LLM Reasoning with Knowledge Graphs for KGQA

Title: Contrastive Learning on LLM Back Generation Treebank for Cross-domain Constituency Parsing

Title: Evaluating and Steering Modality Preferences in Multimodal Large Language Model

Title: Who Reasons in the Large Language Models?

Title: Uncertainty Unveiled: Can Exposure to More In-context Examples Mitigate Uncertainty for Large Language Models?

Title: LLMs are Frequency Pattern Learners in Natural Language Inference

Title: Def-DTS: Deductive Reasoning for Open-domain Dialogue Topic Segmentation

Title: FCKT: Fine-Grained Cross-Task Knowledge Transfer with Semantic Contrastive Learning for Targeted Sentiment Analysis

Title: Predicting Implicit Arguments in Procedural Video Instructions

Title: Faithfulness-Aware Uncertainty Quantification for Fact-Checking the Output of Retrieval Augmented Generation

Title: LLMs Think, But Not In Your Flow: Reasoning-Level Personalization for Black-Box Large Language Models

Title: BLUCK: A Benchmark Dataset for Bengali Linguistic Understanding and Cultural Knowledge

Title: Thinker: Learning to Think Fast and Slow

Title: A Lightweight Multi-Expert Generative Language Model System for Engineering Information and Knowledge Extraction

Title: Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA

Title: Scaling and Prompting for Improved End-to-End Spoken Grammatical Error Correction

Title: Leveraging LLM and Self-Supervised Training Models for Speech Recognition in Chinese Dialects: A Comparative Analysis

Title: Assessment of L2 Oral Proficiency using Speech Large Language Models

Title: M-Wanda: Improving One-Shot Pruning for Multilingual LLMs

Title: TAT-R1: Terminology-Aware Translation with Reinforcement Learning and Word Alignment

Title: Walk Before You Run! Concise LLM Reasoning via Reinforcement Learning

Title: Exploring the Latent Capacity of LLMs for One-Step Text Generation

Title: Unveiling Instruction-Specific Neurons & Experts: An Analytical Framework for LLM's Instruction-Following Capabilities

Title: Pretrained LLMs Learn Multiple Types of Uncertainty

Title: LMCD: Language Models are Zeroshot Cognitive Diagnosis Learners

Title: Evaluation of LLMs in Medical Text Summarization: The Role of Vocabulary Adaptation in High OOV Settings

Title: ReSCORE: Label-free Iterative Retriever Training for Multi-hop Question Answering with Relevance-Consistency Supervision

Title: Multilingual Pretraining for Pixel Language Models

Title: rStar-Coder: Scaling Competitive Code Reasoning with a Large-Scale Verified Dataset

Title: How Humans and LLMs Organize Conceptual Knowledge: Exploring Subordinate Categories in Italian

Title: Charting the Landscape of African NLP: Mapping Progress and Shaping the Road Ahead

Title: Leveraging large language models and traditional machine learning ensembles for ADHD detection from narrative transcripts

Title: PEDANTIC: A Dataset for the Automatic Examination of Definiteness in Patent Claims

Title: Leveraging Large Language Models for Bengali Math Word Problem Solving with Chain of Thought Reasoning

Title: Evaluating LLM Adaptation to Sociodemographic Factors: User Profile vs. Dialogue History

Title: Analyzing values about gendered language reform in LLMs' revisions

Title: AutoJudger: An Agent-Driven Framework for Efficient Benchmarking of MLLMs

Title: Improving Research Idea Generation Through Data: An Empirical Investigation in Social Science

Title: DecisionFlow: Advancing Large Language Model as Principled Decision Maker

Title: Factual Self-Awareness in Language Models: Representation, Robustness, and Scaling

Title: RelationalFactQA: A Benchmark for Evaluating Tabular Fact Retrieval from Large Language Models

Title: Pangu Pro MoE: Mixture of Grouped Experts for Efficient Sparsity

Title: RefTool: Enhancing Model Reasoning with Reference-Guided Tool Creation

Title: Towards Better Instruction Following Retrieval Models

Title: Words Like Knives: Backstory-Personalized Modeling and Detection of Violent Communication

Title: Do LLMs Need to Think in One Language? Correlation between Latent Language and Task Performance

Title: Accelerating Diffusion Language Model Inference via Efficient KV Caching and Guided Diffusion

Title: Scaling External Knowledge Input Beyond Context Windows of LLMs via Multi-Agent Collaboration

Title: Are Language Models Consequentialist or Deontological Moral Reasoners?

Title: UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents

Title: Silence is Not Consensus: Disrupting Agreement Bias in Multi-Agent LLMs via Catfish Agent for Clinical Decision Making

Title: How does Alignment Enhance LLMs' Multilingual Capabilities? A Language Neurons Perspective