2025-04-15

Title: Layers at Similar Depths Generate Similar Activations Across LLM Architectures

Title: From Tokens to Lattices: Emergent Lattice Structures in Language Models

Title: Can AI Master Construction Management (CM)? Benchmarking State-of-the-Art Large Language Models on CM Certification Exams

Title: Efficient Evaluation of Large Language Models via Collaborative Filtering

Title: Enhancing NER Performance in Low-Resource Pakistani Languages using Cross-Lingual Data Augmentation

Title: Exploring Gradient-Guided Masked Language Model to Detect Textual Adversarial Attacks

Title: Exploring the Effectiveness and Interpretability of Texts in LLM-based Time Series Models

Title: CAReDiO: Cultural Alignment of LLM via Representativeness and Distinctiveness Guided Data Optimization

Title: SD$^2$: Self-Distilled Sparse Drafters

Title: Forecasting Communication Derailments Through Conversation Generation

Title: Generating Planning Feedback for Open-Ended Programming Exercises with LLMs

Title: A Fully Automated Pipeline for Conversational Discourse Annotation: Tree Scheme Generation and Labeling with Large Language Models

Title: From Punchlines to Predictions: A Metric to Assess LLM Performance in Identifying Humor in Stand-Up Comedy

Title: Exploration of Plan-Guided Summarization for Narrative Texts: the Case of Small Language Models

Title: A Multi-view Discourse Framework for Integrating Semantic and Syntactic Features in Dialog Agents

Title: Enhancing Dialogue Systems with Discourse-Level Understanding Using Deep Canonical Correlation Analysis

Title: VisuoThink: Empowering LVLM Reasoning with Multimodal Tree Search

Title: Efficient and Asymptotically Unbiased Constrained Decoding for Large Language Models

Title: Can postgraduate translation students identify machine-generated text?

Title: Langformers: Unified NLP Pipelines for Language Models

Title: Parameterized Synthetic Text Generation with SimpleStories

Title: Feature-Aware Malicious Output Detection and Mitigation

Title: Enhancing Contrastive Demonstration Selection with Semantic Diversity for Robust In-Context Machine Translation

Title: Improving the Accuracy and Efficiency of Legal Document Tagging with Large Language Models and Instruction Prompts

Title: QUDsim: Quantifying Discourse Similarities in LLM-Generated Text

Title: Can you map it to English? The Role of Cross-Lingual Alignment in Multilingual Performance of LLMs

Title: On Language Models' Sensitivity to Suspicious Coincidences

Title: Beyond Memorization: Mapping the Originality-Quality Frontier of Language Models

Title: Evaluation Under Imperfect Benchmarks and Ratings: A Case Study in Text Simplification

Title: Question Tokens Deserve More Attention: Enhancing Large Language Models without Training through Step-by-Step Reading and Question Attention Recalibration

Title: UXAgent: A System for Simulating Usability Testing of Web Design with LLM Agents

Title: SaRO: Enhancing LLM Safety through Reasoning-based Alignment

Title: ClinicalGPT-R1: Pushing reasoning capability of generalist disease diagnosis with large language model

Title: HalluShift: Measuring Distribution Shifts towards Hallucination Detection in LLMs

Title: Kongzi: A Historical Large Language Model with Fact Enhancement

Title: MADLLM: Multivariate Anomaly Detection via Pre-trained LLMs

Title: How new data permeates LLM knowledge and how to dilute it

Title: Syzygy of Thoughts: Improving LLM CoT with the Minimal Free Resolution

Title: LLMs Can Achieve High-quality Simultaneous Machine Translation as Efficiently as Offline

Title: Short-Path Prompting in LLMs: Analyzing Reasoning Instability and Solutions for Robust Performance

Title: Metropolis-Hastings Captioning Game: Knowledge Fusion of Vision Language Models via Decentralized Bayesian Inference

Title: Leveraging Reasoning Model Answers to Enhance Non-Reasoning Model Capability

Title: Iterative Self-Training for Code Generation via Reinforced Re-Ranking

Title: Myanmar XNLI: Building a Dataset and Exploring Low-resource Approaches to Natural Language Inference with Myanmar

Title: CLEAR-KGQA: Clarification-Enhanced Ambiguity Resolution for Knowledge Graph Question Answering

Title: Domain-Adaptive Continued Pre-Training of Small Language Models

Title: GRPO-LEAD: A Difficulty-Aware Reinforcement Learning Approach for Concise Mathematical Reasoning in Language Models

Title: Evaluating the Quality of Benchmark Datasets for Low-Resource Languages: A Case Study on Turkish

Title: Improving Multilingual Capabilities with Cultural and Local Knowledge in Large Language Models While Enhancing Native Performance

Title: Executable Functional Abstractions: Inferring Generative Programs for Advanced Math Problems

Title: Reasoning Court: Combining Reasoning, Action, and Judgment for Multi-Hop Reasoning

Title: VDocRAG: Retrieval-Augmented Generation over Visually-Rich Documents

Title: Training Small Reasoning LLMs with Cognitive Preference Alignment

Title: Transferable text data distillation by trajectory matching

Title: Investigating Syntactic Biases in Multilingual Transformers with RC Attachment Ambiguities in Italian and English

Title: Learning from Reference Answers: Versatile Language Model Alignment without Binary Human Preference Data

Title: TWSSenti: A Novel Hybrid Framework for Topic-Wise Sentiment Analysis on Social Media Using Transformer Models

Title: Refining Financial Consumer Complaints through Multi-Scale Model Interaction

Title: Learning to Erase Private Knowledge from Multi-Documents for Retrieval-Augmented Large Language Models

Title: Guiding Reasoning in Small Language Models with LLM Assistance

Title: C-MTCSD: A Chinese Multi-Turn Conversational Stance Detection Dataset

Title: The Mirage of Performance Gains: Why Contrastive Decoding Fails to Address Multimodal Hallucination

Title: DataMosaic: Explainable and Verifiable Multi-Modal Data Analytics through Extract-Reason-Verify

Title: Hallucination Detection in LLMs via Topological Divergence on Attention Graphs

Title: Towards Quantifying Commonsense Reasoning with Mechanistic Insights

Title: SocioVerse: A World Model for Social Simulation Powered by LLM Agents and A Pool of 10 Million Real-World Users

Title: MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning

Title: C-FAITH: A Chinese Fine-Grained Benchmark for Automated Hallucination Evaluation

Title: HalluSearch at SemEval-2025 Task 3: A Search-Enhanced RAG Pipeline for Hallucination Detection

Title: LLM Unlearning Reveals a Stronger-Than-Expected Coreset Effect in Current Benchmarks

Title: Deep Reasoning Translation via Reinforcement Learning

Title: Localized Cultural Knowledge is Conserved and Controllable in Large Language Models

Title: DioR: Adaptive Cognitive Detection and Contextual Retrieval Optimization for Dynamic Retrieval-Augmented Generation

Title: Probing then Editing Response Personality of Large Language Models

Title: Can LLMs Generate Tabular Summaries of Science Papers? Rethinking the Evaluation Protocol

Title: MorphTok: Morphologically Grounded Tokenization for Indian Languages

Title: Forecasting from Clinical Textual Time Series: Adaptations of the Encoder and Decoder Language Model Families

Title: VisualPuzzles: Decoupling Multimodal Reasoning Evaluation from Domain Knowledge

Title: MultiLoKo: a multilingual local knowledge benchmark for LLMs spanning 31 languages

Title: DICE: A Framework for Dimensional and Contextual Evaluation of Language Models

Title: S1-Bench: A Simple Benchmark for Evaluating System 1 Thinking Capability of Large Reasoning Models

Title: LLM-driven Constrained Copy Generation through Iterative Refinement

Title: Performance of Large Language Models in Supporting Medical Diagnosis and Treatment

Title: LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models

Title: CliniChat: A Multi-Source Knowledge-Driven Framework for Clinical Interview Dialogue Reconstruction and Evaluation

Title: Unchecked and Overlooked: Addressing the Checkbox Blind Spot in Large Language Models with CheckboxQA

Title: Can We Edit LLMs for Long-Tail Biomedical Knowledge?

Title: LLM Can be a Dangerous Persuader: Empirical Study of Persuasion Safety in Large Language Models

Title: xVerify: Efficient Answer Verifier for Reasoning Model Evaluations