2025-05-20

Title: A Data Synthesis Method Driven by Large Language Models for Proactive Mining of Implicit User Intentions in Tourism

Title: AI-generated Text Detection: A Multifaceted Approach to Binary and Multiclass Classification

Title: Assessing Collective Reasoning in Multi-Agent LLMs via Hidden Profile Tasks

Title: Talk to Your Slides: Efficient Slide Editing Agent with Large Language Models

Title: MedGUIDE: Benchmarking Clinical Decision-Making in Large Language Models

Title: Steering Risk Preferences in Large Language Models by Aligning Behavioral and Neural Representations

Title: THELMA: Task Based Holistic Evaluation of Large Language Model Applications-RAG Question Answering

Title: Critique-Guided Distillation: Improving Supervised Fine-tuning via Better Distillation

Title: Can an Easy-to-Hard Curriculum Make Reasoning Emerge in Small Language Models? Evidence from a Four-Stage Curriculum on GPT-2

Title: Multilingual Prompt Engineering in Large Language Models: A Survey Across NLP Tasks

Title: Ambiguity Resolution in Text-to-Structured Data Mapping

Title: MedCaseReasoning: Evaluating and learning diagnostic reasoning from clinical case reports

Title: ZeroTuning: Unlocking the Initial Token's Power to Enhance Large Language Models Without Training

Title: Masking in Multi-hop QA: An Analysis of How Language Models Perform with Context Permutation

Title: Towards Universal Semantics With Large Language Models

Title: Retrospex: Language Agent Meets Offline Reinforcement Learning Critic

Title: Efficiently Building a Domain-Specific Large Language Model from Scratch: A Case Study of a Classical Chinese Large Language Model

Title: BELLE: A Bi-Level Multi-Agent Reasoning Framework for Multi-Hop Question Answering

Title: Chain-of-Model Learning for Language Model

Title: Not All Thoughts are Generated Equal: Efficient LLM Reasoning via Multi-Turn Reinforcement Learning

Title: Class Distillation with Mahalanobis Contrast: An Efficient Training Paradigm for Pragmatic Language Understanding Tasks

Title: Multilingual Collaborative Defense for Large Language Models

Title: When AI Co-Scientists Fail: SPOT-a Benchmark for Automated Verification of Scientific Research

Title: NAMET: Robust Massive Model Editing via Noise-Aware Memory Optimization

Title: AutoMedEval: Harnessing Language Models for Automatic Medical Capability Evaluation

Title: Mobile-Bench-v2: A More Realistic and Comprehensive Benchmark for VLM-based Mobile Agents

Title: RLAP: A Reinforcement Learning Enhanced Adaptive Planning Framework for Multi-step NLP Task Solving

Title: Recursive Question Understanding for Complex Question Answering over Heterogeneous Personal Data

Title: ELITE: Embedding-Less retrieval with Iterative Text Exploration

Title: Enhancing Complex Instruction Following for Large Language Models with Mixture-of-Contexts Fine-tuning

Title: An Explanation of Intrinsic Self-Correction via Linear Representations and Latent Concepts

Title: Neuro-Symbolic Query Compiler

Title: ChartEdit: How Far Are MLLMs From Automating Chart Analysis? Evaluating MLLMs' Capability via Chart Editing

Title: CCNU at SemEval-2025 Task 3: Leveraging Internal and External Knowledge of Large Language Models for Multilingual Hallucination Annotation

Title: Unveiling Knowledge Utilization Mechanisms in LLM-based Retrieval-Augmented Generation

Title: Towards Comprehensive Argument Analysis in Education: Dataset, Tasks, and Method

Title: MoL for LLMs: Dual-Loss Optimization to Enhance Domain Expertise While Preserving General Capabilities

Title: ABoN: Adaptive Best-of-N Alignment

Title: GenderBench: Evaluation Suite for Gender Biases in LLMs

Title: Why Not Act on What You Know? Unleashing Safety Potential of LLMs via Self-Aware Guard Enhancement

Title: Do different prompting methods yield a common task representation in language models?

Title: Model Merging in Pre-training of Large Language Models

Title: Personalized Author Obfuscation with Large Language Models

Title: Improving Fairness in LLMs Through Testing-Time Adversaries

Title: The AI Gap: How Socioeconomic Status Affects Language Technology Interactions

Title: Truth Neurons

Title: Decoding the Mind of Large Language Models: A Quantitative Evaluation of Ideology and Biases

Title: Vectors from Larger Language Models Predict Human Reading Time and fMRI Data More Poorly when Dimensionality Expansion is Controlled

Title: How Reliable is Multilingual LLM-as-a-Judge?

Title: Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning

Title: GMSA: Enhancing Context Compression via Group Merging and Layer Semantic Alignment

Title: One-for-All Pruning: A Universal Model for Customized Compression of Large Language Models

Title: Examining Linguistic Shifts in Academic Writing Before and After the Launch of ChatGPT: A Study on Preprint Papers

Title: Bridging Generative and Discriminative Learning: Few-Shot Relation Extraction via Two-Stage Knowledge-Guided Pre-training

Title: PANORAMA: A synthetic PII-laced dataset for studying sensitive data memorization in LLMs

Title: Distribution Prompting: Understanding the Expressivity of Language Models Through the Next-Token Distributions They Can Produce

Title: Not All Documents Are What You Need for Extracting Instruction Tuning Data

Title: Teach2Eval: An Indirect Evaluation Method for LLM by Judging How It Teaches

Title: Learning Auxiliary Tasks Improves Reference-Free Hallucination Detection in Open-Domain Long-Form Generation

Title: $K$-MSHC: Unmasking Minimally Sufficient Head Circuits in Large Language Models with Experiments on Syntactic Classification Tasks

Title: LLM-Based Evaluation of Low-Resource Machine Translation: A Reference-less Dialect Guided Approach with a Refined Sylheti-English Benchmark

Title: The Tower of Babel Revisited: Multilingual Jailbreak Prompts on Closed-Source Large Language Models

Title: Enhance Mobile Agents Thinking Process Via Iterative Preference Learning

Title: HBO: Hierarchical Balancing Optimization for Fine-Tuning Large Language Models

Title: Bidirectional LMs are Better Knowledge Memorizers? A Benchmark for Real-world Knowledge Injection

Title: ExpertSteer: Intervening in LLMs through Expert Knowledge

Title: LLMSR@XLLM25: An Empirical Study of LLM for Structural Reasoning

Title: UniEdit: A Unified Knowledge Editing Benchmark for Large Language Models

Title: Wisdom from Diversity: Bias Mitigation Through Hybrid Human-LLM Crowds

Title: CAPTURE: Context-Aware Prompt Injection Testing and Robustness Enhancement

Title: From n-gram to Attention: How Model Architectures Learn and Propagate Bias in Language Modeling

Title: SLOT: Sample-specific Language Model Optimization at Test-time

Title: Traversal Verification for Speculative Tree Decoding

Title: The power of text similarity in identifying AI-LLM paraphrased documents: The case of BBC news articles and ChatGPT

Title: Table-R1: Region-based Reinforcement Learning for Table Understanding

Title: PSC: Extending Context Window of Large Language Models via Phase Shift Calibration

Title: Learning to Play Like Humans: A Framework for LLM Adaptation in Interactive Fiction Games

Title: Introspective Growth: Automatically Advancing LLM Expertise in Technology Judgment

Title: Towards DS-NER: Unveiling and Addressing Latent Noise in Distant Annotations

Title: What are they talking about? Benchmarking Large Language Models for Knowledge-Grounded Discussion Summarization

Title: Enhancing Large Language Models with Reward-guided Tree Search for Knowledge Graph Question and Answering

Title: KG-QAGen: A Knowledge-Graph-Based Framework for Systematic Question Generation and Long-Context LLM Evaluation

Title: LM$^2$otifs : An Explainable Framework for Machine-Generated Texts Detection

Title: DS-ProGen: A Dual-Structure Deep Language Model for Functional Protein Design

Title: ESC-Judge: A Framework for Comparing Emotional Support Conversational Agents

Title: Relation Extraction or Pattern Matching? Unravelling the Generalisation Limits of Language Models for Biographical RE

Title: Disambiguation in Conversational Question Answering in the Era of LLM: A Survey

Title: Towards Reliable and Interpretable Traffic Crash Pattern Prediction and Safety Interventions Using Customized Large Language Models

Title: Extracting memorized pieces of (copyrighted) books from open-weight language models

Title: Enriching Patent Claim Generation with European Patent Dataset

Title: Measuring Information Distortion in Hierarchical Ultra long Novel Generation:The Optimal Expansion Ratio

Title: Improving Multilingual Language Models by Aligning Representations through Steering

Title: CMLFormer: A Dual Decoder Transformer with Switching Point Learning for Code-Mixed Language Modeling

Title: PromptPrism: A Linguistically-Inspired Taxonomy for Prompts

Title: AD-AGENT: A Multi-agent Framework for End-to-end Anomaly Detection

Title: Think Before You Attribute: Improving the Performance of LLMs Attribution Systems

Title: R1dacted: Investigating Local Censorship in DeepSeek's R1 Language Model

Title: Revealing the Deceptiveness of Knowledge Editing: A Mechanistic Analysis of Superficial Editing

Title: Know3-RAG: A Knowledge-aware RAG Framework with Adaptive Retrieval, Generation, and Filtering

Title: Shadow-FT: Tuning Instruct via Base

Title: ToTRL: Unlock LLM Tree-of-Thoughts Reasoning Potential through Puzzles Solving

Title: Automated Bias Assessment in AI-Generated Educational Content Using CEAT Framework

Title: On-Policy Optimization with Group Equivalent Preference for Multi-Programming Language Understanding

Title: What is Stigma Attributed to? A Theory-Grounded, Expert-Annotated Interview Corpus for Demystifying Mental-Health Stigma

Title: ReEx-SQL: Reasoning with Execution-Aware Reinforcement Learning for Text-to-SQL

Title: A Token is Worth over 1,000 Tokens: Efficient Knowledge Distillation through Low-Rank Clone

Title: EAVIT: Efficient and Accurate Human Value Identification from Text data via LLMs

Title: Decentralized Arena: Towards Democratic and Scalable Automatic Evaluation of Language Models

Title: PsyMem: Fine-grained psychological alignment and Explicit Memory Control for Advanced Role-Playing LLMs

Title: SynDec: A Synthesize-then-Decode Approach for Arbitrary Textual Style Transfer via Large Language Models

Title: Contrastive Prompting Enhances Sentence Embeddings in LLMs through Inference-Time Steering

Title: FlightGPT: Towards Generalizable and Interpretable UAV Vision-and-Language Navigation with Vision-Language Models

Title: The Hidden Structure -- Improving Legal Document Understanding Through Explicit Text Formatting

Title: LEXam: Benchmarking Legal Reasoning on 340 Law Exams

Title: GAP: Graph-Assisted Prompts for Dialogue-based Medication Recommendation

Title: On the Thinking-Language Modeling Gap in Large Language Models

Title: PyFCG: Fluid Construction Grammar in Python

Title: Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs

Title: A3 : an Analytical Low-Rank Approximation Framework for Attention

Title: Neural Morphological Tagging for Nguni Languages

Title: GuRE:Generative Query REwriter for Legal Passage Retrieval

Title: MA-COIR: Leveraging Semantic Search Index and Generative Models for Ontology-Driven Biomedical Concept Recognition

Title: Calm-Whisper: Reduce Whisper Hallucination On Non-Speech By Calming Crazy Heads Down

Title: A Structured Literature Review on Traditional Approaches in Current Natural Language Processing

Title: An Empirical Study of Many-to-Many Summarization with Large Language Models

Title: EffiBench-X: A Multi-Language Benchmark for Measuring Efficiency of LLM-Generated Code

Title: Evaluating the Performance of RAG Methods for Conversational AI in the Airport Domain

Title: KIT's Offline Speech Translation and Instruction Following Submission for IWSLT 2025

Title: Advancing Sequential Numerical Prediction in Autoregressive Models

Title: Systematic Generalization in Language Models Scales with Information Entropy

Title: The Effect of Language Diversity When Fine-Tuning Large Language Models for Translation

Title: Benchmarking and Confidence Evaluation of LALMs For Temporal Reasoning

Title: ModernGBERT: German-only 1B Encoder Model Trained from Scratch

Title: Understanding Cross-Lingual Inconsistency in Large Language Models

Title: What if Deception Cannot be Detected? A Cross-Linguistic Study on the Limits of Deception Detection from Text

Title: Tianyi: A Traditional Chinese Medicine all-rounder language model and its Real-World Clinical Practice

Title: Role-Playing Evaluation for Large Language Models

Title: Positional Fragility in LLMs: How Offset Effects Reshape Our Understanding of Memorization Risks

Title: A Case Study of Cross-Lingual Zero-Shot Generalization for Classical Languages in LLMs

Title: ToolSpectrum : Towards Personalized Tool Utilization for Large Language Models

Title: Efficient Speech Language Modeling via Energy Distance in Continuous Latent Space

Title: Alignment-Augmented Speculative Decoding with Alignment Sampling and Conditional Verification

Title: Picturized and Recited with Dialects: A Multimodal Chinese Representation Framework for Sentiment Analysis of Classical Chinese Poetry

Title: SeedBench: A Multi-task Benchmark for Evaluating Large Language Models in Seed Science

Title: JNLP at SemEval-2025 Task 11: Cross-Lingual Multi-Label Emotion Detection Using Generative Models

Title: Natural Language Planning via Coding and Inference Scaling

Title: HeteroSpec: Leveraging Contextual Heterogeneity for Efficient Speculative Decoding

Title: WikiPersonas: What Can We Learn From Personalized Alignment to Famous People?

Title: Effective and Transparent RAG: Adaptive-Reward Reinforcement Learning for Decision Traceability

Title: From Automation to Autonomy: A Survey on Large Language Models in Scientific Discovery

Title: CSC-SQL: Corrective Self-Consistency in Text-to-SQL via Reinforcement Learning

Title: I'll believe it when I see it: Images increase misinformation sharing in Vision-Language Models

Title: RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning

Title: GUARD: Generation-time LLM Unlearning via Adaptive Restriction and Detection

Title: Rethinking Stateful Tool Use in Multi-Turn Dialogues: Benchmarks and Challenges

Title: Contextual Paralinguistic Data Creation for Multi-Modal Speech-LLM: Data Condensation and Spoken QA Generation

Title: J4R: Learning to Judge with Equivalent Initial State Group Relative Preference Optimization

Title: Investigating the Vulnerability of LLM-as-a-Judge Architectures to Prompt-Injection Attacks

Title: Sense and Sensitivity: Examining the Influence of Semantic Recall on Long Context Code Reasoning

Title: What Prompts Don't Say: Understanding and Managing Underspecification in LLM Prompts

Title: Thinkless: LLM Learns When to Think

Title: R3: Robust Rubric-Agnostic Reward Models

Title: MR. Judge: Multimodal Reasoner as a Judge

Title: Granary: Speech Recognition and Translation Dataset in 25 European Languages

Title: AdaptThink: Reasoning Models Can Learn When to Think

Title: Dementia Through Different Eyes: Explainable Modeling of Human and LLM Perceptions for Early Awareness

Title: SMOTExT: SMOTE meets Large Language Models

Title: ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models

Title: CIE: Controlling Language Model Text Generations Using Continuous Signals