2025-08-13

Title: Argument Quality Annotation and Gender Bias Detection in Financial Communication through Large Language Models

Title: TurQUaz at CheckThat! 2025: Debating Large Language Models for Scientific Web Discourse Detection

Title: Heartificial Intelligence: Exploring Empathy in Language Models

Title: TT-XAI: Trustworthy Clinical Text Explanations via Keyword Distillation and LLM Reasoning

Title: Distilling Knowledge from Large Language Models: A Concept Bottleneck Model for Hate and Counter Speech Recognition

Title: MLLM-CBench:A Comprehensive Benchmark for Continual Instruction Tuning of Multimodal LLMs with Chain-of-Thought Reasoning Analysis

Title: Evaluating Contrast Localizer for Identifying Causal Unitsin Social & Mathematical Tasks in Language Models

Title: Objective Metrics for Evaluating Large Language Models Using External Data Sources

Title: MinionsLLM: a Task-adaptive Framework For The Training and Control of Multi-Agent Systems Through Natural Language

Title: The Illusion of Progress: Re-evaluating Hallucination Detection in LLMs

Title: Sacred or Synthetic? Evaluating LLM Reliability and Abstention for Religious Questions

Title: Putnam-AXIOM: A Functional and Static Benchmark

Title: CoDAE: Adapting Large Language Models for Education via Chain-of-Thought Data Augmentation

Title: Mol-R1: Towards Explicit Long-CoT Reasoning in Molecule Discovery

Title: Rethinking Tokenization for Rich Morphology: The Dominance of Unigram over BPE and Morphological Alignment

Title: Enhancing Small LLM Alignment through Margin-Based Objective Modifications under Resource Constraints

Title: Momentum Point-Perplexity Mechanics in Large Language Models

Title: Steerable Pluralism: Pluralistic Alignment via Few-Shot Comparative Regression

Title: DeCAL Tokenwise Compression

Title: DepressLLM: Interpretable domain-adapted language model for depression detection from real-world narratives

Title: Optimizing Retrieval-Augmented Generation (RAG) for Colloquial Cantonese: A LoRA-Based Systematic Review

Title: InternBootcamp Technical Report: Boosting LLM Reasoning with Verifiable Task Scaling

Title: Quick on the Uptake: Eliciting Implicit Intents from Human Demonstrations for Personalized Mobile-Use Agents

Title: LLaMA-Based Models for Aspect-Based Sentiment Analysis

Title: UWB at WASSA-2024 Shared Task 2: Cross-lingual Emotion Detection

Title: Prompt-Based Approach for Czech Sentiment Analysis

Title: LLM driven Text-to-Table Generation through Sub-Tasks Guidance and Iterative Refinement

Title: TopXGen: Topic-Diverse Parallel Data Generation for Low-Resource Machine Translation

Title: Out of the Box, into the Clinic? Evaluating State-of-the-Art ASR for Clinical Applications for Older Adults

Title: A Survey on Parallel Text Generation: From Parallel Decoding to Diffusion Language Models

Title: IROTE: Human-like Traits Elicitation of Large Language Model via In-Context Self-Reflective Optimization

Title: Magical: Medical Lay Language Generation via Semantic Invariance and Layperson-tailored Adaptation

Title: SciRerankBench: Benchmarking Rerankers Towards Scientific Retrieval-Augmented Generated LLMs

Title: DevNous: An LLM-Based Multi-Agent System for Grounding IT Project Management in Unstructured Conversation

Title: Privacy-protected Retrieval-Augmented Generation for Knowledge Graph Question Answering

Title: Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments

Title: TiMoE: Time-Aware Mixture of Language Experts

Title: An Investigation of Robustness of LLMs in Mathematical Reasoning: Benchmarking with Mathematically-Equivalent Transformation of Advanced Mathematical Problems

Title: Steering Towards Fairness: Mitigating Political Bias in LLMs

Title: BiasGym: Fantastic Biases and How to Find (and Remove) Them

Title: Entangled in Representations: Mechanistic Investigation of Cultural Biases in Large Language Models

Title: ASPD: Unlocking Adaptive Serial-Parallel Decoding by Exploring Intrinsic Parallelism in LLMs

Title: Reveal-Bangla: A Dataset for Cross-Lingual Multi-Step Reasoning Evaluation

Title: Train Long, Think Short: Curriculum Learning for Efficient Reasoning

Title: Jointly Generating and Attributing Answers using Logits of Document-Identifier Tokens

Title: Retrospective Sparse Attention for Efficient Long-Context Generation

Title: LyS at SemEval 2025 Task 8: Zero-Shot Code Generation for Tabular QA

Title: A Survey on Training-free Alignment of Large Language Models

Title: LLM-as-a-Supervisor: Mistaken Therapeutic Behaviors Trigger Targeted Supervisory Feedback

Title: MVISU-Bench: Benchmarking Mobile Agents for Real-World Tasks by Multi-App, Vague, Interactive, Single-App and Unethical Instructions

Title: READER: Retrieval-Assisted Drafter for Efficient LLM Inference

Title: Utilizing Multilingual Encoders to Improve Large Language Models for Low-Resource Languages

Title: AutoCodeBench: Large Language Models are Automatic Code Benchmark Generators

Title: SinLlama - A Large Language Model for Sinhala

Title: OdysseyBench: Evaluating LLM Agents on Long-Horizon Complex Office Application Workflows

Title: Complex Logical Instruction Generation

Title: Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models