2025-08-12

Title: Semi-automated Fact-checking in Portuguese: Corpora Enrichment using Retrieval with Claim extraction

Title: Retrieval augmented generation based dynamic prompting for few-shot biomedical named entity recognition using large language models

Title: CarbonScaling: Extending Neural Scaling Laws for Carbon Footprint in Large Language Models

Title: The Art of Breaking Words: Rethinking Multilingual Tokenizer Design

Title: Factor Augmented Supervised Learning with Text Embeddings

Title: Discerning minds or generic tutors? Evaluating instructional guidance capabilities in Socratic LLMs

Title: LLM Unlearning Without an Expert Curated Dataset

Title: BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent

Title: Train It and Forget It: Merge Lists are Unnecessary for BPE Inference in Language Models

Title: Measuring Stereotype and Deviation Biases in Large Language Models

Title: Testing the Limits of Machine Translation from One Book

Title: Do Biased Models Have Biased Thoughts?

Title: Play Favorites: A Statistical Method to Measure Self-Bias in LLM-as-a-Judge

Title: Large Language Models for Oral History Understanding with Text Classification and Sentiment Analysis

Title: Many-Turn Jailbreaking

Title: SEVADE: Self-Evolving Multi-Agent Analysis with Decoupled Evaluation for Hallucination-Resistant Irony Detection

Title: Annotating Errors in English Learners' Written Language Production: Advancing Automated Written Feedback Systems

Title: The ReQAP System for Question Answering over Personal Information

Title: Score Before You Speak: Improving Persona Consistency in Dialogue Generation using Response Quality Scores

Title: Model-Agnostic Sentiment Distribution Stability Analysis for Robust LLM-Generated Texts Detection

Title: Two-Stage Quranic QA via Ensemble Retrieval and Instruction-Tuned Answer Extraction

Title: Rethinking 1-bit Optimization Leveraging Pre-trained Large Language Models

Title: Vec2Summ: Text Summarization via Probabilistic Sentence Embeddings

Title: SEADialogues: A Multilingual Culturally Grounded Multi-turn Dialogue Dataset on Southeast Asian Languages

Title: BharatBBQ: A Multilingual Bias Benchmark for Question Answering in the Indian Context

Title: Less Is More: Training-Free Sparse Attention with Global Locality for Efficient Reasoning

Title: Investigating Intersectional Bias in Large Language Models using Confidence Disparities in Coreference Resolution

Title: Gradient Surgery for Safe LLM Fine-Tuning

Title: Omni-SafetyBench: A Benchmark for Safety Evaluation of Audio-Visual Large Language Models

Title: Schema Lineage Extraction at Scale: Multilingual Pipelines, Composite Evaluation, and Language-Model Benchmarks

Title: DySK-Attn: A Framework for Efficient, Real-Time Knowledge Updating in Large Language Models via Dynamic Sparse Knowledge Attention

Title: Adapting LLMs to Time Series Forecasting via Temporal Heterogeneity Modeling and Semantic Alignment

Title: Enhancing Rumor Detection Methods with Propagation Structure Infused Language Model

Title: How Does a Deep Neural Network Look at Lexical Stress?

Title: Prompt Tuning for Few-Shot Continual Learning Named Entity Recognition

Title: Incorporating Contextual Paralinguistic Understanding in Large Speech-Language Models

Title: MAQuA: Adaptive Question-Asking for Multidimensional Mental Health Screening using Item Response Theory

Title: "Pull or Not to Pull?'': Investigating Moral Biases in Leading Large Language Models Across Ethical Dilemmas

Title: Arce: Augmented Roberta with Contextualized Elucidations for Ner in Automated Rule Checking

Title: CCFQA: A Benchmark for Cross-Lingual and Cross-Modal Speech and Text Factuality Evaluation

Title: HealthBranches: Synthesizing Clinically-Grounded Question Answering Datasets via Decision Pathways

Title: ObfusQAte: A Proposed Framework to Evaluate LLM Robustness on Obfuscated Factual Question Answering

Title: Strategies of Code-switching in Human-Machine Dialogs

Title: Think Before You Talk: Enhancing Meaningful Dialogue Generation in Full-Duplex Speech Language Models with Planning-Inspired Text Guidance

Title: Grounding Multilingual Multimodal LLMs With Cultural Knowledge

Title: Let's Revise Step-by-Step: A Unified Local Search Framework for Code Generation with LLMs

Title: Positional Biases Shift as Inputs Approach Context Window Limits

Title: ALOPE: Adaptive Layer Optimization for Translation Quality Estimation using Large Language Models

Title: Augmenting Bias Detection in LLMs Using Topological Data Analysis

Title: Word Clouds as Common Voices: LLM-Assisted Visualization of Participant-Weighted Themes in Qualitative Interviews

Title: From Trial-and-Error to Improvement: A Systematic Analysis of LLM Exploration Mechanisms in RLVR

Title: IBPS: Indian Bail Prediction System

Title: Keyword-Centric Prompting for One-Shot Event Detection with Self-Generated Rationale Enhancements

Title: InterChart: Benchmarking Visual Reasoning Across Decomposed and Distributed Chart Information

Title: LoSemB: Logic-Guided Semantic Bridging for Inductive Tool Retrieval

Title: What am I missing here?: Evaluating Large Language Models for Masked Sentence Prediction

Title: Exploring Causal Effect of Social Bias on Faithfulness Hallucinations in Large Language Models

Title: SASST: Leveraging Syntax-Aware Chunking and LLMs for Simultaneous Speech Translation

Title: Grove MoE: Towards Efficient and Superior MoE LLMs with Adjugate Experts

Title: Can You Trick the Grader? Adversarial Persuasion of LLM Judges

Title: Evaluating Large Language Models as Expert Annotators

Title: LLMs for Law: Evaluating Legal-Specific LLMs on Contract Understanding

Title: Large Language Models for Czech Aspect-Based Sentiment Analysis

Title: Tailored Emotional LLM-Supporter: Enhancing Cultural Sensitivity

Title: Expert Preference-based Evaluation of Automated Related Work Generation

Title: Large Language Models for Subjective Language Understanding: A Survey

Title: Understanding Syntactic Generalization in Structure-inducing Language Models

Title: Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL

Title: The Medical Metaphors Corpus (MCC)

Title: WideSearch: Benchmarking Agentic Broad Info-Seeking

Title: Progressive Depth Up-scaling via Optimal Transport

Title: 9th Workshop on Sign Language Translation and Avatar Technologies (SLTAT 2025)

Title: Dual Information Speech Language Models for Emotional Conversations

Title: Assessing LLM Text Detection in Educational Contexts: Does Human Contribution Affect Detection?

Title: Optimal Transport Regularization for Speech Text Alignment in Spoken Language Models

Title: Can LLMs Detect Their Confabulations? Estimating Reliability in Uncertainty-Aware Language Models

Title: Data-Efficient Biomedical In-Context Learning: A Diversity-Enhanced Submodular Perspective

Title: REX-RAG: Reasoning Exploration with Policy Correction in Retrieval-Augmented Generation

Title: Efficient Speculative Decoding for Llama at Scale: Challenges and Solutions

Title: Human-Alignment and Calibration of Inference-Time Uncertainty in Large Language Models

Title: SAEMark: Multi-bit LLM Watermarking with Inference-Time Scaling

Title: Capabilities of GPT-5 on Multimodal Medical Reasoning

Title: Exploring Safety Alignment Evaluation of LLMs in Chinese Mental Health Dialogues via LLM-as-Judge

Title: Jinx: Unlimited LLMs for Probing Alignment Failures