2025-10-23

Title: Contextual Augmentation for Entity Linking using Large Language Models

Title: Small Language Models Offer Significant Potential for Science Community

Title: When Models Can't Follow: Testing Instruction Adherence Across 256 LLMs

Title: Transformer-Based Low-Resource Language Translation: A Study on Standard Bengali to Sylheti

Title: DuoLens: A Framework for Robust Detection of Machine-Generated Multilingual Text and Code

Title: Improving Topic Modeling of Social Media Short Texts with Rephrasing: A Case Study of COVID-19 Related Tweets

Title: Learning from the Best, Differently: A Diversity-Driven Rethinking on Data Selection

Title: Context-aware Fairness Evaluation and Mitigation in LLMs

Title: Misinformation Detection using Large Language Models with Explainability

Title: Evaluating LLM Story Generation through Large-scale Network Analysis of Social Structures

Title: Lost in the Maze: Overcoming Context Limitations in Long-Horizon Agentic Search

Title: ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and Judge

Title: Dynamic Evaluation for Oversensitivity in LLMs

Title: Are they lovers or friends? Evaluating LLMs' Social Reasoning in English and Korean Dialogues

Title: When Can We Trust LLMs in Mental Health? Large-Scale Benchmarks for Reliable LLM Evaluation

Title: From Memorization to Generalization: Fine-Tuning Large Language Models for Biomedical Term-to-Identifier Normalization

Title: That's Deprecated! Understanding, Detecting, and Steering Knowledge Conflicts in Language Models for Code Generation

Title: A Graph Signal Processing Framework for Hallucination Detection in Large Language Models

Title: Tibetan Language and AI: A Comprehensive Survey of Resources, Methods and Challenges

Title: "You Are Rejected!": An Empirical Study of Large Language Models Taking Hiring Evaluations

Title: Think Straight, Stop Smart: Structured Reasoning for Efficient Multi-Hop RAG

Title: When Facts Change: Probing LLMs on Evolving Knowledge with evolveQA

Title: Interpretable Question Answering with Knowledge Graphs

Title: Multi-Faceted Evaluation of Tool-Augmented Dialogue Systems

Title: DiSRouter: Distributed Self-Routing for LLM Selections

Title: SheetBrain: A Neuro-Symbolic Agent for Accurate Reasoning over Complex and Large Spreadsheets

Title: Difficulty-Controllable Multiple-Choice Question Generation Using Large Language Models and Direct Preference Optimization

Title: TheMCPCompany: Creating General-purpose Agents with Task-specific Tools

Title: JointCQ: Improving Factual Hallucination Detection with Joint Claim and Query Generation

Title: HAD: HAllucination Detection Language Models Based on a Comprehensive Hallucination Taxonomy

Title: Balancing Rewards in Text Summarization: Multi-Objective Reinforcement Learning via HyperVolume Optimization

Title: Slot Filling as a Reasoning Task for SpeechLLMs

Title: Algorithmic Fairness in NLP: Persona-Infused LLMs for Human-Centric Hate Speech Detection

Title: Local Obfuscation by GLINER for Impartial Context Aware Lineage: Development and evaluation of PII Removal system

Title: M3-SLU: Evaluating Speaker-Attributed Reasoning in Multimodal Large Language Models

Title: AgenticMath: Enhancing LLM Reasoning via Agentic-based Math Data Generation

Title: LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts

Title: ToMMeR -- Efficient Entity Mention Detection from Large Language Models

Title: BLiSS 1.0: Evaluating Bilingual Learner Competence in Second Language Small Language Models

Title: VideoAgentTrek: Computer Use Pretraining from Unlabeled Videos

Title: Machine Text Detectors are Membership Inference Attacks

Title: What is the Best Sequence Length for BABYLM?

Title: Lookahead Routing for Large Language Models

Title: Detecting Latin in Historical Books with Large Language Models: A Multimodal Benchmark

Title: PBBQ: A Persian Bias Benchmark Dataset Curated with Human-AI Collaboration for Large Language Models

Title: CrossNews-UA: A Cross-lingual News Semantic Similarity Benchmark for Ukrainian, Polish, Russian, and English

Title: Style Attack Disguise: When Fonts Become a Camouflage for Adversarial Intent

Title: LLavaCode: Compressed Code Representations for Retrieval-Augmented Code Generation

Title: Unraveling Emotions with Pre-Trained Models

Title: DiffAdapt: Difficulty-Adaptive Reasoning for Token-Efficient LLM Inference

Title: CoSense-LLM: Semantics at the Edge with Cost- and Uncertainty-Aware Cloud-Edge Cooperation

Title: Are Large Language Models Sensitive to the Motives Behind Communication?

Title: Do Prompts Reshape Representations? An Empirical Study of Prompting Effects on Embeddings

Title: From Answers to Guidance: A Proactive Dialogue System for Legal Documents

Title: Zhyper: Factorized Hypernetworks for Conditioned LLM Fine-Tuning

Title: SmartSwitch: Advancing LLM Reasoning by Overcoming Underthinking via Promoting Deeper Thought Exploration

Title: AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders

Title: Adapting Multilingual Models to Code-Mixed Tasks via Model Merging

Title: ToolDreamer: Instilling LLM Reasoning Into Tool Retrievers

Title: The Art of Asking: Multilingual Prompt Optimization for Synthetic Data

Title: Scaf-GRPO: Scaffolded Group Relative Policy Optimization for Enhancing LLM Reasoning

Title: Hubble: a Model Suite to Advance the Study of LLM Memorization