2025-07-23

Title: eSapiens's DEREK Module: Deep Extraction & Reasoning Engine for Knowledge with LLMs

Title: Small Edits, Big Consequences: Telling Good from Bad Robustness in Large Language Models

Title: Enhancing Hindi NER in Low Context: A Comparative study of Transformer-based models with vs. without Retrieval Augmentation

Title: Learning without training: The implicit dynamics of in-context learning

Title: Help Me Write a Story: Evaluating LLMs' Ability to Generate Writing Feedback

Title: mRAKL: Multilingual Retrieval-Augmented Knowledge Graph Construction for Low-Resourced Languages

Title: AutoMeet: a proof-of-concept study of genAI to automate meetings in automotive engineering

Title: Deep Researcher with Test-Time Diffusion

Title: The Prompt Makes the Person(a): A Systematic Evaluation of Sociodemographic Persona Prompting for Large Language Models

Title: Efficient Compositional Multi-tasking for On-device Large Language Models

Title: Do Large Language Models Have a Planning Theory of Mind? Evidence from MindGames: a Multi-Step Persuasion Task

Title: WakenLLM: A Fine-Grained Benchmark for Evaluating LLM Reasoning Potential and Reasoning Process Stability

Title: Towards Compute-Optimal Many-Shot In-Context Learning

Title: FinResearchBench: A Logic Tree based Agent-as-a-Judge Evaluation Framework for Financial Research Agents

Title: Efficient RL for optimizing conversation level outcomes with an LLM-based tutor

Title: iShumei-Chinchunmei at SemEval-2025 Task 4: A balanced forgetting and retention multi-task framework using effective unlearning loss

Title: Beyond Isolated Dots: Benchmarking Structured Table Construction as Deep Knowledge Extraction

Title: Language Detection by Means of the Minkowski Norm: Identification Through Character Bigrams and Frequency Analysis

Title: SpeLLM: Character-Level Multi-Head Decoding

Title: Re:Form -- Reducing Human Priors in Scalable Formal Software Verification with RL in LLMs: A Preliminary Study on Dafny

Title: GG-BBQ: German Gender Bias Benchmark for Question Answering

Title: PromptAL: Sample-Aware Dynamic Soft Prompts for Few-Shot Active Learning

Title: Dutch CrowS-Pairs: Adapting a Challenge Dataset for Measuring Social Biases in Language Models for Dutch

Title: Towards Enforcing Company Policy Adherence in Agentic Workflows

Title: ICR Probe: Tracking Hidden State Dynamics for Reliable Hallucination Detection in LLMs

Title: Combining Language and Topic Models for Hierarchical Text Classification

Title: Learning Text Styles: A Study on Transfer, Attribution, and Verification

Title: Exploring Gender Bias in Large Language Models: An In-depth Dive into the German Language

Title: Pixels to Principles: Probing Intuitive Physics Understanding in Multimodal Language Models

Title: Step-Audio 2 Technical Report

Title: Towards Automated Regulatory Compliance Verification in Financial Auditing with Large Language Models

Title: P-CoT: A Pedagogically-motivated Participatory Chain-of-Thought Prompting for Phonological Reasoning in LLMs

Title: Self-Contradiction as Self-Improvement: Mitigating the Generation-Understanding Gap in MLLMs

Title: PICACO: Pluralistic In-Context Value Alignment of LLMs via Total Correlation Optimization

Title: Advancing Risk and Quality Assurance: A RAG Chatbot for Improved Regulatory Compliance

Title: RAVine: Reality-Aligned Evaluation for Agentic Search

Title: Beyond Context Limits: Subconscious Threads for Long-Horizon Reasoning

Title: Test-Time-Matching: Decouple Personality, Memory, and Linguistic Style in LLM-based Role-Playing Language Agent

Title: Agentar-Fin-R1: Enhancing Financial Intelligence through Domain Expertise, Training Efficiency, and Advanced Reasoning

Title: LingBench++: A Linguistically-Informed Benchmark and Reasoning Framework for Multi-Step and Cross-Cultural Inference with LLMs