2025-08-18

Title: A2HCoder: An LLM-Driven Coding Agent for Hierarchical Algorithm-to-HDL Translation

Title: PersonaTwin: A Multi-Tier Prompt Conditioning Framework for Generating and Evaluating Personalized Digital Twins

Title: gpt-oss-120b & gpt-oss-20b Model Card

Title: Modeling and Detecting Company Risks from News: A Case Study in Bloomberg News

Title: Rule2Text: A Framework for Generating and Evaluating Natural Language Explanations of Knowledge Graph Rules

Title: Improving Text Style Transfer using Masked Diffusion Language Models with Inference-time Scaling

Title: SproutBench: A Benchmark for Safe and Ethical Large Language Models for Youth

Title: Beyond the Rosetta Stone: Unification Forces in Generalization Dynamics

Title: Hell or High Water: Evaluating Agentic Recovery from External Failures

Title: BIPOLAR: Polarization-based granular framework for LLM bias evaluation

Title: Approaching the Source of Symbol Grounding with Confluent Reductions of Abstract Meaning Representation Directed Graphs

Title: Towards Reliable Multi-Agent Systems for Marketing Applications via Reflection, Memory, and Planning

Title: MoNaCo: More Natural and Complex Questions for Reasoning Across Dozens of Documents

Title: MobQA: A Benchmark Dataset for Semantic Understanding of Human Mobility Data through Question Answering

Title: Personalized Distractor Generation via MCTS-Guided Reasoning Reconstruction

Title: Cross-Granularity Hypergraph Retrieval-Augmented Generation for Multi-hop Question Answering

Title: UNVEILING: What Makes Linguistics Olympiad Puzzles Tricky for LLMs?

Title: LETToT: Label-Free Evaluation of Large Language Models On Tourism Using Expert Tree-of-Thought

Title: ToxiFrench: Benchmarking and Enhancing Language Models via CoT Fine-Tuning for French Toxicity Detection

Title: AI in Mental Health: Emotional and Sentiment Analysis of Large Language Models' Responses to Depression, Anxiety, and Stress Queries

Title: SafeConstellations: Steering LLM Safety to Reduce Over-Refusals Through Task-Specific Trajectory

Title: SGSimEval: A Comprehensive Multifaceted and Similarity-Enhanced Benchmark for Automatic Survey Generation Systems

Title: LLM Compression: How Far Can We Go in Balancing Size and Performance?

Title: SpecDetect: Simple, Fast, and Training-Free Detection of LLM-Generated Text via Spectral Analysis

Title: Feedback Indicators: The Alignment between Llama and a Teacher in Language Learning

Title: When Punctuation Matters: A Large-Scale Comparison of Prompt Robustness Methods for LLMs

Title: Retrieval-augmented reasoning with lean language models

Title: Survey-to-Behavior: Downstream Alignment of Human Values in LLMs via Survey Questions

Title: HumorPlanSearch: Structured Planning and HuCoT for Contextual AI Humor

Title: Online Anti-sexist Speech: Identifying Resistance to Gender Bias in Political Discourse

Title: Reference Points in LLM Sentiment Analysis: The Role of Structured Context

Title: Speciesism in AI: Evaluating Discrimination Against Animals in Large Language Models

Title: Language models align with brain regions that represent concepts across modalities

Title: AgentMental: An Interactive Multi-Agent Framework for Explainable and Adaptive Mental Health Assessment

Title: Aware First, Think Less: Dynamic Boundary Self-Awareness Drives Extreme Reasoning Efficiency in Large Language Models

Title: Dataset Creation for Visual Entailment using Generative AI

Title: TinyTim: A Family of Language Models for Divergent Generation