2025-07-10

Title: Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

Title: Humans overrely on overconfident language models, across languages

Title: ETT: Expanding the Long Context Understanding Capability of LLMs at Test-Time

Title: Could the Road to Grounded, Neuro-symbolic AI be Paved with Words-as-Classifiers?

Title: Evaluating Morphological Alignment of Tokenizers in 70 Languages

Title: PERK: Long-Context Reasoning as Parameter-Efficient Test-Time Learning

Title: Reward Models Can Improve Themselves: Reward-Guided Adversarial Failure Mode Discovery for Robust Reward Modeling

Title: Exploring Task Performance with Interpretable Models via Sparse Auto-Encoders

Title: Perception-Aware Policy Optimization for Multimodal Reasoning

Title: A Semantic Parsing Framework for End-to-End Time Normalization

Title: A Systematic Analysis of Hybrid Linear Attention

Title: On the Robustness of Verbal Confidence of LLMs in Adversarial Attacks

Title: Pun Intended: Multi-Agent Translation of Wordplay with Contrastive Learning and Phonetic-Semantic Embeddings

Title: SpindleKV: A Novel KV Cache Reduction Method Balancing Both Shallow and Deep Layers

Title: InvestAlign: Overcoming Data Scarcity in Aligning Large Language Models with Investor Decision-Making Processes under Herd Behavior

Title: Large Language Model for Extracting Complex Contract Information in Industrial Scenes

Title: The Flaws of Others: An LLM-driven Framework for Scientific Knowledge Production

Title: Enhancing Food-Domain Question Answering with a Multimodal Knowledge Graph: Hybrid QA Generation and Diversity Analysis

Title: Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation

Title: FuDoBa: Fusing Document and Knowledge Graph-based Representations with Bayesian Optimisation

Title: Expediting data extraction using a large language model (LLM) and scoping review protocol: a methodological study within a complex scoping review

Title: Elite Polarization in European Parliamentary Speeches: a Novel Measurement Approach Using Large Language Models

Title: CLI-RAG: A Retrieval-Augmented Framework for Clinically Structured and Context Aware Text Generation with LLMs

Title: On the Effect of Uncertainty on Layer-wise Inference Dynamics

Title: Checklist Engineering Empowers Multilingual LLM Judges

Title: Efficient Industrial sLLMs through Domain Adaptive Continual Pretraining: Method, Evaluation and Applications

Title: Text to model via SysML: Automated generation of dynamical system computational models from unstructured natural language text via enhanced System Modeling Language diagrams

Title: Adaptive Termination for Multi-round Parallel Reasoning: An Universal Semantic Entropy-Guided Framework

Title: Shifting from Ranking to Set Selection for Retrieval Augmented Generation

Title: Developing and Maintaining an Open-Source Repository of AI Evaluations: Challenges and Insights

Title: SCoRE: Streamlined Corpus-based Relation Extraction using Multi-Label Contrastive Learning and Bayesian kNN

Title: VisualTrap: A Stealthy Backdoor Attack on GUI Agents via Visual Grounding Manipulation

Title: MIND: A Multi-agent Framework for Zero-shot Harmful Meme Detection

Title: MultiJustice: A Chinese Dataset for Multi-Party, Multi-Charge Legal Prediction

Title: Exploring LLMs for Predicting Tutor Strategy and Student Outcomes in Dialogues

Title: Rethinking Verification for LLM Code Generation: From Generation to Testing

Title: Investigating the Robustness of Retrieval-Augmented Generation at the Query Level

Title: FlexOlmo: Open Language Models for Flexible Data Use

Title: UniConv: Unifying Retrieval and Response Generation for Large Language Models in Conversations

Title: Discrete Diffusion Models for Language Generation