2025-07-23

Title: Decentralized AI-driven IoT Architecture for Privacy-Preserving and Latency-Optimized Healthcare in Pandemic and Critical Care Scenarios

Title: Quantifying Holistic Review: A Multi-Modal Approach to College Admissions Prediction

Title: eSapiens's DEREK Module: Deep Extraction & Reasoning Engine for Knowledge with LLMs

Title: RDMA: Cost Effective Agent-Driven Rare Disease Discovery within Electronic Health Record Systems

Title: Small Edits, Big Consequences: Telling Good from Bad Robustness in Large Language Models

Title: Document Haystack: A Long Context Multimodal Image/Document Understanding Vision LLM Benchmark

Title: Prompt Smart, Pay Less: Cost-Aware APO for Real-World Applications

Title: PAT++: a cautionary tale about generative visual augmentation for Object Re-identification

Title: ReDi: Rectified Discrete Flow

Title: Towards Mitigation of Hallucination for LLM-empowered Agents: Progressive Generalization Bound Exploration and Watchdog Monitor

Title: Foundation Models and Transformers for Anomaly Detection: A Survey

Title: Towards Reliable, Uncertainty-Aware Alignment

Title: Dual Turing Test: A Framework for Detecting and Mitigating Undetectable AI

Title: An empirical study for the early detection of Mpox from skin lesion images using pretrained CNN models leveraging XAI technique

Title: HyDRA: A Hybrid-Driven Reasoning Architecture for Verifiable Knowledge Graphs

Title: On the transferability of Sparse Autoencoders for interpreting compressed models

Title: BACFuzz: Exposing the Silence on Broken Access Control Vulnerabilities in Web Applications

Title: Semantic-Aware Gaussian Process Calibration with Structured Layerwise Kernels for Deep Neural Networks

Title: "We Need a Standard": Toward an Expert-Informed Privacy Label for Differential Privacy

Title: Enhancing Hindi NER in Low Context: A Comparative study of Transformer-based models with vs. without Retrieval Augmentation

Title: Learning without training: The implicit dynamics of in-context learning

Title: FW-VTON: Flattening-and-Warping for Person-to-Person Virtual Try-on

Title: Is Tracking really more challenging in First Person Egocentric Vision?

Title: Artifacts and Attention Sinks: Structured Approximations for Efficient Vision Transformers

Title: Disrupting Semantic and Abstract Features for Better Adversarial Transferability

Title: AutoMeet: a proof-of-concept study of genAI to automate meetings in automotive engineering

Title: MFAz: Historical Access Based Multi-Factor Authorization

Title: Deep Researcher with Test-Time Diffusion

Title: The Prompt Makes the Person(a): A Systematic Evaluation of Sociodemographic Persona Prompting for Large Language Models

Title: Efficient Compositional Multi-tasking for On-device Large Language Models

Title: Improving Personalized Image Generation through Social Context Feedback

Title: Stop-band Energy Constraint for Orthogonal Tunable Wavelet Units in Convolutional Neural Networks for Computer Vision problems

Title: PUSA V1.0: Surpassing Wan-I2V with $500 Training Cost by Vectorized Timestep Adaptation

Title: Universal Wavelet Units in 3D Retinal Layer Segmentation

Title: DP2Guard: A Lightweight and Byzantine-Robust Privacy-Preserving Federated Learning Scheme for Industrial IoT

Title: Learning Patient-Specific Spatial Biomarker Dynamics via Operator Learning for Alzheimer's Disease Progression

Title: LSSGen: Leveraging Latent Space Scaling in Flow and Diffusion for Efficient Text to Image Generation

Title: AMMNet: An Asymmetric Multi-Modal Network for Remote Sensing Semantic Segmentation

Title: Attacking interpretable NLP systems

Title: AtrousMamaba: An Atrous-Window Scanning Visual State Space Model for Remote Sensing Change Detection

Title: LLM Data Selection and Utilization via Dynamic Bi-level Optimization

Title: EBaReT: Expert-guided Bag Reward Transformer for Auto Bidding

Title: Do Large Language Models Have a Planning Theory of Mind? Evidence from MindGames: a Multi-Step Persuasion Task

Title: WakenLLM: A Fine-Grained Benchmark for Evaluating LLM Reasoning Potential and Reasoning Process Stability

Title: RealBench: Benchmarking Verilog Generation Models with Real-World IP Designs

Title: SVAgent: AI Agent for Hardware Security Verification Assertion

Title: METER: Multi-modal Evidence-based Thinking and Explainable Reasoning -- Algorithm and Benchmark

Title: Advancing Visual Large Language Model for Multi-granular Versatile Perception

Title: Towards Compute-Optimal Many-Shot In-Context Learning

Title: Positive Style Accumulation: A Style Screening and Continuous Utilization Framework for Federated DG-ReID

Title: eX-NIDS: A Framework for Explainable Network Intrusion Detection Leveraging Large Language Models

Title: FinResearchBench: A Logic Tree based Agent-as-a-Judge Evaluation Framework for Financial Research Agents

Title: HoliTracer: Holistic Vectorization of Geographic Objects from Large-Size Remote Sensing Imagery

Title: Efficient RL for optimizing conversation level outcomes with an LLM-based tutor

Title: Edge-case Synthesis for Fisheye Object Detection: A Data-centric Perspective

Title: Quality Text, Robust Vision: The Role of Language in Enhancing Visual Robustness of Vision-Language Models

Title: ToFe: Lagged Token Freezing and Reusing for Efficient Vision Transformer Inference

Title: iShumei-Chinchunmei at SemEval-2025 Task 4: A balanced forgetting and retention multi-task framework using effective unlearning loss

Title: Beyond Isolated Dots: Benchmarking Structured Table Construction as Deep Knowledge Extraction

Title: Reducing GPU Memory Fragmentation via Spatio-Temporal Planning for Efficient Large-Scale Model Training

Title: From Contracts to Code: Automating Smart Contract Generation with Multi-Level Finite State Machines

Title: Understanding Generalization, Robustness, and Interpretability in Low-Capacity Neural Networks

Title: MAN++: Scaling Momentum Auxiliary Network for Supervised Local Learning in Vision Tasks

Title: Language Detection by Means of the Minkowski Norm: Identification Through Character Bigrams and Frequency Analysis

Title: Beyond Label Semantics: Language-Guided Action Anatomy for Few-shot Action Recognition

Title: Dens3R: A Foundation Model for 3D Geometry Prediction

Title: Talking Like a Phisher: LLM-Based Attacks on Voice Phishing Classifiers

Title: Towards Resilient Safety-driven Unlearning for Diffusion Models against Downstream Fine-tuning

Title: Perovskite-R1: A Domain-Specialized LLM for Intelligent Discovery of Precursor Additives and Experimental Design

Title: M-SpecGene: Generalized Foundation Model for RGBT Multispectral Vision

Title: DREAM: Scalable Red Teaming for Text-to-Image Generative Systems via Distribution Modeling

Title: Scene Text Detection and Recognition "in light of" Challenging Environmental Conditions using Aria Glasses Egocentric Vision Cameras

Title: Re:Form -- Reducing Human Priors in Scalable Formal Software Verification with RL in LLMs: A Preliminary Study on Dafny

Title: One Polyp Identifies All: One-Shot Polyp Segmentation with SAM via Cascaded Priors and Iterative Prompt Evolution

Title: Navigating Large-Pose Challenge for High-Fidelity Face Reenactment with Video Diffusion Model

Title: Mamba-OTR: a Mamba-based Solution for Online Take and Release Detection from Untrimmed Egocentric Video

Title: The Cost of Compression: Tight Quadratic Black-Box Attacks on Sketches for $\ell_2$ Norm Estimation

Title: Leveraging Personalized PageRank and Higher-Order Topological Structures for Heterophily Mitigation in Graph Neural Networks

Title: Bipartite Patient-Modality Graph Learning with Event-Conditional Modelling of Censoring for Cancer Survival Prediction

Title: Depth Gives a False Sense of Privacy: LLM Internal States Inversion

Title: From Flat to Round: Redefining Brain Decoding with Surface-Based fMRI and Cortex Structure

Title: Are Foundation Models All You Need for Zero-shot Face Presentation Attack Detection?

Title: ADCD-Net: Robust Document Image Forgery Localization via Adaptive DCT Feature and Hierarchical Content Disentanglement

Title: Sparse-View 3D Reconstruction: Recent Advances and Open Challenges

Title: GG-BBQ: German Gender Bias Benchmark for Question Answering

Title: Towards Railway Domain Adaptation for LiDAR-based 3D Detection: Road-to-Rail and Sim-to-Real via SynDRA-BBox

Title: Combined Image Data Augmentations diminish the benefits of Adaptive Label Smoothing

Title: Robust Noisy Pseudo-label Learning for Semi-supervised Medical Image Segmentation Using Diffusion Model

Title: Dutch CrowS-Pairs: Adapting a Challenge Dataset for Measuring Social Biases in Language Models for Dutch

Title: Towards Enforcing Company Policy Adherence in Agentic Workflows

Title: ICR Probe: Tracking Hidden State Dynamics for Reliable Hallucination Detection in LLMs

Title: Canonical Correlation Patterns for Validating Clustering of Multivariate Time Series

Title: PlantSAM: An Object Detection-Driven Segmentation Pipeline for Herbarium Specimens

Title: The Ever-Evolving Science Exam

Title: C2-Evo: Co-Evolving Multimodal Data and Model for Self-Improving Reasoning

Title: Spatial 3D-LLM: Exploring Spatial Awareness in 3D Vision-Language Models

Title: Learning Text Styles: A Study on Transfer, Attribution, and Verification

Title: confopt: A Library for Implementation and Evaluation of Gradient-based One-Shot NAS Methods

Title: EarthCrafter: Scalable 3D Earth Generation via Dual-Sparse Latent Diffusion

Title: Symbolic Graph Intelligence: Hypervector Message Passing for Learning Graph-Level Patterns with Tsetlin Machines

Title: Explainable Vulnerability Detection in C/C++ Using Edge-Aware Graph Attention Networks

Title: A Comprehensive Data-centric Overview of Federated Graph Learning

Title: Optimization of DNN-based HSI Segmentation FPGA-based SoC for ADS: A Practical Approach

Title: Exploring Gender Bias in Large Language Models: An In-depth Dive into the German Language

Title: Comparative validation of surgical phase recognition, instrument keypoint estimation, and instrument instance segmentation in endoscopy: Results of the PhaKIR 2024 challenge

Title: Pixels to Principles: Probing Intuitive Physics Understanding in Multimodal Language Models

Title: From Text to Actionable Intelligence: Automating STIX Entity and Relationship Extraction

Title: Scaling Linear Attention with Sparse State Expansion

Title: LLMxCPG: Context-Aware Vulnerability Detection Through Code Property Graph-Guided Large Language Models

Title: CTSL: Codebook-based Temporal-Spatial Learning for Accurate Non-Contrast Cardiac Risk Prediction Using Cine MRIs

Title: Automatic Fine-grained Segmentation-assisted Report Generation

Title: A2Mamba: Attention-augmented State Space Models for Visual Recognition

Title: Step-Audio 2 Technical Report

Title: Towards Automated Regulatory Compliance Verification in Financial Auditing with Large Language Models

Title: P-CoT: A Pedagogically-motivated Participatory Chain-of-Thought Prompting for Phonological Reasoning in LLMs

Title: Synthetic Data Matters: Re-training with Geo-typical Synthetic Labels for Building Detection

Title: Meta-Learning for Cold-Start Personalization in Prompt-Tuned LLMs

Title: GASPnet: Global Agreement to Synchronize Phases

Title: Custom Algorithm-based Fault Tolerance for Attention Layers in Transformers

Title: PICACO: Pluralistic In-Context Value Alignment of LLMs via Total Correlation Optimization

Title: Interpretable Topic Extraction and Word Embedding Learning using row-stochastic DEDICOM

Title: Advancing Risk and Quality Assurance: A RAG Chatbot for Improved Regulatory Compliance

Title: Enhancing Remote Sensing Vision-Language Models Through MLLM and LLM-Based High-Quality Image-Text Dataset Generation

Title: Temporally-Constrained Video Reasoning Segmentation and Automated Benchmark Construction

Title: HarmonPaint: Harmonized Training-Free Diffusion Inpainting

Title: DFR: A Decompose-Fuse-Reconstruct Framework for Multi-Modal Few-Shot Segmentation

Title: Denoising-While-Completing Network (DWCNet): Robust Point Cloud Completion Under Corruption

Title: CMP: A Composable Meta Prompt for SAM-Based Cross-Domain Few-Shot Segmentation

Title: Faithful, Interpretable Chest X-ray Diagnosis with Anti-Aliased B-cos Networks

Title: When LLMs Copy to Think: Uncovering Copy-Guided Attacks in Reasoning LLMs

Title: Task-Specific Zero-shot Quantization-Aware Training for Object Detection

Title: Beyond Context Limits: Subconscious Threads for Long-Horizon Reasoning

Title: AUTOPSY: A Framework for Tackling Privacy Challenges in the Automotive Industry

Title: Enhancing Domain Diversity in Synthetic Data Face Recognition with Dataset Fusion

Title: Steering Out-of-Distribution Generalization with Concept Ablation Fine-Tuning

Title: Test-Time-Matching: Decouple Personality, Memory, and Linguistic Style in LLM-based Role-Playing Language Agent

Title: Agentar-Fin-R1: Enhancing Financial Intelligence through Domain Expertise, Training Efficiency, and Advanced Reasoning

Title: LingBench++: A Linguistically-Informed Benchmark and Reasoning Framework for Multi-Step and Cross-Cultural Inference with LLMs

Title: MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

Title: ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning