2026-03-25

Title: Founder effects shape the evolutionary dynamics of multimodality in open LLM families

Title: Evaluating Prompting Strategies for Chart Question Answering with Large Language Models

Title: MERIT: Memory-Enhanced Retrieval for Interpretable Knowledge Tracing

Title: Less is More: Adapting Text Embeddings for Low-Resource Languages with Small Scale Noisy Synthetic Data

Title: Evaluating Large Language Models' Responses to Sexual and Reproductive Health Queries in Nepali

Title: TIPS: Turn-Level Information-Potential Reward Shaping for Search-Augmented LLMs

Title: Efficient Embedding-based Synthetic Data Generation for Complex Reasoning Tasks

Title: Whether, Not Which: Mechanistic Interpretability Reveals Dissociable Affect Reception and Emotion Categorization in LLMs

Title: Between the Layers Lies the Truth: Uncertainty Estimation in LLMs Using Intra-Layer Local Information Scores

Title: Scaling Attention via Feature Sparsity

Title: Latent Semantic Manifolds in Large Language Models

Title: Sample Transform Cost-Based Training-Free Hallucination Detector for Large Language Models

Title: Mitigating Premature Discretization with Progressive Quantization for Robust Vector Tokenization

Title: CN-Buzz2Portfolio: A Chinese-Market Dataset and Benchmark for LLM-Based Macro and Sector Asset Allocation from Daily Trending Financial News

Title: Full waveform inversion method based on diffusion model

Title: UniFluids: Unified Neural Operator Learning with Conditional Flow-matching

Title: Emergency Preemption Without Online Exploration: A Decision Transformer Approach

Title: ST-GDance++: A Scalable Spatial-Temporal Diffusion for Long-Duration Group Choreography

Title: A graph neural network based chemical mechanism reduction method for combustion applications

Title: Sparsely-Supervised Data Assimilation via Physics-Informed Schrödinger Bridge

Title: From Instructions to Assistance: a Dataset Aligning Instruction Manuals with Assembly Videos for Evaluating Multimodal LLMs

Title: AEGIS: An Operational Infrastructure for Post-Market Governance of Adaptive Medical AI Under US and EU Regulations

Title: A Multi-Task Targeted Learning Framework for Lithium-Ion Battery State-of-Health and Remaining Useful Life

Title: DAQ: Delta-Aware Quantization for Post-Training LLM Weight Compression

Title: Hybrid Associative Memories

Title: Beyond the Mean: Distribution-Aware Loss Functions for Bimodal Regression

Title: Trained Persistent Memory for Frozen Decoder-Only LLMs

Title: Large Language Models for Missing Data Imputation: Understanding Behavior, Hallucination Effects, and Control Mechanisms

Title: T-MAP: Red-Teaming LLM Agents with Trajectory-aware Evolutionary Search

Title: Cloud-Edge Collaborative Large Models for Robust Photovoltaic Power Forecasting

Title: COMPASS-Hedge: Learning Safely Without Knowing the World

Title: Unveiling the Mechanism of Continuous Representation Full-Waveform Inversion: A Wave Based Neural Tangent Kernel Framework

Title: MCLR: Improving Conditional Modeling in Visual Generative Models via Inter-Class Likelihood-Ratio Maximization and Establishing the Equivalence between Classifier-Free Guidance and Alignment Objectives

Title: Q-AGNN: Quantum-Enhanced Attentive Graph Neural Network for Intrusion Detection

Title: FAAR: Format-Aware Adaptive Rounding for NVFP4

Title: Three Creates All: You Only Sample 3 Steps

Title: Symbolic Graph Networks for Robust PDE Discovery from Noisy Sparse Data

Title: Spatially-Aware Evaluation Framework for Aerial LiDAR Point Cloud Semantic Segmentation: Distance-Based Metrics on Challenging Regions

Title: OsteoFlow: Lyapunov-Guided Flow Distillation for Predicting Bone Remodeling after Mandibular Reconstruction

Title: Neural Structure Embedding for Symbolic Regression via Continuous Structure Search and Coefficient Optimization

Title: Model Predictive Control with Differentiable World Models for Offline Reinforcement Learning

Title: mmFHE: mmWave Sensing with End-to-End Fully Homomorphic Encryption

Title: Architecture-Derived CBOMs for Cryptographic Migration: A Security-Aware Architecture Tradeoff Method

Title: Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs

Title: Static Scene Reconstruction from Dynamic Egocentric Videos

Title: MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding

Title: LLM-guided headline rewriting for clickability enhancement without clickbait

Title: A Theoretical Framework for Energy-Aware Gradient Pruning in Federated Learning

Title: Functional Component Ablation Reveals Specialization Patterns in Hybrid Language Model Architectures

Title: Model Context Protocol Threat Modeling and Analyzing Vulnerabilities to Prompt Injection with Tool Poisoning

Title: Tiny Inference-Time Scaling with Latent Verifiers

Title: Rashid: A Cipher-Based Framework for Exploring In-Context Language Learning

Title: OrgForge-IT: A Verifiable Synthetic Benchmark for LLM-Based Insider Threat Detection

Title: Sketch2CT: Multimodal Diffusion for Structure-Aware 3D Medical Volume Generation

Title: CTF as a Service: A reproducible and scalable infrastructure for cybersecurity training

Title: Adversarial Vulnerabilities in Neural Operator Digital Twins: Gradient-Free Attacks on Nuclear Thermal-Hydraulic Surrogates

Title: UrbanVGGT: Scalable Sidewalk Width Estimation from Street View Images

Title: Generalized multi-object classification and tracking with sparse feature resonator networks

Title: MIOFlow 2.0: A unified framework for inferring cellular stochastic dynamics from single cell and spatial transcriptomics data

Title: CanViT: Toward Active-Vision Foundation Models

Title: FullCircle: Effortless 3D Reconstruction from Casual 360$^\circ$ Captures

Title: CAPITU: A Benchmark for Evaluating Instruction-Following in Brazilian Portuguese with Literary Context

Title: STRIATUM-CTF: A Protocol-Driven Agentic Framework for General-Purpose CTF Solving

Title: Lie to Me: How Faithful Is Chain-of-Thought Reasoning in Reasoning Models?

Title: A Foundation Model for Instruction-Conditioned In-Context Time Series Tasks

Title: Precision-Varying Prediction (PVP): Robustifying ASR systems against adversarial attacks

Title: Language Models Can Explain Visual Features via Steering

Title: Semi-Automated Threat Modeling of Cloud-Based Systems Through Extracting Software Architecture from Configuration and Network Flow

Title: Dress-ED: Instruction-Guided Editing for Virtual Try-On and Try-Off

Title: BioShield: A Context-Aware Firewall for Securing Bio-LLMs

Title: A Vision Language Model for Generating Procedural Plant Architecture Representations from Simulated Images

Title: To Agree or To Be Right? The Grounding-Sycophancy Tradeoff in Medical Vision-Language Models

Title: Toward Faithful Segmentation Attribution via Benchmarking and Dual-Evidence Fusion

Title: PIVM: Diffusion-Based Prior-Integrated Variation Modeling for Anatomically Precise Abdominal CT Synthesis

Title: LGSE: Lexically Grounded Subword Embedding Initialization for Low-Resource Language Adaptation

Title: Multi-Method Validation of Large Language Model Medical Translation Across High- and Low-Resource Languages

Title: Pretext Matters: An Empirical Study of SSL Methods in Medical Imaging

Title: Bounding Box Anomaly Scoring for simple and efficient Out-of-Distribution detection

Title: Improving LLM Predictions via Inter-Layer Structural Encoders

Title: GeoTikzBridge: Advancing Multimodal Code Generation for Geometric Perception and Reasoning

Title: WiFi2Cap: Semantic Action Captioning from Wi-Fi CSI via Limb-Level Semantic Alignment

Title: TimeWeaver: Age-Consistent Reference-Based Face Restoration with Identity Preservation

Title: Synthetic or Authentic? Building Mental Patient Simulators from Longitudinal Evidence

Title: Detecting Non-Membership in LLM Training Data via Rank Correlations

Title: Who Spoke What When? Evaluating Spoken Language Models for Conversational ASR with Semantic and Overlap-Aware Metrics

Title: Does Teaming-Up LLMs Improve Secure Code Generation? A Comprehensive Evaluation with Multi-LLMSecCodeEval

Title: Spiking Personalized Federated Learning for Brain-Computer Interface-Enabled Immersive Communication

Title: How Utilitarian Are OpenAI's Models Really? Replicating and Reinterpreting Pfeffer, Krügel, and Uhl (2025)

Title: SOUPLE: Enhancing Audio-Visual Localization and Segmentation with Learnable Prompt Contexts

Title: Explanation Generation for Contradiction Reconciliation with LLMs

Title: Multitask-Informed Prior for In-Context Learning on Tabular Data: Application to Steel Property Prediction

Title: CIPL: A Target-Independent Framework for Channel-Inversion Privacy Leakage in Agents

Title: PRISM: A Dual View of LLM Reasoning through Semantic Flow and Latent Computation

Title: MVPBench: A Multi-Video Perception Evaluation Benchmark for Multi-Modal Video Understanding

Title: Multimodal Industrial Anomaly Detection via Geometric Prior

Title: ENC-Bench: A Benchmark for Evaluating Multimodal Large Language Models in Electronic Navigational Chart Understanding

Title: DALDALL: Data Augmentation for Lexical and Semantic Diverse in Legal Domain by leveraging LLM-Persona

Title: From Pixels to Semantics: A Multi-Stage AI Framework for Structural Damage Detection in Satellite Imagery

Title: From Arithmetic to Logic: The Resilience of Logic and Lookup-Based Neural Networks Under Parameter Bit-Flips

Title: Explainable Threat Attribution for IoT Networks Using Conditional SHAP and Flow Behavior Modelling

Title: Typography-Based Monocular Distance Estimation Framework for Vehicle Safety Systems

Title: Know3D: Prompting 3D Generation with Knowledge from Vision-Language Models

Title: Caterpillar of Thoughts: The Optimal Test-Time Algorithm for Large Language Models

Title: It Takes Two: A Duet of Periodicity and Directionality for Burst Flicker Removal

Title: Span Modeling for Idiomaticity and Figurative Language Detection with Span Contrastive Loss

Title: Transformers Trained via Gradient Descent Can Provably Learn a Class of Teacher Models

Title: Combinatorial Privacy: Private Multi-Party Bitstream Grand Sum by Hiding in Birkhoff Polytopes

Title: Universal and efficient graph neural networks with dynamic attention for machine learning interatomic potentials

Title: Efficient Hallucination Detection: Adaptive Bayesian Estimation of Semantic Entropy with Guided Semantic Exploration

Title: Focus, Don't Prune: Identifying Instruction-Relevant Regions for Information-Rich Image Understanding

Title: TDATR: Improving End-to-End Table Recognition via Table Detail-Aware Learning and Cell-Level Visual Alignment

Title: MVRD-Bench: Multi-View Learning and Benchmarking for Dynamic Remote Photoplethysmography under Occlusion

Title: Analysing LLM Persona Generation and Fairness Interpretation in Polarised Geopolitical Contexts

Title: UAV-DETR: DETR for Anti-Drone Target Detection

Title: Rethinking Token-Level Policy Optimization for Multimodal Chain-of-Thought

Title: Gau-Occ: Geometry-Completed Gaussians for Multi-Modal 3D Occupancy Prediction

Title: Agent Audit: A Security Analysis System for LLM Agent Applications

Title: Avoiding Over-smoothing in Social Media Rumor Detection with Pre-trained Propagation Tree Transformer

Title: The Coordinate System Problem in Persistent Structural Memory for Neural Architectures

Title: Agent-Sentry: Bounding LLM Agents via Execution Provenance

Title: ForeSea: AI Forensic Search with Multi-modal Queries for Video Surveillance

Title: TreeTeaming: Autonomous Red-Teaming of Vision-Language Models via Hierarchical Strategy Exploration

Title: Balancing Safety and Efficiency in Aircraft Health Diagnosis: A Task Decomposition Framework with Heterogeneous Long-Micro Scale Cascading and Knowledge Distillation-based Interpretability

Title: VLGOR: Visual-Language Knowledge Guided Offline Reinforcement Learning for Generalizable Agents

Title: SLARM: Streaming and Language-Aligned Reconstruction Model for Dynamic Scenes

Title: EchoKV: Efficient KV Cache Compression via Similarity-Based Reconstruction

Title: ForestPrune: High-ratio Visual Token Compression for Video Multimodal Large Language Models via Spatial-Temporal Forest Modeling

Title: When AVSR Meets Video Conferencing: Dataset, Degradation, and the Hidden Mechanism Behind Performance Collapse

Title: EVA: Efficient Reinforcement Learning for End-to-End Video Agent

Title: Quality Over Clicks: Intrinsic Quality-Driven Iterative Reinforcement Learning for Cold-Start E-Commerce Query Suggestion

Title: SoK: The Attack Surface of Agentic AI -- Tools, and Autonomy

Title: FixationFormer: Direct Utilization of Expert Gaze Trajectories for Chest X-Ray Classification

Title: Caption Generation for Dongba Paintings via Prompt Learning and Semantic Fusion

Title: Weak-PDE-Net: Discovering Open-Form PDEs via Differentiable Symbolic Networks and Weak Formulation

Title: Privacy-Preserving EHR Data Transformation via Geometric Operators: A Human-AI Co-Design Technical Report

Title: Asymptotic Learning Curves for Diffusion Models with Random Features Score and Manifold Data

Title: Few-Shot Generative Model Adaption via Identity Injection and Preservation

Title: Set-Valued Prediction for Large Language Models with Feasibility-Aware Coverage Guarantees

Title: Beyond Theoretical Bounds: Empirical Privacy Loss Calibration for Text Rewriting Under Local Differential Privacy

Title: WorldMesh: Generating Navigable Multi-Room 3D Scenes via Mesh-Conditioned Image Diffusion

Title: How Far Should We Need to Go : Evaluate Provenance-based Intrusion Detection Systems in Industrial Scenarios

Title: Can Graph Foundation Models Generalize Over Architecture?

Title: Beyond Hate: Differentiating Uncivil and Intolerant Speech in Multimodal Content Moderation

Title: A Critical Review on the Effectiveness and Privacy Threats of Membership Inference Attacks

Title: Robustness Quantification and Uncertainty Quantification: Comparing Two Methods for Assessing the Reliability of Classifier Predictions

Title: VLA-IAP: Training-Free Visual Token Pruning via Interaction Alignment for Vision-Language-Action Models

Title: Multi-User Multi-Key Image Steganography with Key Isolation

Title: AgentRAE: Remote Action Execution through Notification-based Visual Backdoors against Screenshots-based Mobile GUI Agents

Title: Zero-Shot Personalization of Objects via Textual Inversion

Title: RTS-ABAC: Real-Time Server-Aided Attribute-Based Authorization & Access Control for Substation Automation Systems

Title: A Sobering Look at Tabular Data Generation via Probabilistic Circuits

Title: Concept-based explanations of Segmentation and Detection models in Natural Disaster Management

Title: Cog3DMap: Multi-View Vision-Language Reasoning with 3D Cognitive Maps

Title: Looking Beyond the Window: Global-Local Aligned CLIP for Training-free Open-Vocabulary Semantic Segmentation

Title: Generative Event Pretraining with Foundation Model Alignment

Title: Traffic Sign Recognition in Autonomous Driving: Dataset, Benchmark, and Field Experiment

Title: YOLOv10 with Kolmogorov-Arnold networks and vision-language foundation models for interpretable object detection and trustworthy multimodal AI in computer vision perception

Title: HUydra: Full-Range Lung CT Synthesis via Multiple HU Interval Generative Modelling

Title: Assessing the Robustness of Climate Foundation Models under No-Analog Distribution Shifts

Title: Mind Your HEARTBEAT! Claw Background Execution Inherently Enables Silent Memory Pollution

Title: MLLM-HWSI: A Multimodal Large Language Model for Hierarchical Whole Slide Image Understanding

Title: MsFormer: Enabling Robust Predictive Maintenance Services for Industrial Devices

Title: Policy-based Tuning of Autoregressive Image Models with Instance- and Distribution-Level Rewards

Title: When Language Models Lose Their Mind: The Consequences of Brain Misalignment

Title: SpecXMaster Technical Report

Title: NeuroSeg Meets DINOv3: Transferring 2D Self-Supervised Visual Priors to 3D Neuron Segmentation via DINOv3 Initialization

Title: AgentFoX: LLM Agent-Guided Fusion with eXplainability for AI-Generated Image Detection

Title: Automatic Segmentation of 3D CT scans with SAM2 using a zero-shot approach

Title: TRAP: Hijacking VLA CoT-Reasoning via Adversarial Patches

Title: SMSP: A Plug-and-Play Strategy of Multi-Scale Perception for MLLMs to Perceive Visual Illusions

Title: PiCo: Active Manifold Canonicalization for Robust Robotic Visual Anomaly Detection

Title: 3rd Place of MeViS-Audio Track of the 5th PVUW: VIRST-Audio

Title: InterDyad: Interactive Dyadic Speech-to-Video Generation by Querying Intermediate Visual Guidance

Title: A Bayesian Learning Approach for Drone Coverage Network: A Case Study on Cardiac Arrest in Scotland

Title: HGNet: Scalable Foundation Model for Automated Knowledge Graph Generation from Scientific Literature

Title: DAK-UCB: Diversity-Aware Prompt Routing for LLMs and Generative Models

Title: Why AI-Generated Text Detection Fails: Evidence from Explainable AI Beyond Benchmark Accuracy

Title: VoDaSuRe: A Large-Scale Dataset Revealing Domain Shift in Volumetric Super-Resolution

Title: GSwap: Realistic Head Swapping with Dynamic Neural Gaussian Field

Title: Robust Safety Monitoring of Language Models via Activation Watermarking

Title: From Synthetic to Native: Benchmarking Multilingual Intent Classification in Logistics Customer Service

Title: Gimbal360: Differentiable Auto-Leveling for Canonicalized $360^\circ$ Panoramic Image Completion

Title: ViKey: Enhancing Temporal Understanding in Videos via Visual Prompting

Title: Gaze-Regularized VLMs for Ego-Centric Behavior Understanding

Title: Privacy-Aware Smart Cameras: View Coverage via Socially Responsible Coordination

Title: Sparser, Faster, Lighter Transformer Language Models

Title: FDIF: Formula-Driven supervised Learning with Implicit Functions for 3D Medical Image Segmentation

Title: Gaze-Regularized Vision-Language-Action Models for Robotic Manipulation

Title: Decoding AI Authorship: Can LLMs Truly Mimic Human Style Across Literature and Politics?

Title: General Machine Learning: Theory for Learning Under Variable Regimes

Title: PRETTINESS -- Privacy pResErving aTTrIbute maNagEment SyStem

Title: Gyokuro: Source-assisted Private Membership Testing using Trusted Execution Environments

Title: I Came, I Saw, I Explained: Benchmarking Multimodal LLMs on Figurative Meaning in Memes

Title: The Power of Power Codes: New Classes of Easy Instances for the Linear Equivalence Problem

Title: GEM: Guided Expectation-Maximization for Behavior-Normalized Candidate Action Selection in Offline RL

Title: GO-Renderer: Generative Object Rendering with 3D-aware Controllable Video Diffusion Models

Title: Is AI Catching Up to Human Expression? Exploring Emotion, Personality, Authorship, and Linguistic Style in English and Arabic with Six Large Language Models

Title: On the Vulnerability of FHE Computation to Silent Data Corruption

Title: Permutation-Symmetrized Diffusion for Unconditional Molecular Generation

Title: SynForceNet: A Force-Driven Global-Local Latent Representation Framework for Lithium-Ion Battery Fault Diagnosis

Title: SafeSeek: Universal Attribution of Safety Circuits in Language Models

Title: Not All Tokens Are Created Equal: Query-Efficient Jailbreak Fuzzing for LLMs

Title: Multi-Modal Image Fusion via Intervention-Stable Feature Learning

Title: CCF: Complementary Collaborative Fusion for Domain Generalized Multi-Modal 3D Object Detection

Title: A Comparative Study of Machine Learning Models for Hourly Forecasting of Air Temperature and Relative Humidity

Title: Mamba-driven MRI-to-CT Synthesis for MRI-only Radiotherapy Planning

Title: Steering LLMs for Culturally Localized Generation

Title: Security Barriers to Trustworthy AI-Driven Cyber Threat Intelligence in Finance: Evidence from Practitioners

Title: Curriculum-Driven 3D CT Report Generation via Language-Free Visual Grafting and Zone-Constrained Compression

Title: Robustness Quantification for Discriminative Models: a New Robustness Metric and its Application to Dynamic Classifier Selection

Title: WISTERIA: Weak Implicit Signal-based Temporal Relation Extraction with Attention

Title: ViBe: Ultra-High-Resolution Video Synthesis Born from Pure Images

Title: An Explainable AI-Driven Framework for Automated Brain Tumor Segmentation Using an Attention-Enhanced U-Net

Title: FHAvatar: Fast and High-Fidelity Reconstruction of Face-and-Hair Composable 3D Head Avatar from Few Casual Captures

Title: What a Mesh: Formal Security Analysis of WPA3 SAE Wireless Authentication

Title: Off-Policy Value-Based Reinforcement Learning for Large Language Models

Title: Central Dogma Transformer III: Interpretable AI Across DNA, RNA, and Protein

Title: Object Pose Transformer: Unifying Unseen Object Pose Estimation

Title: ABot-PhysWorld: Interactive World Foundation Model for Robotic Manipulation with Physics Alignment

Title: FG-Portrait: 3D Flow Guided Editable Portrait Animation

Title: From Feature Learning to Spectral Basis Learning: A Unifying and Flexible Framework for Efficient and Robust Shape Matching

Title: Harnessing Lightweight Transformer with Contextual Synergic Enhancement for Efficient 3D Medical Image Segmentation

Title: Graph Energy Matching: Transport-Aligned Energy-Based Modeling for Graph Generation

Title: Unleashing Spatial Reasoning in Multimodal Large Language Models via Textual Representation Guided Reasoning

Title: GeoSANE: Learning Geospatial Representations from Models, Not Data

Title: I3DM: Implicit 3D-aware Memory Retrieval and Injection for Consistent Video Scene Generation

Title: SortedRL: Accelerating RL Training for LLMs through Online Length-Aware Scheduling

Title: An Experimental Study of Machine Learning-Based Intrusion Detection for OPC UA over Industrial Private 5G Networks

Title: Targeted Adversarial Traffic Generation : Black-box Approach to Evade Intrusion Detection Systems in IoT Networks

Title: 3DCity-LLM: Empowering Multi-modality Large Language Models for 3D City-scale Perception and Understanding

Title: CSTS: A Canonical Security Telemetry Substrate for AI-Native Cyber Detection

Title: RealMaster: Lifting Rendered Scenes into Photorealistic Video

Title: InverFill: One-Step Inversion for Enhanced Few-Step Diffusion Inpainting

Title: Byzantine-Robust and Differentially Private Federated Optimization under Weaker Assumptions

Title: UniFunc3D: Unified Active Spatial-Temporal Grounding for 3D Functionality Segmentation

Title: SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning

Title: Failure of contextual invariance in gender inference with large language models

Title: TETO: Tracking Events with Teacher Observation for Motion Estimation and Frame Interpolation

Title: AgentRVOS: Reasoning over Object Tracks for Zero-Shot Referring Video Object Segmentation

Title: Foveated Diffusion: Efficient Spatially Adaptive Image and Video Generation

Title: Estimating Flow Velocity and Vehicle Angle-of-Attack from Non-invasive Piezoelectric Structural Measurements Using Deep Learning

Title: WildWorld: A Large-Scale Dataset for Dynamic World Modeling with Actions and Explicit State toward Generative ARPG

Title: DA-Flow: Degradation-Aware Optical Flow Estimation with Diffusion Models

Title: UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation

Title: MedObvious: Exposing the Medical Moravec's Paradox in VLMs via Clinical Triage

Title: OccAny: Generalized Unconstrained Urban 3D Occupancy