2026-03-19

Title: A foundation model for electrodermal activity data

Title: What on Earth is AlphaEarth? Hierarchical structure and functional interpretability for global land cover

Title: Leveraging Large Vision Model for Multi-UAV Co-perception in Low-Altitude Wireless Networks

Title: AgriChat: A Multimodal Large Language Model for Agriculture Image Understanding

Title: TDMM-LM: Bridging Facial Understanding and Animation via Language Models

Title: KGS-GCN: Enhancing Sparse Skeleton Sensing via Kinematics-Driven Gaussian Splatting and Probabilistic Topology for Action Recognition

Title: SCE-LITE-HQ: Smooth visual counterfactual explanations with generative foundation models

Title: Early Quantization Shrinks Codebook: A Simple Fix for Diversity-Preserving Tokenization

Title: An End-to-End Framework for Functionality-Embedded Provenance Graph Construction and Threat Interpretation

Title: SENSE: Efficient EEG-to-Text via Privacy-Preserving Semantic Retrieval

Title: Pixel-level Counterfactual Contrastive Learning for Medical Image Segmentation

Title: MosaicMem: Hybrid Spatial Memory for Controllable Video World Models

Title: SMAL-pets: SMAL Based Avatars of Pets from Single Image

Title: BEV-SLD: Self-Supervised Scene Landmark Detection for Global Localization with LiDAR Bird's-Eye View Images

Title: Patient4D: Temporally Consistent Patient Body Mesh Recovery from Monocular Operating Room Video

Title: Self-Conditioned Denoising for Atomistic Representation Learning

Title: TharuChat: Bootstrapping Large Language Models for a Low-Resource Language via Synthetic Data and Human Validation

Title: Neuron-Level Emotion Control in Speech-Generative Large Audio-Language Models

Title: WINFlowNets: Warm-up Integrated Networks Training of Generative Flow Networks for Robotics and Machine Fault Adaptation

Title: MedSAD-CLIP: Supervised CLIP with Token-Patch Cross-Attention for Medical Anomaly Detection and Segmentation

Title: Learning Permutation Distributions via Reflected Diffusion on Ranks

Title: Shot-Aware Frame Sampling for Video Understanding

Title: SCALE:Scalable Conditional Atlas-Level Endpoint transport for virtual cell perturbation prediction

Title: VisionNVS: Self-Supervised Inpainting for Novel View Synthesis under the Virtual-Shift Paradigm

Title: Cohomological Obstructions to Global Counterfactuals: A Sheaf-Theoretic Foundation for Generative Causal Models

Title: The Causal Uncertainty Principle: Manifold Tearing and the Topological Limits of Counterfactual Interventions

Title: Toward Phonology-Guided Sign Language Motion Generation: A Diffusion Baseline and Conditioning Analysis

Title: Harnessing the Power of Foundation Models for Accurate Material Classification

Title: Motion-Adaptive Temporal Attention for Lightweight Video Generation with Stable Diffusion

Title: Large-Scale 3D Ground-Motion Synthesis with Physics-Inspired Latent Operator Flow Matching

Title: Joint Degradation-Aware Arbitrary-Scale Super-Resolution for Variable-Rate Extreme Image Compression

Title: SHIFT: Motion Alignment in Video Diffusion Models with Adversarial Hybrid Fine-Tuning

Title: Baguan-TS: A Sequence-Native In-Context Learning Model for Time Series Forecasting with Covariates

Title: Omni-I2C: A Holistic Benchmark for High-Fidelity Image-to-Code Generation

Title: Proof-of-Authorship for Diffusion-based AI Generated Content

Title: EI: Early Intervention for Multimodal Imaging based Disease Recognition

Title: MM-OVSeg:Multimodal Optical-SAR Fusion for Open-Vocabulary Segmentation in Remote Sensing

Title: AirDDE: Multifactor Neural Delay Differential Equations for Air Quality Forecasting

Title: AdapTS: Lightweight Teacher-Student Approach for Multi-Class and Continual Visual Anomaly Detection

Title: Rel-Zero: Harnessing Patch-Pair Invariance for Robust Zero-Watermarking Against AI Editing

Title: ProGVC: Progressive-based Generative Video Compression via Auto-Regressive Context Modeling

Title: FrescoDiffusion: 4K Image-to-Video with Prior-Regularized Tiled Diffusion

Title: Face anonymization preserving facial expressions and photometric realism

Title: FoMo X: Modular Explainability Signals for Outlier Detection Foundation Models

Title: Unsupervised Symbolic Anomaly Detection

Title: LoGSAM: Parameter-Efficient Cross-Modal Grounding for MRI Segmentation

Title: Edit-As-Act: Goal-Regressive Planning for Open-Vocabulary 3D Indoor Scene Editing

Title: AdaMuS: Adaptive Multi-view Sparsity Learning for Dimensionally Unbalanced Data

Title: S-VGGT: Structure-Aware Subscene Decomposition for Scalable 3D Foundation Models

Title: DSS-GAN: Directional State Space GAN with Mamba backbone for Class-Conditional Image Synthesis

Title: Anchoring and Rescaling Attention for Semantically Coherent Inbetweening

Title: Few-Step Diffusion Sampling Through Instance-Aware Discretizations

Title: DeepCORO-CLIP: A Multi-View Foundation Model for Comprehensive Coronary Angiography Video-Text Analysis and External Validation

Title: Adaptive Guidance for Retrieval-Augmented Masked Diffusion Models

Title: Flow Matching Policy with Entropy Regularization

Title: Parameter-Efficient Modality-Balanced Symmetric Fusion for Multimodal Remote Sensing Semantic Segmentation

Title: Eye image segmentation using visual and concept prompts with Segment Anything Model 3 (SAM3)

Title: Machine Learning for Network Attacks Classification and Statistical Evaluation of Machine Learning for Network Attacks Classification and Adversarial Learning Methodologies for Synthetic Data Generation

Title: TAPESTRY: From Geometry to Appearance via Consistent Turntable Videos

Title: Towards Infinitely Long Neural Simulations: Self-Refining Neural Surrogate Models for Dynamical Systems

Title: CrowdGaussian: Reconstructing High-Fidelity 3D Gaussians for Human Crowd from a Single Image

Title: Exploring parameter-efficient fine-tuning (PEFT) of billion-parameter vision models with QLoRA and DoRA: insights into generalization for limited-data image classification under a 98:1 test-to-train regime

Title: RangeAD: Fast On-Model Anomaly Detection

Title: ChopGrad: Pixel-Wise Losses for Latent Video Diffusion via Truncated Backpropagation

Title: M2P: Improving Visual Foundation Models with Mask-to-Point Weakly-Supervised Learning for Dense Point Tracking

Title: Steering Video Diffusion Transformers with Massive Activations

Title: TINA: Text-Free Inversion Attack for Unlearned Text-to-Image Diffusion Models

Title: Video Understanding: From Geometry and Semantics to Unified Models

Title: Omni-3DEdit: Generalized Versatile 3D Editing in One-Pass

Title: Revisiting foundation models for cell instance segmentation

Title: Physics-Aware Machine Learning for Seismic and Volcanic Signal Interpretation

Title: Edit Spillover as a Probe: Do Image Editing Models Implicitly Understand World Relations?

Title: Differential Attention-Augmented BiomedCLIP with Asymmetric Focal Optimization for Imbalanced Multi-Label Video Capsule Endoscopy Classification

Title: Differential Privacy in Generative AI Agents: Analysis and Optimal Tradeoffs

Title: Noise-Aware Misclassification Attack Detection in Collaborative DNN Inference

Title: SegFly: A 2D-3D-2D Paradigm for Aerial RGB-Thermal Semantic Segmentation at Scale

Title: TransText: Transparency Aware Image-to-Video Typography Animation

Title: LaDe: Unified Multi-Layered Graphic Media Generation and Decomposition

Title: Robust-ComBat: Mitigating Outlier Effects in Diffusion MRI Data Harmonization

Title: AHOY! Animatable Humans under Occlusion from YouTube Videos with Gaussian Splatting and Video Diffusion Priors

Title: LoST: Level of Semantics Tokenization for 3D Shapes

Title: The Unreasonable Effectiveness of Text Embedding Interpolation for Continuous Image Steering

Title: Loc3R-VLM: Language-based Localization and 3D Reasoning with Vision-Language Models