2025-04-07

Title: Comparative Analysis of Deepfake Detection Models: New Approaches and Perspectives

Title: Morpheus: Benchmarking Physical Reasoning of Video Generative Models with Real Physical Experiments

Title: VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning

Title: Localized Definitions and Distributed Reasoning: A Proof-of-Concept Mechanistic Interpretability Study via Activation Patching

Title: VIP: Video Inpainting Pipeline for Real World Human Removal

Title: How I Warped Your Noise: a Temporally-Correlated Noise Prior for Diffusion Models

Title: SLACK: Attacking LiDAR-based SLAM with Adversarial Point Injections

Title: FontGuard: A Robust Font Watermarking Approach Leveraging Deep Font Knowledge

Title: Model Reveals What to Cache: Profiling-Based Feature Reuse for Video Diffusion Models

Title: NuScenes-SpatialQA: A Spatial Understanding and Reasoning Benchmark for Vision-Language Models in Autonomous Driving

Title: Finding the Reflection Point: Unpadding Images to Remove Data Augmentation Artifacts in Large Open Source Image Datasets for Machine Learning

Title: REJEPA: A Novel Joint-Embedding Predictive Architecture for Efficient Remote Sensing Image Retrieval

Title: MIMRS: A Survey on Masked Image Modeling in Remote Sensing

Title: Steerable Anatomical Shape Synthesis with Implicit Neural Representations

Title: Optimal Embedding Guided Negative Sample Generation for Knowledge Graph Link Prediction

Title: QIRL: Boosting Visual Question Answering via Optimized Question-Image Relation Learning

Title: BitHEP -- The Limits of Low-Precision ML in HEP

Title: Autonomous state-space segmentation for Deep-RL sparse reward scenarios

Title: D-Garment: Physics-Conditioned Latent Diffusion for Dynamic Garment Deformations

Title: Dynamic Importance in Diffusion U-Net for Enhanced Image Synthesis

Title: BUFF: Bayesian Uncertainty Guided Diffusion Probabilistic Model for Single Image Super-Resolution

Title: Diffusion Active Learning: Towards Data-Driven Experimental Design in Computed Tomography

Title: HumanDreamer-X: Photorealistic Single-image Human Avatars Reconstruction via Gaussian Restoration

Title: Multimodal Diffusion Bridge with Attention-Based SAR Fusion for Satellite Image Cloud Removal

Title: Autonomous and Self-Adapting System for Synthetic Media Detection and Attribution

Title: VISTA-OCR: Towards generative and interactive end to end OCR models

Title: Shape My Moves: Text-Driven Shape-Aware Synthesis of Human Motions

Title: MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models