2025-04-01

Title: A Novel Chaos-Based Cryptographic Scrambling Technique to Secure Medical Images

Title: A Spatial-temporal Deep Probabilistic Diffusion Model for Reliable Hail Nowcasting with Radar Echo Extrapolation

Title: Reasoning Beyond Limits: Advances and Open Problems for LLMs

Title: Cyborg Data: Merging Human with AI Generated Training Data

Title: ShieldAgent: Shielding Agents via Verifiable Safety Policy Reasoning

Title: Adaptive State-Space Mamba for Real-Time Sensor Data Anomaly Detection

Title: LeForecast: Enterprise Hybrid Forecast by Time Series Intelligence

Title: Ignite Forecasting with SPARK: An Efficient Generative Framework for Refining LLMs in Temporal Knowledge Graph Forecasting

Title: Patronus: Bringing Transparency to Diffusion Models with Prototypes

Title: DiTFastAttnV2: Head-wise Attention Compression for Multi-Modality Diffusion Transformers

Title: Zero-shot Domain Generalization of Foundational Models for 3D Medical Image Segmentation: An Experimental Study

Title: SIGHT: Single-Image Conditioned Generation of Hand Trajectories for Hand-Object Interaction

Title: Task Tokens: A Flexible Approach to Adapting Behavior Foundation Models

Title: Learning Library Cell Representations in Vector Space

Title: Resona: Improving Context Copying in Linear Recurrence Models with Retrieval

Title: SuperEIO: Self-Supervised Event Feature Learning for Event Inertial Odometry

Title: Multi-label classification for multi-temporal, multi-spatial coral reef condition monitoring using vision foundation model with adapter learning

Title: MeshCraft: Exploring Efficient and Controllable Mesh Generation with Flow-based DiTs

Title: Unsupervised Anomaly Detection in Multivariate Time Series across Heterogeneous Domains

Title: Efficient Adaptation For Remote Sensing Visual Grounding

Title: The Reasoning-Memorization Interplay in Language Models Is Mediated by a Single Direction

Title: Evaluating Compositional Scene Understanding in Multimodal Generative Models

Title: A GAN-Enhanced Deep Learning Framework for Rooftop Detection from Historical Aerial Imagery

Title: Enhancing Knowledge Graph Completion with Entity Neighborhood and Relation Context

Title: RECALL-MM: A Multimodal Dataset of Consumer Product Recalls for Risk Analysis using Computational Methods and Large Language Models

Title: Large Self-Supervised Models Bridge the Gap in Domain Adaptive Object Detection

Title: Synthetic Art Generation and DeepFake Detection A Study on Jamini Roy Inspired Dataset

Title: Evaluating how LLM annotations represent diverse views on contentious topics

Title: Learning Predictive Visuomotor Coordination

Title: HiPART: Hierarchical Pose AutoRegressive Transformer for Occluded 3D Human Pose Estimation

Title: TraceMark-LDM: Authenticatable Watermarking for Latent Diffusion Models via Binary-Guided Rearrangement

Title: Object Isolated Attention for Consistent Story Visualization

Title: DSPFusion: Image Fusion via Degradation and Semantic Dual-Prior Guidance

Title: Towards Physically Plausible Video Generation via VLM Planning

Title: Map Feature Perception Metric for Map Generation Quality Assessment and Loss Optimization

Title: JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization

Title: A Large Scale Analysis of Gender Biases in Text-to-Image Generative Models

Title: Diffusion Meets Few-shot Class Incremental Learning

Title: GMapLatent: Geometric Mapping in Latent Space

Title: Towards Trustworthy GUI Agents: A Survey

Title: AU-TTT: Vision Test-Time Training model for Facial Action Unit Detection

Title: Beyond Academic Benchmarks: Critical Analysis and Best Practices for Visual Industrial Anomaly Detection

Title: TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes

Title: Boosting Omnidirectional Stereo Matching with a Pre-trained Depth Foundation Model

Title: Federated Self-Supervised Learning for One-Shot Cross-Modal and Cross-Imaging Technique Segmentation

Title: Enhancing Creative Generation on Stable Diffusion-based Models

Title: DiT4SR: Taming Diffusion Transformer for Real-World Image Super-Resolution

Title: Make Autoregressive Great Again: Diffusion-Free Graph Generation with Next-Scale Prediction

Title: Graph-Eq: Discovering Mathematical Equations using Graph Generative Models

Title: Leveraging Vision-Language Foundation Models to Reveal Hidden Image-Attribute Relationships in Medical Imaging

Title: Language-Guided Trajectory Traversal in Disentangled Stable Diffusion Latent Space for Factorized Medical Image Generation

Title: Expanding-and-Shrinking Binary Neural Networks

Title: Effective Cloud Removal for Remote Sensing Images by an Improved Mean-Reverting Denoising Model with Elucidated Design Space

Title: KOFFVQA: An Objectively Evaluated Free-form VQA Benchmark for Large Vision-Language Models in the Korean Language

Title: Time-Series Forecasting via Topological Information Supervised Framework with Efficient Topological Feature Learning

Title: Accelerating High-Efficiency Organic Photovoltaic Discovery via Pretrained Graph Neural Networks and Generative Reinforcement Learning

Title: WinoWhat: A Parallel Corpus of Paraphrased WinoGrande Sentences with Common Sense Categorization

Title: MGD-SAM2: Multi-view Guided Detail-enhanced Segment Anything Model 2 for High-Resolution Class-agnostic Segmentation

Title: On-device Sora: Enabling Training-Free Diffusion-based Text-to-Video Generation for Mobile Devices

Title: An extension of linear self-attention for in-context learning

Title: Conformal uncertainty quantification to evaluate predictive fairness of foundation AI model for skin lesion classes across patient demographics

Title: Expanding RL with Verifiable Rewards Across Diverse Domains

Title: FlexiMo: A Flexible Remote Sensing Foundation Model

Title: Communication-Efficient and Personalized Federated Foundation Model Fine-Tuning via Tri-Matrix Adaptation

Title: ExScene: Free-View 3D Scene Reconstruction with Gaussian Splatting from a Single Image

Title: MuseFace: Text-driven Face Editing via Diffusion-based Mask Generation Approach

Title: DiffScale: Continuous Downscaling and Bias Correction of Subseasonal Wind Speed Forecasts using Diffusion Models

Title: Training-Free Text-Guided Image Editing with Visual Autoregressive Model

Title: HumanAesExpert: Advancing a Multi-Modality Foundation Model for Human Image Aesthetic Assessment

Title: Green MLOps to Green GenOps: An Empirical Study of Energy Consumption in Discriminative and Generative AI Operations

Title: JointTuner: Appearance-Motion Adaptive Joint Training for Customized Video Generation

Title: SALT: A Flexible Semi-Automatic Labeling Tool for General LiDAR Point Clouds with Cross-Scene Adaptability and 4D Consistency

Title: Federated Structured Sparse PCA for Anomaly Detection in IoT Networks

Title: DenseFormer: Learning Dense Depth Map from Sparse Depth and Image via Conditional Diffusion Model

Title: AMMSM: Adaptive Motion Magnification and Sparse Mamba for Micro-Expression Recognition

Title: A Plasticity-Aware Method for Continual Self-Supervised Learning in Remote Sensing

Title: PolypSegTrack: Unified Foundation Model for Colonoscopy Video Analysis

Title: It's a (Blind) Match! Towards Vision-Language Correspondence without Parallel Data

Title: Learning a Canonical Basis of Human Preferences from Binary Ratings

Title: Foundation Models For Seismic Data Processing: An Extensive Review

Title: Implicit In-Context Learning: Evidence from Artificial Language Experiments

Title: DiET-GS: Diffusion Prior and Event Stream-Assisted Motion Deblurring 3D Gaussian Splatting

Title: Pre-training with 3D Synthetic Data: Learning 3D Point Cloud Instance Segmentation from 3D Synthetic Scenes

Title: Enhancing Large Language Models (LLMs) for Telecommunications using Knowledge Graphs and Retrieval-Augmented Generation

Title: Beyond a Single Mode: GAN Ensembles for Diverse Medical Data Generation

Title: FakeScope: Large Multimodal Expert Model for Transparent AI-Generated Image Forensics

Title: Visual Acoustic Fields

Title: Learning Velocity and Acceleration: Self-Supervised Motion Consistency for Pedestrian Trajectory Prediction

Title: Style Quantization for Data-Efficient GAN Training

Title: Can Test-Time Scaling Improve World Foundation Model?

Title: NoProp: Training Neural Networks without Back-propagation or Forward-propagation

Title: Self-Supervised Pretraining for Aerial Road Extraction

Title: PathOrchestra: A Comprehensive Foundation Model for Computational Pathology with Over 100 Diverse Clinical-Grade Tasks

Title: ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion

Title: InstructRestore: Region-Customized Image Restoration with Human Instructions

Title: Adapting Vision Foundation Models for Real-time Ultrasound Image Segmentation

Title: Consistent Subject Generation via Contrastive Instantiated Concepts