2024-05-16

Title: Language-Guided Self-Supervised Video Summarization Using Text Semantic Matching Considering the Diversity of the Video

Title: CLIP with Quality Captions: A Strong Pretraining for Vision Tasks

Title: Self-supervised vision-langage alignment of deep learning representations for bone X-rays analysis

Title: Challenges in Deploying Long-Context Transformers: A Theoretical Peak Performance Analysis

Title: Deep Learning in Earthquake Engineering: A Comprehensive Review

Title: SMART: Towards Pre-trained Missing-Aware Model for Patient Health Status Prediction

Title: CTS: A Consistency-Based Medical Image Segmentation Model

Title: Response Matching for generating materials and molecules

Title: RSHazeDiff: A Unified Fourier-aware Diffusion Model for Remote Sensing Image Dehazing

Title: Towards Next-Generation Steganalysis: LLMs Unleash the Power of Detecting Steganography

Title: SOEDiff: Efficient Distillation for Small Object Editing

Title: A Hierarchically Feature Reconstructed Autoencoder for Unsupervised Anomaly Detection

Title: QMedShield: A Novel Quantum Chaos-based Image Encryption Scheme for Secure Medical Image Storage in the Cloud

Title: Dance Any Beat: Blending Beats with Visuals in Dance Video Generation

Title: DeCoDEx: Confounder Detector Guidance for Improved Diffusion-based Counterfactual Explanations

Title: SARATR-X: A Foundation Model for Synthetic Aperture Radar Images Target Recognition

Title: Time-Equivariant Contrastive Learning for Degenerative Disease Progression in Retinal OCT

Title: Global-Local Image Perceptual Score (GLIPS): Evaluating Photorealistic Quality of AI-Generated Images

Title: A Survey On Text-to-3D Contents Generation In The Wild