2024-07-09

Title: MetaFruit Meets Foundation Models: Leveraging a Comprehensive Multi-Fruit Dataset for Advancing Agricultural Foundation Models

Title: SPINEX: Similarity-based Predictions with Explainable Neighbors Exploration for Anomaly and Outlier Detection

Title: Revealing the Utilized Rank of Subspaces of Learning in Neural Networks

Title: Segmentation-Free Guidance for Text-to-Image Diffusion Models

Title: NSD-DIL: Null-Shot Deblurring Using Deep Identity Learning

Title: MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?

Title: SAM-Med3D-MoE: Towards a Non-Forgetting Segment Anything Model via Mixture of Experts for 3D Medical Image Segmentation

Title: FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior

Title: Entropy-Informed Weighting Channel Normalizing Flow

Title: How do you know that? Teaching Generative Language Models to Reference Answers to Biomedical Questions

Title: Robust Skin Color Driven Privacy Preserving Face Recognition via Function Secret Sharing

Title: FedTSA: A Cluster-based Two-Stage Aggregation Method for Model-heterogeneous Federated Learning

Title: Synthetic Data Aided Federated Learning Using Foundation Models

Title: VisioBlend: Sketch and Stroke-Guided Denoising Diffusion Probabilistic Model for Realistic Image Generation

Title: Effect of Rotation Angle in Self-Supervised Pre-training is Dataset-Dependent

Title: Unlocking Textual and Visual Wisdom: Open-Vocabulary 3D Object Detection Enhanced by Comprehensive Guidance from Text and Image

Title: DTR: A Unified Deep Tensor Representation Framework for Multimedia Data Recovery

Title: UltraEdit: Instruction-based Fine-Grained Image Editing at Scale

Title: Gradient Diffusion: A Perturbation-Resilient Gradient Leakage Attack

Title: An Improved Method for Personalizing Diffusion Models

Title: Can Model Uncertainty Function as a Proxy for Multiple-Choice Question Item Difficulty?

Title: Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model

Title: Multimodal Prompt Learning with Missing Modalities for Sentiment Analysis and Emotion Recognition

Title: Image-Conditional Diffusion Transformer for Underwater Image Enhancement

Title: FM-OSD: Foundation Model-Enabled One-Shot Detection of Anatomical Landmarks

Title: Cross Prompting Consistency with Segment Anything Model for Semi-supervised Medical Image Segmentation

Title: See Further for Parameter Efficient Fine-tuning by Standing on the Shoulders of Decomposition

Title: Self-supervised Learning via Cluster Distance Prediction for Operating Room Context Awareness

Title: Just read twice: closing the recall gap for recurrent language models

Title: LaSe-E2V: Towards Language-guided Semantic-Aware Event-to-Video Reconstruction

Title: Read, Watch and Scream! Sound Generation from Text and Video

Title: Ada-adapter:Fast Few-shot Style Personlization of Diffusion Model with Pre-trained Image Encoder

Title: Spatio-Temporal Encoding and Decoding-Based Method for Future Human Activity Skeleton Synthesis

Title: An Experimental Comparison of Transfer Learning against Self-supervised Learning

Title: Generative Debunking of Climate Misinformation

Title: WSI-VQA: Interpreting Whole Slide Images by Generative Visual Question Answering

Title: Deep Learning-based Anomaly Detection and Log Analysis for Computer Networks

Title: BEVWorld: A Multimodal World Model for Autonomous Driving via Unified BEV Latent Space

Title: Retrieved In-Context Principles from Previous Mistakes

Title: Sub-SA: Strengthen In-context Learning via Submodular Selective Annotation

Title: Empirical Study of Symmetrical Reasoning in Conversational Chatbots

Title: Cross-domain Few-shot In-context Learning for Enhancing Traffic Sign Recognition

Title: 3D Vessel Graph Generation Using Denoising Diffusion

Title: Bringing Masked Autoencoders Explicit Contrastive Properties for Point Cloud Self-Supervised Learning

Title: Minutes to Seconds: Speeded-up DDPM-based Image Inpainting with Coarse-to-Fine Sampling

Title: Generation and De-Identification of Indian Clinical Discharge Summaries using LLMs

Title: Enhancing Vision-Language Models with Scene Graphs for Traffic Accident Understanding

Title: Graph Anomaly Detection with Noisy Labels by Reinforcement Learning

Title: T2VSafetyBench: Evaluating the Safety of Text-to-Video Generative Models

Title: On Bellman equations for continuous-time policy evaluation I: discretization and approximation

Title: LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 Languages

Title: KidSat: satellite imagery to map childhood poverty dataset and benchmark

Title: Bounding Boxes and Probabilistic Graphical Models: Video Anomaly Detection Simplified

Title: Distilling System 2 into System 1

Title: Test-time adaptation for geospatial point cloud semantic segmentation with distinct domain shifts

Title: LaFAM: Unsupervised Feature Attribution with Label-free Activation Maps

Title: Layered Diffusion Model for One-Shot High Resolution Text-to-Image Synthesis

Title: LLMcap: Large Language Model for Unsupervised PCAP Failure Detection

Title: Accelerating Diffusion for SAR-to-Optical Image Translation via Adversarial Consistency Distillation

Title: PerlDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Models

Title: Structured Generations: Using Hierarchical Clusters to guide Diffusion Models

Title: ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation

Title: CHAMP: Conformalized 3D Human Multi-Hypothesis Pose Estimators

Title: The Tug-of-War Between Deepfake Generation and Detection

Title: Transfer Learning with Self-Supervised Vision Transformers for Snake Identification

Title: Compositional Video Generation as Flow Equalization

Title: JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation

Title: 4D Contrastive Superflows are Dense 3D Representation Learners

Title: Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images