2024-04-19

Title: Prompt-Driven Feature Diffusion for Open-World Semi-Supervised Learning

Title: When are Foundation Models Effective? Understanding the Suitability for Pixel-Level Classification Using Multispectral Imagery

Title: Tailoring Generative Adversarial Networks for Smooth Airfoil Design

Title: Utilizing Adversarial Examples for Bias Mitigation and Accuracy Enhancement

Title: Hypergraph Self-supervised Learning with Sampling-efficient Signals

Title: From Image to Video, what do we need in multimodal LLMs?

Title: OPTiML: Dense Semantic Invariance Using Optimal Transport for Self-Supervised Medical Image Representation

Title: FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models

Title: EdgeFusion: On-Device Text-to-Image Generation

Title: LD-Pruner: Efficient Pruning of Latent Diffusion Models using Task-Agnostic Insights

Title: Sketch-guided Image Inpainting with Partial Discrete Diffusion Process

Title: The devil is in the object boundary: towards annotation-free instance segmentation using Foundation Models

Title: Curriculum Point Prompting for Weakly-Supervised Referring Image Segmentation

Title: Sequential Compositional Generalization in Multimodal Models

Title: S3R-Net: A Single-Stage Approach to Self-Supervised Shadow Removal

Title: StyleBooth: Image Style Editing with Multimodal Instruction

Title: How to Benchmark Vision Foundation Models for Semantic Segmentation?

Title: Observation, Analysis, and Solution: Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training

Title: Blind Localization and Clustering of Anomalies in Textures

Title: Alleviating Catastrophic Forgetting in Facial Expression Recognition with Emotion-Centered Models

Title: Physics-integrated generative modeling using attentive planar normalizing flow based variational autoencoder

Title: Guided Discrete Diffusion for Electronic Health Record Generation

Title: Customizing Text-to-Image Diffusion with Camera Viewpoint Control

Title: Measuring Feature Dependency of Neural Networks by Collapsing Feature Dimensions in the Data Manifold

Title: Large Language Models in Targeted Sentiment Analysis

Title: AniClipart: Clipart Animation with Text-to-Video Priors

Title: Point-In-Context: Understanding Point Cloud via In-Context Learning

Title: Towards a Foundation Model for Partial Differential Equation: Multi-Operator Learning and Extrapolation

Title: From $r$ to $Q^*$: Your Language Model is Secretly a Q-Function

Title: Inverse Neural Rendering for Explainable Multi-Object Tracking

Title: MedThink: Explaining Medical Visual Question Answering via Multimodal Decision-Making Rationale

Title: Lazy Diffusion Transformer for Interactive Image Editing

Title: G-HOP: Generative Hand-Object Prior for Interaction Reconstruction and Grasp Synthesis

Title: SOHES: Self-supervised Open-world Hierarchical Entity Segmentation

Title: VideoGigaGAN: Towards Detail-rich Video Super-Resolution

Title: Moving Object Segmentation: All You Need Is SAM (and Flow)

Title: On the Content Bias in Fréchet Video Distance