2024-04-26

Title: A Survey on Generative AI and LLM for Video Generation, Understanding, and Streaming

Title: Quantitative Characterization of Retinal Features in Translated OCTA

Title: Towards a Holistic Evaluation of LLMs on Factual Knowledge Recall

Title: S2DEVFMAP: Self-Supervised Learning Framework with Dual Ensemble Voting Fusion for Maximizing Anomaly Prediction in Timeseries

Title: ABCD: Trust enhanced Attention based Convolutional Autoencoder for Risk Assessment

Title: Towards Efficient Patient Recruitment for Clinical Trials: Application of a Prompt-Based Learning Model

Title: An Analysis of Recent Advances in Deepfake Image Detection in an Evolving Threat Landscape

Title: AutoGluon-Multimodal (AutoMM): Supercharging Multimodal AutoML with Foundation Models

Title: Reinforcement Learning with Generative Models for Compact Support Sets

Title: CFMW: Cross-modality Fusion Mamba for Multispectral Object Detection under Adverse Weather Conditions

Title: TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models

Title: Semantic Segmentation Refiner for Ultrasound Applications with Zero-Shot Foundation Models

Title: FedStyle: Style-Based Federated Learning Crowdsourcing Framework for Art Commissions

Title: Guarding Graph Neural Networks for Unsupervised Graph Anomaly Detection

Title: U2++ MoE: Scaling 4.7x parameters with minimal impact on RTF

Title: Asking and Answering Questions to Extract Event-Argument Structures

Title: SynCellFactory: Generative Data Augmentation for Cell Tracking

Title: Point-JEPA: A Joint Embedding Predictive Architecture for Self-Supervised Learning on Point Cloud

Title: DiffSeg: A Segmentation Model for Skin Lesions Based on Diffusion Difference

Title: 3D Face Modeling via Weakly-supervised Disentanglement Network joint Identity-consistency Prior

Title: Conditional Distribution Modelling for Few-Shot Image Synthesis with Diffusion Models

Title: MonoPCC: Photometric-invariant Cycle Constraint for Monocular Depth Estimation of Endoscopic Images

Title: MuseumMaker: Continual Style Customization without Catastrophic Forgetting

Title: Denoising: from classical methods to deep CNNs

Title: Zero-Shot Distillation for Image Encoders: How to Make Effective Use of Synthetic Data

Title: Tele-FLM Technical Report

Title: Formal Specification, Assessment, and Enforcement of Fairness for Generative AIs

Title: Multimodal Semantic-Aware Automatic Colorization with Diffusion Prior

Title: NTIRE 2024 Quality Assessment of AI-Generated Content Challenge

Title: Cooperate or Collapse: Emergence of Sustainability Behaviors in a Society of LLM Agents

Title: RadGenome-Chest CT: A Grounded Vision-Language Dataset for Chest CT Analysis

Title: REBEL: Reinforcement Learning via Regressing Relative Rewards

Title: ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving

Title: ConKeD++ -- Improving descriptor learning for retinal image registration: A comprehensive study of contrastive losses

Title: In-Context Freeze-Thaw Bayesian Optimization for Hyperparameter Optimization

Title: Improving Diversity of Commonsense Generation by Large Language Models via In-Context Learning

Title: Boosting Unsupervised Semantic Segmentation with Principal Mask Proposals

Title: Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings

Title: How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites

Title: Made to Order: Discovering monotonic temporal changes via self-supervised video ordering

Title: Make-it-Real: Unleashing Large Multimodal Model's Ability for Painting 3D Objects with Realistic Materials

Title: The Third Monocular Depth Estimation Challenge