2024-09-27

Title: An Art-centric perspective on AI-based content moderation of nudity

Title: Enhancing Guardrails for Safe and Secure Healthcare AI

Title: A random measure approach to reinforcement learning in continuous time

Title: 2024 BRAVO Challenge Track 1 1st Place Report: Evaluating Robustness of Vision Foundation Models for Semantic Segmentation

Title: Walker: Self-supervised Multiple Object Tracking by Walking on Temporal Appearance Graphs

Title: Disco4D: Disentangled 4D Human Generation and Animation from a Single Image

Title: Consistent estimation of generative model representations in the data kernel perspective space

Title: KIPPS: Knowledge infusion in Privacy Preserving Synthetic Data Generation

Title: VL4AD: Vision-Language Models Improve Pixel-wise Anomaly Detection

Title: Block Expanded DINORET: Adapting Natural Domain Foundation Models for Retinal Imaging Without Catastrophic Forgetting

Title: Trading through Earnings Seasons using Self-Supervised Contrastive Representation Learning

Title: Rejection Sampling IMLE: Designing Priors for Better Few-Shot Image Synthesis

Title: CadVLM: Bridging Language and Vision in the Generation of Parametric CAD Sketches

Title: Revisiting Deep Ensemble Uncertainty for Enhanced Medical Anomaly Detection

Title: Learning Quantized Adaptive Conditions for Diffusion Models

Title: Uni-Med: A Unified Medical Generalist Foundation Model For Multi-Task Learning Via Connector-MoE

Title: JoyType: A Robust Design for Multilingual Visual Text Creation

Title: A Simple but Strong Baseline for Sounding Video Generation: Effective Adaptation of Audio and Video Diffusion Models for Joint Generation

Title: Pixel-Space Post-Training of Latent Diffusion Models

Title: Flexiffusion: Segment-wise Neural Architecture Search for Flexible Denoising Schedule

Title: ID$^3$: Identity-Preserving-yet-Diversified Diffusion Models for Synthetic Face Recognition

Title: RmGPT: Rotating Machinery Generative Pretrained Model

Title: Appearance Blur-driven AutoEncoder and Motion-guided Memory Module for Video Anomaly Detection

Title: ZALM3: Zero-Shot Enhancement of Vision-Language Alignment via In-Context Information in Multi-Turn Multimodal Medical Dialogue

Title: Self-Supervised Learning of Deviation in Latent Representation for Co-speech Gesture Video Generation

Title: Dark Miner: Defend against unsafe generation for text-to-image diffusion models

Title: MIO: A Foundation Model on Multimodal Tokens

Title: AnyLogo: Symbiotic Subject-Driven Diffusion System with Gemini Status

Title: Text Image Generation for Low-Resource Languages with Dual Translation Learning

Title: Taming Diffusion Prior for Image Super-Resolution with Domain Shift SDEs

Title: Self-supervised Preference Optimization: Enhance Your Language Model with Preference Degree Awareness

Title: Continual learning with task specialist

Title: Ordinary Differential Equations for Enhanced 12-Lead ECG Generation

Title: Machine Learning-based vs Deep Learning-based Anomaly Detection in Multivariate Time Series for Spacecraft Attitude Sensors

Title: Self-supervised Monocular Depth Estimation with Large Kernel Attention

Title: Atlas-Chat: Adapting Large Language Models for Low-Resource Moroccan Arabic Dialect

Title: WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians

Title: Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation

Title: Pioneering Reliable Assessment in Text-to-Image Knowledge Editing: Leveraging a Fine-Grained Dataset and an Innovative Criterion

Title: Perturb, Attend, Detect and Localize (PADL): Robust Proactive Image Defense

Title: Spatial Hierarchy and Temporal Attention Guided Cross Masking for Self-supervised Skeleton-based Action Recognition

Title: CNCA: Toward Customizable and Natural Generation of Adversarial Camouflage for Vehicle Detectors

Title: LLM4Brain: Training a Large Language Model for Brain Video Understanding

Title: InterNet: Unsupervised Cross-modal Homography Estimation Based on Interleaved Modality Transfer and Self-supervised Homography Prediction

Title: Transferring disentangled representations: bridging the gap between synthetic and real images

Title: EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions

Title: Stable Video Portraits

Title: DiffSSC: Semantic LiDAR Scan Completion using Denoising Diffusion Probabilistic Models

Title: Self-supervised Pretraining for Cardiovascular Magnetic Resonance Cine Segmentation

Title: EdgeRunner: Auto-regressive Auto-encoder for Artistic Mesh Generation

Title: Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction

Title: FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner