diffusion

Title: Fast Adaptation with Bradley-Terry Preference Models in Text-To-Image Classification and Generation. (arXiv:2308.07929v1 [cs.CV])

Title: YODA: You Only Diffuse Areas. An Area-Masked Diffusion Approach For Image Super-Resolution. (arXiv:2308.07977v1 [cs.CV])

Title: DragNUWA: Fine-grained Control in Video Generation by Integrating Text, Image, and Trajectory. (arXiv:2308.08089v1 [cs.CV])

Title: Learning to Generate Semantic Layouts for Higher Text-Image Correspondence in Text-to-Image Synthesis. (arXiv:2308.08157v1 [cs.CV])

Title: Dual-Stream Diffusion Net for Text-to-Video Generation. (arXiv:2308.08316v1 [cs.CV])

Title: Diff-CAPTCHA: An Image-based CAPTCHA with Security Enhanced by Denoising Diffusion Model. (arXiv:2308.08367v1 [cs.CR])

Title: TeCH: Text-guided Reconstruction of Lifelike Clothed Humans. (arXiv:2308.08545v1 [cs.CV])

self-supervised

Title: Distilled Feature Fields Enable Few-Shot Language-Guided Manipulation. (arXiv:2308.07931v1 [cs.CV])

Title: Contrastive Learning for Lane Detection via cross-similarity. (arXiv:2308.08242v1 [cs.CV])

Title: Stable and Causal Inference for Discriminative Self-supervised Deep Visual Representations. (arXiv:2308.08321v1 [cs.CV])

Title: Self-Supervised Online Camera Calibration for Automated Driving and Parking Applications. (arXiv:2308.08495v1 [cs.CV])

Title: Is Self-Supervised Pretraining Good for Extrapolation in Molecular Property Prediction?. (arXiv:2308.08129v1 [cs.LG])

foundation model

Title: $A^2$Nav: Action-Aware Zero-Shot Robot Navigation by Exploiting Vision-and-Language Ability of Foundation Models. (arXiv:2308.07997v1 [cs.CV])

Title: LLM4TS: Two-Stage Fine-Tuning for Time-Series Forecasting with Pre-Trained LLMs. (arXiv:2308.08469v1 [cs.LG])

generative

Title: Likelihood-Based Text-to-Image Evaluation with Patch-Level Perceptual and Semantic Credit Assignment. (arXiv:2308.08525v1 [cs.CV])

Title: Enhancing Performance on Seen and Unseen Dialogue Scenarios using Retrieval-Augmented End-to-End Task-Oriented System. (arXiv:2308.08169v1 [cs.CL])

Title: Deep Generative Imputation Model for Missing Not At Random Data. (arXiv:2308.08158v1 [cs.LG])

Title: It Ain't That Bad: Understanding the Mysterious Performance Drop in OOD Generalization for Generative Transformer Models. (arXiv:2308.08268v1 [cs.LG])

Title: Explainable AI for clinical risk prediction: a survey of concepts, methods, and modalities. (arXiv:2308.08407v1 [cs.LG])

anomaly

in-context

Title: Time Travel in LLMs: Tracing Data Contamination in Large Language Models. (arXiv:2308.08493v1 [cs.CL])