diffusion

Title: U-Turn Diffusion. (arXiv:2308.07421v1 [cs.LG])

Title: UniBrain: Unify Image Reconstruction and Captioning All in One Diffusion Model from Human Brain Activity. (arXiv:2308.07428v1 [cs.CV])

Title: SGDiff: A Style Guided Diffusion Model for Fashion Synthesis. (arXiv:2308.07605v1 [cs.CV])

Title: Geometry of the Visual Cortex with Applications to Image Inpainting and Enhancement. (arXiv:2308.07652v1 [cs.CV])

Title: Inversion-by-Inversion: Exemplar-based Sketch-to-Photo Synthesis via Stochastic Differential Equations without Training. (arXiv:2308.07665v1 [cs.CV])

Title: DiffGuard: Semantic Mismatch-Guided Out-of-Distribution Detection using Pre-trained Diffusion Models. (arXiv:2308.07687v1 [cs.CV])

Title: Dancing Avatar: Pose and Text-Guided Human Motion Videos Synthesis with Image Diffusion Model. (arXiv:2308.07749v1 [cs.CV])

Title: CCD-3DR: Consistent Conditioning in Diffusion for Single-Image 3D Reconstruction. (arXiv:2308.07837v1 [cs.CV])

Title: StyleDiffusion: Controllable Disentangled Style Transfer via Diffusion Models. (arXiv:2308.07863v1 [cs.CV])

Title: Physics-Informed Deep Learning to Reduce the Bias in Joint Prediction of Nitrogen Oxides. (arXiv:2308.07441v1 [cs.LG])

self-supervised

Title: PARIS: Part-level Reconstruction and Motion Analysis for Articulated Objects. (arXiv:2308.07391v1 [cs.CV])

Title: Semantify: Simplifying the Control of 3D Morphable Models using CLIP. (arXiv:2308.07415v1 [cs.CV])

Title: Multi-view 3D Face Reconstruction Based on Flame. (arXiv:2308.07551v1 [cs.CV])

Title: Self-supervised Hypergraphs for Learning Multiple World Interpretations. (arXiv:2308.07615v1 [cs.CV])

foundation model

Title: Self-Prompting Large Vision Models for Few-Shot Medical Image Segmentation. (arXiv:2308.07624v1 [cs.CV])

Title: Prompt Switch: Efficient CLIP Adaptation for Text-Video Retrieval. (arXiv:2308.07648v1 [cs.CV])

Title: A Foundation LAnguage-Image model of the Retina (FLAIR): Encoding expert knowledge in text supervision. (arXiv:2308.07898v1 [cs.CV])

generative

Title: Confidence Contours: Uncertainty-Aware Annotation for Medical Semantic Segmentation. (arXiv:2308.07528v1 [cs.CV])

Title: Development and Evaluation of Three Chatbots for Postpartum Mood and Anxiety Disorders. (arXiv:2308.07407v1 [cs.CL])

Title: Playing with Words: Comparing the Vocabulary and Lexical Richness of ChatGPT and Humans. (arXiv:2308.07462v1 [cs.CL])

Title: Informed Named Entity Recognition Decoding for Generative Language Models. (arXiv:2308.07791v1 [cs.CL])

Title: Generating Personas for Games with Multimodal Adversarial Imitation Learning. (arXiv:2308.07598v1 [cs.LG])

anomaly

Title: Future Video Prediction from a Single Frame for Video Anomaly Detection. (arXiv:2308.07783v1 [cs.CV])

Title: ImbSAM: A Closer Look at Sharpness-Aware Minimization in Class-Imbalanced Recognition. (arXiv:2308.07815v1 [cs.CV])

Title: A Graph Encoder-Decoder Network for Unsupervised Anomaly Detection. (arXiv:2308.07774v1 [cs.LG])

in-context

Title: Link-Context Learning for Multimodal LLMs. (arXiv:2308.07891v1 [cs.CV])

Title: RAVEN: In-Context Learning with Retrieval Augmented Encoder-Decoder Language Models. (arXiv:2308.07922v1 [cs.CL])

Title: Robustness Over Time: Understanding Adversarial Examples' Effectiveness on Longitudinal Versions of Large Language Models. (arXiv:2308.07847v1 [cs.CR])