diffusion

Title: Zero-Shot Object Counting with Language-Vision Models. (arXiv:2309.13097v1 [cs.CV])

Title: GLOBER: Coherent Non-autoregressive Video Generation via GLOBal Guided Video DecodER. (arXiv:2309.13274v1 [cs.CV])

self-supervised

Title: Understanding Calibration of Deep Neural Networks for Medical Image Classification. (arXiv:2309.13132v1 [cs.CV])

Title: Poster: Self-Supervised Quantization-Aware Knowledge Distillation. (arXiv:2309.13220v1 [cs.CV])

Title: M$^3$CS: Multi-Target Masked Point Modeling with Learnable Codebook and Siamese Decoders. (arXiv:2309.13235v1 [cs.CV])

Title: Rethinking Amodal Video Segmentation from Learning Supervised Signals with Object-centric Representation. (arXiv:2309.13248v1 [cs.CV])

Title: C$^2$VAE: Gaussian Copula-based VAE Differing Disentangled from Coupled Representations with Contrastive Posterior. (arXiv:2309.13303v1 [cs.LG])

foundation model

generative

Title: GAMIX-VAE: A VAE with Gaussian Mixture Based Posterior. (arXiv:2309.13160v1 [cs.LG])

Title: Flow Factorized Representation Learning. (arXiv:2309.13167v1 [cs.LG])

Title: MISFIT-V: Misaligned Image Synthesis and Fusion using Information from Thermal and Visual. (arXiv:2309.13216v1 [cs.CV])

Title: ChEDDAR: Student-ChatGPT Dialogue in EFL Writing Education. (arXiv:2309.13243v1 [cs.CL])

Title: Beyond Fairness: Age-Harmless Parkinson's Detection via Voice. (arXiv:2309.13292v1 [cs.LG])

anomaly

Title: Real3D-AD: A Dataset of Point Cloud Anomaly Detection. (arXiv:2309.13226v1 [cs.CV])

in-context

Title: A Practical Survey on Zero-shot Prompt Design for In-context Learning. (arXiv:2309.13205v1 [cs.CL])

Title: User Simulation with Large Language Models for Evaluating Task-Oriented Dialogue. (arXiv:2309.13233v1 [cs.CL])

Title: Calibrating LLM-Based Evaluator. (arXiv:2309.13308v1 [cs.CL])