2024-08-22

Title: Tabular Transfer Learning via Prompting LLMs

Title: DiffZOO: A Purely Query-Based Black-Box Attack for Red-teaming Text-to-Image Generative Model via Zeroth Order Optimization

Title: GSLoc: Efficient Camera Pose Refinement via 3D Gaussian Splatting

Title: MS$^3$D: A RG Flow-Based Regularization for GAN Training with Limited Data

Title: Total Uncertainty Quantification in Inverse PDE Solutions Obtained with Reduced-Order Deep Learning Surrogate Models

Title: Compress Guidance in Conditional Diffusion Sampling

Title: UKAN: Unbound Kolmogorov-Arnold Network Accompanied with Accelerated Library

Title: PooDLe: Pooled and dense self-supervised learning from naturalistic videos

Title: On the Potential of Open-Vocabulary Models for Object Detection in Unusual Street Scenes

Title: CooPre: Cooperative Pretraining for V2X Cooperative Perception

Title: Do Neural Scaling Laws Exist on Graph Self-Supervised Learning?

Title: Taming Generative Diffusion for Universal Blind Image Restoration

Title: UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation

Title: TWLV-I: Analysis and Insights from Holistic Evaluation on Video Foundation Models

Title: HumanCoser: Layered 3D Human Generation via Semantic-Aware Diffusion Model

Title: Hypergraph Learning based Recommender System for Anomaly Detection, Control and Optimization

Title: Video Diffusion Models are Strong Video Inpainter

Title: Latent Feature and Attention Dual Erasure Attack against Multi-View Diffusion Models for 3D Assets Protection

Title: SelfDRSC++: Self-Supervised Learning for Dual Reversed Rolling Shutter Correction

Title: Pano2Room: Novel View Synthesis from a Single Indoor Panorama

Title: Towards "Differential AI Psychology" and in-context Value-driven Statement Alignment with Moral Foundations Theory

Title: T2VIndexer: A Generative Video Indexer for Efficient Text-Video Retrieval

Title: GaussianOcc: Fully Self-supervised and Efficient 3D Occupancy Estimation with Gaussian Splatting

Title: MeTTA: Single-View to 3D Textured Mesh Reconstruction with Test-Time Adaptation

Title: TrackGo: A Flexible and Efficient Method for Controllable Video Generation

Title: Imagining from Images with an AI Storytelling Tool

Title: Just Project! Multi-Channel Despeckling, the Easy Way

Title: Memorization In In-Context Learning

Title: AnyDesign: Versatile Area Fashion Editing via Mask-Free Diffusion

Title: Semi-supervised 3D Semantic Scene Completion with 2D Vision Foundation Model Guidance

Title: Self-Supervised Iterative Refinement for Anomaly Detection in Industrial Quality Control

Title: AutoDirector: Online Auto-scheduling Agents for Multi-sensory Composition

Title: Robust 3D Gaussian Splatting for Novel View Synthesis in Presence of Distractors

Title: FRAP: Faithful and Realistic Text-to-Image Generation with Adaptive Prompt Weighting

Title: Iterative Object Count Optimization for Text-to-image Diffusion Models

Title: Sum of Squares Circuits

Title: Timeline and Boundary Guided Diffusion Network for Video Shadow Detection

Title: Practical token pruning for foundation models in few-shot conversational virtual assistant systems

Title: Pixel Is Not A Barrier: An Effective Evasion Attack for Pixel-Domain Diffusion Models

Title: EmbodiedSAM: Online Segment Any 3D Thing in Real Time