2025-01-16

Title: SCOT: Self-Supervised Contrastive Pretraining For Zero-Shot Compositional Retrieval

Title: Cross-Modal Transferable Image-to-Video Attack on Video Quality Metrics

Title: Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models

Title: Towards Zero-Shot & Explainable Video Description by Reasoning over Graphs of Events in Space and Time

Title: Time series forecasting for multidimensional telemetry data using GAN and BiLSTM in a Digital Twin

Title: Benchmarking Classical, Deep, and Generative Models for Human Activity Recognition

Title: Yuan: Yielding Unblemished Aesthetics Through A Unified Network for Visual Imperfections Removal in Generated Images

Title: Score-based 3D molecule generation with neural fields

Title: Multimodal Fake News Video Explanation Generation

Title: Comprehensive Subjective and Objective Evaluation Method for Text-generated Video

Title: Molecular Graph Contrastive Learning with Line Graph

Title: Watermarking in Diffusion Model: Gaussian Shading with Exact Diffusion Inversion via Coupled Transformations (EDICT)

Title: RLHS: Mitigating Misalignment in RLHF with Hindsight Simulation

Title: CT-PatchTST: Channel-Time Patch Time-Series Transformer for Long-Term Renewable Energy Forecasting

Title: SWSC: Shared Weight for Similar Channel in LLM

Title: Joint Learning of Depth and Appearance for Portrait Image Animation

Title: StereoGen: High-quality Stereo Image Generation from a Single Image

Title: Investigating Parameter-Efficiency of Hybrid QuGANs Based on Geometric Properties of Generated Sea Route Graphs

Title: Transformed Low-rank Adaptation via Tensor Decomposition and Its Applications to Text-to-image Models

Title: Few-Shot Learner Generalizes Across AI-Generated Image Detection

Title: Deep learning for temporal super-resolution 4D Flow MRI

Title: Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous Driving

Title: ARMOR: Shielding Unlearnable Examples against Data Augmentation

Title: Enhanced Multi-Scale Cross-Attention for Person Image Generation

Title: CityLoc: 6 DoF Localization of Text Descriptions in Large-Scale Scenes with Gaussian Representation

Title: CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities

Title: RepVideo: Rethinking Cross-Layer Representation for Video Generation

Title: VECT-GAN: A variationally encoded generative model for overcoming data scarcity in pharmaceutical science

Title: SimGen: A Diffusion-Based Framework for Simultaneous Surgical Image and Segmentation Mask Generation

Title: Multimodal LLMs Can Reason about Aesthetics in Zero-Shot

Title: Ouroboros-Diffusion: Exploring Consistent Content Generation in Tuning-free Long Video Diffusion