2025-01-17

Title: Do generative video models learn physical principles from watching videos?

Title: Generative Visual Commonsense Answering and Explaining with Generative Scene Graph Constructing

Title: CookingDiffusion: Cooking Procedural Image Generation with Stable Diffusion

Title: Generating Realistic Synthetic Head Rotation Data for Extended Reality using Deep Learning

Title: SHYI: Action Support for Contrastive Learning in High-Fidelity Text-to-Image Generation

Title: Generative Medical Image Anonymization Based on Latent Code Projection and Optimization

Title: Attention is All You Need Until You Need Retention

Title: Grounding Text-To-Image Diffusion Models For Controlled High-Quality Image Generation

Title: Unified Few-shot Crack Segmentation and its Precise 3D Automatic Measurement in Concrete Structures

Title: Knowledge Distillation for Image Restoration : Simultaneous Learning from Degraded and Clean Images

Title: Text-guided Synthetic Geometric Augmentation for Zero-shot 3D Understanding

Title: Soft Knowledge Distillation with Multi-Dimensional Cross-Net Attention for Image Restoration Models Compression

Title: SVIA: A Street View Image Anonymization Framework for Self-Driving Applications

Title: FASP: Fast and Accurate Structured Pruning of Large Language Models

Title: Dynamic Neural Style Transfer for Artistic Image Generation using VGG19

Title: CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation

Title: Pruning for Sparse Diffusion Models based on Gradient Flow

Title: VanGogh: A Unified Multimodal Diffusion-based Framework for Video Colorization

Title: AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation

Title: Confidence Estimation for Error Detection in Text-to-SQL Systems

Title: Intra-day Solar and Power Forecast for Optimization of Intraday Market Participation

Title: Text-driven Adaptation of Foundation Models for Few-shot Surgical Workflow Analysis

Title: Sequential PatchCore: Anomaly Detection for Surface Inspection using Synthetic Impurities

Title: WMamba: Wavelet-based Mamba for Face Forgery Detection

Title: Empowering Large Language Models in Wireless Communication: A Novel Dataset and Fine-Tuning Framework

Title: A Simple Aerial Detection Baseline of Multimodal Language Models

Title: Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

Title: Learnings from Scaling Visual Tokenizers for Reconstruction and Generation