2025-04-14

Title: Have we unified image generation and understanding yet? An empirical study of GPT-4o's image generation ability

Title: Teaching Humans Subtle Differences with DIFFusion

Title: Compositional Flows for 3D Molecule and Synthesis Pathway Co-design

Title: X-DECODE: EXtreme Deblurring with Curriculum Optimization and Domain Equalization

Title: ContrastiveGaussian: High-Fidelity 3D Generation with Contrastive Learning and Gaussian Splatting

Title: POEM: Precise Object-level Editing via MLLM control

Title: Gen3DEval: Using vLLMs for Automatic Evaluation of Generated 3D Objects

Title: LoRAX: LoRA eXpandable Networks for Continual Synthetic Image Attribution

Title: TokenMotion: Decoupled Motion Control via Token Disentanglement for Human-centric Video Generation

Title: RealCam-Vid: High-resolution Video Dataset with Dynamic Scenes and Metric-scale Camera Movements

Title: VL-UR: Vision-Language-guided Universal Restoration of Images Degraded by Adverse Weather Conditions

Title: CoProSketch: Controllable and Progressive Sketch Generation with Diffusion Model

Title: DreamFuse: Adaptive Image Fusion with Diffusion Transformer

Title: Generative AI for Film Creation: A Survey of Recent Advances

Title: EasyGenNet: An Efficient Framework for Audio-Driven Gesture Video Generation Based on Diffusion Model

Title: LMM4LMM: Benchmarking and Evaluating Large-multimodal Image Generation with LMMs

Title: PCA-RAG: Principal Component Analysis for Efficient Retrieval-Augmented Generation

Title: Graph Reduction with Unsupervised Learning in Column Generation: A Routing Application

Title: A Knowledge-guided Adversarial Defense for Resisting Malicious Visual Manipulation

Title: GeoTexBuild: 3D Building Model Generation from Map Footprints

Title: Customizing Spider Silk: Generative Models with Mechanical Property Conditioning for Protein Engineering

Title: Muon-Accelerated Attention Distillation for Real-Time Edge Synthesis via Optimized Latent Diffusion

Title: Cut-and-Splat: Leveraging Gaussian Splatting for Synthetic Data Generation

Title: Discriminator-Free Direct Preference Optimization for Video Diffusion

Title: Slicing the Gaussian Mixture Wasserstein Distance

Title: Banana Ripeness Level Classification using a Simple CNN Model Trained with Real and Synthetic Datasets

Title: ZipIR: Latent Pyramid Diffusion Transformer for High-Resolution Image Restoration

Title: Efficient Mixture of Geographical Species for On Device Wildlife Monitoring

Title: Latent Diffusion Autoencoders: Toward Efficient and Meaningful Unsupervised Representation Learning in Medical Imaging

Title: Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization

Title: Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Title: Hypergraph Vision Transformers: Images are More than Nodes, More than Edges

Title: Generating Fine Details of Entity Interactions

Title: GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation