2024-11-26

Title: Adaptively Controllable Diffusion Model for Efficient Conditional Image Generation

Title: Multimodal large language model for wheat breeding: a new exploration of smart breeding

Title: DAGSM: Disentangled Avatar Generation with GS-enhanced Mesh

Title: S$^2$ALM: Sequence-Structure Pre-trained Large Language Model for Comprehensive Antibody Representation Learning

Title: Reflections from the 2024 Large Language Model (LLM) Hackathon for Applications in Materials Science and Chemistry

Title: Text Embedding is Not All You Need: Attention Control for Text-to-Image Semantic Alignment with Text Self-Attention Maps

Title: AnyText2: Visual Text Generation and Editing With Customizable Attributes

Title: Reward Fine-Tuning Two-Step Diffusion Models via Learning Differentiable Latent-Space Surrogate Reward

Title: LocRef-Diffusion:Tuning-Free Layout and Appearance-Guided Generation

Title: MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation

Title: Derivative-Free Diffusion Manifold-Constrained Gradient for Unified XAI

Title: EADReg: Probabilistic Correspondence Generation with Efficient Autoregressive Diffusion Model for Outdoor Point Cloud Registration

Title: Foundation Cures Personalization: Recovering Facial Personalized Models' Prompt Consistency

Title: Don't Mesh with Me: Generating Constructive Solid Geometry Instead of Meshes by Fine-Tuning a Code-Generation LLM

Title: There is no SAMantics! Exploring SAM as a Backbone for Visual Understanding Tasks

Title: Exploiting Watermark-Based Defense Mechanisms in Text-to-Image Diffusion Models for Unauthorized Data Usage

Title: Gradient-Free Classifier Guidance for Diffusion Model Sampling

Title: FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Generation

Title: Semi-supervised Single-view 3D Reconstruction via Multi Shape Prior Fusion Strategy and Self-Attention

Title: Learning a local trading strategy: deep reinforcement learning for grid-scale renewable energy integration

Title: What Makes a Scene ? Scene Graph-based Evaluation and Feedback for Controllable Generation

Title: ConsistentAvatar: Learning to Diffuse Fully Consistent Talking Head Avatar with Temporal Guidance

Title: Twin Trigger Generative Networks for Backdoor Attacks against Object Detection

Title: Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy

Title: Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator

Title: SplatSDF: Boosting Neural Implicit SDF via Gaussian Splatting Fusion

Title: KinMo: Kinematic-aware Human Motion Understanding and Generation

Title: Improving Factuality of 3D Brain MRI Report Generation with Paired Image-domain Retrieval and Text-domain Augmentation

Title: AeroGen: Enhancing Remote Sensing Object Detection with Diffusion-Driven Data Generation

Title: Interactive Visual Assessment for Text-to-Image Generation Models

Title: MUNBa: Machine Unlearning via Nash Bargaining

Title: Large Language Model with Region-guided Referring and Grounding for CT Report Generation

Title: Optical-Flow Guided Prompt Optimization for Coherent Video Generation

Title: Improving Transferable Targeted Attacks with Feature Tuning Mixup

Title: TKG-DM: Training-free Chroma Key Content Generation Diffusion Model

Title: FLD+: Data-efficient Evaluation Metric for Generative Models

Title: Boosting Semi-Supervised Scene Text Recognition via Viewing and Summarizing

Title: Fixing the Perspective: A Critical Examination of Zero-1-to-3

Title: ROOT: VLM based System for Indoor Scene Understanding and Beyond

Title: Chain of Attack: On the Robustness of Vision-Language Models Against Transfer-Based Adversarial Attacks

Title: LTCF-Net: A Transformer-Enhanced Dual-Channel Fourier Framework for Low-Light Image Restoration

Title: Beyond Data Scarcity: A Frequency-Driven Framework for Zero-Shot Forecasting

Title: Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing

Title: PanoLlama: Generating Endless and Coherent Panoramas with Next-Token-Prediction LLMs

Title: Making Images from Images: Interleaving Denoising and Transformation

Title: Gaussian Scenes: Pose-Free Sparse-View Scene Reconstruction using Depth-Enhanced Diffusion Priors

Title: CNNs for Style Transfer of Digital to Film Photography

Title: From Dashcam Videos to Driving Simulations: Stress Testing Automated Vehicles against Rare Events

Title: Debiasing Classifiers by Amplifying Bias with Latent Diffusion and Large Language Models

Title: Boosting 3D Object Generation through PBR Materials

Title: AI-Generated Image Quality Assessment Based on Task-Specific Prompt and Multi-Granularity Similarity

Title: Med-PerSAM: One-Shot Visual Prompt Tuning for Personalized Segment Anything Model in Medical Domain

Title: TreeFormer: Single-view Plant Skeleton Estimation via Tree-constrained Graph Generation

Title: Context Awareness Gate For Retrieval Augmented Generation

Title: MVGenMaster: Scaling Multi-View Generation from Any Image via 3D Priors Enhanced Diffusion Model

Title: Text-to-Image Synthesis: A Decade Survey

Title: BadSFL: Backdoor Attack against Scaffold Federated Learning

Title: Image Generation Diversity Issues and How to Tame Them

Title: U2NeRF: Unsupervised Underwater Image Restoration and Neural Radiance Fields

Title: Fancy123: One Image to High-Quality 3D Mesh Generation via Plug-and-Play Deformation

Title: VIRES: Video Instance Repainting with Sketch and Text Guidance

Title: Video-Text Dataset Construction from Multi-AI Feedback: Promoting Weak-to-Strong Preference Learning for Video Large Language Models

Title: SMGDiff: Soccer Motion Generation using diffusion probabilistic models

Title: Mixed Degradation Image Restoration via Local Dynamic Optimization and Conditional Embedding

Title: Weakly supervised image segmentation for defect-based grading of fresh produce

Title: Diagnosis of diabetic retinopathy using machine learning & deep learning technique

Title: DiffDesign: Controllable Diffusion with Meta Prior for Efficient Interior Design Generation

Title: EPS: Efficient Patch Sampling for Video Overfitting in Deep Super-Resolution Model Training

Title: One Diffusion to Generate Them All

Title: Luminance Component Analysis for Exposure Correction

Title: CapHDR2IR: Caption-Driven Transfer from Visible Light to Infrared Domain

Title: Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing

Title: Synthesising Handwritten Music with GANs: A Comprehensive Evaluation of CycleWGAN, ProGAN, and DCGAN

Title: TopV-Nav: Unlocking the Top-View Spatial Reasoning Potential of MLLM for Zero-shot Object Navigation

Title: Unsupervised Event Outlier Detection in Continuous Time

Title: SplatFlow: Multi-View Rectified Flow Model for 3D Gaussian Splatting Synthesis

Title: VQ-SGen: A Vector Quantized Stroke Representation for Sketch Generation

Title: Efficient Video Face Enhancement with Enhanced Spatial-Temporal Consistency

Title: Multi-Resolution Generative Modeling of Human Motion from Limited Data

Title: LaB-RAG: Label Boosted Retrieval Augmented Generation for Radiology Report Generation

Title: Representation Collapsing Problems in Vector Quantization

Title: Enhancing Few-Shot Learning with Integrated Data and GAN Model Approaches

Title: Rethinking Diffusion for Text-Driven Human Motion Generation

Title: Chat2SVG: Vector Graphics Generation with Large Language Models and Image Diffusion Models

Title: Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric

Title: Imperceptible Adversarial Examples in the Physical World

Title: Exploring Discrete Flow Matching for 3D De Novo Molecule Generation

Title: DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation

Title: Factorized Visual Tokenization and Generation

Title: Generative Omnimatte: Learning to Decompose Video into Layers