2024-12-30

Title: ZenSVI: An Open-Source Software for the Integrated Acquisition, Processing and Analysis of Street View Imagery Towards Scalable Urban Science

Title: Dissecting CLIP: Decomposition with a Schur Complement-based Approach

Title: 1.58-bit FLUX

Title: Video Is Worth a Thousand Images: Exploring the Latest Trends in Long Video Generation

Title: Elucidating Flow Matching ODE Dynamics with respect to Data Geometries

Title: Embodied Image Quality Assessment for Robotic Intelligence

Title: ObitoNet: Multimodal High-Resolution Point Cloud Reconstruction

Title: Protective Perturbations against Unauthorized Data Usage in Diffusion-based Image Generation

Title: DRDM: A Disentangled Representations Diffusion Model for Synthesizing Realistic Person Images

Title: DebiasDiff: Debiasing Text-to-image Diffusion Models with Self-discovering Latent Attribute Directions

Title: CausalTAD: Causal Implicit Generative Model for Debiased Online Trajectory Anomaly Detection

Title: DiFiC: Your Diffusion Model Holds the Secret to Fine-Grained Clustering

Title: SWAG: Long-term Surgical Workflow Prediction with Generative-based Anticipation

Title: Computing Approximate Graph Edit Distance via Optimal Transport

Title: Cross-PCR: A Robust Cross-Source Point Cloud Registration Framework

Title: Accelerating Diffusion Transformers with Dual Feature Caching

Title: Generative Face Parsing Map Guided 3D Face Reconstruction Under Occluded Scenes

Title: Exemplar-condensed Federated Class-incremental Learning

Title: UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation

Title: TINQ: Temporal Inconsistency Guided Blind Video Quality Assessment

Title: ModelGrow: Continual Text-to-Video Pre-training with Model Expansion and Language Understanding Enhancement

Title: MGAN-CRCM: A Novel Multiple Generative Adversarial Network and Coarse-Refinement Based Cognizant Method for Image Inpainting

Title: FACEMUG: A Multimodal Generative and Fusion Framework for Local Facial Editing

Title: Relation-aware Hierarchical Prompt for Open-vocabulary Scene Graph Generation

Title: DAPoinTr: Domain Adaptive Point Transformer for Point Cloud Completion

Title: FFCG: Effective and Fast Family Column Generation for Solving Large-Scale Linear Program

Title: Mask Factory: Towards High-quality Synthetic Data Generation for Dichotomous Image Segmentation

Title: Improving Generative Pre-Training: An In-depth Study of Masked Image Modeling and Denoising Models

Title: Discrete vs. Continuous Trade-offs for Generative Models

Title: Advanced Knowledge Transfer: Refined Feature Distillation for Zero-Shot Quantization in Edge Computing

Title: MVS-GS: High-Quality 3D Gaussian Splatting Mapping via Online Multi-View Stereo

Title: Generating Editable Head Avatars with 3D Gaussian GANs

Title: Referencing Where to Focus: Improving VisualGrounding with Referential Query

Title: Learning Cross-Domain Representations for Transferable Drug Perturbations on Single-Cell Transcriptional Responses

Title: FineVQ: Fine-Grained User Generated Content Video Quality Assessment

Title: PearSAN: A Machine Learning Method for Inverse Design using Pearson Correlated Surrogate Annealing

Title: RAG with Differential Privacy

Title: Manga Generation via Layout-controllable Diffusion

Title: MLLM-SUL: Multimodal Large Language Model for Semantic Scene Understanding and Localization in Traffic Scenarios

Title: MINIMA: Modality Invariant Image Matching

Title: Multi-scale Latent Point Consistency Models for 3D Shape Generation

Title: Gx2Mol: De Novo Generation of Hit-like Molecules from Gene Expression Profiles via Deep Learning

Title: NijiGAN: Transform What You See into Anime with Contrastive Semi-Supervised Learning and Neural Ordinary Differential Equations

Title: Focusing Image Generation to Mitigate Spurious Correlations

Title: Generative Adversarial Network on Motion-Blur Image Restoration

Title: DrivingWorld: ConstructingWorld Model for Autonomous Driving via Video GPT

Title: Estimation of System Parameters Including Repeated Cross-Sectional Data through Emulator-Informed Deep Generative Model

Title: Is Your Text-to-Image Model Robust to Caption Noise?

Title: P3S-Diffusion:A Selective Subject-driven Generation Framework via Point Supervision

Title: Diverse Rare Sample Generation with Pretrained GANs

Title: Structural Similarity in Deep Features: Image Quality Assessment Robust to Geometrically Disparate Reference

Title: ReNeg: Learning Negative Embedding with Reward Guidance

Title: VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Models

Title: From Elements to Design: A Layered Approach for Automatic Graphic Design Composition

Title: Generative Pretrained Embedding and Hierarchical Irregular Time Series Representation for Daily Living Activity Recognition

Title: Generative Video Propagation

Title: Tensor Network Estimation of Distribution Algorithms

Title: InfAlign: Inference-aware language model alignment