2025-01-03

Title: Highly Optimized Kernels and Fine-Grained Codebooks for LLM Inference on Arm CPUs

Title: DDD-GenDT: Dynamic Data-driven Generative Digital Twin Framework

Title: "Generative Models for Financial Time Series Data: Enhancing Signal-to-Noise Ratio and Addressing Data Scarcity in A-Share Market

Title: A Novel Framework for Learning Stochastic Representations for Sequence Generation and Recognition

Title: LTX-Video: Realtime Video Latent Diffusion

Title: PQD: Post-training Quantization for Efficient Diffusion Models

Title: TrajLearn: Trajectory Prediction Learning using Deep Generative Models

Title: MLLM-as-a-Judge for Image Safety without Human Labeling

Title: DecoratingFusion: A LiDAR-Camera Fusion Network with the Combination of Point-level and Feature-level Fusion

Title: ReFormer: Generating Radio Fakes for Data Augmentation

Title: Dual Diffusion for Unified Image Generation and Understanding

Title: Token Pruning for Caching Better: 9 Times Acceleration on Stable Diffusion for Free

Title: Generalizing Trust: Weak-to-Strong Trustworthiness in Language Models

Title: Unleashing Text-to-Image Diffusion Prior for Zero-Shot Image Captioning

Title: SAT-LDM: Provably Generalizable Image Watermarking for Latent Diffusion Models with Self-Augmented Training

Title: Dementia Detection using Multi-modal Methods on Audio Data

Title: Probing Visual Language Priors in VLMs

Title: Unbiased GNN Learning via Fairness-Aware Subgraph Diffusion

Title: DreamDrive: Generative 4D Scene Modeling from Street View Images

Title: DiC: Rethinking Conv3x3 Designs in Diffusion Models

Title: SoundBrush: Sound as a Brush for Visual Scene Editing

Title: Taming Feed-forward Reconstruction Models as Latent Encoders for 3D Generative Models

Title: Knowledge-Guided Prompt Learning for Deepfake Facial Image Detection

Title: RORem: Training a Robust Object Remover with Human-in-the-Loop

Title: Foreground-Covering Prototype Generation and Matching for SAM-Aided Few-Shot Segmentation

Title: Exploring Structured Semantic Priors Underlying Diffusion Score for Test-time Adaptation

Title: Improving Autoregressive Visual Generation with Cluster-Oriented Token Prediction

Title: Demystifying Online Clustering of Bandits: Enhanced Exploration Under Stochastic and Smoothed Adversarial Contexts

Title: Text2Earth: Unlocking Text-driven Remote Sensing Image Generation with a Global-Scale Dataset and a Foundation Model

Title: Population Aware Diffusion for Time Series Generation

Title: AutoPresent: Designing Structured Visuals from Scratch

Title: Hierarchical Vision-Language Alignment for Text-to-Image Generation via Diffusion Models

Title: A Novel Diffusion Model for Pairwise Geoscience Data Generation with Unbalanced Training Dataset

Title: Diffusion Prism: Enhancing Diversity and Morphology Consistency in Mask-to-Image Diffusion

Title: OASIS Uncovers: High-Quality T2I Models, Same Old Stereotypes

Title: Optimizing Noise Schedules of Generative Models in High Dimensionss

Title: State-of-the-art AI-based Learning Approaches for Deepfake Generation and Detection, Analyzing Opportunities, Threading through Pros, Cons, and Future Prospects

Title: Event Masked Autoencoder: Point-wise Action Recognition with Event-Based Cameras

Title: Enhancing Precision of Automated Teller Machines Network Quality Assessment: Machine Learning and Multi Classifier Fusion Approaches

Title: Graph Generative Pre-trained Transformer

Title: EliGen: Entity-Level Controlled Image Generation with Regional Attention

Title: AIM: Additional Image Guided Generation of Transferable Adversarial Attacks

Title: BatStyler: Advancing Multi-category Style Generation for Source-free Domain Generalization

Title: HarmonyIQA: Pioneering Benchmark and Model for Image Harmonization Quality Assessment

Title: DuMo: Dual Encoder Modulation Network for Precise Concept Erasure

Title: TexAVi: Generating Stereoscopic VR Video Clips from Text Descriptions

Title: LayeringDiff: Layered Image Synthesis via Generation, then Disassembly with Generative Knowledge

Title: TabTreeFormer: Tree Augmented Tabular Data Generation using Transformers

Title: Conditional Consistency Guided Image Translation and Enhancement

Title: SVFR: A Unified Framework for Generalized Video Face Restoration

Title: SeedVR: Seeding Infinity in Diffusion Transformer Towards Generic Video Restoration

Title: Test-time Controllable Image Generation by Explicit Spatial Constraint Enforcement

Title: On Unifying Video Generation and Camera Pose Estimation

Title: Multi-Modal Video Feature Extraction for Popularity Prediction

Title: Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Title: Object-level Visual Prompts for Compositional Image Generation

Title: Free-Form Motion Control: A Synthetic Video Generation Dataset with Controllable Camera and Object Motions

Title: VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control