2024-12-24

Title: A Decade of Deep Learning: A Survey on The Magnificent Seven

Title: AgroXAI: Explainable AI-Driven Crop Recommendation System for Agriculture 4.0

Title: Synthetic Time Series Data Generation for Healthcare Applications: A PCG Case Study

Title: Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation

Title: ManiVideo: Generating Hand-Object Manipulation Video with Dexterous and Generalizable Grasping

Title: AdvIRL: Reinforcement Learning-Based Adversarial Attacks on 3D NeRF Models

Title: GALOT: Generative Active Learning via Optimizable Zero-shot Text-to-image Generation

Title: Training-free Heterogeneous Graph Condensation via Data Selection

Title: Interactive Scene Authoring with Specialized Generative Primitives

Title: PromptLA: Towards Integrity Verification of Black-box Text-to-Image Diffusion Models

Title: HybGRAG: Hybrid Retrieval-Augmented Generation on Textual and Relational Knowledge Bases

Title: When Worse is Better: Navigating the compression-generation tradeoff in visual tokenization

Title: Improving Equity in Health Modeling with GPT4-Turbo Generated Synthetic Data: A Comparative Study

Title: A High-Quality Text-Rich Image Instruction Tuning Dataset via Hybrid Instruction Generation

Title: Has LLM Reached the Scaling Ceiling Yet? Unified Insights into LLM Regularities and Constraints

Title: Rethinking Model Redundancy for Low-light Image Enhancement

Title: Enhancing Nighttime Vehicle Detection with Day-to-Night Style Transfer and Labeling-Free Augmentation

Title: Follow-Your-MultiPose: Tuning-Free Multi-Character Text-to-Video Generation via Pose Guidance

Title: Autonomous Crack Detection using Deep Learning on Synthetic Thermogram Datasets

Title: TrojFlow: Flow Models are Natural Targets for Trojan Attacks

Title: Diffusion Prior Interpolation for Flexibility Real-World Face Super-Resolution

Title: SemTalk: Holistic Co-speech Motion Generation with Frame-level Semantic Emphasis

Title: Learning for Cross-Layer Resource Allocation in MEC-Aided Cell-Free Networks

Title: REO-VLM: Transforming VLM to Meet Regression Challenges in Earth Observation

Title: Leveraging Contrastive Learning for Semantic Segmentation with Consistent Labels Across Varying Appearances

Title: OmniSplat: Taming Feed-Forward 3D Gaussian Splatting for Omnidirectional Images with Editable Capabilities

Title: Generalizable Articulated Object Perception with Superpoints

Title: Adversarial Attack Against Images Classification based on Generative Adversarial Networks

Title: Two-in-One: Unified Multi-Person Interactive Motion Generation by Latent Diffusion Transformer

Title: VAST 1.0: A Unified Framework for Controllable and Consistent Video Generation

Title: TCAQ-DM: Timestep-Channel Adaptive Quantization for Diffusion Models

Title: GANFusion: Feed-Forward Text-to-3D with Diffusion in GAN Space

Title: ViM-Disparity: Bridging the Gap of Speed, Accuracy and Memory for Disparity Map Generation

Title: Solving Inverse Problems via Diffusion Optimal Control

Title: Paraformer: Parameterization of Sub-grid Scale Processes Using Transformers

Title: SilVar: Speech Driven Multimodal Model for Reasoning Visual Question Answering and Object Localization

Title: RoomPainter: View-Integrated Diffusion for Consistent Indoor Scene Texturing

Title: Layer- and Timestep-Adaptive Differentiable Token Compression Ratios for Efficient Diffusion Transformers

Title: Human-Guided Image Generation for Expanding Small-Scale Training Image Datasets

Title: Anchor3DLane++: 3D Lane Detection via Sample-Adaptive Sparse 3D Anchor Regression

Title: Self-Corrected Flow Distillation for Consistent One-Step and Few-Step Text-to-Image Generation

Title: TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction

Title: Image Quality Assessment: Investigating Causal Perceptual Effects with Abductive Counterfactual Inference

Title: DTSGAN: Learning Dynamic Textures via Spatiotemporal Generative Adversarial Network

Title: PromptDresser: Improving the Quality and Controllability of Virtual Try-On via Generative Textual Prompt and Prompt-aware Mask

Title: InterDance:Reactive 3D Dance Generation with Realistic Duet Interactions

Title: Where am I? Cross-View Geo-localization with Natural Language Descriptions

Title: HyperNet Fields: Efficiently Training Hypernetworks without Ground Truth by Learning Weight Trajectories

Title: Adapting Image-to-Video Diffusion Models for Large-Motion Frame Interpolation

Title: DreamOmni: Unified Image Generation and Editing

Title: Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching

Title: Generative Diffusion Modeling: A Practical Handbook

Title: Enhancing Item Tokenization for Generative Recommendation through Self-Improvement

Title: Foundation Model for Lossy Compression of Spatiotemporal Scientific Data

Title: Discriminative Image Generation with Diffusion Models for Zero-Shot Learning

Title: CharGen: High Accurate Character-Level Visual Text Generation Model with MultiModal Encoder

Title: OLiDM: Object-aware LiDAR Diffusion Models for Autonomous Driving

Title: FedMeld: A Model-dispersal Federated Learning Framework for Space-ground Integrated Networks

Title: QTSeg: A Query Token-Based Architecture for Efficient 2D Medical Image Segmentation

Title: Enhancing Multi-Text Long Video Generation Consistency without Tuning: Time-Frequency Analysis, Prompt Alignment, and Theory

Title: Free-viewpoint Human Animation with Pose-correlated Reference Selection

Title: EcoSearch: A Constant-Delay Best-First Search Algorithm for Program Synthesis

Title: Broadband Ground Motion Synthesis by Diffusion Model with Minimal Condition

Title: FFA Sora, video generation as fundus fluorescein angiography simulator

Title: ORIGAMI: A generative transformer architecture for predictions from semi-structured data

Title: A Plug-and-Play Physical Motion Restoration Approach for In-the-Wild High-Difficulty Motions

Title: Singular Value Scaling: Efficient Generative Model Compression via Pruned Weights Refinement

Title: Multimodal Preference Data Synthetic Alignment with Reward Model

Title: CALLIC: Content Adaptive Learning for Lossless Image Compression

Title: Improving the Noise Estimation of Latent Neural Stochastic Differential Equations

Title: Constructing Fair Latent Space for Intersection of Fairness and Explainability

Title: S-INF: Towards Realistic Indoor Scene Synthesis via Scene Implicit Neural Field

Title: HumanVBench: Exploring Human-Centric Video Understanding Capabilities of MLLMs with Synthetic Benchmark Data

Title: EasyTime: Time Series Forecasting Made Easy

Title: SBS Figures: Pre-training Figure QA from Stage-by-Stage Synthesized Images

Title: Personalized Large Vision-Language Models

Title: Be More Diverse than the Most Diverse: Online Selection of Diverse Mixtures of Generative Models

Title: SCBench: A Sports Commentary Benchmark for Video LLMs

Title: DreamFit: Garment-Centric Human Generation via a Lightweight Anything-Dressing Encoder

Title: Benchmarking Generative AI Models for Deep Learning Test Input Generation

Title: A Bias-Free Training Paradigm for More General AI-generated Image Detection

Title: GaussianPainter: Painting Point Cloud into 3D Gaussians with Normal Guidance

Title: VidTwin: Video VAE with Decoupled Structure and Dynamics

Title: Sensitivity Curve Maximization: Attacking Robust Aggregators in Distributed Learning

Title: Reasoning to Attend: Try to Understand How Token Works

Title: The Superposition of Diffusion Models Using the It\^o Density Estimator

Title: Large Motion Video Autoencoding with Cross-modal Video VAE

Title: Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders

Title: ChatGarment: Garment Estimation, Generation and Editing via Large Language Models

Title: FaceLift: Single Image to 3D Head with View Generation and GS-LRM