2024-11-08

Title: DiMSUM: Diffusion Mamba -- A Scalable and Unified Spatial-Frequency Method for Image Generation

Title: Quantum Diffusion Models for Few-Shot Learning

Title: PocoLoco: A Point Cloud Diffusion Model of Human Shape in Loose Clothing

Title: Generative Discrete Event Process Simulation for Hidden Markov Models to Predict Competitor Time-to-Market

Title: Enhancing Security Control Production With Generative AI

Title: Efficient Symmetry-Aware Materials Generation via Hierarchical Generative Flow Networks

Title: HandCraft: Anatomically Correct Restoration of Malformed Hands in Diffusion Generated Images

Title: GazeGen: Gaze-Driven User Interaction for Visual Content Generation

Title: MegaPortrait: Revisiting Diffusion Control for High-fidelity Portrait Generation

Title: Robust and Efficient Fine-tuning of LLMs with Bayesian Reparameterization of Low-Rank Adaptation

Title: TrajGPT: Controlled Synthetic Trajectory Generation Using a Multitask Transformer-Based Spatiotemporal Model

Title: Unlearning in- vs. out-of-distribution data in LLMs under gradient-based method

Title: Bayesian Calibration of Win Rate Estimation with LLM Evaluators

Title: Scaling Laws for Pre-training Agents and World Models

Title: Comparing Fairness of Generative Mobility Models

Title: Series-to-Series Diffusion Bridge Model

Title: Hypercube Policy Regularization Framework for Offline Reinforcement Learning

Title: Meta-Reasoning Improves Tool Use in Large Language Models

Title: Peri-midFormer: Periodic Pyramid Transformer for Time Series Analysis

Title: Constrained Latent Action Policies for Model-Based Offline Reinforcement Learning

Title: DomainGallery: Few-shot Domain-driven Image Generation by Attribute-centric Finetuning

Title: Social EgoMesh Estimation

Title: Solar potential analysis over Indian cities using high-resolution satellite imagery and DEM

Title: Brain Tumour Removing and Missing Modality Generation using 3D WDM

Title: DanceFusion: A Spatio-Temporal Skeleton Diffusion Transformer for Audio-Driven Dance Motion Reconstruction

Title: From CNN to ConvRNN: Adapting Visualization Techniques for Time-Series Anomaly Detection

Title: TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation

Title: SEE-DPO: Self Entropy Enhanced Direct Preference Optimization

Title: Multi-Reward as Condition for Instruction-based Image Editing

Title: Controlling Human Shape and Pose in Text-to-Image Diffusion Models via Domain Adaptation

Title: Taming Rectified Flow for Inversion and Editing

Title: Attention Masks Help Adversarial Attacks to Bypass Safety Detectors

Title: Learn to Solve Vehicle Routing Problems ASAP: A Neural Optimization Approach for Time-Constrained Vehicle Routing Problems with Finite Vehicle Fleet

Title: End-to-end Inception-Unet based Generative Adversarial Networks for Snow and Rain Removals

Title: VTechAGP: An Academic-to-General-Audience Text Paraphrase Dataset and Benchmark Models

Title: D$^3$epth: Self-Supervised Depth Estimation with Dynamic Mask in Dynamic Scenes

Title: OneProt: Towards Multi-Modal Protein Foundation Models

Title: Boosting Latent Diffusion with Perceptual Objectives

Title: In the Era of Prompt Learning with Vision-Language Models

Title: GASE: Generatively Augmented Sentence Encoding

Title: MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views

Title: DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion

Title: CAD-MLLM: Unifying Multimodality-Conditioned CAD Generation With MLLM

Title: Uncovering Hidden Subspaces in Video Diffusion Models Using Re-Identification

Title: VAIR: Visuo-Acoustic Implicit Representations for Low-Cost, Multi-Modal Transparent Surface Reconstruction in Indoor Scenes

Title: SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation

Title: Clustering in Causal Attention Masking

Title: Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models

Title: ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning

Title: Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion Models

Title: ProEdit: Simple Progression is All You Need for High-Quality 3D Scene Editing

Title: SVDQunat: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models