2024-10-04

Title: Automatic Scene Generation: State-of-the-Art Techniques, Models, Datasets, Challenges, and Future Prospects

Title: PixelBytes: Catching Unified Representation for Multimodal Generation

Title: OCC-MLLM-Alpha:Empowering Multi-modal Large Language Model for the Understanding of Occluded Objects with Self-Supervised Test-Time Learning

Title: Social Media Authentication and Combating Deepfakes using Semi-fragile Invisible Image Watermarking

Title: A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation

Title: TAEGAN: Generating Synthetic Tabular Data For Data Augmentation

Title: Discrete Copula Diffusion

Title: Score-based pullback Riemannian geometry

Title: Generate then Refine: Data Augmentation for Zero-shot Intent Detection

Title: UlcerGPT: A Multimodal Approach Leveraging Large Language and Vision Models for Diabetic Foot Ulcer Image Transcription

Title: Using Style Ambiguity Loss to Improve Aesthetics of Diffusion Models

Title: Semi-Supervised Fine-Tuning of Vision Foundation Models with Content-Style Decomposition

Title: Learning from the Giants: A Practical Approach to Underwater Depth and Surface Normals Estimation

Title: Depth Pro: Sharp Monocular Metric Depth in Less Than a Second

Title: Deep Generative Modeling for Identification of Noisy, Non-Stationary Dynamical Systems

Title: EMMA: Efficient Visual Alignment in Multi-Modal LLMs

Title: FARM: Functional Group-Aware Representations for Small Molecules

Title: HyperBrain: Anomaly Detection for Temporal Hypergraph Brain Networks

Title: EC-DIT: Scaling Diffusion Transformers with Adaptive Expert-Choice Routing

Title: Dataset Distillation via Knowledge Distillation: Towards Efficient Self-Supervised Pre-Training of Deep Networks

Title: C-MELT: Contrastive Enhanced Masked Auto-Encoders for ECG-Language Pre-Training

Title: TrajGPT: Irregular Time-Series Representation Learning for Health Trajectory Analysis

Title: Plug-and-Play Controllable Generation for Discrete Masked Models

Title: Training Nonlinear Transformers for Chain-of-Thought Inference: A Theoretical Generalization Analysis

Title: Channel-aware Contrastive Conditional Diffusion for Multivariate Probabilistic Time Series Forecasting

Title: Calibrate to Discriminate: Improve In-Context Learning with Label-Free Comparative Inference

Title: Hard Negative Sample Mining for Whole Slide Image Classification

Title: Mitigating Downstream Model Risks via Model Provenance

Title: SCA: Highly Efficient Semantic-Consistent Unrestricted Adversarial Attack

Title: PFGuard: A Generative Framework with Privacy and Fairness Safeguards

Title: Correlation and Navigation in the Vocabulary Key Representation Space of Language Models

Title: Make Compound Sentences Simple to Analyze: Learning to Split Sentences for Aspect-based Sentiment Analysis

Title: Decoupling Layout from Glyph in Online Chinese Handwriting Generation

Title: Convergence of Score-Based Discrete Diffusion Models: A Discrete-Time Analysis

Title: Simplicity bias and optimization threshold in two-layer ReLU networks

Title: From Concrete to Abstract: A Multimodal Generative Approach to Abstract Concept Learning

Title: Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation

Title: BiSSL: Bilevel Optimization for Self-Supervised Pre-Training and Fine-Tuning

Title: SynCo: Synthetic Hard Negatives in Contrastive Learning for Better Unsupervised Visual Representations

Title: Eliminating Oversaturation and Artifacts of High Guidance Scales in Diffusion Models

Title: PnP-Flow: Plug-and-Play Image Restoration with Flow Matching

Title: Learning the Latent Rules of a Game from Data: A Chess Story

Title: Personalized Federated Learning for Generative AI-Assisted Semantic Communications

Title: Towards a Theoretical Understanding of Memorization in Diffusion Models

Title: Event-Customized Image Generation

Title: SAFLEX: Self-Adaptive Augmentation via Feature Label Extrapolation

Title: Learning from Offline Foundation Features with Tensor Augmentations

Title: Pseudo-Stereo Inputs: A Solution to the Occlusion Challenge in Self-Supervised Stereo Matching

Title: Towards Implicit Bias Detection and Mitigation in Multi-Agent LLM Interactions

Title: Beyond Squared Error: Exploring Loss Design for Enhanced Training of Generative Flow Networks

Title: Diffusion & Adversarial Schr\"odinger Bridges via Iterative Proportional Markovian Fitting

Title: Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers

Title: Undesirable Memorization in Large Language Models: A Survey

Title: Measuring and Improving Persuasiveness of Generative Models

Title: Scalable Simulation-free Entropic Unbalanced Optimal Transport

Title: GUD: Generation with Unified Diffusion

Title: ControlAR: Controllable Image Generation with Autoregressive Models

Title: SteerDiff: Steering towards Safe Text-to-Image Diffusion Models

Title: NETS: A Non-Equilibrium Transport Sampler

Title: SynthFormer: Equivariant Pharmacophore-based Generation of Molecules for Ligand-Based Drug Design

Title: Domain-Specific Retrieval-Augmented Generation Using Vector Stores, Knowledge Graphs, and Tensor Factorization

Title: Adaptive Inference-Time Compute: LLMs Can Predict if They Can Do Better, Even Mid-Generation

Title: Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models

Title: Contrastive Localized Language-Image Pre-Training

Title: ReLIC: A Recipe for 64k Steps of In-Context Reinforcement Learning for Embodied AI

Title: FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models