2024-11-19

Title: Boundary Attention Constrained Zero-Shot Layout-To-Image Generation

Title: Prompt-Guided Environmentally Consistent Adversarial Patch

Title: FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on

Title: OnlyFlow: Optical Flow based Motion Conditioning for Video Diffusion Models

Title: Everything is a Video: Unifying Modalities through Next-Frame Prediction

Title: DR-BFR: Degradation Representation with Diffusion Models for Blind Face Restoration

Title: SmoothCache: A Universal Inference Acceleration Technique for Diffusion Transformers

Title: On the Privacy Risk of In-context Learning

Title: Any2Any: Incomplete Multimodal Retrieval with Conformal Prediction

Title: "On the goals of linguistic theory": Revisiting Chomskyan theories in the era of AI

Title: Does Prompt Formatting Have Any Impact on LLM Performance?

Title: SoftLMs: Efficient Adaptive Low-Rank Approximation of Language Models using Soft-Thresholding Mechanism

Title: Motion Diffusion-Guided 3D Global HMR from a Dynamic Camera

Title: Drift-Resilient TabPFN: In-Context Learning Temporal Distribution Shifts on Tabular Data

Title: IntentGPT: Few-shot Intent Discovery with Large Language Models

Title: From Prototypes to General Distributions: An Efficient Curriculum for Masked Image Modeling

Title: MaskMedPaint: Masked Medical Image Inpainting with Diffusion Models for Mitigation of Spurious Correlations

Title: Diffusion-based Layer-wise Semantic Reconstruction for Unsupervised Out-of-Distribution Detection

Title: A Regularized LSTM Method for Detecting Fake News Articles

Title: Multi Scale Graph Neural Network for Alzheimer's Disease

Title: On-device Anomaly Detection in Conveyor Belt Operations

Title: TDSM:Triplet Diffusion for Skeleton-Text Matching in Zero-Shot Action Recognition

Title: Steam Turbine Anomaly Detection: An Unsupervised Learning Approach Using Enhanced Long Short-Term Memory Variational Autoencoder

Title: Bag of Design Choices for Inference of High-Resolution Masked Generative Transformer

Title: C-DiffSET: Leveraging Latent Diffusion for SAR-to-EO Image Translation with Confidence-Guided Reliable Object Generation

Title: Anatomy-Guided Radiology Report Generation with Pathology-Aware Regional Prompts

Title: Test-time Conditional Text-to-Image Synthesis Using Diffusion Models

Title: Stable Continual Reinforcement Learning via Diffusion-based Trajectory Replay

Title: Conformation Generation using Transformer Flows

Title: One-Layer Transformer Provably Learns One-Nearest Neighbor In Context

Title: Automatic Discovery and Assessment of Interpretable Systematic Errors in Semantic Segmentation

Title: Large Vision-Language Models for Remote Sensing Visual Question Answering

Title: See-Saw Generative Mechanism for Scalable Recursive Code Generation with Generative AI

Title: Improvement in Facial Emotion Recognition using Synthetic Data Generated by Diffusion Model

Title: MetricGold: Leveraging Text-To-Image Latent Diffusion Models for Metric Depth Estimation

Title: Watermarking Generative Categorical Data

Title: SPICA: Retrieving Scenarios for Pluralistic In-Context Alignment

Title: Generating Compositional Scenes via Text-to-image RGBA Instance Generation

Title: LLM-assisted Physical Invariant Extraction for Cyber-Physical Systems Anomaly Detection

Title: Multi-Modal Self-Supervised Learning for Surgical Feedback Effectiveness Assessment

Title: Constrained Diffusion with Trust Sampling

Title: Iterative Camera-LiDAR Extrinsic Optimization via Surrogate Diffusion

Title: Anomaly Detection for People with Visual Impairments Using an Egocentric 360-Degree Camera

Title: Direct and Explicit 3D Generation from a Single Image

Title: Understanding Multimodal LLMs: the Mechanistic Interpretability of Llava in Visual Question Answering

Title: TeG: Temporal-Granularity Method for Anomaly Detection with Attention in Smart City Surveillance

Title: Time Step Generating: A Universal Synthesized Deepfake Image Detector

Title: StableV2V: Stablizing Shape Consistency in Video-to-Video Editing

Title: D-Cube: Exploiting Hyper-Features of Diffusion Model for Robust Medical Classification

Title: Oscillation Inversion: Understand the structure of Large Flow Model through the Lens of Inversion Method

Title: Infinite Width Limits of Self Supervised Neural Networks

Title: Enhanced Anime Image Generation Using USE-CMHSA-GAN

Title: AMAGO-2: Breaking the Multi-Task Barrier in Meta-Reinforcement Learning with Transformers

Title: SoK: Unifying Cybersecurity and Cybersafety of Multimodal Foundation Models with an Information Theory Approach

Title: Stealing Training Graphs from Graph Neural Networks

Title: Relational Contrastive Learning and Masked Image Modeling for Scene Text Recognition

Title: Efficient Transfer Learning for Video-language Foundation Models

Title: MEMO-Bench: A Multiple Benchmark for Text-to-Image and Multimodal Large Language Models on Human Emotion Analysis

Title: ZeFaV: Boosting Large Language Models for Zero-shot Fact Verification

Title: Effective Predictive Modeling for Emergency Department Visits and Evaluating Exogenous Variables Impact: Using Explainable Meta-learning Gradient Boosting

Title: Reducing Label Dependency for Underwater Scene Understanding: A Survey of Datasets, Techniques and Applications

Title: SADDE: Semi-supervised Anomaly Detection with Dependable Explanations

Title: Enhancing Decision Transformer with Diffusion-Based Trajectory Branch Generation

Title: Teaching Video Diffusion Model with Latent Physical Phenomenon Knowledge

Title: TL-CLIP: A Power-specific Multimodal Pre-trained Visual Foundation Model for Transmission Line Defect Recognition

Title: LeC$^2$O-NeRF: Learning Continuous and Compact Large-Scale Occupancy for Urban Scenes

Title: CLUE-MARK: Watermarking Diffusion Models using CLWE

Title: The ADUULM-360 Dataset -- A Multi-Modal Dataset for Depth Estimation in Adverse Weather

Title: MGNiceNet: Unified Monocular Geometric Scene Understanding

Title: MVLight: Relightable Text-to-3D Generation via Light-conditioned Multi-View Diffusion

Title: Safe + Safe = Unsafe? Exploring How Safe Images Can Be Exploited to Jailbreak Large Vision-Language Models

Title: LaVin-DiT: Large Vision Diffusion Transformer

Title: Learning a Neural Association Network for Self-supervised Multi-Object Tracking

Title: Cascaded Diffusion Models for 2D and 3D Microscopy Image Synthesis to Enhance Cell Segmentation

Title: SeqProFT: Applying LoRA Finetuning for Sequence-only Protein Property Predictions

Title: Generative Spatio-temporal GraphNet for Transonic Wing Pressure Distribution Forecasting

Title: Leveraging Computational Pathology AI for Noninvasive Optical Imaging Analysis Without Retraining

Title: Chapter 7 Review of Data-Driven Generative AI Models for Knowledge Extraction from Scientific Literature in Healthcare

Title: TSINR: Capturing Temporal Continuity via Implicit Neural Representations for Time Series Anomaly Detection

Title: Conceptwm: A Diffusion Model Watermark for Concept Protection

Title: Robust Reinforcement Learning under Diffusion Models for Data with Jumps

Title: Aligning Few-Step Diffusion Models with Dense Reward Difference Learning

Title: BitMoD: Bit-serial Mixture-of-Datatype LLM Acceleration

Title: LLM-IE: A Python Package for Generative Information Extraction with Large Language Models

Title: Generative World Explorer