2024-03-12

Title: Audio-Synchronized Visual Animation

Title: DP-TabICL: In-Context Learning with Differentially Private Tabular Data

Title: SeeGULL Multilingual: a Dataset of Geo-Culturally Situated Stereotypes

Title: A Benchmark of Domain-Adapted Large Language Models for Generating Brief Hospital Course Summaries

Title: Inception Attacks: Immersive Hijacking in Virtual Reality Systems

Title: Augmentations vs Algorithms: What Works in Self-Supervised Learning

Title: MG-TSD: Multi-Granularity Time Series Diffusion Models with Guided Learning Process

Title: uniGradICON: A Foundation Model for Medical Image Registration

Title: Privacy-Preserving Diffusion Model Using Homomorphic Encryption

Title: ClinicalMamba: A Generative Clinical Language Model on Longitudinal Clinical Notes

Title: A self-supervised CNN for image watermark removal

Title: Adaptive Multi-modal Fusion of Spatially Variant Kernel Refinement with Diffusion Model for Blind Image Super-Resolution

Title: SAFDNet: A Simple and Effective Network for Fully Sparse 3D Object Detection

Title: TrafficGPT: Breaking the Token Barrier for Efficient Long Traffic Analysis and Generation

Title: Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines

Title: LTGC: Long-tail Recognition via Leveraging LLMs-driven Generated Content

Title: DO3D: Self-supervised Learning of Decomposed Object-aware 3D Motion and Depth from Monocular Videos

Title: RealNet: A Feature Selection Network with Realistic Synthetic Anomaly for Anomaly Detection

Title: SEMRes-DDPM: Residual Network Based Diffusion Modelling Applied to Imbalanced Data

Title: Learned 3D volumetric recovery of clouds and its uncertainty for climate analysis

Title: General surgery vision transformer: A video pre-trained foundation model for general surgery

Title: Can Generative Models Improve Self-Supervised Representation Learning?

Title: Few-Shot Cross-Lingual Transfer for Prompting Large Language Models in Low-Resource Languages

Title: Multi-conditioned Graph Diffusion for Neural Architecture Search

Title: Reframe Anything: LLM Agent for Open World Video Reframing

Title: FrameQuant: Flexible Low-Bit Quantization for Transformers

Title: Towards In-Vehicle Multi-Task Facial Attribute Recognition: Investigating Synthetic Data and Vision Foundation Models

Title: Diffusion Models Trained with Large Data Are Transferable Visual Models

Title: VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models

Title: Coherent Temporal Synthesis for Incremental Action Segmentation

Title: Universal Debiased Editing for Fair Medical Image Classification

Title: In-context Prompt Learning for Test-time Vision Recognition with Frozen Vision-language Model

Title: FedPIT: Towards Privacy-preserving and Few-shot Federated Instruction Tuning

Title: MACE: Mass Concept Erasure in Diffusion Models

Title: RESTORE: Towards Feature Shift for Vision-Language Prompt Learning

Title: Bayesian Random Semantic Data Augmentation for Medical Image Classification

Title: All-in-one platform for AI R&D in medical imaging, encompassing data collection, selection, annotation, and pre-processing

Title: GlanceVAD: Exploring Glance Supervision for Label-efficient Video Anomaly Detection

Title: Platypose: Calibrated Zero-Shot Multi-Hypothesis 3D Human Motion Estimation

Title: DiffuMatting: Synthesizing Arbitrary Objects with Matting-level Annotation

Title: An Improved Analysis of Langevin Algorithms with Prior Diffusion for Non-Log-Concave Sampling

Title: Harmonious Group Choreography with Trajectory-Controllable Diffusion

Title: On depth prediction for autonomous driving using self-supervised learning

Title: Cooperative Classification and Rationalization for Graph Generalization

Title: Text-Guided Variational Image Generation for Industrial Anomaly Detection and Segmentation

Title: SCORE: Self-supervised Correspondence Fine-tuning for Improved Content Representations

Title: FastVideoEdit: Leveraging Consistency Models for Efficient Text-to-Video Editing

Title: An End-to-End Deep Learning Generative Framework for Refinable Shape Matching and Generation

Title: Fake or Compromised? Making Sense of Malicious Clients in Federated Learning

Title: Transferable Reinforcement Learning via Generalized Occupancy Models

Title: Put Myself in Your Shoes: Lifting the Egocentric Perspective from Exocentric Videos

Title: Say Anything with Any Style

Title: Style2Talker: High-Resolution Talking Head Generation with Emotion Style and Art Style

Title: Enhancing Semantic Fidelity in Text-to-Image Synthesis: Attention Regulation in Diffusion Models

Title: A Zero Trust Framework for Realization and Defense Against Generative AI Attacks in Power Grid

Title: FSViewFusion: Few-Shots View Generation of Novel Objects

Title: DivCon: Divide and Conquer for Progressive Text-to-Image Generation

Title: 'One size doesn't fit all': Learning how many Examples to use for In-Context Learning for Improved Text Classification

Title: PointSeg: A Training-Free Paradigm for 3D Scene Segmentation via Foundation Models

Title: Comparison of No-Reference Image Quality Models via MAP Estimation in Diffusion Latents

Title: A Comparative Study of Perceptual Quality Metrics for Audio-driven Talking Head Videos

Title: Joint-Embedding Masked Autoencoder for Self-supervised Learning of Dynamic Functional Connectivity from the Human Brain

Title: Text2QR: Harmonizing Aesthetic Customization and Scanning Robustness for Text-Guided QR Code Generation

Title: 3D-aware Image Generation and Editing with Multi-modal Conditions

Title: Toward Generalist Anomaly Detection via In-context Residual Learning with Few-shot Sample Prompts

Title: Advancing Text-Driven Chest X-Ray Generation with Policy-Based Reinforcement Learning

Title: Active Generation for Image Classification

Title: OMH: Structured Sparsity via Optimally Matched Hierarchy for Unsupervised Semantic Segmentation

Title: Detection of Object Throwing Behavior in Surveillance Videos

Title: Leveraging Foundation Models for Content-Based Medical Image Retrieval in Radiology

Title: FFAD: A Novel Metric for Assessing Generated Time Series Data Utilizing Fourier Transform and Auto-encoder

Title: Distributionally Generative Augmentation for Fair Facial Attribute Classification

Title: Guiding Clinical Reasoning with Large Language Models via Knowledge Seeds

Title: Towards Zero-Shot Interpretable Human Recognition: A 2D-3D Registration Framework

Title: Car Damage Detection and Patch-to-Patch Self-supervised Image Alignment

Title: Trustworthy Partial Label Learning with Out-of-distribution Detection

Title: PCLD: Point Cloud Layerwise Diffusion for Adversarial Purification

Title: Enhancing Image Caption Generation Using Reinforcement Learning with Human Feedback

Title: V3D: Video Diffusion Models are Effective 3D Generators

Title: Distribution-Aware Data Expansion with Diffusion Models

Title: Boosting Image Restoration via Priors from Pre-trained Models

Title: Data-Independent Operator: A Training-Free Artifact Representation Extractor for Generalizable Deepfake Detection

Title: Multistep Consistency Models

Title: In-context Exploration-Exploitation for Reinforcement Learning

Title: Medical Image Synthesis via Fine-Grained Image-Text Alignment and Anatomy-Pathology Prompting

Title: Stochastic Cortical Self-Reconstruction

Title: QUASAR: QUality and Aesthetics Scoring with Advanced Representations

Title: Learning with Noisy Foundation Models

Title: COOD: Combined out-of-distribution detection using multiple measures for anomaly & novel class detection in large-scale hierarchical classification

Title: MEND: Meta dEmonstratioN Distillation for Efficient and Effective In-Context Learning

Title: DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations

Title: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data

Title: Bayesian Diffusion Models for 3D Shape Reconstruction

Title: BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion