secure

Title: The offline digital currency puzzle solved by a local blockchain. (arXiv:2305.02290v1 [cs.CR])

security

Title: Illicit item detection in X-ray images for security applications. (arXiv:2305.01936v1 [cs.CV])

Title: Deep Learning-Based Multiband Signal Fusion for 3-D SAR Super-Resolution. (arXiv:2305.02017v1 [cs.CV])

Title: GANonymization: A GAN-based Face Anonymization Framework for Preserving Emotional Expressions. (arXiv:2305.02143v1 [cs.CV])

Title: AutoLock: Automatic Design of Logic Locking with Evolutionary Computation. (arXiv:2305.01840v1 [cs.CR])

privacy

Title: A Systematic Study on Object Recognition Using Millimeter-wave Radar. (arXiv:2305.02085v1 [cs.CV])

Title: Data Privacy with Homomorphic Encryption in Neural Networks Training and Inference. (arXiv:2305.02225v1 [cs.CR])

Title: A Survey on Dataset Distillation: Approaches, Applications and Future Directions. (arXiv:2305.01975v1 [cs.LG])

protect

defense

attack

Title: LearnDefend: Learning to Defend against Targeted Model-Poisoning Attacks on Federated Learning. (arXiv:2305.02022v1 [cs.LG])

robust

Title: Out-of-distribution detection algorithms for robust insect classification. (arXiv:2305.01823v1 [cs.CV])

Title: Class adaptive threshold and negative class guided noisy annotation robust Facial Expression Recognition. (arXiv:2305.01884v1 [cs.CV])

Title: Real-Time Radiance Fields for Single-Image Portrait View Synthesis. (arXiv:2305.02310v1 [cs.CV])

Title: Robust Natural Language Watermarking through Invariant Features. (arXiv:2305.01904v1 [cs.CL])

Title: A Curriculum View of Robust Loss Functions. (arXiv:2305.02139v1 [cs.LG])

Title: The Benefits of Label-Description Training for Zero-Shot Text Classification. (arXiv:2305.02239v1 [cs.CL])

Title: Single-model uncertainty quantification in neural network potentials does not consistently outperform model ensembles. (arXiv:2305.01754v1 [cs.LG])

Title: MolKD: Distilling Cross-Modal Knowledge in Chemical Reactions for Molecular Property Prediction. (arXiv:2305.01912v1 [cs.LG])

Title: Rethinking Graph Lottery Tickets: Graph Sparsity Matters. (arXiv:2305.02190v1 [cs.LG])

biometric

Title: Localization using Multi-Focal Spatial Attention for Masked Face Recognition. (arXiv:2305.01905v1 [cs.CV])

steal

extraction

Title: High-Resolution Synthetic RGB-D Datasets for Monocular Depth Estimation. (arXiv:2305.01732v1 [cs.CV])

In this paper, we generate a high-resolution synthetic depth dataset (HRSD) of dimension 1920 X 1080 from Grand Theft Auto (GTA-V), which contains 100,000 color images and corresponding dense ground truth depth maps. The generated datasets are diverse and have scenes from indoors to outdoors, from homogeneous surfaces to textures. For experiments and analysis, we train the DPT algorithm, a state-of-the-art transformer-based MDE algorithm on the proposed synthetic dataset, which significantly increases the accuracy of depth maps on different scenes by 9 %. Since the synthetic datasets are of higher resolution, we propose adding a feature extraction module in the transformer encoder and incorporating an attention-based loss, further improving the accuracy by 15 %.

Title: LineFormer: Rethinking Line Chart Data Extraction as Instance Segmentation. (arXiv:2305.01837v1 [cs.CV])

Title: Evolving Dictionary Representation for Few-shot Class-incremental Learning. (arXiv:2305.01885v1 [cs.LG])

Title: Improved Static Hand Gesture Classification on Deep Convolutional Neural Networks using Novel Sterile Training Technique. (arXiv:2305.02039v1 [cs.CV])

Title: Rethinking the Encoding of Satellite Image Time Series. (arXiv:2305.02086v1 [cs.CV])

Title: Causality-aware Concept Extraction based on Knowledge-guided Prompting. (arXiv:2305.01876v1 [cs.CL])

Title: Generative Meta-Learning for Zero-Shot Relation Triplet Extraction. (arXiv:2305.01920v1 [cs.CL])

Title: Natural language processing on customer note data. (arXiv:2305.02029v1 [cs.CL])

Title: GPT-RE: In-context Learning for Relation Extraction using Large Language Models. (arXiv:2305.02105v1 [cs.CL])

In this paper, we propose GPT-RE to bridge the gap between LLMs and fully-supervised baselines. GPT-RE successfully addresses the aforementioned issues by (1) incorporating task-specific entity representations in demonstration retrieval; and (2) enriching the demonstrations with gold label-induced reasoning logic. We evaluate GPT-RE on four widely-used RE datasets, and observe that GPT-RE achieves improvements over not only existing GPT-3 baselines, but also fully-supervised baselines. Specifically, GPT-RE achieves SOTA performances on the Semeval and SciERC datasets, and competitive performances on the TACRED and ACE05 datasets.

membership infer

federate

Title: Scalable Data Point Valuation in Decentralized Learning. (arXiv:2305.01657v1 [cs.LG])

Title: LESS-VFL: Communication-Efficient Feature Selection for Vertical Federated Learning. (arXiv:2305.02219v1 [cs.LG])

fair

Title: Fairness in AI Systems: Mitigating gender bias from language-vision models. (arXiv:2305.01888v1 [cs.CV])

Title: Few-shot Event Detection: An Empirical Study and a Unified View. (arXiv:2305.01901v1 [cs.CL])

Title: Explaining Language Models' Predictions with High-Impact Concepts. (arXiv:2305.02160v1 [cs.CL])

Title: Fairness and representation in satellite-based poverty maps: Evidence of urban-rural disparities and their impacts on downstream policy. (arXiv:2305.01783v1 [cs.LG])

interpretability

Title: Fashionpedia-Taste: A Dataset towards Explaining Human Fashion Taste. (arXiv:2305.02307v1 [cs.CV])

Title: Visual Chain of Thought: Bridging Logical Gaps with Multimodal Infillings. (arXiv:2305.02317v1 [cs.CL])

Title: Stars Are All You Need: A Distantly Supervised Pyramid Network for Document-Level End-to-End Sentiment Analysis. (arXiv:2305.01710v1 [cs.CL])

Title: Learning Disentangled Semantic Spaces of Explanations via Invertible Neural Networks. (arXiv:2305.01713v1 [cs.CL])

Title: Transferablility of coVariance Neural Networks and Application to Interpretable Brain Age Prediction using Anatomical Features. (arXiv:2305.01807v1 [cs.LG])

explainability

watermark

diffusion

Title: Multimodal Data Augmentation for Image Captioning using Diffusion Models. (arXiv:2305.01855v1 [cs.CV])

Title: DiffFacto Controllable Part-Based 3D Point Cloud Generation with Cross Diffusion. (arXiv:2305.01921v1 [cs.CV])

Title: DiffuSum: Generation Enhanced Extractive Summarization with Diffusion. (arXiv:2305.01735v1 [cs.CL])

Title: Multimodal Procedural Planning via Dual Text-Image Prompting. (arXiv:2305.01795v1 [cs.CL])

Title: Unpaired Downscaling of Fluid Flows with Diffusion Bridges. (arXiv:2305.01822v1 [cs.LG])

noise learning

data-free

transformer

Title: "Glitch in the Matrix!": A Large Scale Benchmark for Content Driven Audio-Visual Forgery Detection and Localization. (arXiv:2305.01979v1 [cs.CV])

Title: Unsupervised Mutual Transformer Learning for Multi-Gigapixel Whole Slide Image Classification. (arXiv:2305.02032v1 [cs.CV])

Title: A Vision Transformer Approach for Efficient Near-Field Irregular SAR Super-Resolution. (arXiv:2305.02074v1 [cs.CV])

Title: Learngene: Inheriting Condensed Knowledge from the Ancestry Model to Descendant Models. (arXiv:2305.02279v1 [cs.LG])

Title: DynamicStereo: Consistent Dynamic Depth from Stereo Videos. (arXiv:2305.02296v1 [cs.CV])

Title: SeqAug: Sequential Feature Resampling as a modality agnostic augmentation method. (arXiv:2305.01954v1 [cs.CL])

Title: Towards Being Parameter-Efficient: A Stratified Sparsely Activated Transformer with Dynamic Capacity. (arXiv:2305.02176v1 [cs.CL])

Title: Exploring Linguistic Properties of Monolingual BERTs with Typological Classification among Languages. (arXiv:2305.02215v1 [cs.CL])

Title: A Lightweight CNN-Transformer Model for Learning Traveling Salesman Problems. (arXiv:2305.01883v1 [cs.LG])

generative

Title: Making the Most of What You Have: Adapting Pre-trained Visual Language Models in the Low-data Regime. (arXiv:2305.02297v1 [cs.CV])

Title: AG3D: Learning to Generate 3D Avatars from 2D Image Collections. (arXiv:2305.02312v1 [cs.CV])

Title: Nonparametric Generative Modeling with Conditional and Locally-Connected Sliced-Wasserstein Flows. (arXiv:2305.02164v1 [cs.LG])

large language model

Title: Few-shot In-context Learning for Knowledge Base Question Answering. (arXiv:2305.01750v1 [cs.CL])

Title: SCOTT: Self-Consistent Chain-of-Thought Distillation. (arXiv:2305.01879v1 [cs.CL])

Title: Can Large Language Models Be an Alternative to Human Evaluations?. (arXiv:2305.01937v1 [cs.CL])

Title: Clinical Note Generation from Doctor-Patient Conversations using Large Language Models: Insights from MEDIQA-Chat. (arXiv:2305.02220v1 [cs.CL])

Title: Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes. (arXiv:2305.02301v1 [cs.CL])

Title: CodeGen2: Lessons for Training LLMs on Programming and Natural Languages. (arXiv:2305.02309v1 [cs.LG])

In this study, we attempt to render the training of LLMs for program synthesis more efficient by unifying four key components: (1) model architectures, (2) learning methods, (3) infill sampling, and, (4) data distributions. Specifically, for the model architecture, we attempt to unify encoder and decoder-based models into a single prefix-LM. For learning methods, (i) causal language modeling, (ii) span corruption, (iii) infilling are unified into a simple learning algorithm. For infill sampling, we explore the claim of a "free lunch" hypothesis. For data distributions, the effect of a mixture distribution of programming and natural languages on model performance is explored.

We conduct a comprehensive series of empirical experiments on 1B LLMs, for which failures and successes of this exploration are distilled into four lessons. We will provide a final recipe for training and release CodeGen2 models in size 1B, 3.7B, 7B, and, 16B parameters, along with the training framework as open-source: https://github.com/salesforce/CodeGen2.

segmentation

Title: DeepAqua: Self-Supervised Semantic Segmentation of Wetlands from SAR Images using Knowledge Distillation. (arXiv:2305.01698v1 [cs.CV])

Title: Expectation Maximization Pseudo Labelling for Segmentation with Limited Annotations. (arXiv:2305.01747v1 [cs.CV])

Title: AV-SAM: Segment Anything Model Meets Audio-Visual Localization and Segmentation. (arXiv:2305.01836v1 [cs.CV])

Title: Morphological Classification of Galaxies Using SpinalNet. (arXiv:2305.01873v1 [cs.LG])

Title: Distributional Instance Segmentation: Modeling Uncertainty and High Confidence Predictions with Latent-MaskRCNN. (arXiv:2305.01910v1 [cs.CV])

Title: Zenseact Open Dataset: A large-scale and diverse multimodal dataset for autonomous driving. (arXiv:2305.02008v1 [cs.CV])

Title: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model. (arXiv:2305.02034v1 [cs.CV])

Title: CLUSTSEG: Clustering for Universal Segmentation. (arXiv:2305.02187v1 [cs.CV])