secure

Title: Deep Learning for Iris Recognition: A Review. (arXiv:2303.08514v1 [cs.CV])

Title: Efficient and Secure Federated Learning for Financial Applications. (arXiv:2303.08355v1 [cs.LG])

security

Title: Real Face Foundation Representation Learning for Generalized Deepfake Detection. (arXiv:2303.08439v1 [cs.CV])

Title: Exploring Large-scale Unlabeled Faces to Enhance Facial Expression Recognition. (arXiv:2303.08617v1 [cs.CV])

Title: Compact and Divisible E-Cash with Threshold Issuance. (arXiv:2303.08221v1 [cs.CR])

privacy

Title: Fashion-model pose recommendation and generation using Machine Learning. (arXiv:2303.08660v1 [cs.CV])

protect

defense

attack

Title: Model Extraction Attacks on Split Federated Learning. (arXiv:2303.08581v1 [cs.LG])

robust

Title: Rotation-Invariant Transformer for Point Cloud Matching. (arXiv:2303.08231v1 [cs.CV])

Title: Improving Adversarial Robustness with Hypersphere Embedding and Angular-based Regularizations. (arXiv:2303.08289v1 [cs.LG])

Title: Guided Slot Attention for Unsupervised Video Object Segmentation. (arXiv:2303.08314v1 [cs.CV])

Title: Rethinking Optical Flow from Geometric Matching Consistent Perspective. (arXiv:2303.08384v1 [cs.CV])

Title: A Triplet-loss Dilated Residual Network for High-Resolution Representation Learning in Image Retrieval. (arXiv:2303.08398v1 [cs.CV])

Title: Learning Accurate Template Matching with Differentiable Coarse-to-Fine Correspondence Refinement. (arXiv:2303.08438v1 [cs.CV])

Title: BEVHeight: A Robust Framework for Vision-based Roadside 3D Object Detection. (arXiv:2303.08498v1 [cs.CV])

Title: MGA: Medical generalist agent through text-guided knowledge transformation. (arXiv:2303.08562v1 [cs.CV])

Title: MSeg3D: Multi-modal 3D Semantic Segmentation for Autonomous Driving. (arXiv:2303.08600v1 [cs.CV])

Title: Improving Fast Auto-Focus with Event Polarity. (arXiv:2303.08611v1 [cs.CV])

Title: Deep Visual Forced Alignment: Learning to Align Transcription with Talking Face Video. (arXiv:2303.08670v1 [cs.CV])

Title: RefiNeRF: Modelling dynamic neural radiance fields with inconsistent or missing camera parameters. (arXiv:2303.08695v1 [cs.CV])

Title: Attention-likelihood relationship in transformers. (arXiv:2303.08288v1 [cs.CL])

Title: PR-MCS: Perturbation Robust Metric for MultiLingual Image Captioning. (arXiv:2303.08389v1 [cs.CL])

Title: Systematic design space exploration by learning the explored space using Machine Learning. (arXiv:2303.08249v1 [cs.LG])

biometric

steal

extraction

Title: DICNet: Deep Instance-Level Contrastive Network for Double Incomplete Multi-View Multi-Label Classification. (arXiv:2303.08358v1 [cs.CV])

Title: Quality evaluation of point clouds: a novel no-reference approach using transformer-based architecture. (arXiv:2303.08634v1 [cs.CV])

Title: Economical Quaternion Extraction from a Human Skeletal Pose Estimate using 2-D Cameras. (arXiv:2303.08657v1 [cs.CV])

Title: Multi-Exposure HDR Composition by Gated Swin Transformer. (arXiv:2303.08704v1 [cs.CV])

Title: Contextualized Medication Information Extraction Using Transformer-based Deep Learning Architectures. (arXiv:2303.08259v1 [cs.CL])

Materials and methods: We developed NLP systems for medication mention extraction, event classification (indicating medication changes discussed or not), and context classification to classify medication changes context into 5 orthogonal dimensions related to drug changes. We explored 6 state-of-the-art pretrained transformer models for the three subtasks, including GatorTron, a large language model pretrained using >90 billion words of text (including >80 billion words from >290 million clinical notes identified at the University of Florida Health). We evaluated our NLP systems using annotated data and evaluation scripts provided by the 2022 n2c2 organizers.

Results:Our GatorTron models achieved the best F1-scores of 0.9828 for medication extraction (ranked 3rd), 0.9379 for event classification (ranked 2nd), and the best micro-average accuracy of 0.9126 for context classification. GatorTron outperformed existing transformer models pretrained using smaller general English text and clinical text corpora, indicating the advantage of large language models.

Conclusion: This study demonstrated the advantage of using large transformer models for contextual medication information extraction from clinical narratives.

Title: Clinical Concept and Relation Extraction Using Prompt-based Machine Reading Comprehension. (arXiv:2303.08262v1 [cs.CL])

Methods: We formulate both clinical concept extraction and relation extraction using a unified prompt-based MRC architecture and explore state-of-the-art transformer models. We compare our MRC models with existing deep learning models for concept extraction and end-to-end relation extraction using two benchmark datasets developed by the 2018 National NLP Clinical Challenges (n2c2) challenge (medications and adverse drug events) and the 2022 n2c2 challenge (relations of social determinants of health [SDoH]). We also evaluate the transfer learning ability of the proposed MRC models in a cross-institution setting. We perform error analyses and examine how different prompting strategies affect the performance of MRC models.

Results and Conclusion: The proposed MRC models achieve state-of-the-art performance for clinical concept and relation extraction on the two benchmark datasets, outperforming previous non-MRC transformer models. GatorTron-MRC achieves the best strict and lenient F1-scores for concept extraction, outperforming previous deep learning models on the two datasets by 1%~3% and 0.7%~1.3%, respectively. For end-to-end relation extraction, GatorTron-MRC and BERT-MIMIC-MRC achieve the best F1-scores, outperforming previous deep learning models by 0.9%~2.4% and 10%-11%, respectively. For cross-institution evaluation, GatorTron-MRC outperforms traditional GatorTron by 6.4% and 16% for the two datasets, respectively. The proposed method is better at handling nested/overlapped concepts, extracting relations, and has good portability for cross-institute applications.

Title: A Cross-institutional Evaluation on Breast Cancer Phenotyping NLP Algorithms on Electronic Health Records. (arXiv:2303.08448v1 [cs.CL])

Materials and Methods: Two clinical corpora of breast cancer patients were collected from the electronic health records from the University of Minnesota and the Mayo Clinic, and annotated following the same guideline. We developed three types of NLP models (i.e., conditional random field, bi-directional long short-term memory and CancerBERT) to extract cancer phenotypes from clinical texts. The models were evaluated for their generalizability on different test sets with different learning strategies (model transfer vs. locally trained). The entity coverage score was assessed with their association with the model performances.

Results: We manually annotated 200 and 161 clinical documents at UMN and MC, respectively. The corpora of the two institutes were found to have higher similarity between the target entities than the overall corpora. The CancerBERT models obtained the best performances among the independent test sets from two clinical institutes and the permutation test set. The CancerBERT model developed in one institute and further fine-tuned in another institute achieved reasonable performance compared to the model developed on local data (micro-F1: 0.925 vs 0.932).

Conclusions: The results indicate the CancerBERT model has the best learning ability and generalizability among the three types of clinical NLP models. The generalizability of the models was found to be correlated with the similarity of the target entities between the corpora.

Title: Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples!. (arXiv:2303.08559v1 [cs.CL])

Title: GCRE-GPT: A Generative Model for Comparative Relation Extraction. (arXiv:2303.08601v1 [cs.CL])

membership infer

federate

Title: Visual Prompt Based Personalized Federated Learning. (arXiv:2303.08678v1 [cs.LG])

Title: Optimization Design for Federated Learning in Heterogeneous 6G Networks. (arXiv:2303.08322v1 [cs.LG])

fair

Title: FairAdaBN: Mitigating unfairness with adaptive batch normalization and its application to dermatological disease classification. (arXiv:2303.08325v1 [cs.LG])

Title: Graph Neural Network Surrogates of Fair Graph Filtering. (arXiv:2303.08157v1 [cs.LG])

Title: DualFair: Fair Representation Learning at Both Group and Individual Levels via Contrastive Self-supervision. (arXiv:2303.08403v1 [cs.LG])

Title: Fair Off-Policy Learning from Observational Data. (arXiv:2303.08516v1 [cs.LG])

interpretability

explainability

watermark

diffusion

Title: Decomposed Diffusion Models for High-Quality Video Generation. (arXiv:2303.08320v1 [cs.CV])

Title: DiffBEV: Conditional Diffusion Model for Bird's Eye View Perception. (arXiv:2303.08333v1 [cs.CV])

Title: Uncertainty-Aware Pedestrian Trajectory Prediction via Distributional Diffusion. (arXiv:2303.08367v1 [cs.CV])

Title: The Devil's Advocate: Shattering the Illusion of Unexploitable Data using Diffusion Models. (arXiv:2303.08500v1 [cs.LG])

Title: Zero-Shot Contrastive Loss for Text-Guided Diffusion Image Style Transfer. (arXiv:2303.08622v1 [cs.CV])

Title: ResDiff: Combining CNN and Diffusion Model for Image Super-Resolution. (arXiv:2303.08714v1 [cs.CV])

Title: DiffusionAD: Denoising Diffusion for Anomaly Detection. (arXiv:2303.08730v1 [cs.CV])

Title: Highly Personalized Text Embedding for Image Manipulation by Stable Diffusion. (arXiv:2303.08767v1 [cs.CV])

Title: On the uncertainty analysis of the data-enabled physics-informed neural network for solving neutron diffusion eigenvalue problem. (arXiv:2303.08455v1 [cs.LG])

Title: Stochastic Interpolants: A Unifying Framework for Flows and Diffusions. (arXiv:2303.08797v1 [cs.LG])