secure

Title: EnSolver: Uncertainty-Aware CAPTCHA Solver Using Deep Ensembles. (arXiv:2307.15180v1 [cs.CV])

Title: PUF Probe: A PUF-based Hardware Authentication Equipment for IEDs. (arXiv:2307.15338v1 [cs.CR])

Title: Provably secure KEM-based protocols over unauthenticated channels. (arXiv:2307.15465v1 [cs.CR])

Title: S3C2 Summit 2202-09: Industry Secure Suppy Chain Summit. (arXiv:2307.15642v1 [cs.CR])

security

privacy

Title: Deep Generative Models, Synthetic Tabular Data, and Differential Privacy: An Overview and Synthesis. (arXiv:2307.15424v1 [cs.LG])

protect

Title: Equitable Time-Varying Pricing Tariff Design: A Joint Learning and Optimization Approach. (arXiv:2307.15088v1 [cs.LG])

Title: The Initial Screening Order Problem. (arXiv:2307.15398v1 [cs.LG])

defense

Title: Backdoor Defense with Non-Adversarial Backdoor. (arXiv:2307.15539v1 [cs.LG])

attack

Title: Set-Membership Inference Attacks using Data Watermarking. (arXiv:2307.15067v1 [cs.CV])

Title: Detecting Morphing Attacks via Continual Incremental Training. (arXiv:2307.15105v1 [cs.CV])

Title: Adversarial training for tabular data with attack propagation. (arXiv:2307.15677v1 [cs.LG])

robust

Title: R-LPIPS: An Adversarially Robust Perceptual Similarity Metric. (arXiv:2307.15157v1 [cs.CV])

Title: D2S: Representing local descriptors and global scene coordinates for camera relocalization. (arXiv:2307.15250v1 [cs.CV])

Title: A Solution to Co-occurrence Bias: Attributes Disentanglement via Mutual Information Minimization for Pedestrian Attribute Recognition. (arXiv:2307.15252v1 [cs.CV])

Title: Attentive Multimodal Fusion for Optical and Scene Flow. (arXiv:2307.15301v1 [cs.CV])

Title: AffineGlue: Joint Matching and Robust Estimation. (arXiv:2307.15381v1 [cs.CV])

Title: Few-shot Image Classification based on Gradual Machine Learning. (arXiv:2307.15524v1 [cs.CV])

Title: Point Clouds Are Specialized Images: A Knowledge Transfer Approach for 3D Understanding. (arXiv:2307.15569v1 [cs.CV])

Title: PatchMixer: Rethinking network design to boost generalization for 3D point cloud understanding. (arXiv:2307.15692v1 [cs.CV])

Title: The timing bottleneck: Why timing and overlap are mission-critical for conversational user interfaces, speech recognition and dialogue systems. (arXiv:2307.15493v1 [cs.CL])

Title: Robust Distortion-free Watermarks for Language Models. (arXiv:2307.15593v1 [cs.LG])

Title: A/B Testing and Best-arm Identification for Linear Bandits with Robustness to Non-stationarity. (arXiv:2307.15154v1 [cs.LG])

Title: Worrisome Properties of Neural Network Controllers and Their Symbolic Representations. (arXiv:2307.15456v1 [cs.LG])

Title: From continuous-time formulations to discretization schemes: tensor trains and robust regression for BSDEs and parabolic PDEs. (arXiv:2307.15496v1 [cs.LG])

biometric

steal

extraction

Title: One-shot Joint Extraction, Registration and Segmentation of Neuroimaging Data. (arXiv:2307.15198v1 [cs.CV])

membership infer

federate

Title: A Practical Recipe for Federated Learning Under Statistical Heterogeneity Experimental Design. (arXiv:2307.15245v1 [cs.LG])

Title: The Applicability of Federated Learning to Official Statistics. (arXiv:2307.15503v1 [cs.LG])

fair

Title: Is this model reliable for everyone? Testing for strong calibration. (arXiv:2307.15247v1 [cs.LG])

Title: LUCID-GAN: Conditional Generative Models to Locate Unfairness. (arXiv:2307.15466v1 [cs.LG])

interpretability

Title: Toward Transparent Sequence Models with Model-Based Tree Markov Model. (arXiv:2307.15367v1 [cs.LG])

Title: Bayesian Time-Series Classifier for Decoding Simple Visual Stimuli from Intracranial Neural Activity. (arXiv:2307.15672v1 [cs.LG])

explainability

watermark

diffusion

Title: Recovering high-quality FODs from a reduced number of diffusion-weighted images using a model-driven deep learning architecture. (arXiv:2307.15273v1 [cs.CV])

noise learning

data-free

transformer

Title: Writer adaptation for offline text recognition: An exploration of neural network-based methods. (arXiv:2307.15071v1 [cs.CV])

Title: DocDeshadower: Frequency-aware Transformer for Document Shadow Removal. (arXiv:2307.15318v1 [cs.CV])

Title: TaskExpert: Dynamically Assembling Multi-Task Representations with Memorial Mixture-of-Experts. (arXiv:2307.15324v1 [cs.CV])

Title: BARTPhoBEiT: Pre-trained Sequence-to-Sequence and Image Transformers Models for Vietnamese Visual Question Answering. (arXiv:2307.15335v1 [cs.CL])

Title: Prompt Guided Transformer for Multi-Task Dense Prediction. (arXiv:2307.15362v1 [cs.CV])

Title: MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking. (arXiv:2307.15700v1 [cs.CV])

Title: Cascaded Cross-Modal Transformer for Request and Complaint Detection. (arXiv:2307.15097v1 [cs.CL])

Title: VISU at WASSA 2023 Shared Task: Detecting Emotions in Reaction to News Stories Leveraging BERT and Stacked Embeddings. (arXiv:2307.15164v1 [cs.CL])

Title: WC-SBERT: Zero-Shot Text Classification via SBERT with Self-Training for Wikipedia Categories. (arXiv:2307.15293v1 [cs.CL])

Title: The Road to Quality is Paved with Good Revisions: A Detailed Evaluation Methodology for Revision Policies in Incremental Sequence Labelling. (arXiv:2307.15508v1 [cs.CL])

Title: Universal Recurrent Event Memories for Streaming Data. (arXiv:2307.15694v1 [cs.LG])

generative

Title: Med-Flamingo: a Multimodal Medical Few-shot Learner. (arXiv:2307.15189v1 [cs.CV])

Title: Learning with Constraint Learning: New Perspective, Solution Strategy and Various Applications. (arXiv:2307.15257v1 [cs.CV])

Title: Staging E-Commerce Products for Online Advertising using Retrieval Assisted Image Generation. (arXiv:2307.15326v1 [cs.CV])

large language model

Title: RSGPT: A Remote Sensing Vision Language Model and Benchmark. (arXiv:2307.15266v1 [cs.CV])

Title: ChatHome: Development and Evaluation of a Domain-Specific Language Model for Home Renovation. (arXiv:2307.15290v1 [cs.CL])

Title: TrafficSafetyGPT: Tuning a Pre-trained Large Language Model to a Domain-Specific Expert in Transportation Safety. (arXiv:2307.15311v1 [cs.CL])

Title: Tutorials on Stance Detection using Pre-trained Language Models: Fine-tuning BERT and Prompting Large Language Models. (arXiv:2307.15331v1 [cs.CL])

Title: Skeleton-of-Thought: Large Language Models Can Do Parallel Decoding. (arXiv:2307.15337v1 [cs.CL])

Title: Med-HALT: Medical Domain Hallucination Test for Large Language Models. (arXiv:2307.15343v1 [cs.CL])

Our study evaluated leading LLMs, including Text Davinci, GPT-3.5, LlaMa-2, MPT, and Falcon, revealing significant differences in their performance. The paper provides detailed insights into the dataset, promoting transparency and reproducibility. Through this work, we aim to contribute to the development of safer and more reliable language models in healthcare. Our benchmark can be found at medhalt.github.io

Title: Investigating the Learning Behaviour of In-context Learning: A Comparison with Supervised Learning. (arXiv:2307.15411v1 [cs.CL])

Title: A Critical Review of Large Language Models: Sensitivity, Bias, and the Path Toward Specialized AI. (arXiv:2307.15425v1 [cs.CL])

segmentation

Title: AC-Norm: Effective Tuning for Medical Image Analysis via Affine Collaborative Normalization. (arXiv:2307.15282v1 [cs.CV])

Title: Local and Global Information in Obstacle Detection on Railway Tracks. (arXiv:2307.15478v1 [cs.CV])

Title: Improving Image Quality of Sparse-view Lung Cancer CT Images with a Convolutional Neural Network. (arXiv:2307.15506v1 [cs.CV])

Methods: CT images from 41 subjects (34 with lung cancer, seven healthy) were retrospectively selected (01.2016-12.2018) and forward projected onto 2048-view sinograms. Six corresponding sparse-view CT data subsets at varying levels of undersampling were reconstructed from sinograms using filtered backprojection with 16, 32, 64, 128, 256, and 512 views, respectively. A dual-frame U-Net was trained and evaluated for each subsampling level on 8,658 images from 22 diseased subjects. A representative image per scan was selected from 19 subjects (12 diseased, seven healthy) for a single-blinded reader study. The selected slices, for all levels of subsampling, with and without post-processing by the U-Net model, were presented to three readers. Image quality and diagnostic confidence were ranked using pre-defined scales. Subjective nodule segmentation was evaluated utilizing sensitivity (Se) and Dice Similarity Coefficient (DSC) with 95% confidence intervals (CI).

Results: The 64-projection sparse-view images resulted in Se = 0.89 and DSC = 0.81 [0.75,0.86] while their counterparts, post-processed with the U-Net, had improved metrics (Se = 0.94, DSC = 0.85 [0.82,0.87]). Fewer views lead to insufficient quality for diagnostic purposes. For increased views, no substantial discrepancies were noted between the sparse-view and post-processed images.

Conclusion: Projection views can be reduced from 2048 to 64 while maintaining image quality and the confidence of the radiologists on a satisfactory level.

Title: OAFuser: Towards Omni-Aperture Fusion for Light Field Semantic Segmentation of Road Scenes. (arXiv:2307.15588v1 [cs.CV])