secure

Title: Hardening and Speeding Up Zero-interaction Pairing and Authentication. (arXiv:2306.04458v1 [cs.CR])

Title: Differentially Private Selection from Secure Distributed Computin. (arXiv:2306.04564v1 [cs.CR])

security

Title: NFT.mine: An xDeepFM-based Recommender System for Non-fungible Token (NFT) Buyers. (arXiv:2306.03942v1 [cs.CR])

Title: High-Performance Caching of Homomorphic Encryption for Cloud Databases. (arXiv:2306.04227v1 [cs.CR])

Title: Development and Analysis of P2SCP: A Paradigm for Penetration Testing of Systems that Cannot be Subjected to the Risk of Penetration Testing. (arXiv:2306.04279v1 [cs.CR])

Title: Development of a Multi-purpose Fuzzer to Perform Assessment as Input to a Cybersecurity Risk Assessment and Analysis System. (arXiv:2306.04284v1 [cs.CR])

Title: Security Analysis of WG-7 Lightweight Stream Cipher against Cube Attack. (arXiv:2306.04352v1 [cs.CR])

Title: Sustainable Adaptive Security. (arXiv:2306.04481v1 [cs.CR])

Title: The Effect of Length on Key Fingerprint Verification Security and Usability. (arXiv:2306.04574v1 [cs.CR])

Title: Prefix Siphoning: Exploiting LSM-Tree Range Filters For Information Disclosure (Full Version). (arXiv:2306.04602v1 [cs.CR])

To date, key-value store timing attacks have aimed to disclose stored values and have exploited external mechanisms that can be disabled for protection. In this paper, we point out that key disclosure is also a security threat -- and demonstrate key disclosure timing attacks that exploit mechanisms of the key-value store itself.

We target LSM-tree based key-value stores utilizing range filters, which have been recently proposed to optimize LSM-tree range queries. We analyze the impact of the range filters SuRF and prefix Bloom filter on LSM-trees through a security lens, and show that they enable a key disclosure timing attack, which we call prefix siphoning. Prefix siphoning successfully leverages benign queries for non-present keys to identify prefixes of actual keys -- and in some cases, full keys -- in scenarios where brute force searching for keys (via exhaustive enumeration or random guesses) is infeasible.

privacy

Title: SF-FSDA: Source-Free Few-Shot Domain Adaptive Object Detection with Efficient Labeled Data Factory. (arXiv:2306.04385v1 [cs.CV])

Title: Point Cloud Video Anomaly Detection Based on Point Spatio-Temporal Auto-Encoder. (arXiv:2306.04466v1 [cs.CV])

Title: PILLAR: How to make semi-private learning more effective. (arXiv:2306.03962v1 [cs.LG])

Title: Is Homomorphic Encryption Feasible for Smart Mobility?. (arXiv:2306.04195v1 [cs.CR])

Title: A Threat Model for Soft Privacy on Smart Cars. (arXiv:2306.04222v1 [cs.CR])

Title: CaptAinGlove: Capacitive and Inertial Fusion-Based Glove for Real-Time on Edge Hand Gesture Recognition for Drone Control. (arXiv:2306.04319v1 [cs.LG])

protect

defense

attack

Title: CFDP: Common Frequency Domain Pruning. (arXiv:2306.04147v1 [cs.CV])

Title: PromptAttack: Probing Dialogue State Trackers with Adversarial Prompts. (arXiv:2306.04535v1 [cs.CL])

Title: Extracting Cloud-based Model with Prior Knowledge. (arXiv:2306.04192v1 [cs.CR])

Title: Development of a System Vulnerability Analysis Tool for Assessment of Complex Mission Critical Systems. (arXiv:2306.04280v1 [cs.CR])

Title: Vulnerable Smart Contract Function Locating Based on Multi-Relational Nested Graph Convolutional Network. (arXiv:2306.04479v1 [cs.CR])

Title: Divide and Repair: Using Options to Improve Performance of Imitation Learning Against Adversarial Demonstrations. (arXiv:2306.04581v1 [cs.LG])

Title: Membership inference attack with relative decision boundary distance. (arXiv:2306.04109v1 [cs.LG])

Title: Adversarial Sample Detection Through Neural Network Transport Dynamics. (arXiv:2306.04252v1 [cs.LG])

robust

Title: Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images!. (arXiv:2306.03932v1 [cs.CV])

Title: StructuredMesh: 3D Structured Optimization of Fa\c{c}ade Components on Photogrammetric Mesh Models using Binary Integer Programming. (arXiv:2306.04184v1 [cs.CV])

Title: Learning Probabilistic Coordinate Fields for Robust Correspondences. (arXiv:2306.04231v1 [cs.CV])

Title: ICON$^2$: Reliably Benchmarking Predictive Inequity in Object Detection. (arXiv:2306.04482v1 [cs.CV])

Title: ARTIC3D: Learning Robust Articulated 3D Shapes from Noisy Web Image Collections. (arXiv:2306.04619v1 [cs.CV])

Title: Cross-Genre Argument Mining: Can Language Models Automatically Fill in Missing Discourse Markers?. (arXiv:2306.04314v1 [cs.CL])

Title: GPT Self-Supervision for a Better Data Annotator. (arXiv:2306.04349v1 [cs.CL])

Title: Label Aware Speech Representation Learning For Language Identification. (arXiv:2306.04374v1 [cs.CL])

Title: PromptBench: Towards Evaluating the Robustness of Large Language Models on Adversarial Prompts. (arXiv:2306.04528v1 [cs.CL])

Title: Revisiting Out-of-distribution Robustness in NLP: Benchmark, Analysis, and LLMs Evaluations. (arXiv:2306.04618v1 [cs.CL])

Title: Agent Performing Autonomous Stock Trading under Good and Bad Situations. (arXiv:2306.03985v1 [cs.LG])

Title: Transferable Adversarial Robustness for Categorical Data via Universal Robust Embeddings. (arXiv:2306.04064v1 [cs.LG])

In this paper, we tackle both challenges. We present a method that allows us to train adversarially robust deep networks for tabular data and to transfer this robustness to other classifiers via universal robust embeddings tailored to categorical data. These embeddings, created using a bilevel alternating minimization framework, can be transferred to boosted trees or random forests making them robust without the need for adversarial training while preserving their high accuracy on tabular data. We show that our methods outperform existing techniques within a practical threat model suitable for tabular data.

Title: A novel deeponet model for learning moving-solution operators with applications to earthquake hypocenter localization. (arXiv:2306.04096v1 [cs.LG])

Title: Efficient Alternating Minimization with Applications to Weighted Low Rank Approximation. (arXiv:2306.04169v1 [cs.LG])

Title: Optimal Transport Model Distributional Robustness. (arXiv:2306.04178v1 [cs.LG])

Title: Self-Adjusting Weighted Expected Improvement for Bayesian Optimization. (arXiv:2306.04262v1 [cs.LG])

Title: Timing Process Interventions with Causal Inference and Reinforcement Learning. (arXiv:2306.04299v1 [cs.LG])

Title: Balancing of competitive two-player Game Levels with Reinforcement Learning. (arXiv:2306.04429v1 [cs.LG])

Title: Faithful Knowledge Distillation. (arXiv:2306.04431v1 [cs.LG])

Title: Training-Free Neural Active Learning with Initialization-Robustness Guarantees. (arXiv:2306.04454v1 [cs.LG])

Title: Recent applications of machine learning, remote sensing, and iot approaches in yield prediction: a critical review. (arXiv:2306.04566v1 [cs.LG])

Title: Generalization Across Observation Shifts in Reinforcement Learning. (arXiv:2306.04595v1 [cs.LG])

biometric

steal

extraction

Title: ECQED: Emotion-Cause Quadruple Extraction in Dialogs. (arXiv:2306.03969v1 [cs.CL])

Title: Leveraging Knowledge Graph Embeddings to Enhance Contextual Representations for Relation Extraction. (arXiv:2306.04203v1 [cs.CL])

Title: Co-evolving Graph Reasoning Network for Emotion-Cause Pair Extraction. (arXiv:2306.04340v1 [cs.CL])

Title: Enhancing In-Context Learning with Answer Feedback for Multi-Span Question Answering. (arXiv:2306.04508v1 [cs.CL])

Title: Permutation Equivariant Graph Framelets for Heterophilous Semi-supervised Learning. (arXiv:2306.04265v1 [cs.LG])

membership infer

federate

Title: Phoenix: A Federated Generative Diffusion Model. (arXiv:2306.04098v1 [cs.LG])

Title: FedVal: Different good or different bad in federated learning. (arXiv:2306.04040v1 [cs.LG])

Title: Fast Optimal Locally Private Mean Estimation via Random Projections. (arXiv:2306.04444v1 [cs.LG])

Title: Guiding The Last Layer in Federated Learning with Pre-Trained Models. (arXiv:2306.03937v1 [cs.LG])

fair

Title: Randomized 3D Scene Generation for Generalizable Self-supervised Pre-training. (arXiv:2306.04237v1 [cs.CV])

Title: Echoes from Alexandria: A Large Resource for Multilingual Book Summarization. (arXiv:2306.04334v1 [cs.CL])

Title: Examining Bias in Opinion Summarisation Through the Perspective of Opinion Diversity. (arXiv:2306.04424v1 [cs.CL])

Title: Language Models Get a Gender Makeover: Mitigating Gender Bias with Few-Shot Data Interventions. (arXiv:2306.04597v1 [cs.CL])

Title: BeMap: Balanced Message Passing for Fair Graph Neural Network. (arXiv:2306.04107v1 [cs.LG])

Title: M$^3$Fair: Mitigating Bias in Healthcare Data through Multi-Level and Multi-Sensitive-Attribute Reweighting Method. (arXiv:2306.04118v1 [cs.LG])

Title: Migrate Demographic Group For Fair GNNs. (arXiv:2306.04212v1 [cs.LG])

Title: A Fair Classifier Embracing Triplet Collapse. (arXiv:2306.04400v1 [cs.LG])

Title: Fair Column Subset Selection. (arXiv:2306.04489v1 [cs.LG])

Title: Optimal Fair Multi-Agent Bandits. (arXiv:2306.04498v1 [cs.LG])

interpretability

Title: Effective Neural Topic Modeling with Embedding Clustering Regularization. (arXiv:2306.04217v1 [cs.CL])

Title: World Models for Math Story Problems. (arXiv:2306.04347v1 [cs.CL])

Title: Hardness of Deceptive Certificate Selection. (arXiv:2306.04505v1 [cs.LG])

We consider a malicious prover-verifier duo that aims to exploit the AFC to achieve high completeness and soundness while using uninformative certificates. We show that this task is $\mathsf{NP}$-hard and cannot be approximated better than $\mathcal{O}(m^{1/8 - \epsilon})$, where $m$ is the number of possible certificates, for $\epsilon>0$ under the Dense-vs-Random conjecture. This is some evidence that AFC should not prevent the use of interactive classification for real-world tasks, as it is computationally hard to be exploited.

Title: Generalized Teacher Forcing for Learning Chaotic Dynamics. (arXiv:2306.04406v1 [cs.LG])

explainability

Title: MarineVRS: Marine Video Retrieval System with Explainability via Semantic Understanding. (arXiv:2306.04593v1 [cs.CV])

watermark

Title: On the Reliability of Watermarks for Large Language Models. (arXiv:2306.04634v1 [cs.LG])

diffusion

Title: Improving Diffusion-based Image Translation using Asymmetric Gradient Guidance. (arXiv:2306.04396v1 [cs.CV])

Title: Multi-modal Latent Diffusion. (arXiv:2306.04445v1 [cs.LG])

Title: On the Design Fundamentals of Diffusion Models: A Survey. (arXiv:2306.04542v1 [cs.LG])

Title: Integrating Geometric Control into Text-to-Image Diffusion Models for High-Quality Detection Data Generation via Text Prompt. (arXiv:2306.04607v1 [cs.CV])

Title: Designing a Better Asymmetric VQGAN for StableDiffusion. (arXiv:2306.04632v1 [cs.CV])

Title: Randomized Schur Complement Views for Graph Contrastive Learning. (arXiv:2306.04004v1 [cs.LG])

Title: MESSY Estimation: Maximum-Entropy based Stochastic and Symbolic densitY Estimation. (arXiv:2306.04120v1 [cs.LG])

Title: A Survey on Generative Diffusion Models for Structured Data. (arXiv:2306.04139v1 [cs.LG])

noise learning

data-free

transformer

Title: Energy-Based Models for Cross-Modal Localization using Convolutional Transformers. (arXiv:2306.04021v1 [cs.CV])

Title: BokehOrNot: Transforming Bokeh Effect with Image Transformer and Lens Metadata Embedding. (arXiv:2306.04032v1 [cs.CV])

Title: Efficient Vision Transformer for Human Pose Estimation via Patch Selection. (arXiv:2306.04225v1 [cs.CV])

Title: Normalization Layers Are All That Sharpness-Aware Minimization Needs. (arXiv:2306.04226v1 [cs.LG])

Title: Revising deep learning methods in parking lot occupancy detection. (arXiv:2306.04288v1 [cs.LG])

Title: Sentiment Analysis in Finance: From Transformers Back to eXplainable Lexicons (XLex). (arXiv:2306.03997v1 [cs.CL])

Title: Transfer Learning of Transformer-based Speech Recognition Models from Czech to Slovak. (arXiv:2306.04399v1 [cs.CL])

Title: Evaluation of ChatGPT on Biomedical Tasks: A Zero-Shot Comparison with Fine-Tuned Generative Transformers. (arXiv:2306.04504v1 [cs.CL])

Title: Transformers as Statisticians: Provable In-Context Learning with In-Context Algorithm Selection. (arXiv:2306.04637v1 [cs.LG])

Building on these ``base'' ICL algorithms, intriguingly, we show that transformers can implement more complex ICL procedures involving \emph{in-context algorithm selection}, akin to what a statistician can do in real life -- A \emph{single} transformer can adaptively select different base ICL algorithms -- or even perform qualitatively different tasks -- on different input sequences, without any explicit prompting of the right algorithm or task. We both establish this in theory by explicit constructions, and also observe this phenomenon experimentally. In theory, we construct two general mechanisms for algorithm selection with concrete examples: pre-ICL testing, and post-ICL validation. As an example, we use the post-ICL validation mechanism to construct a transformer that can perform nearly Bayes-optimal ICL on a challenging task -- noisy linear models with mixed noise levels. Experimentally, we demonstrate the strong in-context algorithm selection capabilities of standard transformer architectures.

Title: Proximity-Informed Calibration for Deep Neural Networks. (arXiv:2306.04590v1 [cs.LG])

generative

Title: GP-UNIT: Generative Prior for Versatile Unsupervised Image-to-Image Translation. (arXiv:2306.04636v1 [cs.CV])

Title: Augmenting Reddit Posts to Determine Wellness Dimensions impacting Mental Health. (arXiv:2306.04059v1 [cs.CL])

Title: Gotta: Generative Few-shot Question Answering by Prompt-based Cloze Data Augmentation. (arXiv:2306.04101v1 [cs.CL])

Title: From the One, Judge of the Whole: Typed Entailment Graph Construction with Predicate Generation. (arXiv:2306.04170v1 [cs.CL])

Title: ConTextual Masked Auto-Encoder for Retrieval-based Dialogue Systems. (arXiv:2306.04357v1 [cs.CL])

Title: Multi-Task Training with In-Domain Language Models for Diagnostic Reasoning. (arXiv:2306.04551v1 [cs.CL])

Title: Learning Causal Mechanisms through Orthogonal Neural Networks. (arXiv:2306.03938v1 [cs.LG])

In this paper, we investigate a problem of learning, in a fully unsupervised manner, the inverse of a set of independent mechanisms from distorted data points. We postulate, and justify this claim with experimental results, that an important weakness of existing machine learning solutions lies in the insufficiency of cross-module diversification. Addressing this crucial discrepancy between human and machine intelligence is an important challenge for pattern recognition systems.

To this end, our work proposes an unsupervised method that discovers and disentangles a set of independent mechanisms from unlabeled data, and learns how to invert them. A number of experts compete against each other for individual data points in an adversarial setting: one that best inverses the (unknown) generative mechanism is the winner. We demonstrate that introducing an orthogonalization layer into the expert architectures enforces additional diversity in the outputs, leading to significantly better separability. Moreover, we propose a procedure for relocating data points between experts to further prevent any one from claiming multiple mechanisms. We experimentally illustrate that these techniques allow discovery and modularization of much less pronounced transformations, in addition to considerably faster convergence.

Title: Partial Inference in Structured Prediction. (arXiv:2306.03949v1 [cs.LG])

Title: One-Dimensional Deep Image Prior for Curve Fitting of S-Parameters from Electromagnetic Solvers. (arXiv:2306.04001v1 [cs.LG])

large language model

Title: Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Dataset for Pre-training and Benchmarks. (arXiv:2306.04362v1 [cs.CV])

Title: M$^3$IT: A Large-Scale Dataset towards Multi-Modal Multilingual Instruction Tuning. (arXiv:2306.04387v1 [cs.CV])

Title: Turning large language models into cognitive models. (arXiv:2306.03917v1 [cs.CL])

Title: MISGENDERED: Limits of Large Language Models in Understanding Pronouns. (arXiv:2306.03950v1 [cs.CL])

Gender bias in language technologies has been widely studied, but research has mostly been restricted to a binary paradigm of gender. It is essential also to consider non-binary gender identities, as excluding them can cause further harm to an already marginalized group. In this paper, we comprehensively evaluate popular language models for their ability to correctly use English gender-neutral pronouns (e.g., singular they, them) and neo-pronouns (e.g., ze, xe, thon) that are used by individuals whose gender identity is not represented by binary pronouns. We introduce MISGENDERED, a framework for evaluating large language models' ability to correctly use preferred pronouns, consisting of (i) instances declaring an individual's pronoun, followed by a sentence with a missing pronoun, and (ii) an experimental setup for evaluating masked and auto-regressive language models using a unified method. When prompted out-of-the-box, language models perform poorly at correctly predicting neo-pronouns (averaging 7.6% accuracy) and gender-neutral pronouns (averaging 31.0% accuracy). This inability to generalize results from a lack of representation of non-binary pronouns in training data and memorized associations. Few-shot adaptation with explicit examples in the prompt improves the performance but plateaus at only 45.4% for neo-pronouns. We release the full dataset, code, and demo at https://tamannahossainkay.github.io/misgendered/

Title: Leveraging Explicit Procedural Instructions for Data-Efficient Action Prediction. (arXiv:2306.03959v1 [cs.CL])

Title: B\"{u}y\"{u}k dil modellerinin T\"{u}rk\c{c}e verisetleri ile e\u{g}itilmesi ve ince ayarlanmas\i. (arXiv:2306.03978v1 [cs.CL])

--

B\"uy\"uk dil modelleri inan{\i}lmaz \"ol\c{c}\"ude geli\c{s}mekte, b\"uy\"uk ilgi toplayarak ve \"uzerlerinde yo\u{g}un ara\c{s}tirmalarin yapildi\u{g}i bir d\"onemdedirler. Geli\c{s}tirilen modeller ve e\u{g}itimde kullanilan verisetlerinden bazilari a\c{c}ik eri\c{s}imli olarak sunulmaktadir. B\"oylece ince ayarlama teknikleri uygulayarak \"ozelle\c{s}mi\c{s} g\"orevler i\c{c}in \c{c}ali\c{s}abilir modeller elde edilmektedir. T\"urk\c{c}e s\"oz konusu oldu\u{g}unda bu modellerinin kapsayicili\u{g}i yeterli d\"uzeyde de\u{g}ildir. Bu durum, yayimlanan verisetlerinde de g\"ozlemlenebilir. Bunu a\c{s}manin yollari T\"urk\c{c}e i\c{c}erikli b\"uy\"uk verisetlerinin olu\c{s}turulmasi, b\"uy\"uk dil modellerinin bunlarla e\u{g}itilmesi ve \"onceden e\u{g}itilmi\c{s} modellerin T\"urk\c{c}e girdilerle ince ayarlanmalari olabilir. Bu \c{c}ali\c{s}mada a\c{c}ik eri\c{s}imli dil modelleri ve verisetleri \"uzerinde durulmakta ve T\"urk\c{c}e temelli bazi deneyler, kar\c{s}ila\c{s}ilan sorunlar ve sonu\c{c}lar irdelenmektedir.

Title: XSemPLR: Cross-Lingual Semantic Parsing in Multiple Natural Languages and Meaning Representations. (arXiv:2306.04085v1 [cs.CL])

Title: Knowledge-Augmented Language Model Prompting for Zero-Shot Knowledge Graph Question Answering. (arXiv:2306.04136v1 [cs.CL])

Title: Increasing Diversity While Maintaining Accuracy: Text Data Generation with Large Language Models and Human Interventions. (arXiv:2306.04140v1 [cs.CL])

Title: A New Dataset and Empirical Study for Sentence Simplification in Chinese. (arXiv:2306.04188v1 [cs.CL])

Title: Multilingual Clinical NER: Translation or Cross-lingual Transfer?. (arXiv:2306.04384v1 [cs.CL])

Title: STEPS: A Benchmark for Order Reasoning in Sequential Tasks. (arXiv:2306.04441v1 [cs.CL])

Title: Long-form analogies generated by chatGPT lack human-like psycholinguistic properties. (arXiv:2306.04537v1 [cs.CL])

Title: The Two Word Test: A Semantic Benchmark for Large Language Models. (arXiv:2306.04610v1 [cs.CL])

Title: ModuleFormer: Learning Modular Large Language Models From Uncurated Data. (arXiv:2306.04640v1 [cs.CL])

Title: StudentEval: A Benchmark of Student-Written Prompts for Large Language Models of Code. (arXiv:2306.04556v1 [cs.LG])

segmentation

Title: Real-Time Online Unsupervised Domain Adaptation for Real-World Person Re-identification. (arXiv:2306.03993v1 [cs.CV])

Title: 1st Place Solution for PVUW Challenge 2023: Video Panoptic Segmentation. (arXiv:2306.04091v1 [cs.CV])

Title: MultiSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos. (arXiv:2306.04216v1 [cs.CV])

Title: CorrMatch: Label Propagation via Correlation Matching for Semi-Supervised Semantic Segmentation. (arXiv:2306.04300v1 [cs.CV])

Title: ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation. (arXiv:2306.04344v1 [cs.CV])

Title: Fine-Grained Visual Prompting. (arXiv:2306.04356v1 [cs.CV])

Title: FoSp: Focus and Separation Network for Early Smoke Segmentation. (arXiv:2306.04474v1 [cs.CV])

Title: NeMO: Neural Map Growing System for Spatiotemporal Fusion in Bird's-Eye-View and BDD-Map Benchmark. (arXiv:2306.04540v1 [cs.CV])

Title: PhenoBench -- A Large Dataset and Benchmarks for Semantic Image Interpretation in the Agricultural Domain. (arXiv:2306.04557v1 [cs.CV])

Title: Contrastive Lift: 3D Object Instance Segmentation by Slow-Fast Contrastive Fusion. (arXiv:2306.04633v1 [cs.CV])