
Title: Confidential Truth Finding with Multi-Party Computation (Extended Version). (arXiv:2305.14727v1 [cs.CR])


Title: Understanding the Country-Level Security of Free Content Websites and their Hosting Infrastructure. (arXiv:2305.14531v1 [cs.CR])

Title: Adversarial Machine Learning and Cybersecurity: Risks, Challenges, and Legal Implications. (arXiv:2305.14553v1 [cs.CR])

This report is meant to accomplish two things. First, it provides a high-level discussion of AI vulnerabilities, including the ways in which they are disanalogous to other types of vulnerabilities, and the current state of affairs regarding information sharing and legal oversight of AI vulnerabilities. Second, it attempts to articulate broad recommendations as endorsed by the majority of participants at the workshop.

Title: Towards Understanding Crypto Money Laundering in Web3 Through the Lenses of Ethereum Heists. (arXiv:2305.14748v1 [cs.CR])

Title: Applications of Machine Learning in Detecting Afghan Fake Banknotes. (arXiv:2305.14745v1 [cs.LG])


Title: MathDial: A Dialogue Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems. (arXiv:2305.14536v1 [cs.CL])




Title: Adversarial Nibbler: A Data-Centric Challenge for Improving the Safety of Text-to-Image Models. (arXiv:2305.14384v1 [cs.LG])


Title: Connecting Multi-modal Contrastive Representations. (arXiv:2305.14381v1 [cs.LG])

Title: Point2SSM: Learning Morphological Variations of Anatomies from Point Cloud. (arXiv:2305.14486v1 [cs.CV])

Title: Eliminating Spurious Correlations from Pre-trained Models via Data Mixing. (arXiv:2305.14521v1 [cs.LG])

Title: Robust 3D-aware Object Classification via Discriminative Render-and-Compare. (arXiv:2305.14668v1 [cs.CV])

Title: NegVSR: Augmenting Negatives for Generalized Noise Modeling in Real-World Video Super-Resolution. (arXiv:2305.14669v1 [cs.CV])

Title: AdvFunMatch: When Consistent Teaching Meets Adversarial Robustness. (arXiv:2305.14700v1 [cs.LG])

Title: AutoDepthNet: High Frame Rate Depth Map Reconstruction using Commodity Depth and RGB Cameras. (arXiv:2305.14731v1 [cs.CV])

Title: Is Information Extraction Solved by ChatGPT? An Analysis of Performance, Evaluation Criteria, Robustness and Errors. (arXiv:2305.14450v1 [cs.CL])

Title: On Robustness of Finetuned Transformer-based NLP Models. (arXiv:2305.14453v1 [cs.CL])

In this paper, we study the robustness of three language models (BERT, GPT-2 and T5) with eight different text perturbations on the General Language Understanding Evaluation (GLUE) benchmark. Also, we use two metrics (CKA and STIR) to quantify changes between pretrained and finetuned language model representations across layers. GPT-2 representations are more robust than BERT and T5 across multiple types of input perturbation. Although models exhibit good robustness broadly, dropping nouns, verbs or changing characters are the most impactful. Overall, this study provides valuable insights into perturbation-specific weaknesses of popular Transformer-based models which should be kept in mind when passing inputs.

Title: Are Large Language Models Robust Zero-shot Coreference Resolvers?. (arXiv:2305.14489v1 [cs.CL])

Title: How to Choose How to Choose Your Chatbot: A Massively Multi-System MultiReference Data Set for Dialog Metric Evaluation. (arXiv:2305.14533v1 [cs.CL])

Title: From Characters to Words: Hierarchical Pre-trained Language Model for Open-vocabulary Language Understanding. (arXiv:2305.14571v1 [cs.CL])

Title: DialogVCS: Robust Natural Language Understanding in Dialogue System Upgrade. (arXiv:2305.14751v1 [cs.CL])

Title: Bi-Drop: Generalizable Fine-tuning for Pre-trained Language Models via Adaptive Subnetwork Optimization. (arXiv:2305.14760v1 [cs.CL])

Title: Measuring the Knowledge Acquisition-Utilization Gap in Pretrained Language Models. (arXiv:2305.14775v1 [cs.CL])

Title: Sequence Modeling is a Robust Contender for Offline Reinforcement Learning. (arXiv:2305.14550v1 [cs.LG])

Title: Negative Feedback Training: A Novel Concept to Improve Robustness of NVCiM DNN Accelerators. (arXiv:2305.14561v1 [cs.LG])

Title: Robust Explanations for Deep Neural Networks via Pseudo Neural Tangent Kernel Surrogate Models. (arXiv:2305.14585v1 [cs.LG])

Title: Learning Survival Distribution with Implicit Survival Function. (arXiv:2305.14655v1 [cs.LG])

Title: An Evaluation on Practical Batch Bayesian Sampling Algorithms for Online Adaptive Traffic Experimentation. (arXiv:2305.14704v1 [cs.LG])




Title: Assessment of Anterior Cruciate Ligament Injury Risk Based on Human Key Points Detection Algorithm. (arXiv:2305.14612v1 [cs.CV])

Title: Domain-Expanded ASTE: Rethinking Generalization in Aspect Sentiment Triplet Extraction. (arXiv:2305.14434v1 [cs.CL])

Title: BAND: Biomedical Alert News Dataset. (arXiv:2305.14480v1 [cs.CL])

Title: RE$^2$: Region-Aware Relation Extraction from Visually Rich Documents. (arXiv:2305.14590v1 [cs.CL])

Title: Iteratively Improving Biomedical Entity Linking and Event Extraction via Hard Expectation-Maximization. (arXiv:2305.14645v1 [cs.CL])

Title: InteractiveIE: Towards Assessing the Strength of Human-AI Collaboration in Improving the Performance of Information Extraction. (arXiv:2305.14659v1 [cs.CL])

Title: Complex Mathematical Symbol Definition Structures: A Dataset and Model for Coordination Resolution in Definition Extraction. (arXiv:2305.14660v1 [cs.CL])

Title: A Causal View of Entity Bias in (Large) Language Models. (arXiv:2305.14695v1 [cs.CL])

membership infer



Title: Gender Biases in Automatic Evaluation Metrics: A Case Study on Image Captioning. (arXiv:2305.14711v1 [cs.CL])

Title: FITNESS: A Causal De-correlation Approach for Mitigating Bias in Machine Learning Software. (arXiv:2305.14396v1 [cs.LG])

Title: Chakra: Advancing Performance Benchmarking and Co-design using Standardized Execution Traces. (arXiv:2305.14516v1 [cs.LG])

Title: Interpretation of Time-Series Deep Models: A Survey. (arXiv:2305.14582v1 [cs.LG])


Title: Bridging Continuous and Discrete Spaces: Interpretable Sentence Representation Learning via Compositional Operations. (arXiv:2305.14599v1 [cs.CL])

Title: Mixture of Prompt Experts for Generalizable and Interpretable Question Answering. (arXiv:2305.14628v1 [cs.CL])

Title: SenteCon: Leveraging Lexicons to Learn Human-Interpretable Language Representations. (arXiv:2305.14728v1 [cs.CL])

Title: Human-Centered Metrics for Dialog System Evaluation. (arXiv:2305.14757v1 [cs.CL])


Title: TACR: A Table-alignment-based Cell-selection and Reasoning Model for Hybrid Question-Answering. (arXiv:2305.14682v1 [cs.CL])



Title: T1: Scaling Diffusion Probabilistic Fields to High-Resolution on Unified Visual Modalities. (arXiv:2305.14674v1 [cs.CV])

Title: Optimal Linear Subspace Search: Learning to Construct Fast and High-Quality Schedulers for Diffusion Models. (arXiv:2305.14677v1 [cs.CV])

Title: BLIP-Diffusion: Pre-trained Subject Representation for Controllable Text-to-Image Generation and Editing. (arXiv:2305.14720v1 [cs.CV])

Title: I Spy a Metaphor: Large Language Models and Diffusion Models Co-Create Visual Metaphors. (arXiv:2305.14724v1 [cs.CL])

Title: ChatFace: Chat-Guided Real Face Editing via Diffusion Latent Space Manipulation. (arXiv:2305.14742v1 [cs.CV])

Title: Diffusion Models in NLP: A Survey. (arXiv:2305.14671v1 [cs.CL])

Title: SSD-2: Scaling and Inference-time Fusion of Diffusion Language Models. (arXiv:2305.14771v1 [cs.CL])

Title: On the Generalization of Diffusion Model. (arXiv:2305.14712v1 [cs.LG])

noise learning



Title: Reinforcement Learning finetuned Vision-Code Transformer for UI-to-Code Generation. (arXiv:2305.14637v1 [cs.CV])

We propose an end-to-end pipeline that can generate high-quality code snippets directly from screenshots, streamlining the website creation process for developers. To train and evaluate our models, we created a synthetic dataset of 30,000 unique pairs of code and corresponding screenshots.

We evaluate the performance of our approach using a combination of automated metrics such as MSE, BLEU, IoU, and a novel htmlBLEU score, where our models demonstrated strong performance. We establish a strong baseline with the DiT-GPT2 model and show that actor-critic can be used to improve IoU score from the baseline of 0.64 to 0.79 and lower MSE from 12.25 to 9.02. We achieved similar performance as when using larger models, with much lower computational cost.

Title: Quantifying Character Similarity with Vision Transformers. (arXiv:2305.14672v1 [cs.CL])

Title: BinaryViT: Towards Efficient and Accurate Binary Vision Transformers. (arXiv:2305.14730v1 [cs.CV])

Title: Dual Path Transformer with Partition Attention. (arXiv:2305.14768v1 [cs.CV])

Title: Finding the Pillars of Strength for Multi-Head Attention. (arXiv:2305.14380v1 [cs.LG])

Title: NAIL: Lexical Retrieval Indices with Efficient Non-Autoregressive Decoders. (arXiv:2305.14499v1 [cs.CL])

Title: All Roads Lead to Rome? Exploring the Invariance of Transformers' Representations. (arXiv:2305.14555v1 [cs.CL])

Title: KNN-LM Does Not Improve Open-ended Text Generation. (arXiv:2305.14625v1 [cs.CL])

Title: Advancements in Arabic Grammatical Error Detection and Correction: An Empirical Investigation. (arXiv:2305.14734v1 [cs.CL])

Title: Adapting Language Models to Compress Contexts. (arXiv:2305.14788v1 [cs.CL])

Title: NeuralMatrix: Moving Entire Neural Networks to General Matrix Multiplication for Efficient Inference. (arXiv:2305.14405v1 [cs.LG])

Title: A Joint Time-frequency Domain Transformer for Multivariate Time Series Forecasting. (arXiv:2305.14649v1 [cs.LG])

Title: Revenge of MLP in Sequential Recommendation. (arXiv:2305.14675v1 [cs.LG])

Title: Can Transformers Learn to Solve Problems Recursively?. (arXiv:2305.14699v1 [cs.LG])


Title: Design a Delicious Lunchbox in Style. (arXiv:2305.14522v1 [cs.CV])

Title: Towards Early Prediction of Human iPSC Reprogramming Success. (arXiv:2305.14575v1 [cs.CV])

Title: Generative Modeling through the Semi-dual Formulation of Unbalanced Optimal Transport. (arXiv:2305.14777v1 [cs.CV])

Title: CGCE: A Chinese Generative Chat Evaluation Benchmark for General and Financial Domains. (arXiv:2305.14471v1 [cs.CL])

Title: Revisit and Outstrip Entity Alignment: A Perspective of Generative Models. (arXiv:2305.14651v1 [cs.CL])

Title: A Rational Model of Dimension-reduced Human Categorization. (arXiv:2305.14383v1 [cs.LG])

Title: Fourier Neural Operators for Arbitrary Resolution Climate Data Downscaling. (arXiv:2305.14452v1 [cs.LG])

Title: torchgfn: A PyTorch GFlowNet library. (arXiv:2305.14594v1 [cs.LG])

large language model

Title: Prompting Language-Informed Distribution for Compositional Zero-Shot Learning. (arXiv:2305.14428v1 [cs.CV])

Title: Let GPT be a Math Tutor: Teaching Math Word Problem Solvers with Customized Exercise Generation. (arXiv:2305.14386v1 [cs.LG])

Title: AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback. (arXiv:2305.14387v1 [cs.LG])

Title: Exploring Contrast Consistency of Open-Domain Question Answering Systems on Minimally Edited Questions. (arXiv:2305.14441v1 [cs.CL])

Title: Having Beer after Prayer? Measuring Cultural Bias in Large Language Models. (arXiv:2305.14456v1 [cs.CL])

Title: Dancing Between Success and Failure: Edit-level Simplification Evaluation using SALSA. (arXiv:2305.14458v1 [cs.CL])

Title: Language Model Self-improvement by Reinforcement Learning Contemplation. (arXiv:2305.14483v1 [cs.CL])

Title: Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement. (arXiv:2305.14497v1 [cs.CL])

Title: RetICL: Sequential Retrieval of In-Context Examples with Reinforcement Learning. (arXiv:2305.14502v1 [cs.CL])

Title: Deduction under Perturbed Evidence: Probing Student Simulation Capabilities of Large Language Models. (arXiv:2305.14507v1 [cs.CL])

Title: Sources of Hallucination by Large Language Models on Inference Tasks. (arXiv:2305.14552v1 [cs.CL])

Title: PEARL: Prompting Large Language Models to Plan and Execute Actions Over Long Documents. (arXiv:2305.14564v1 [cs.CL])

Title: ALGO: Synthesizing Algorithmic Programs with Generated Oracle Verifiers. (arXiv:2305.14591v1 [cs.CL])

Title: Attentiveness to Answer Choices Doesn't Always Entail High QA Accuracy. (arXiv:2305.14596v1 [cs.CL])

Title: Self-Checker: Plug-and-Play Modules for Fact-Checking with Large Language Models. (arXiv:2305.14623v1 [cs.CL])

Title: Enabling Large Language Models to Generate Text with Citations. (arXiv:2305.14627v1 [cs.CL])

Title: Testing Causal Models of Word Meaning in GPT-3 and -4. (arXiv:2305.14630v1 [cs.CL])

Title: Evaluate What You Can't Evaluate: Unassessable Generated Responses Quality. (arXiv:2305.14658v1 [cs.CL])

Title: ExpertPrompting: Instructing Large Language Models to be Distinguished Experts. (arXiv:2305.14688v1 [cs.CL])

Title: Have Large Language Models Developed a Personality?: Applicability of Self-Assessment Tests in Measuring Personality in LLMs. (arXiv:2305.14693v1 [cs.CL])

Title: Analyzing Influential Factors in Human Preference Judgments via GPT-4. (arXiv:2305.14702v1 [cs.CL])

Title: Instructions as Backdoors: Backdoor Vulnerabilities of Instruction Tuning for Large Language Models. (arXiv:2305.14710v1 [cs.CL])

Title: In-Context Demonstration Selection with Cross Entropy Difference. (arXiv:2305.14726v1 [cs.CL])

Title: Mastering the ABCDs of Complex Questions: Answer-Based Claim Decomposition for Fine-grained Self-Evaluation. (arXiv:2305.14750v1 [cs.CL])

Title: Clever Hans or Neural Theory of Mind? Stress Testing Social Reasoning in Large Language Models. (arXiv:2305.14763v1 [cs.CL])

Title: BeamSearchQA: Large Language Models are Strong Zero-Shot QA Solver. (arXiv:2305.14766v1 [cs.CL])

Title: Using Natural Language Explanations to Rescale Human Judgments. (arXiv:2305.14770v1 [cs.CL])

Title: Large Language Models as Counterfactual Generator: Strengths and Weaknesses. (arXiv:2305.14791v1 [cs.CL])

Title: MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions. (arXiv:2305.14795v1 [cs.CL])

Title: Estimating Large Language Model Capabilities without Labeled Test Data. (arXiv:2305.14802v1 [cs.CL])


Title: Breast Cancer Segmentation using Attention-based Convolutional Network and Explainable AI. (arXiv:2305.14389v1 [cs.CV])

Title: FLAIR #2: textural and temporal information for semantic segmentation from multi-source optical imagery. (arXiv:2305.14467v1 [cs.CV])

Title: Streaming Object Detection on Fisheye Cameras for Automatic Parking. (arXiv:2305.14713v1 [cs.CV])

Title: Polarimetric Imaging for Perception. (arXiv:2305.14787v1 [cs.CV])

Title: Advancing Topic Segmentation and Outline Generation in Chinese Texts: The Paragraph-level Topic Representation, Corpus, and Benchmark. (arXiv:2305.14790v1 [cs.CL])