

Title: Zero-shot racially balanced dataset generation using an existing biased StyleGAN2. (arXiv:2305.07710v1 [cs.CV])

Title: Improving Defensive Distillation using Teacher Assistant. (arXiv:2305.08076v1 [cs.CV])

Title: Beyond the Safeguards: Exploring the Security Risks of ChatGPT. (arXiv:2305.08005v1 [cs.CR])

Title: Interchain Timestamping for Mesh Security. (arXiv:2305.07830v1 [cs.CR])

Title: Systematic Meets Unintended: Prior Knowledge Adaptive 5G Vulnerability Detection via Multi-Fuzzing. (arXiv:2305.08039v1 [cs.CR])

Title: NLP-based Cross-Layer 5G Vulnerabilities Detection via Fuzzing Generated Run-Time Profiling. (arXiv:2305.08226v1 [cs.CR])

Title: On the Computational Cost of Stochastic Security. (arXiv:2305.07973v1 [cs.LG])


Title: MetaMorphosis: Task-oriented Privacy Cognizant Feature Generation for Multi-task Learning. (arXiv:2305.07815v1 [cs.CV])

Title: Black-box Source-free Domain Adaptation via Two-stage Knowledge Distillation. (arXiv:2305.07881v1 [cs.CV])

Title: Private and Communication-Efficient Algorithms for Entropy Estimation. (arXiv:2305.07751v1 [cs.LG])

Title: The Case for the Anonymization of Offloaded Computation. (arXiv:2305.07803v1 [cs.CR])

Title: Balancing Privacy and Utility of Spatio-Temporal Data for Taxi-Demand Prediction. (arXiv:2305.08107v1 [cs.LG])

Title: Traceable mixnets. (arXiv:2305.08138v1 [cs.CR])



Title: DNN-Defender: An in-DRAM Deep Neural Network Defense Mechanism for Adversarial Weight Attack. (arXiv:2305.08034v1 [cs.CR])


Title: Diffusion Models for Imperceptible and Transferable Adversarial Attack. (arXiv:2305.08192v1 [cs.CV])

Title: ChargeX: Exploring State Switching Attack on Electric Vehicle Charging Systems. (arXiv:2305.08037v1 [cs.CR])

Title: Mastering Percolation-like Games with Deep Learning. (arXiv:2305.07687v1 [cs.LG])


Title: Lightweight Delivery Detection on Doorbell Cameras. (arXiv:2305.07812v1 [cs.CV])

Title: On enhancing the robustness of Vision Transformers: Defensive Diffusion. (arXiv:2305.08031v1 [cs.CV])

Title: GPT-Sentinel: Distinguishing Human and ChatGPT Generated Content. (arXiv:2305.07969v1 [cs.CL])

Title: Predicting COVID-19 pandemic by spatio-temporal graph neural networks: A New Zealand's study. (arXiv:2305.07731v1 [cs.LG])

Title: SPP-CNN: An Efficient Framework for Network Robustness Prediction. (arXiv:2305.07872v1 [cs.LG])




Title: On the Hidden Mystery of OCR in Large Multimodal Models. (arXiv:2305.07895v1 [cs.CV])

Title: Distinguish Before Answer: Generating Contrastive Explanation as Knowledge for Commonsense Question Answering. (arXiv:2305.08135v1 [cs.CL])

Title: A Federated Learning-based Industrial Health Prognostics for Heterogeneous Edge Devices using Matched Feature Extraction. (arXiv:2305.07854v1 [cs.LG])

membership infer


Title: Understanding Model Averaging in Federated Learning on Heterogeneous Data. (arXiv:2305.07845v1 [cs.LG])

Title: A Survey of Federated Evaluation in Federated Learning. (arXiv:2305.08070v1 [cs.LG])

Title: Federated TD Learning over Finite-Rate Erasure Channels: Linear Speedup under Markovian Sampling. (arXiv:2305.08104v1 [cs.LG])


Title: DAC-MR: Data Augmentation Consistency Based Meta-Regularization for Meta-Learning. (arXiv:2305.07892v1 [cs.LG])


Title: Answering Complex Questions over Text by Hybrid Question Parsing and Execution. (arXiv:2305.07789v1 [cs.CL])

Title: Zero-shot Faithful Factual Error Correction. (arXiv:2305.07982v1 [cs.CL])


Title: ProKnow: Process Knowledge for Safety Constrained and Explainable Question Generation for Mental Health Diagnostic Assistance. (arXiv:2305.08010v1 [cs.CL])



Title: Meta-DM: Applications of Diffusion Models on Few-Shot Learning. (arXiv:2305.08092v1 [cs.LG])

noise learning



Title: ROI-based Deep Image Compression with Swin Transformers. (arXiv:2305.07783v1 [cs.CV])

Title: CEMFormer: Learning to Predict Driver Intentions from In-Cabin and External Cameras via Spatial-Temporal Transformers. (arXiv:2305.07840v1 [cs.CV])

Title: GSB: Group Superposition Binarization for Vision Transformer with Limited Training Samples. (arXiv:2305.07931v1 [cs.CV])

Title: A Two-Stage Real Image Deraining Method for GT-RAIN Challenge CVPR 2023 Workshop UG$^{\textbf{2}}$+ Track 3. (arXiv:2305.07979v1 [cs.CV])

Title: TSGN: Temporal Scene Graph Neural Networks with Projected Vectorized Representation for Multi-Agent Motion Prediction. (arXiv:2305.08190v1 [cs.CV])

Title: TinyStories: How Small Can Language Models Be and Still Speak Coherent English?. (arXiv:2305.07759v1 [cs.CL])

In this work, we introduce TinyStories, a synthetic dataset of short stories that only contain words that a typical 3 to 4-year-olds usually understand, generated by GPT-3.5 and GPT-4. We show that TinyStories can be used to train and evaluate LMs that are much smaller than the state-of-the-art models (below 10 million total parameters), or have much simpler architectures (with only one transformer block), yet still produce fluent and consistent stories with several paragraphs that are diverse and have almost perfect grammar, and demonstrate reasoning capabilities.

We also introduce a new paradigm for the evaluation of language models: We suggest a framework which uses GPT-4 to grade the content generated by these models as if those were stories written by students and graded by a (human) teacher. This new paradigm overcomes the flaws of standard benchmarks which often requires the model's output to be very structures, and moreover provides a multidimensional score for the model, providing scores for different capabilities such as grammar, creativity and consistency.

We hope that TinyStories can facilitate the development, analysis and research of LMs, especially for low-resource or specialized domains, and shed light on the emergence of language capabilities in LMs.

Title: PESTS: Persian_English Cross Lingual Corpus for Semantic Textual Similarity. (arXiv:2305.07893v1 [cs.CL])

Title: Towards Understanding and Improving Knowledge Distillation for Neural Machine Translation. (arXiv:2305.08096v1 [cs.CL])

Title: Croatian Film Review Dataset (Cro-FiReDa): A Sentiment Annotated Dataset of Film Reviews. (arXiv:2305.08173v1 [cs.CL])

Title: DRew: Dynamically Rewired Message Passing with Delay. (arXiv:2305.08018v1 [cs.LG])

Title: HiPerformer: Hierarchically Permutation-Equivariant Transformer for Time Series Forecasting. (arXiv:2305.08073v1 [cs.LG])


Title: Dr. LLaMA: Improving Small Language Models in Domain-Specific QA via Generative Data Augmentation. (arXiv:2305.07804v1 [cs.CL])

Title: Learning to Generalize for Cross-domain QA. (arXiv:2305.08208v1 [cs.CL])

Title: LatentPINNs: Generative physics-informed neural networks via a latent representation learning. (arXiv:2305.07671v1 [cs.LG])

Title: Measuring Surprise in the Wild. (arXiv:2305.07733v1 [cs.LG])

Title: Latent Processes Identification From Multi-View Time Series. (arXiv:2305.08164v1 [cs.LG])

large language model

Title: NL2TL: Transforming Natural Languages to Temporal Logics using Large Language Models. (arXiv:2305.07766v1 [cs.CL])

Title: Bridging History with AI A Comparative Evaluation of GPT 3.5, GPT4, and GoogleBARD in Predictive Accuracy and Fact Checking. (arXiv:2305.07868v1 [cs.CL])

Title: CodeT5+: Open Code Large Language Models for Code Understanding and Generation. (arXiv:2305.07922v1 [cs.CL])

Title: Make Prompt-based Black-Box Tuning Colorful: Boosting Model Generalization from Three Orthogonal Perspectives. (arXiv:2305.08088v1 [cs.CL])


Title: AURA : Automatic Mask Generator using Randomized Input Sampling for Object Removal. (arXiv:2305.07857v1 [cs.CV])

Title: Illumination-insensitive Binary Descriptor for Visual Measurement Based on Local Inter-patch Invariance. (arXiv:2305.07943v1 [cs.CV])

Title: Image Segmentation via Probabilistic Graph Matching. (arXiv:2305.07954v1 [cs.CV])

Title: SCRNet: a Retinex Structure-based Low-light Enhancement Model Guided by Spatial Consistency. (arXiv:2305.08053v1 [cs.CV])

Title: A Comprehensive Survey on Segment Anything Model for Vision and Beyond. (arXiv:2305.08196v1 [cs.CV])