Natural Adversarial Examples 论文

2021引用 757

Adversarial Robustness in Machine LearningPhysical Unclonable Functions (PUFs) and Hardware SecurityAdvanced Malware Detection Techniques

网络安全 Adversarial Robustness in Machine Learning Physical Unclonable Functions (PUFs) and Hardware Security Advanced Malware Detection Techniques

关系图谱

作者

摘要

We introduce two challenging datasets that reliably cause machine learning model performance to substantially degrade. The datasets are collected with a simple adversarial filtration technique to create datasets with limited spurious cues. Our datasets’ real-world, unmodified examples transfer to various unseen models reliably, demonstrating that computer vision models have shared weaknesses. The first dataset is called IMAGENET-A and is like the ImageNet test set, but it is far more challenging for existing models. We also curate an adversarial out-of-distribution detection dataset called IMAGENET-O, which is the first out-of-distribution detection dataset created for ImageNet models. On IMAGENET-A a DenseNet-121 obtains around 2% accuracy, an accuracy drop of approximately 90%, and its out-of-distribution detection performance on IMAGENET-O is near random chance levels. We find that existing data augmentation techniques hardly boost performance, and using other public training datasets provides improvements that are limited. However, we find that improvements to computer vision architectures provide a promising path towards robust models.

作者查看全部 (5)

Dawn Song

Jacob Steinhardt

Steven Basart

Kevin Zhao

Natural Adversarial Examples 论文

详细信息

摘要

作者查看全部 (5)

相关技术查看全部 (2)

相关事件

相关文章