深层神经网络的强大明智的对抗性学习图像分类

论文标题

深层神经网络的强大明智的对抗性学习图像分类

Robust Sensible Adversarial Learning of Deep Neural Networks for Image Classification

论文作者

Kim, Jungeum, Wang, Xiao

论文摘要

鲁棒性的想法对于现代统计分析至关重要。但是，尽管深度神经网络（DNN）最近取得了进步，但许多研究表明，DNN容易受到对抗性攻击的影响。对图像进行不可察觉的更改可能会导致DNN模型以高度置信度进行错误的分类，例如将良性痣分类为恶性肿瘤和停止符号为速度限制符号。鲁棒性和标准精度之间的权衡对于DNN模型来说是常见的。在本文中，我们介绍了明智的对抗性学习，并证明了标准自然准确性和鲁棒性追求之间的协同作用。具体而言，我们定义了一个明智的对手，该对手对于学习强大的模型很有用，同时保持高自然精度。从理论上讲，我们确定贝叶斯分类器是在明智的对抗性学习下具有0-1损失的最强大的多级分类器。我们提出了一种新颖有效的算法，该算法使用隐式损失截断来训练健壮的模型。我们将明智的对抗性学习应用于大规模图像分类，将其用于称为MNIST的手写数字图像数据集和称为CIFAR10的对象识别彩色图像数据集。我们已经进行了广泛的比较研究，以将我们的方法与其他竞争方法进行比较。我们的实验从经验上表明，我们的方法对其超参数不敏感，即使具有较小的模型能力也不会崩溃，同时促进了针对各种攻击的鲁棒性并保持高自然的准确性。

The idea of robustness is central and critical to modern statistical analysis. However, despite the recent advances of deep neural networks (DNNs), many studies have shown that DNNs are vulnerable to adversarial attacks. Making imperceptible changes to an image can cause DNN models to make the wrong classification with high confidence, such as classifying a benign mole as a malignant tumor and a stop sign as a speed limit sign. The trade-off between robustness and standard accuracy is common for DNN models. In this paper, we introduce sensible adversarial learning and demonstrate the synergistic effect between pursuits of standard natural accuracy and robustness. Specifically, we define a sensible adversary which is useful for learning a robust model while keeping high natural accuracy. We theoretically establish that the Bayes classifier is the most robust multi-class classifier with the 0-1 loss under sensible adversarial learning. We propose a novel and efficient algorithm that trains a robust model using implicit loss truncation. We apply sensible adversarial learning for large-scale image classification to a handwritten digital image dataset called MNIST and an object recognition colored image dataset called CIFAR10. We have performed an extensive comparative study to compare our method with other competitive methods. Our experiments empirically demonstrate that our method is not sensitive to its hyperparameter and does not collapse even with a small model capacity while promoting robustness against various attacks and keeping high natural accuracy.

下载PDF全文

下载文献需遵守相关版权规定

论文标题