当地的卷积会导致对高频对手的隐性偏见

论文标题

当地的卷积会导致对高频对手的隐性偏见

Local Convolutions Cause an Implicit Bias towards High Frequency Adversarial Examples

论文作者

Caro, Josue Ortega, Ju, Yilong, Pyle, Ryan, Dey, Sourav, Brendel, Wieland, Anselmi, Fabio, Patel, Ankit

论文摘要

对抗性攻击仍然是神经网络的重大挑战。最近的工作表明，对抗扰动通常包含高频特征，但是这种现象的根本原因仍然未知。受到线性全宽卷积模型的理论工作的启发，我们假设在当前神经网络中常用的局部（即有界宽度的）卷积操作是隐含的偏向于学习高频特征的偏见，这是高频对抗性示例的根本原因之一。为了检验这一假设，我们分析了线性和非线性体系结构选择对空间和频域中对对抗性扰动的隐式偏差的影响。我们发现，高频逆向扰动主要取决于卷积操作，因为局部卷积的空间限制性质会引起对高频特征的隐含偏见。后者的解释涉及傅立叶不确定性原理：空间限制（在空间域中）滤波器也不能是频率限制的（频域中的局部）。此外，使用较大的卷积内核大小或避免卷积（例如，使用Vision Transformers体系结构）大大降低了这种高频偏见，但不能大大降低对攻击的总体易感性。展望未来，我们的工作强烈建议理解和控制建筑的隐性偏见对于实现对抗性鲁棒性至关重要。

Adversarial Attacks are still a significant challenge for neural networks. Recent work has shown that adversarial perturbations typically contain high-frequency features, but the root cause of this phenomenon remains unknown. Inspired by theoretical work on linear full-width convolutional models, we hypothesize that the local (i.e. bounded-width) convolutional operations commonly used in current neural networks are implicitly biased to learn high frequency features, and that this is one of the root causes of high frequency adversarial examples. To test this hypothesis, we analyzed the impact of different choices of linear and nonlinear architectures on the implicit bias of the learned features and the adversarial perturbations, in both spatial and frequency domains. We find that the high-frequency adversarial perturbations are critically dependent on the convolution operation because the spatially-limited nature of local convolutions induces an implicit bias towards high frequency features. The explanation for the latter involves the Fourier Uncertainty Principle: a spatially-limited (local in the space domain) filter cannot also be frequency-limited (local in the frequency domain). Furthermore, using larger convolution kernel sizes or avoiding convolutions (e.g. by using Vision Transformers architecture) significantly reduces this high frequency bias, but not the overall susceptibility to attacks. Looking forward, our work strongly suggests that understanding and controlling the implicit bias of architectures will be essential for achieving adversarial robustness.

下载PDF全文

下载文献需遵守相关版权规定

论文标题