论文标题

对抗训练应用于卷积神经网络,以进行光度红移预测

Adversarial training applied to Convolutional Neural Network for photometric redshift predictions

论文作者

Campagne, Jean-Eric

论文摘要

由于机器学习(ML)生态系统的快速发展,近年已开发了使用卷积神经网络(CNN)通过分析不同波长带中的图像来估计星系光度变速概率分布的。作者使用标准的培训和测试方法来确保其模型的概括能力,并研究了CNN体系结构,并研究了他们的性能和一些系统来源。到目前为止还不错,但是缺少一块:模型概括能力是否得到很好的测量?本文清楚地表明,非常小的图像扰动可以完全欺骗模型,并打开\ textit {对抗性}攻击的Pandora盒子。在不同的技术和场景中,我们选择使用快速符号梯度一步方法及其预计的梯度下降迭代扩展作为对抗发电机工具套件。但是,似乎这些对抗性样本不仅蒙蔽了一个模型,而且揭示了模型和经典培训的弱点。通过在训练阶段注入一小部分对抗样本,显示了重新审视的算法和应用。尽管我们的研究可以应用于其他模型(不仅在其他情况下 - 在其他情况下),但不仅在其他模型中应用了数值实验,但使用特定的CNN模型进行了插图,不仅可以应用于其他模型 - 不仅在其他情况下 - 红移测量值 - 因此它处理了边界决策表面的复杂性。

The use of Convolutional Neural Networks (CNN) to estimate the galaxy photometric redshift probability distribution by analysing the images in different wavelength bands has been developed in the recent years thanks to the rapid development of the Machine Learning (ML) ecosystem. Authors have set-up CNN architectures and studied their performances and some sources of systematics using standard methods of training and testing to ensure the generalisation power of their models. So far so good, but one piece was missing : does the model generalisation power is well measured? The present article shows clearly that very small image perturbations can fool the model completely and opens the Pandora's box of \textit{adversarial} attack. Among the different techniques and scenarios, we have chosen to use the Fast Sign Gradient one-step Method and its Projected Gradient Descent iterative extension as adversarial generator tool kit. However, as unlikely as it may seem these adversarial samples which fool not only a single model, reveal a weakness both of the model and the classical training. A revisited algorithm is shown and applied by injecting a fraction of adversarial samples during the training phase. Numerical experiments have been conducted using a specific CNN model for illustration although our study could be applied to other models - not only CNN ones - and in other contexts - not only redshift measurements - as it deals with the complexity of the boundary decision surface.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源