论文标题

Dropnas:分组的操作辍学用于可区分体系结构搜索

DropNAS: Grouped Operation Dropout for Differentiable Architecture Search

论文作者

Hong, Weijun, Li, Guilin, Zhang, Weinan, Tang, Ruiming, Wang, Yunhe, Li, Zhenguo, Yu, Yong

论文摘要

神经体系结构搜索(NAS)表现出令人鼓舞的结果,可以自动化体系结构设计。最近,DARTS通过可区分的公式放松搜索过程,该配方利用体重分享和SGD,同时训练所有候选人的操作。我们的经验结果表明,这种过程导致共同适应问题和MATTHEW效应:更少参数的操作将早于成熟。这引起了两个问题:首先,具有更多参数的操作可能永远不会有机会表达所需的功能,因为那些较少的人已经完成了工作。其次,该系统将通过降低其体系结构参数来惩罚那些表现不佳的操作,并且它们将获得较小的损失梯度,从而导致Matthew效应。在本文中,我们系统地研究了这些问题,并提出了一种新型的分组操作辍学算法,以解决飞镖的问题。广泛的实验表明,DropNA解决了上述问题并实现了有希望的表现。具体而言,DropNA在CIFAR-10上达到了2.26%的测试误差,CIFAR-100的16.39%和Imagenet的23.4%(具有与DARTS相同的训练超级仪以进行公平比较)。还可以观察到,在飞镖搜索空间的变体中,DropNA具有强大的功能。代码可从https://github.com/wiljohnhong/dropnas获得。

Neural architecture search (NAS) has shown encouraging results in automating the architecture design. Recently, DARTS relaxes the search process with a differentiable formulation that leverages weight-sharing and SGD where all candidate operations are trained simultaneously. Our empirical results show that such procedure results in the co-adaption problem and Matthew Effect: operations with fewer parameters would be trained maturely earlier. This causes two problems: firstly, the operations with more parameters may never have the chance to express the desired function since those with less have already done the job; secondly, the system will punish those underperforming operations by lowering their architecture parameter, and they will get smaller loss gradients, which causes the Matthew Effect. In this paper, we systematically study these problems and propose a novel grouped operation dropout algorithm named DropNAS to fix the problems with DARTS. Extensive experiments demonstrate that DropNAS solves the above issues and achieves promising performance. Specifically, DropNAS achieves 2.26% test error on CIFAR-10, 16.39% on CIFAR-100 and 23.4% on ImageNet (with the same training hyperparameters as DARTS for a fair comparison). It is also observed that DropNAS is robust across variants of the DARTS search space. Code is available at https://github.com/wiljohnhong/DropNAS.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源