论文标题
长尾视觉识别的调查
A Survey on Long-Tailed Visual Recognition
论文作者
论文摘要
对数据的严重依赖是目前限制深度学习发展的主要原因之一。数据质量直接主导着深度学习模型的效果,而长尾巴分布是影响数据质量的因素之一。长尾现象由于自然界中的权力法而普遍存在。在这种情况下,深度学习模型的表现通常由头等阶层主导,而尾巴课程的学习严重欠发达。为了对所有课程进行足够的学习,许多研究人员都研究并初步解决了长期的问题。在这项调查中,我们关注由长尾数据分布引起的问题,整理代表性的长尾视觉识别数据集并总结一些主流的长尾研究。具体而言,我们将这些研究从表示学习的角度总结为十个类别,并概述了每个类别的亮点和局限性。此外,我们研究了四个定量指标来评估失衡,并建议使用Gini系数评估数据集的长尾巴。基于GINI系数,我们定量研究了过去十年中提出的20种广泛使用和大规模的视觉数据集,并发现长尾现象广泛,尚未得到充分研究。最后,我们为开发长尾学习提供了多个未来的方向,为读者提供更多想法。
The heavy reliance on data is one of the major reasons that currently limit the development of deep learning. Data quality directly dominates the effect of deep learning models, and the long-tailed distribution is one of the factors affecting data quality. The long-tailed phenomenon is prevalent due to the prevalence of power law in nature. In this case, the performance of deep learning models is often dominated by the head classes while the learning of the tail classes is severely underdeveloped. In order to learn adequately for all classes, many researchers have studied and preliminarily addressed the long-tailed problem. In this survey, we focus on the problems caused by long-tailed data distribution, sort out the representative long-tailed visual recognition datasets and summarize some mainstream long-tailed studies. Specifically, we summarize these studies into ten categories from the perspective of representation learning, and outline the highlights and limitations of each category. Besides, we have studied four quantitative metrics for evaluating the imbalance, and suggest using the Gini coefficient to evaluate the long-tailedness of a dataset. Based on the Gini coefficient, we quantitatively study 20 widely-used and large-scale visual datasets proposed in the last decade, and find that the long-tailed phenomenon is widespread and has not been fully studied. Finally, we provide several future directions for the development of long-tailed learning to provide more ideas for readers.