论文标题
端到端深面识别的要素:对最近进步的调查
The Elements of End-to-end Deep Face Recognition: A Survey of Recent Advances
论文作者
论文摘要
面部识别是计算机视觉中最受欢迎,最长期的主题之一。随着深度学习技术和大规模数据集的最新发展,深度识别取得了显着的进步,并在许多现实世界中广泛使用。鉴于自然图像或视频框架作为输入,端到端的深面识别系统输出了面部功能以识别识别。为了实现这一目标,典型的端到端系统是由三个关键要素构建的:面部检测,面部对齐和面部表示。面部检测位于图像或框架中。然后,将面部对齐方式进行校准,以校准规范视图,并用标准化像素尺寸裁剪它们。最后,在面部表示阶段,从对齐的面上提取判别特征以识别。如今,深度卷积神经网络的技术实现了这三个要素。在这篇调查文章中,我们对每个元素的最新进展进行了全面评论。首先,我们概述了端到端的深面识别。然后,我们分别回顾了每个元素的进步,涵盖了许多方面,例如迄今为止的算法设计,评估指标,数据集,绩效比较,现有挑战以及未来研究的有希望的方向。此外,我们还提供了有关每个元素对其后续元素和整体系统的影响的详细讨论。通过这项调查,我们希望在两个方面带来贡献:首先,读者可以方便地识别出子类别中相当强大的基本风格的方法,以进行进一步探索。其次,人们还可以采用合适的方法来从头开始建立最先进的端到端面部识别系统。
Face recognition is one of the most popular and long-standing topics in computer vision. With the recent development of deep learning techniques and large-scale datasets, deep face recognition has made remarkable progress and been widely used in many real-world applications. Given a natural image or video frame as input, an end-to-end deep face recognition system outputs the face feature for recognition. To achieve this, a typical end-to-end system is built with three key elements: face detection, face alignment, and face representation. The face detection locates faces in the image or frame. Then, the face alignment is proceeded to calibrate the faces to the canonical view and crop them with a normalized pixel size. Finally, in the stage of face representation, the discriminative features are extracted from the aligned face for recognition. Nowadays, all of the three elements are fulfilled by the technique of deep convolutional neural network. In this survey article, we present a comprehensive review about the recent advance of each element. To start with, we present an overview of the end-to-end deep face recognition. Then, we review the advance of each element, respectively, covering many aspects such as the to-date algorithm designs, evaluation metrics, datasets, performance comparison, existing challenges, and promising directions for future research. Also, we provide a detailed discussion about the effect of each element on its subsequent elements and the holistic system. Through this survey, we wish to bring contributions in two aspects: first, readers can conveniently identify the methods which are quite strong-baseline style in the subcategory for further exploration; second, one can also employ suitable methods for establishing a state-of-the-art end-to-end face recognition system from scratch.