姿势示意性的对比度学习，以进行自我监督的面部表现

论文标题

姿势示意性的对比度学习，以进行自我监督的面部表现

Pose-disentangled Contrastive Learning for Self-supervised Facial Representation

论文作者

Liu, Yuanyuan, Wang, Wenbin, Zhan, Yibing, Feng, Shaoze, Liu, Kejun, Chen, Zhe

论文摘要

自我监督的面部表现最近由于其表现面部理解的能力而不依赖大规模注释的数据集而引起了越来越多的关注。但是，从分析上讲，目前基于对比的自我监督学习（SSL）仍然在学习面部表现方面表现不佳。更具体地说，现有的对比学习（CL）倾向于学习姿势不变的特征，这些特征无法描绘面部的姿势细节，从而损害了学习表现。为了征服上述CL的局限性，我们提出了一种新颖的姿势 - 偏见的对比学习（PCL）方法，用于一般自我监督的面部表现。我们的PCL首先使用精心设计的正交调节设计了一个姿势 - 符合解码器（PDD），该调节将与姿势相关的特征与面部感知功能相关。因此，可以在单个子网中执行与姿势相关和其他姿势无关的面部信息，并且不会影响彼此的培训。此外，我们引入了一种与姿势相关的对比学习方案，该方案基于同一图像的数据增强来学习与姿势相关的信息，这将为各种下游任务提供更有效的表情表示。我们对四个具有挑战性的下游面部理解任务进行线性评估，即，面部表达识别，面部识别，AU检测和头部姿势估计。实验结果表明，我们的方法显着胜过最先进的SSL方法。代码可从https://github.com/dreammr/pcl} {https://github.com/dreammr/pcl获得

Self-supervised facial representation has recently attracted increasing attention due to its ability to perform face understanding without relying on large-scale annotated datasets heavily. However, analytically, current contrastive-based self-supervised learning (SSL) still performs unsatisfactorily for learning facial representation. More specifically, existing contrastive learning (CL) tends to learn pose-invariant features that cannot depict the pose details of faces, compromising the learning performance. To conquer the above limitation of CL, we propose a novel Pose-disentangled Contrastive Learning (PCL) method for general self-supervised facial representation. Our PCL first devises a pose-disentangled decoder (PDD) with a delicately designed orthogonalizing regulation, which disentangles the pose-related features from the face-aware features; therefore, pose-related and other pose-unrelated facial information could be performed in individual subnetworks and do not affect each other's training. Furthermore, we introduce a pose-related contrastive learning scheme that learns pose-related information based on data augmentation of the same image, which would deliver more effective face-aware representation for various downstream tasks. We conducted linear evaluation on four challenging downstream facial understanding tasks, ie, facial expression recognition, face recognition, AU detection and head pose estimation. Experimental results demonstrate that our method significantly outperforms state-of-the-art SSL methods. Code is available at https://github.com/DreamMr/PCL}{https://github.com/DreamMr/PCL

下载PDF全文

下载文献需遵守相关版权规定

论文标题