脸部脱离像素化的神经对准

论文标题

脸部脱离像素化的神经对准

Neural Alignment for Face De-pixelization

论文作者

Shuvi, Maayan, Fish, Noa, Aberman, Kfir, Shamir, Ariel, Cohen-Or, Daniel

论文摘要

我们提出了一种简单的方法，可以重建来自面部视频的高分辨率视频，其中一个人的身份被像素化遮盖了。这种隐藏方法很受欢迎，因为观众仍然可以感知人脸的人物和整体头部运动。但是，我们在实验中表明，可以以损害匿名性的方式重建原始视频的相当良好的近似值。我们的系统利用了描绘人脸的近距离视频框架之间的同时相似性和微小的差异，并采用了空间转换组件，以了解像素化帧之间的对齐方式。每个帧都由其对齐的周围框架支撑，首先编码，然后解码为更高的分辨率。重建和感知损失促进了遵守地面真相，对抗性损失有助于维持领域的忠诚。无需明确的时间连贯性损失，因为相邻框架和重建的对齐方式隐含地保持了。尽管很简单，但我们的框架综合了高质量的面部重建，证明鉴于人脸的统计先验，多个对齐的像素化框架包含足够的信息，可以重建对原始信号的高质量近似。

We present a simple method to reconstruct a high-resolution video from a face-video, where the identity of a person is obscured by pixelization. This concealment method is popular because the viewer can still perceive a human face figure and the overall head motion. However, we show in our experiments that a fairly good approximation of the original video can be reconstructed in a way that compromises anonymity. Our system exploits the simultaneous similarity and small disparity between close-by video frames depicting a human face, and employs a spatial transformation component that learns the alignment between the pixelated frames. Each frame, supported by its aligned surrounding frames, is first encoded, then decoded to a higher resolution. Reconstruction and perceptual losses promote adherence to the ground-truth, and an adversarial loss assists in maintaining domain faithfulness. There is no need for explicit temporal coherency loss as it is maintained implicitly by the alignment of neighboring frames and reconstruction. Although simple, our framework synthesizes high-quality face reconstructions, demonstrating that given the statistical prior of a human face, multiple aligned pixelated frames contain sufficient information to reconstruct a high-quality approximation of the original signal.

下载PDF全文

下载文献需遵守相关版权规定

论文标题