修改双眼视频

论文标题

修改双眼视频

Mononizing Binocular Videos

论文作者

Hu, Wenbo, Xia, Menghan, Fu, Chi-Wing, Wong, Tien-Tsin

论文摘要

本文介绍了单眼视频的概念和有效实现的框架。单核意味着我们故意将附带视频转换为常规的单眼视频，其立体声信息图像以视觉但几乎侵略的形式编码。因此，Wecan公正地分发并将单人视频显示为普通的单眼视频。与普通的单眼视频不同，我们可以从原始的双眼视频中恢复，并在立体显示器上显示。首先，我们使用锥体可去模块的编码和编码框架制定了编码和编码框架，以利用theleft和正确视图之间的远程对应关系，一个量化层，以抑制恢复的工件，以及压缩噪声模拟模块以抵制由现代视频编码器引入的压缩措施。我们的框架是自我监督的，因为我们用输入中定义的损失术语表达了目标功能：创建单眼视频的单眼术语，可恢复原始视频的可逆性术语以及用于框架到framecoherence的时间术语。此外，我们进行了广泛的实验，以评估我们的共同的单人视频，并恢复了图像和3D电影的各种类型的双眼视频。标准指标和用户感知研究的定量结果均显示了我们方法的有效性。

This paper presents the idea ofmono-nizingbinocular videos and a frame-work to effectively realize it. Mono-nize means we purposely convert abinocular video into a regular monocular video with the stereo informationimplicitly encoded in a visual but nearly-imperceptible form. Hence, wecan impartially distribute and show the mononized video as an ordinarymonocular video. Unlike ordinary monocular videos, we can restore from itthe original binocular video and show it on a stereoscopic display. To start,we formulate an encoding-and-decoding framework with the pyramidal de-formable fusion module to exploit long-range correspondences between theleft and right views, a quantization layer to suppress the restoring artifacts,and the compression noise simulation module to resist the compressionnoise introduced by modern video codecs. Our framework is self-supervised,as we articulate our objective function with loss terms defined on the input:a monocular term for creating the mononized video, an invertibility termfor restoring the original video, and a temporal term for frame-to-framecoherence. Further, we conducted extensive experiments to evaluate ourgenerated mononized videos and restored binocular videos for diverse typesof images and 3D movies. Quantitative results on both standard metrics anduser perception studies show the effectiveness of our method.

下载PDF全文

下载文献需遵守相关版权规定

论文标题