进行视频修复的两阶段框架的逐步培训

论文标题

进行视频修复的两阶段框架的逐步培训

Progressive Training of A Two-Stage Framework for Video Restoration

论文作者

Zheng, Meisong, Xing, Qunliang, Qiao, Minglang, Xu, Mai, Jiang, Lai, Liu, Huaida, Chen, Ying

论文摘要

作为一项广泛研究的任务，视频修复旨在提高具有多种潜在降解的视频质量，例如噪音，模糊和压缩工件。在视频修复体中，压缩视频质量增强和视频超分辨率是两个在实际情况下具有重要价值的主要钉。最近，由于它们在序列到序列建模中令人印象深刻的能力，复发性的神经网络和变形金刚吸引了该领域的研究兴趣。但是，对这些模型的培训不仅是昂贵的，而且还相对难以融合，梯度爆炸和消失的问题。为了应对这些问题，我们提出了一个两阶段的框架，包括多帧复发网络和单帧变压器。此外，开发了多种培训策略，例如转移学习和渐进培训，以缩短培训时间并改善模型性能。从上述技术贡献中受益，我们的解决方案赢得了两名冠军，并在NTIRE 2022超级分辨率和压缩视频挑战的质量增强中获得了亚军。代码可从https://github.com/ryanxingql/winner-ntire22-vqe获得。

As a widely studied task, video restoration aims to enhance the quality of the videos with multiple potential degradations, such as noises, blurs and compression artifacts. Among video restorations, compressed video quality enhancement and video super-resolution are two of the main tacks with significant values in practical scenarios. Recently, recurrent neural networks and transformers attract increasing research interests in this field, due to their impressive capability in sequence-to-sequence modeling. However, the training of these models is not only costly but also relatively hard to converge, with gradient exploding and vanishing problems. To cope with these problems, we proposed a two-stage framework including a multi-frame recurrent network and a single-frame transformer. Besides, multiple training strategies, such as transfer learning and progressive training, are developed to shorten the training time and improve the model performance. Benefiting from the above technical contributions, our solution wins two champions and a runner-up in the NTIRE 2022 super-resolution and quality enhancement of compressed video challenges. Code is available at https://github.com/ryanxingql/winner-ntire22-vqe.

下载PDF全文

下载文献需遵守相关版权规定

论文标题