使用伪影放大的框架插值基准测试的主观注释

论文标题

使用伪影放大的框架插值基准测试的主观注释

Subjective Annotation for a Frame Interpolation Benchmark using Artefact Amplification

论文作者

Men, Hui, Hosu, Vlad, Lin, Hanhe, Bruhn, Andrés, Saupe, Dietmar

论文摘要

光流算法的当前基准测试通过将预测的流场与地面真相进行比较，或通过使用预测的流场进行框架插值，然后将插值框架与实际帧进行比较，从而评估了估计。在后一种情况下，通常采用客观质量措施（例如平方误差）。但是，众所周知，对于图像质量评估，用户所经历的实际质量不能从这种简单的措施中完全推导。因此，我们对米德尔伯里基准之一的光流基准之一提供的插值框架进行了主观质量评估人群群体。我们收集了插值图像与相应地面真理之间的强制选择配对的比较。为了提高观察者在配对比较中的微小差异时，我们引入了一种新的方法，以称为“人工制作放大”领域。从众包数据中，我们根据Thurstone的模型重建了绝对质量尺度值。结果，我们获得了155个参与算法W.R.T.的重新排列。插值框架的视觉质量。这种重新排列的不仅显示了视觉质量评估的必要性，作为光流和框架插值基准测试的另一种评估指标，结果还为设计新的图像质量评估（IQA）方法提供了基础真实，该方法专门针对插值图像的知觉质量。作为第一步，我们提出了一种新的全参考方法，称为WAE-IQA。通过权衡插值图像与其地面真相之间的局部差异，wae-iqa的表现比文献中目前最佳的fr-iqa方法要好得多。

Current benchmarks for optical flow algorithms evaluate the estimation either directly by comparing the predicted flow fields with the ground truth or indirectly by using the predicted flow fields for frame interpolation and then comparing the interpolated frames with the actual frames. In the latter case, objective quality measures such as the mean squared error are typically employed. However, it is well known that for image quality assessment, the actual quality experienced by the user cannot be fully deduced from such simple measures. Hence, we conducted a subjective quality assessment crowdscouring study for the interpolated frames provided by one of the optical flow benchmarks, the Middlebury benchmark. We collected forced-choice paired comparisons between interpolated images and corresponding ground truth. To increase the sensitivity of observers when judging minute difference in paired comparisons we introduced a new method to the field of full-reference quality assessment, called artefact amplification. From the crowdsourcing data, we reconstructed absolute quality scale values according to Thurstone's model. As a result, we obtained a re-ranking of the 155 participating algorithms w.r.t. the visual quality of the interpolated frames. This re-ranking not only shows the necessity of visual quality assessment as another evaluation metric for optical flow and frame interpolation benchmarks, the results also provide the ground truth for designing novel image quality assessment (IQA) methods dedicated to perceptual quality of interpolated images. As a first step, we proposed such a new full-reference method, called WAE-IQA. By weighing the local differences between an interpolated image and its ground truth WAE-IQA performed slightly better than the currently best FR-IQA approach from the literature.

下载PDF全文

下载文献需遵守相关版权规定

论文标题