论文标题
全向视觉的深度学习:调查和新观点
Deep Learning for Omnidirectional Vision: A Survey and New Perspectives
论文作者
论文摘要
全向图像(ODI)数据是用360x180视野捕获的,它比针孔摄像机宽得多,并且包含比常规平面图像更丰富的空间信息。因此,全向视觉引起了人们在众多应用中(例如自动驾驶和虚拟现实)中更有利的性能,引起了人们的关注。近年来,客户级360摄像机的可用性使全向视觉变得更加流行,并且深度学习的进步(DL)显着引发了其研究和应用。本文对DL方法的最新进展进行了系统的全面综述和分析。我们的工作涵盖了四个主要内容:(i)与2D平面图像数据相比,全向成像原理,ODI上的卷积方法以及数据集的介绍; (ii)全向视觉的DL方法的结构和层次分类学; (iii)最新新颖的学习策略和应用的总结; (iv)通过强调触发社区更多研究的潜在研究方向,对挑战和开放问题进行深入的讨论。
Omnidirectional image (ODI) data is captured with a 360x180 field-of-view, which is much wider than the pinhole cameras and contains richer spatial information than the conventional planar images. Accordingly, omnidirectional vision has attracted booming attention due to its more advantageous performance in numerous applications, such as autonomous driving and virtual reality. In recent years, the availability of customer-level 360 cameras has made omnidirectional vision more popular, and the advance of deep learning (DL) has significantly sparked its research and applications. This paper presents a systematic and comprehensive review and analysis of the recent progress in DL methods for omnidirectional vision. Our work covers four main contents: (i) An introduction to the principle of omnidirectional imaging, the convolution methods on the ODI, and datasets to highlight the differences and difficulties compared with the 2D planar image data; (ii) A structural and hierarchical taxonomy of the DL methods for omnidirectional vision; (iii) A summarization of the latest novel learning strategies and applications; (iv) An insightful discussion of the challenges and open problems by highlighting the potential research directions to trigger more research in the community.