有效的基于视觉上下文感测的管道

论文标题

有效的基于视觉上下文感测的管道

Efficient Pipelines for Vision-Based Context Sensing

论文作者

Liu, Xiaochen

论文摘要

上下文意识是移动和无处不在计算的重要组成部分。它的目标是揭示有关移动用户（例如位置和活动）的情境信息。感知的上下文可以实现许多服务，例如导航，AR和智能购物。可以以不同的方式感知这种上下文，包括视觉传感器。全球部署的视觉来源出现。相机可以安装在路边，内部和移动平台上。这种趋势提供了可用于上下文感测的大量视觉数据。但是，视觉数据收集和分析仍然是当今高度手动的。很难大规模部署摄像头进行数据收集。来自数据的组织和标记环境也是劳动密集型的。近年来，先进的视力算法和深层神经网络用于帮助分析视力数据。但是，这种方法受数据质量，标签工作和对硬件资源的依赖的限制。总而言之，当今基于视觉的上下文传感系统面临三个主要挑战：大规模数据收集和标记，使用有限的硬件资源有效地处理大型数据量，并从视觉数据中提取准确的上下文。论文探讨了由三个维度组成的设计空间：传感任务，传感器类型和任务位置。我们先前的工作探讨了此设计空间中的几点。我们通过（1）在基于视觉的传感任务的设计空间中为不同点开发有效且可扩展的解决方案做出贡献；（2）在这些应用程序中实现最先进的准确性；（3）并制定了设计此类传感系统的准则。

Context awareness is an essential part of mobile and ubiquitous computing. Its goal is to unveil situational information about mobile users like locations and activities. The sensed context can enable many services like navigation, AR, and smarting shopping. Such context can be sensed in different ways including visual sensors. There is an emergence of vision sources deployed worldwide. The cameras could be installed on roadside, in-house, and on mobile platforms. This trend provides huge amount of vision data that could be used for context sensing. However, the vision data collection and analytics are still highly manual today. It is hard to deploy cameras at large scale for data collection. Organizing and labeling context from the data are also labor intensive. In recent years, advanced vision algorithms and deep neural networks are used to help analyze vision data. But this approach is limited by data quality, labeling effort, and dependency on hardware resources. In summary, there are three major challenges for today's vision-based context sensing systems: data collection and labeling at large scale, process large data volumes efficiently with limited hardware resources, and extract accurate context out of vision data. The thesis explores the design space that consists of three dimensions: sensing task, sensor types, and task locations. Our prior work explores several points in this design space. We make contributions by (1) developing efficient and scalable solutions for different points in the design space of vision-based sensing tasks; (2) achieving state-of-the-art accuracy in those applications; (3) and developing guidelines for designing such sensing systems.

下载PDF全文

下载文献需遵守相关版权规定

论文标题