设计有效的端到端机器学习管道，以实时空架检测

论文标题

设计有效的端到端机器学习管道，以实时空架检测

Designing an Efficient End-to-end Machine Learning Pipeline for Real-time Empty-shelf Detection

论文作者

Jha, Dipendra, Mahjoubfar, Ata, Joshi, Anupama

论文摘要

零售商店中产品的现成可用性（OSA）是快速移动的消费品和零售业的关键业务标准。当产品不存储（OO）并且客户在设计的架子上找不到它时，这激发了客户的商店转换或什么都不购买，这会导致未来的销售和需求。零售商正在采用多种方法来检测空货架并确保产品的高OSA；但是，这种方法通常是无效且不可行的，因为它们是手动，昂贵的或不太准确的。最近已经提出了基于机器学习的解决方案，但是由于缺乏大量注释的货架产品数据集，它们遭受了高计算成本和低准确性问题的困扰。在这里，我们提出了一种优雅的方法，用于设计端到端机器学习（ML）管道，以实时空架子检测。考虑到ML模型的质量和数据质量之间的强烈依赖性，我们将重点介绍适当的数据收集，清洁和正确的数据注释的重要性，然后再研究建模。由于用于实时预测的空架检测解决方案应具有计算效率，因此我们探索不同的运行时间优化以改善模型性能。我们的数据集包含1000张图像，通过以下定义明确的指南收集和注释。我们的低延迟模型的平均F1得分平均为68.5％，并且在Intel Xeon Gold上最多可处理67张图像/s，并且在A100 GPU上最多可处理860张图像/s。

On-Shelf Availability (OSA) of products in retail stores is a critical business criterion in the fast moving consumer goods and retails sector. When a product is out-of-stock (OOS) and a customer cannot find it on its designed shelf, this motivates the customer to store-switching or buying nothing, which causes fall in future sales and demands. Retailers are employing several approaches to detect empty shelves and ensure high OSA of products; however, such methods are generally ineffective and infeasible since they are either manual, expensive or less accurate. Recently machine learning based solutions have been proposed, but they suffer from high computational cost and low accuracy problem due to lack of large annotated datasets of on-shelf products. Here, we present an elegant approach for designing an end-to-end machine learning (ML) pipeline for real-time empty shelf detection. Considering the strong dependency between the quality of ML models and the quality of data, we focus on the importance of proper data collection, cleaning and correct data annotation before delving into modeling. Since an empty-shelf detection solution should be computationally-efficient for real-time predictions, we explore different run-time optimizations to improve the model performance. Our dataset contains 1000 images, collected and annotated by following well-defined guidelines. Our low-latency model achieves a mean average F1-score of 68.5%, and can process up to 67 images/s on Intel Xeon Gold and up to 860 images/s on an A100 GPU.

下载PDF全文

下载文献需遵守相关版权规定

论文标题