论文标题
概念漂移探测器是可靠的警报系统吗? - 比较研究
Are Concept Drift Detectors Reliable Alarming Systems? -- A Comparative Study
论文作者
论文摘要
随着机器学习模型越来越替代生产系统中的传统业务逻辑,它们的生命周期管理正在成为一个重大问题。一旦部署到生产中,就会在新的流数据中评估机器学习模型。考虑到连续的数据流,在这种情况下,转移数据(也称为概念漂移)无处不在。概念漂移通常会影响机器学习模型的性能,因此确定需要发生概念漂移的时刻。概念漂移是通过概念漂移探测器确定的。在这项工作中,我们通过探索他们报告漂移的时间有多晚以及发出了多少个错误警报来评估概念漂移检测器的可靠性,以及时确定漂移。我们比较属于两个不同概念漂移检测器组,基于错误率的检测器和基于数据分布的检测器的最流行的漂移检测器的性能。我们评估它们在合成和现实世界数据上的性能。在合成数据的情况下,我们研究了检测器的性能,以识别两种类型的概念漂移,突然和逐渐。我们的发现旨在帮助从业者了解应在不同情况下采用哪种漂移探测器,并为此,我们共享整个研究中最重要的观察结果列表,这些观察结果可以用作实际使用的指南。此外,根据我们的经验结果,我们分析了每个概念漂移检测组的适用性,以用作警报系统。
As machine learning models increasingly replace traditional business logic in the production system, their lifecycle management is becoming a significant concern. Once deployed into production, the machine learning models are constantly evaluated on new streaming data. Given the continuous data flow, shifting data, also known as concept drift, is ubiquitous in such settings. Concept drift usually impacts the performance of machine learning models, thus, identifying the moment when concept drift occurs is required. Concept drift is identified through concept drift detectors. In this work, we assess the reliability of concept drift detectors to identify drift in time by exploring how late are they reporting drifts and how many false alarms are they signaling. We compare the performance of the most popular drift detectors belonging to two different concept drift detector groups, error rate-based detectors and data distribution-based detectors. We assess their performance on both synthetic and real-world data. In the case of synthetic data, we investigate the performance of detectors to identify two types of concept drift, abrupt and gradual. Our findings aim to help practitioners understand which drift detector should be employed in different situations and, to achieve this, we share a list of the most important observations made throughout this study, which can serve as guidelines for practical usage. Furthermore, based on our empirical results, we analyze the suitability of each concept drift detection group to be used as alarming system.