论文标题
多臂强盗中的通用异常检测
Generic Outlier Detection in Multi-Armed Bandit
论文作者
论文摘要
在本文中,我们研究了多武器匪徒设置中的异常手臂检测问题,该设置在许多高影响力领域(例如金融,医疗保健和在线广告)中找到了许多应用程序。对于这个问题,学习者旨在确定其预期奖励显着偏离其他大多数武器的武器。与现有工作不同,我们针对的是通用的离群臂或离群臂群,其预期的奖励可以更大,较小,甚至在正常武器之间。为此,我们首先提供了此类通用的离群武器和离群臂组的全面定义。然后,我们提出了一种名为Gold的小说拉动算法,以识别这种通用的异常臂。它根据上限范围构建了实时邻域图,并从正常手臂中捕获异常值的行为模式。我们还从各个方面分析了其性能。在对合成和现实世界数据集进行的实验中,与最先进的技术相比,该算法达到了98%的精度,同时节省了83%的勘探成本。
In this paper, we study the problem of outlier arm detection in multi-armed bandit settings, which finds plenty of applications in many high-impact domains such as finance, healthcare, and online advertising. For this problem, a learner aims to identify the arms whose expected rewards deviate significantly from most of the other arms. Different from existing work, we target the generic outlier arms or outlier arm groups whose expected rewards can be larger, smaller, or even in between those of normal arms. To this end, we start by providing a comprehensive definition of such generic outlier arms and outlier arm groups. Then we propose a novel pulling algorithm named GOLD to identify such generic outlier arms. It builds a real-time neighborhood graph based on upper confidence bounds and catches the behavior pattern of outliers from normal arms. We also analyze its performance from various aspects. In the experiments conducted on both synthetic and real-world data sets, the proposed algorithm achieves 98 % accuracy while saving 83 % exploration cost on average compared with state-of-the-art techniques.