论文标题
人类援助下的分类
Classification Under Human Assistance
论文作者
论文摘要
大多数监督学习模型都经过完整的自动化培训。但是,在某些特定情况下,他们的预测有时比人类专家的预测还要糟糕。我们的目标是在这种经验观察中的动机,是设计经过优化以在不同自动化水平下运行的分类器。更具体地说,我们专注于基于凸的分类器,并首先表明问题是NP-HARD。然后,我们进一步表明,对于支持向量机,相应的目标函数可以表示为两个函数f = g-c的差,其中g是单调,非负和γ呈γ的subsodular,c是非阴性和模块化的。这种表示允许最近引入的确定性贪婪算法以及算法的更有效的随机变体,可以保证解决问题。来自医学诊断的几个应用程序的合成和现实数据的实验说明了我们的理论发现,并表明,在人力援助下,受过培训的监督学习模型可以在不同的自动化水平下进行操作,这可以优于那些接受全自动的培训以及单独运作的人类。
Most supervised learning models are trained for full automation. However, their predictions are sometimes worse than those by human experts on some specific instances. Motivated by this empirical observation, our goal is to design classifiers that are optimized to operate under different automation levels. More specifically, we focus on convex margin-based classifiers and first show that the problem is NP-hard. Then, we further show that, for support vector machines, the corresponding objective function can be expressed as the difference of two functions f = g - c, where g is monotone, non-negative and γ-weakly submodular, and c is non-negative and modular. This representation allows a recently introduced deterministic greedy algorithm, as well as a more efficient randomized variant of the algorithm, to enjoy approximation guarantees at solving the problem. Experiments on synthetic and real-world data from several applications in medical diagnosis illustrate our theoretical findings and demonstrate that, under human assistance, supervised learning models trained to operate under different automation levels can outperform those trained for full automation as well as humans operating alone.