论文标题

歧视和阶级失衡在线幼稚贝叶斯

Discrimination and Class Imbalance Aware Online Naive Bayes

论文作者

Badar, Maryam, Fisichella, Marco, Iosifidis, Vasileios, Nejdl, Wolfgang

论文摘要

在当代机器学习领域中,大规模数据流的公平挖掘是一个日益严重且挑战性的关注点。许多流学习算法用于在关键决策点上替换人类,例如雇用人员,评估信用风险等。这要求以最小的响应延迟处理大规模传入信息,同时确保公平,高质量的决策。根据总体准确性优化了最近的歧视感知学习方法。但是,总体准确性是有利于多数类的偏见。因此,最新的方法主要通过部分或完全忽略少数类别来减少歧视。在这种情况下,我们提出了一种新颖的适应天真的贝叶斯,以减轻嵌入在溪流中的歧视,同时保持多数和少数族裔阶层的高预测性能。我们提出的算法简单,快速,并且实现了多目标优化目标。为了处理类不平衡和概念漂移,提出了一个动态实例加权模块,这对最近的实例更为重要,而对基于少数族裔或多数族裔的成员资格过时实例的重要性较小。我们在一系列流和静态数据集上进行了实验,并推断出我们提出的方法在歧视得分和平衡的准确性方面都优于现有的最先进的公平感知方法。

Fairness-aware mining of massive data streams is a growing and challenging concern in the contemporary domain of machine learning. Many stream learning algorithms are used to replace humans at critical decision-making points e.g., hiring staff, assessing credit risk, etc. This calls for handling massive incoming information with minimum response delay while ensuring fair and high quality decisions. Recent discrimination-aware learning methods are optimized based on overall accuracy. However, the overall accuracy is biased in favor of the majority class; therefore, state-of-the-art methods mainly diminish discrimination by partially or completely ignoring the minority class. In this context, we propose a novel adaptation of Naïve Bayes to mitigate discrimination embedded in the streams while maintaining high predictive performance for both the majority and minority classes. Our proposed algorithm is simple, fast, and attains multi-objective optimization goals. To handle class imbalance and concept drifts, a dynamic instance weighting module is proposed, which gives more importance to recent instances and less importance to obsolete instances based on their membership in minority or majority class. We conducted experiments on a range of streaming and static datasets and deduced that our proposed methodology outperforms existing state-of-the-art fairness-aware methods in terms of both discrimination score and balanced accuracy.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源