公平引导的基于SMT的决策树和随机森林的纠正

论文标题

公平引导的基于SMT的决策树和随机森林的纠正

Fairness-guided SMT-based Rectification of Decision Trees and Random Forests

论文作者

Zhang, Jiang, Beschastnikh, Ivan, Mechtaev, Sergey, Roychoudhury, Abhik

论文摘要

以数据为导向的决策在各种机器学习模型的受欢迎程度中变得越来越突出。不幸的是，机器学习训练中使用的现实生活数据可能会捕获人类的偏见，因此，学习的模型可能导致不公平的决策。在本文中，我们为决策树和随机森林提供了解决方案。我们的方法将任何决策树或随机森林转换为公平的森林，相对于特定的数据集，公平标准和敏感属性。基于我们的方法构建的\ emph {FairRepair}工具的灵感来自传统程序的自动化程序维修技术。它使用SMT求解器来确定决策树中哪些路径可以使结果倒转以改善模型的公平性。我们对来自UC Irvine的著名成人数据集进行的实验表明，Fairrepair尺度到现实的决策树和随机森林。此外，费尔雷帕尔（Fairrepair）提供了有关寻找维修的合理性和完整性的正式保证。由于我们的公平指导的维修技术可以修复从给定（不公平）数据集获得的决策树和随机森林，因此它可以帮助识别和纠正组织决策中的偏见。

Data-driven decision making is gaining prominence with the popularity of various machine learning models. Unfortunately, real-life data used in machine learning training may capture human biases, and as a result the learned models may lead to unfair decision making. In this paper, we provide a solution to this problem for decision trees and random forests. Our approach converts any decision tree or random forest into a fair one with respect to a specific data set, fairness criteria, and sensitive attributes. The \emph{FairRepair} tool, built based on our approach, is inspired by automated program repair techniques for traditional programs. It uses an SMT solver to decide which paths in the decision tree could have their outcomes flipped to improve the fairness of the model. Our experiments on the well-known adult dataset from UC Irvine demonstrate that FairRepair scales to realistic decision trees and random forests. Furthermore, FairRepair provides formal guarantees about soundness and completeness of finding a repair. Since our fairness-guided repair technique repairs decision trees and random forests obtained from a given (unfair) data-set, it can help to identify and rectify biases in decision-making in an organisation.

下载PDF全文

下载文献需遵守相关版权规定

论文标题