论文标题
有关因果推断的调查
A Survey on Causal Inference
论文作者
论文摘要
因果推论是数十年来统计,计算机科学,教育,公共政策和经济学等许多领域的关键研究主题。如今,与随机对照试验相比,由于大量可用数据和低预算要求,观察数据的因果效应已成为一个有吸引力的研究方向。随着迅速发展的机器学习领域的拥抱,观察数据的各种因果效应估计方法已经浮出水面。在这项调查中,我们对潜在的结果框架下的因果推理方法进行了全面综述,这是众所周知的因果推理框架之一。这些方法分为两类,具体取决于它们是否需要对潜在结果框架的所有三个假设。对于每个类别,都讨论并比较了传统的统计方法和最近的机器学习增强方法。还提出了这些方法的合理应用,包括广告,建议,医学等方面的应用。此外,还总结了常用的基准数据集以及开源代码,这促进了研究人员和从业人员探索,评估和应用因果推理方法。
Causal inference is a critical research topic across many domains, such as statistics, computer science, education, public policy and economics, for decades. Nowadays, estimating causal effect from observational data has become an appealing research direction owing to the large amount of available data and low budget requirement, compared with randomized controlled trials. Embraced with the rapidly developed machine learning area, various causal effect estimation methods for observational data have sprung up. In this survey, we provide a comprehensive review of causal inference methods under the potential outcome framework, one of the well known causal inference framework. The methods are divided into two categories depending on whether they require all three assumptions of the potential outcome framework or not. For each category, both the traditional statistical methods and the recent machine learning enhanced methods are discussed and compared. The plausible applications of these methods are also presented, including the applications in advertising, recommendation, medicine and so on. Moreover, the commonly used benchmark datasets as well as the open-source codes are also summarized, which facilitate researchers and practitioners to explore, evaluate and apply the causal inference methods.