论文标题
人类的福祉和机器学习
Human Wellbeing and Machine Learning
论文作者
论文摘要
关于主观福祉的决定因素有大量文献。国际组织和统计局现在正在大规模收集此类调查数据。但是,标准回归模型令人惊讶地解释了健康状况的变化,从而限制了我们预测它的能力。作为回应,我们在这里评估了机器学习的潜力(ML),以帮助我们更好地了解健康。我们分析了来自德国,英国和美国超过一百万受访者的福利数据。就预测能力而言,我们的ML方法的性能比传统模型更好。尽管在绝对方面的改进规模很小,但与健康等关键变量相比,事实证明这是很大的。此外,我们发现,大幅扩展解释变量集使OLS的预测能力和在看不见的数据上的预测能力增加了一倍。通过我们的ML算法(即$ $材料条件,健康和有意义的社会关系)所识别的变量与文献中已经确定的变量相似。从这个意义上讲,我们的数据驱动的ML结果验证了传统方法的发现。
There is a vast literature on the determinants of subjective wellbeing. International organisations and statistical offices are now collecting such survey data at scale. However, standard regression models explain surprisingly little of the variation in wellbeing, limiting our ability to predict it. In response, we here assess the potential of Machine Learning (ML) to help us better understand wellbeing. We analyse wellbeing data on over a million respondents from Germany, the UK, and the United States. In terms of predictive power, our ML approaches do perform better than traditional models. Although the size of the improvement is small in absolute terms, it turns out to be substantial when compared to that of key variables like health. We moreover find that drastically expanding the set of explanatory variables doubles the predictive power of both OLS and the ML approaches on unseen data. The variables identified as important by our ML algorithms - $i.e.$ material conditions, health, and meaningful social relations - are similar to those that have already been identified in the literature. In that sense, our data-driven ML results validate the findings from conventional approaches.