论文标题

有条件地生成医疗时间序列,以推断出代表性不足的人群

Conditional Generation of Medical Time Series for Extrapolation to Underrepresented Populations

论文作者

Bing, Simon, Dittadi, Andrea, Bauer, Stefan, Schwab, Patrick

论文摘要

电子健康记录(EHRS)的广泛采用以及随后的纵向医疗保健数据的可用性增加,导致了我们对健康和疾病的理解,直接和直接影响了新诊断和治疗治疗方案的发展。但是,由于其感知敏感性和相关的法律问题,通常会限制获得EHR,并且其中的同伙通常是在特定医院或医院网络上看到的,因此不代表更广泛的患者人群。在这里,我们提出了HealthGen,这是一种有条件地生成合成EHR的新方法,该方法可以准确表示实际患者特征,时间信息和缺失模式。我们在实验上证明了HealthGen产生的合成群体对实际患者EHR的忠诚程度明显比当前的最新面积更为忠诚,并且通过有条件生成的患者体积不足的亚群的有条件产生的同类群体增强实际数据集可以显着增强从这些数据集中到不同患者群体的模型的普遍性。合成的有条件生成的EHR可以帮助增加纵向医疗保健数据集的可访问性,并提高这些数据集对代表性不足人群的推论的普遍性。

The widespread adoption of electronic health records (EHRs) and subsequent increased availability of longitudinal healthcare data has led to significant advances in our understanding of health and disease with direct and immediate impact on the development of new diagnostics and therapeutic treatment options. However, access to EHRs is often restricted due to their perceived sensitive nature and associated legal concerns, and the cohorts therein typically are those seen at a specific hospital or network of hospitals and therefore not representative of the wider population of patients. Here, we present HealthGen, a new approach for the conditional generation of synthetic EHRs that maintains an accurate representation of real patient characteristics, temporal information and missingness patterns. We demonstrate experimentally that HealthGen generates synthetic cohorts that are significantly more faithful to real patient EHRs than the current state-of-the-art, and that augmenting real data sets with conditionally generated cohorts of underrepresented subpopulations of patients can significantly enhance the generalisability of models derived from these data sets to different patient populations. Synthetic conditionally generated EHRs could help increase the accessibility of longitudinal healthcare data sets and improve the generalisability of inferences made from these data sets to underrepresented populations.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源