论文标题
一个县级数据集,用于告知美国对Covid-19的回应
A County-level Dataset for Informing the United States' Response to COVID-19
论文作者
论文摘要
随着2019年冠状病毒病(Covid-19)继续是全球大流行,政策制定者已经制定并逆转了具有各种限制的非药物干预措施,以限制其传播。数据驱动的方法分析大流行的时间特征及其对区域条件的依赖可能会提供信息以支持缓解和抑制策略的实施。为了促进美国的研究,我们提出了一个可读的机器可读数据集,该数据集汇总了美国县一级政府,新闻和学术来源的相关数据。除了来自JHU CSSE COVID-19仪表板的县级时序数据外,我们的数据集还包含300多个变量,这些变量总结了人口估计,人口统计,种族,住房,教育,就业,就业和收入,气候,过境评分,过境分数和医疗保健系统与与医疗系统相关的Metrics。此外,我们向每个县(包括杂货店和医院)提供了各种兴趣点的室外活动信息,总结了Safegraph和Google Mobility报告中的数据。我们收集了IHME,州和县级政府的信息,以及有关非药物干预措施的制定和逆转日期的报纸。通过收集这些数据,并提供阅读它们的工具,我们希望加速研究疾病如何传播以及为什么传播在各个地区之间有所不同。我们的数据集和关联的代码可在github.com/jieyingwu/covid-19_us_county-level_summaries上找到。
As the coronavirus disease 2019 (COVID-19) continues to be a global pandemic, policy makers have enacted and reversed non-pharmaceutical interventions with various levels of restrictions to limit its spread. Data driven approaches that analyze temporal characteristics of the pandemic and its dependence on regional conditions might supply information to support the implementation of mitigation and suppression strategies. To facilitate research in this direction on the example of the United States, we present a machine-readable dataset that aggregates relevant data from governmental, journalistic, and academic sources on the U.S. county level. In addition to county-level time-series data from the JHU CSSE COVID-19 Dashboard, our dataset contains more than 300 variables that summarize population estimates, demographics, ethnicity, housing, education, employment and income, climate, transit scores, and healthcare system-related metrics. Furthermore, we present aggregated out-of-home activity information for various points of interest for each county, including grocery stores and hospitals, summarizing data from SafeGraph and Google mobility reports. We compile information from IHME, state and county-level government, and newspapers for dates of the enactment and reversal of non-pharmaceutical interventions. By collecting these data, as well as providing tools to read them, we hope to accelerate research that investigates how the disease spreads and why spread may be different across regions. Our dataset and associated code are available at github.com/JieYingWu/COVID-19_US_County-level_Summaries.