论文标题
一种新型的流行病学方法,用于通过Google趋势在美国地理绘制人群干眼症
A Novel Epidemiological Approach to Geographically Mapping Population Dry Eye Disease in the United States through Google Trends
论文作者
论文摘要
干眼症(DED)影响了大约一半的美国人口。由于多种原因,DED的特征是在Corena表面上干燥。这项研究通过使用Google趋势作为与环境风险因素有关的地理映射的新型流行病学工具,填补了DED流行病学中的时空差距。我们利用Google趋势来提取与2004 - 2019年在美国的用户意图估算用户意图的特定查询。我们合并了国家气候数据,以生成比较DED的地理,时间和环境关系的热图。构建了多变量回归模型,以生成预测DED和控制搜索的二次预测。我们的结果说明了美国地理环境中DED搜索量的上升趋势,季节性模式,环境影响和空间关系。在海岸线上可视化了DED兴趣的本地化补丁。美国人口普查区域的DED查询没有显着差异。回归模型1预测DED搜索会随着时间的流逝(R^2 = 0.97),其重要预测因子为控制查询(P = 0.0024),时间(P = 0.001)和季节性(冬季P = 0.0028; Spring P <0.001; Spring P <0.001; Summer P = 0.018)。回归模型2预测了每个状态(r^2 = 0.49)的DED查询,其温度为温度(p = 0.0003)和沿海区(p = 0.025)。重要的是,临床文献可能表明的是,温度,沿海地位和季节性是DED搜索的风险因素比湿度,阳光,污染或区域更强。我们的工作为未来探索地理信息系统的探索铺平了道路,用于通过在线搜索查询指标来定位DED和其他疾病。
Dry eye disease (DED) affects approximately half of the United States population. DED is characterized by dryness on the corena surface due to a variety of causes. This study fills the spatiotemporal gaps in DED epidemiology by using Google Trends as a novel epidemiological tool for geographically mapping DED in relation to environmental risk factors. We utilized Google Trends to extract DED-related queries estimating user intent from 2004-2019 in the United States. We incorporated national climate data to generate heat maps comparing geographic, temporal, and environmental relationships of DED. Multi-variable regression models were constructed to generate quadratic forecasts predicting DED and control searches. Our results illustrated the upward trend, seasonal pattern, environmental influence, and spatial relationship of DED search volume across US geography. Localized patches of DED interest were visualized along the coastline. There was no significant difference in DED queries across US census regions. Regression model 1 predicted DED searches over time (R^2=0.97) with significant predictors being control queries (p=0.0024), time (p=0.001), and seasonality (Winter p=0.0028; Spring p<0.001; Summer p=0.018). Regression model 2 predicted DED queries per state (R^2=0.49) with significant predictors being temperature (p=0.0003) and coastal zone (p=0.025). Importantly, temperature, coastal status, and seasonality were stronger risk factors of DED searches than humidity, sunshine, pollution, or region as clinical literature may suggest. Our work paves the way for future exploration of geographic information systems for locating DED and other diseases via online search query metrics.