论文标题

COVID-19的大型阿拉伯Twitter数据集

Large Arabic Twitter Dataset on COVID-19

论文作者

Alqurashi, Sarah, Alhindi, Ahmad, Alanazi, Eisa

论文摘要

2019年冠状病毒病(Covid-19)于2019年12月下旬出现在中国,现在正在全球迅速蔓延。在撰写本文时,全球确认的案件的数量已经超过18万名死亡。许多国家已经执行严格的社会距离政策,以遏制病毒的传播。这改变了数千万人的日常生活,并敦促人们通过Twitter等在线社交媒体网站在线上进行讨论。在这项工作中,我们描述了自2020年1月1日以来一直在收集的Covid-19上的第一个阿拉伯语推文数据集。该数据集将帮助研究人员和政策制定者研究与大流行有关的不同社会问题。也可以分析许多与行为改变,信息共享,错误信息和谣言传播有关的任务。

The 2019 coronavirus disease (COVID-19), emerged late December 2019 in China, is now rapidly spreading across the globe. At the time of writing this paper, the number of global confirmed cases has passed two millions and half with over 180,000 fatalities. Many countries have enforced strict social distancing policies to contain the spread of the virus. This have changed the daily life of tens of millions of people, and urged people to turn their discussions online, e.g., via online social media sites like Twitter. In this work, we describe the first Arabic tweets dataset on COVID-19 that we have been collecting since January 1st, 2020. The dataset would help researchers and policy makers in studying different societal issues related to the pandemic. Many other tasks related to behavioral change, information sharing, misinformation and rumors spreading can also be analyzed.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源