论文标题
意义与敏感性:描述社交媒体用户有关使用有争议的术语的COVID-19
Sense and Sensibility: Characterizing Social Media Users Regarding the Use of Controversial Terms for COVID-19
论文作者
论文摘要
随着2019年新颖的冠状病毒的全球发展,尽管他正式宣布该病为Covid-19,但许多人仍在使用一个有争议的术语 - “中国病毒”仍在使用。同时,全球在线媒体报道与共同19的种族攻击有关,其中大多数是反中国人或反亚洲的。随着这个大流行变得越来越严重,越来越多的人开始在Twitter等社交媒体平台上谈论它。当他们指的是Covid-19时,主要有两种方法:使用有争议的术语,例如“中国病毒”或“ Wuhan病毒”,或使用“冠状病毒”等非争辩术语。在这项研究中,我们试图表征使用有争议的术语以及使用非争议术语的Twitter用户。我们使用Tweepy API检索1700万条相关的推文及其作者的信息。我们发现,这两组Twitter用户之间的人口统计数据之间存在显着差异,用户级的功能,例如关注者的数量,政治以下状况以及其地理位置。此外,我们应用分类模型来预测更可能使用有争议的术语的Twitter用户。据我们所知,这是第一个基于社交媒体的大规模研究,旨在在重大危机期间用户使用有争议的术语来表征用户的特征。
With the world-wide development of 2019 novel coronavirus, although WHO has officially announced the disease as COVID-19, one controversial term - "Chinese Virus" is still being used by a great number of people. In the meantime, global online media coverage about COVID-19-related racial attacks increases steadily, most of which are anti-Chinese or anti-Asian. As this pandemic becomes increasingly severe, more people start to talk about it on social media platforms such as Twitter. When they refer to COVID-19, there are mainly two ways: using controversial terms like "Chinese Virus" or "Wuhan Virus", or using non-controversial terms like "Coronavirus". In this study, we attempt to characterize the Twitter users who use controversial terms and those who use non-controversial terms. We use the Tweepy API to retrieve 17 million related tweets and the information of their authors. We find significant differences between these two groups of Twitter users across their demographics, user-level features like the number of followers, political following status, as well as their geo-locations. Moreover, we apply classification models to predict Twitter users who are more likely to use controversial terms. To our best knowledge, this is the first large-scale social media-based study to characterize users with respect to their usage of controversial terms during a major crisis.