论文标题

分析网络安全中信息传递的趋势主题和基于文本的信息渠道

Analysis of Trending Topics and Text-based Channels of Information Delivery in Cybersecurity

论文作者

Wu, Tingmin, Ma, Wanlun, Wen, Sheng, Xia, Xin, Paris, Cecile, Nepal, Surya, Xiang, Yang

论文摘要

计算机用户通常面临着正确的安全决策的困难。尽管越来越多的人试图或愿意接受正规的安全培训,但包括新闻,安全博客和网站在内的在线资源不断使安全知识更容易获得。对网络安全文本的分析可以提供有关趋势主题的见解,并确定当前的安全问题以及网络攻击如何随着时间的流逝而发展。这些反过来可以支持研究人员和从业人员预测和准备这些攻击。比较不同的来源可以通过持续从不同的网络安全环境中获得的安全知识来促进普通用户的学习过程。先前的研究既没有系统地分析数字来源的广泛范围,也没有提供任何标准化,以分析最近的安全文本的趋势主题。尽管LDA在主题生成中已被广泛采用,但其生成的主题不能完全涵盖网络安全概念。为了解决这个问题,我们提出了一种半自动分类方法,以生成全面的安全类别,而不是LDA生成的主题。我们进一步根据不同来源的受欢迎程度和影响来比较确定的16个安全类别。我们揭示了一些令人惊讶的发现。 (1)来自网络安全文本所反映的影响与网络犯罪造成的货币损失密切相关。 (2)对于大多数类别,安全博客随着时间的推移具有最大的受欢迎程度和最大的绝对/相对影响。 (3)网站提供安全信息而不关心及时性,其中三分之一的文章未指定日期,其余的则在发布新出现的安全问题时滞后。

Computer users are generally faced with difficulties in making correct security decisions. While an increasingly fewer number of people are trying or willing to take formal security training, online sources including news, security blogs, and websites are continuously making security knowledge more accessible. Analysis of cybersecurity texts can provide insights into the trending topics and identify current security issues as well as how cyber attacks evolve over time. These in turn can support researchers and practitioners in predicting and preparing for these attacks. Comparing different sources may facilitate the learning process for normal users by persisting the security knowledge gained from different cybersecurity context. Prior studies neither systematically analysed the wide-range of digital sources nor provided any standardisation in analysing the trending topics from recent security texts. Although LDA has been widely adopted in topic generation, its generated topics cannot cover the cybersecurity concepts completely and considerably overlap. To address this issue, we propose a semi-automated classification method to generate comprehensive security categories instead of LDA-generated topics. We further compare the identified 16 security categories across different sources based on their popularity and impact. We have revealed several surprising findings. (1) The impact reflected from cyber-security texts strongly correlates with the monetary loss caused by cybercrimes. (2) For most categories, security blogs share the largest popularity and largest absolute/relative impact over time. (3) Websites deliver security information without caring about timeliness much, where one third of the articles do not specify the date and the rest have a time lag in posting emerging security issues.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源