论文标题
TENFOR:一种基于张量的工具,可从安全论坛中提取有趣的事件
TenFor: A Tensor-Based Tool to Extract Interesting Events from Security Forums
论文作者
论文摘要
我们如何获得一个安全论坛来“告诉我们”其感兴趣的活动和事件?我们采用一个独特的角度:我们希望在没有任何先验知识的情况下识别这些活动,这与以前的大多数问题配方相比是一个关键区别。尽管最近进行了一些努力,但采矿安全论坛提取有用信息的关注却相对较少,而大多数人通常正在寻找特定的信息。我们提出了一种基于张量的方法Tenfor,以系统地确定三维空间中的重要事件:(a)用户,(b)线程和(c)时间。我们的方法包括三个高级步骤:(a)在三个维度上进行基于张量的聚类,(b)使用内容和行为特征的广泛集群分析,以及(c)更深入的研究,我们在其中识别了关键用户和线程。此外,我们将方法作为实践者的功能强大且易于使用的平台。在我们的评估中,我们发现83%的集群捕获了有意义的事件,并且与以前的方法相比,我们发现了更有意义的群集。我们的方法和我们的平台构成了以无监督的学习方式从论坛中检测有趣的活动的重要一步。
How can we get a security forum to "tell" us its activities and events of interest? We take a unique angle: we want to identify these activities without any a priori knowledge, which is a key difference compared to most of the previous problem formulations. Despite some recent efforts, mining security forums to extract useful information has received relatively little attention, while most of them are usually searching for specific information. We propose TenFor, an unsupervised tensor-based approach, to systematically identify important events in a three-dimensional space: (a) user, (b) thread, and (c) time. Our method consists of three high-level steps: (a) a tensor-based clustering across the three dimensions, (b) an extensive cluster profiling that uses both content and behavioral features, and (c) a deeper investigation, where we identify key users and threads within the events of interest. In addition, we implement our approach as a powerful and easy-to-use platform for practitioners. In our evaluation, we find that 83% of our clusters capture meaningful events and we find more meaningful clusters compared to previous approaches. Our approach and our platform constitute an important step towards detecting activities of interest from a forum in an unsupervised learning fashion in practice.