论文标题
归因图的半监督异常检测
Semi-supervised Anomaly Detection on Attributed Graphs
论文作者
论文摘要
我们提出了一种简单而有效的方法,用于在属性图上检测具有少数实例的标签信息的异常实例。尽管使用标准的异常检测方法,通常假定实例是独立的且分布相同的,在许多现实世界中,实例通常会显式相互连接,从而产生所谓的属性图。所提出的方法通过考虑其属性以及基于图形卷积网络(GCN)的图形结构,将节点(实例)嵌入到潜在空间中的属性图上。为了学习专门用于异常检测的节点嵌入,在这种嵌入中,由于罕见的异常性,因此存在类不平衡的嵌入,GCN的参数经过培训,可以最大程度地减少嵌入正常实例嵌入异常异态异态异常嵌入的超孔的体积。这使我们能够通过简单地计算节点嵌入和超孔中心之间的距离来检测异常。提出的方法可以通过考虑节点的属性,图形结构和类不平衡来有效地传播有关少量节点的标签信息,以使其无标记。在使用五个现实世界归因的图形数据集的实验中,我们证明所提出的方法比各种现有的异常检测方法实现了更好的性能。
We propose a simple yet effective method for detecting anomalous instances on an attribute graph with label information of a small number of instances. Although with standard anomaly detection methods it is usually assumed that instances are independent and identically distributed, in many real-world applications, instances are often explicitly connected with each other, resulting in so-called attributed graphs. The proposed method embeds nodes (instances) on the attributed graph in the latent space by taking into account their attributes as well as the graph structure based on graph convolutional networks (GCNs). To learn node embeddings specialized for anomaly detection, in which there is a class imbalance due to the rarity of anomalies, the parameters of a GCN are trained to minimize the volume of a hypersphere that encloses the node embeddings of normal instances while embedding anomalous ones outside the hypersphere. This enables us to detect anomalies by simply calculating the distances between the node embeddings and hypersphere center. The proposed method can effectively propagate label information on a small amount of nodes to unlabeled ones by taking into account the node's attributes, graph structure, and class imbalance. In experiments with five real-world attributed graph datasets, we demonstrate that the proposed method achieves better performance than various existing anomaly detection methods.