论文标题
使用消息传递和多源相似性特征来预测circrna-disease关联的图形卷积网络
Graph Convolution Networks Using Message Passing and Multi-Source Similarity Features for Predicting circRNA-Disease Association
论文作者
论文摘要
图可用于有效地表示复杂的数据结构。在图中学习这些不规则数据是具有挑战性的,仍然受到浅层学习的困扰。在图形上应用深度学习最近在社会分析,生物信息学等的许多应用中都表现出良好的性能。消息传递图卷积网络是一种强大的方法,具有能够学习图形结构的表现力。同时,circrna是一种非编码RNA,在人类疾病中起着至关重要的作用。确定circrNA和疾病之间的关联对于诊断和治疗复杂疾病很重要。但是,它们之间的已知关联数量有限,并且进行生物学实验以确定新的关联是耗时且昂贵的。结果,需要建立有效且可行的计算方法来预测潜在的circrna-disese酶关联。在本文中,我们提出了一个新型的图形卷积网络框架,以从具有多源相似性信息的图中学习特征,以预测circrna-disese sease关联。首先,我们使用Circrna相似性,疾病和Circrna Gaussian相互作用曲线(GIP)内核相似性的多源信息,以使用第一图卷积提取特征。然后,我们预测每个circrna的疾病关联与第二幅卷积。对各种实验进行了五倍交叉验证的拟议框架显示了预测Circrna-疾病关联的有希望的结果,并且表现优于其他现有方法。
Graphs can be used to effectively represent complex data structures. Learning these irregular data in graphs is challenging and still suffers from shallow learning. Applying deep learning on graphs has recently showed good performance in many applications in social analysis, bioinformatics etc. A message passing graph convolution network is such a powerful method which has expressive power to learn graph structures. Meanwhile, circRNA is a type of non-coding RNA which plays a critical role in human diseases. Identifying the associations between circRNAs and diseases is important to diagnosis and treatment of complex diseases. However, there are limited number of known associations between them and conducting biological experiments to identify new associations is time consuming and expensive. As a result, there is a need of building efficient and feasible computation methods to predict potential circRNA-disease associations. In this paper, we propose a novel graph convolution network framework to learn features from a graph built with multi-source similarity information to predict circRNA-disease associations. First we use multi-source information of circRNA similarity, disease and circRNA Gaussian Interaction Profile (GIP) kernel similarity to extract the features using first graph convolution. Then we predict disease associations for each circRNA with second graph convolution. Proposed framework with five-fold cross validation on various experiments shows promising results in predicting circRNA-disease association and outperforms other existing methods.