图形卷积网络的可区分图形模块（DGM）

论文标题

图形卷积网络的可区分图形模块（DGM）

Differentiable Graph Module (DGM) for Graph Convolutional Networks

论文作者

Kazi, Anees, Cosmo, Luca, Ahmadi, Seyed-Ahmad, Navab, Nassir, Bronstein, Michael

论文摘要

Graph Deep Leadme最近已成为一种强大的ML概念，允许将成功的深神经体系结构推广到非欧几里得结构化数据。这种方法已显示出从社会科学，生物医学和粒子物理学到计算机视觉，图形和化学的广泛应用中的有希望的结果。大多数当前图神经网络体系结构的局限性之一是它们通常仅限于转导设置，并依赖于基础图为{\ em已知}和{\ em固定}的假设。通常，此假设不是正确的，因为该图可能是嘈杂的，或者部分甚至完全未知的。在这种情况下，直接从数据中推断出图将很有帮助，尤其是在训练时图中不存在一些节点的归纳环境中。此外，学习图可能本身就是目的，因为推断的结构可以提供下游任务旁边的互补见解。在本文中，我们引入了可靠的图形模块（DGM），这是一个可学习的功能，可预测图形中的边缘概率，最适合下游任务。 DGM可以与卷积图神经网络层结合使用，并以端到端的方式进行训练。我们提供了来自医疗保健领域（疾病预测），大脑成像（年龄预测），计算机图形（3D点云分段）和计算机视觉（零照片学习）的应用程序的广泛评估。我们表明，我们的模型在转导和归纳环境中都对基线的基准有了显着改善，并取得了最先进的结果。

Graph deep learning has recently emerged as a powerful ML concept allowing to generalize successful deep neural architectures to non-Euclidean structured data. Such methods have shown promising results on a broad spectrum of applications ranging from social science, biomedicine, and particle physics to computer vision, graphics, and chemistry. One of the limitations of the majority of current graph neural network architectures is that they are often restricted to the transductive setting and rely on the assumption that the underlying graph is {\em known} and {\em fixed}. Often, this assumption is not true since the graph may be noisy, or partially and even completely unknown. In such cases, it would be helpful to infer the graph directly from the data, especially in inductive settings where some nodes were not present in the graph at training time. Furthermore, learning a graph may become an end in itself, as the inferred structure may provide complementary insights next to the downstream task. In this paper, we introduce Differentiable Graph Module (DGM), a learnable function that predicts edge probabilities in the graph which are optimal for the downstream task. DGM can be combined with convolutional graph neural network layers and trained in an end-to-end fashion. We provide an extensive evaluation of applications from the domains of healthcare (disease prediction), brain imaging (age prediction), computer graphics (3D point cloud segmentation), and computer vision (zero-shot learning). We show that our model provides a significant improvement over baselines both in transductive and inductive settings and achieves state-of-the-art results.

下载PDF全文

下载文献需遵守相关版权规定

论文标题