论文标题
在超图中,我们何时需要高阶信息?关于超越预测的案例研究
How Much and When Do We Need Higher-order Information in Hypergraphs? A Case Study on Hyperedge Prediction
论文作者
论文摘要
HyperGraphs提供了一种自然的表示群体关系的方式,其复杂性激发了一系列先前的工作,以采用某种形式的抽象和简化高阶相互作用。但是,以下问题尚未解决:群体互动的抽象足以解决超盖任务,以及在数据集中的这种结果如何变化?这个问题如果正确回答,则提供了一个有用的工程指南,以解决如何在解决下游任务的复杂性和准确性之间进行贸易。为此,我们提出了一种使用n-projected图的概念逐渐代表组相互作用的方法,该图的累积包含有关最多n向相互作用的信息,并量化求解任务的准确性,因为n为各种数据集增长。作为下游任务,我们考虑了HyperEdge预测,即链接预测的扩展,这是评估图形模型的规范任务。通过在15个现实世界数据集上的实验,我们绘制以下消息:(a)回报减少:小N足以达到与近乎完美近似值相当的精确度,(b)故障排除:随着任务变得更具挑战性,更大的n带来了更大的好处,并且(c)不可减少的性能:与互动相互作用时,相互作用却不多地效果差异,将其效果差不多。
Hypergraphs provide a natural way of representing group relations, whose complexity motivates an extensive array of prior work to adopt some form of abstraction and simplification of higher-order interactions. However, the following question has yet to be addressed: How much abstraction of group interactions is sufficient in solving a hypergraph task, and how different such results become across datasets? This question, if properly answered, provides a useful engineering guideline on how to trade off between complexity and accuracy of solving a downstream task. To this end, we propose a method of incrementally representing group interactions using a notion of n-projected graph whose accumulation contains information on up to n-way interactions, and quantify the accuracy of solving a task as n grows for various datasets. As a downstream task, we consider hyperedge prediction, an extension of link prediction, which is a canonical task for evaluating graph models. Through experiments on 15 real-world datasets, we draw the following messages: (a) Diminishing returns: small n is enough to achieve accuracy comparable with near-perfect approximations, (b) Troubleshooter: as the task becomes more challenging, larger n brings more benefit, and (c) Irreducibility: datasets whose pairwise interactions do not tell much about higher-order interactions lose much accuracy when reduced to pairwise abstractions.