论文标题

高质量的分层过程映射

High-Quality Hierarchical Process Mapping

论文作者

Faraj, Marcelo Fonseca, van der Grinten, Alexander, Meyerhenke, Henning, Träff, Jesper Larsson, Schulz, Christian

论文摘要

将图形分配到大小相等的块中,以至于在平行计算机上处​​理图时,在块之间几乎不需要边缘。当已知分布式系统的拓扑结构是一项重要任务时,将分区的块映射到处理器上,以便降低整体通信成本。我们提出了整合图形分区和过程映射的新型多级算法。我们算法的重要成分包括快速标签传播,更本地化的本地搜索,初始分区以及压缩数据结构以计算处理器距离而无需存储距离矩阵。实验表明,我们的算法加快了整体映射过程,并且由于集成的多级方法,在实践中也找到了更好的解决方案。例如,我们的算法的一种配置在映射质量方面,在映射质量方面,同时更快地绘制了62个。与当前最快的迭代多级映射算法苏格兰人相比,我们获得了16%的更好的解决方案,同时投资略高的运行时间。

Partitioning graphs into blocks of roughly equal size such that few edges run between blocks is a frequently needed operation when processing graphs on a parallel computer. When a topology of a distributed system is known an important task is then to map the blocks of the partition onto the processors such that the overall communication cost is reduced. We present novel multilevel algorithms that integrate graph partitioning and process mapping. Important ingredients of our algorithm include fast label propagation, more localized local search, initial partitioning, as well as a compressed data structure to compute processor distances without storing a distance matrix. Experiments indicate that our algorithms speed up the overall mapping process and, due to the integrated multilevel approach, also find much better solutions in practice. For example, one configuration of our algorithm yields better solutions than the previous state-of-the-art in terms of mapping quality while being a factor 62 faster. Compared to the currently fastest iterated multilevel mapping algorithm Scotch, we obtain 16% better solutions while investing slightly more running time.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源