论文标题
HALO 1.0:一个硬件敏捷的加速器编排框架,用于启用硬件 - 敏捷编程,具有真实的性能可移植性,用于异质HPC
HALO 1.0: A Hardware-agnostic Accelerator Orchestration Framework for Enabling Hardware-agnostic Programming with True Performance Portability for Heterogeneous HPC
论文作者
论文摘要
本文介绍了Halo 1.0,这是一种开放式的可扩展多代理软件框架,该框架实现了一组建议的硬件 - 敏捷加速器编排(HALO)原理。 Halo实现了一个新颖的以计算为中心的消息传递接口(C^2MPI)规范,以启用跨异构加速器的硬件 - 敏捷主机应用程序的性能便携式执行。 The experiment results of evaluating eight widely used HPC subroutines based on Intel Xeon E5-2620 CPUs, Intel Arria 10 GX FPGAs, and NVIDIA GeForce RTX 2080 Ti GPUs show that HALO 1.0 allows for a unified control flow for host programs to run across all the computing devices with a consistently top performance portability score, which is up to five orders of magnitude higher than the OpenCL-based 解决方案。
This paper presents HALO 1.0, an open-ended extensible multi-agent software framework that implements a set of proposed hardware-agnostic accelerator orchestration (HALO) principles. HALO implements a novel compute-centric message passing interface (C^2MPI) specification for enabling the performance portable execution of a hardware-agnostic host application across heterogeneous accelerators. The experiment results of evaluating eight widely used HPC subroutines based on Intel Xeon E5-2620 CPUs, Intel Arria 10 GX FPGAs, and NVIDIA GeForce RTX 2080 Ti GPUs show that HALO 1.0 allows for a unified control flow for host programs to run across all the computing devices with a consistently top performance portability score, which is up to five orders of magnitude higher than the OpenCL-based solution.