论文标题

Scichain:值得信赖的科学数据出处

SciChain: Trustworthy Scientific Data Provenance

论文作者

Al-Mamun, Abdullah, Zhao, Dongfang

论文摘要

审核和复制高性能计算(HPC)系统的最新技术是通过数据出处子系统。尽管数据出处的最新进展在于降低性能开销并提高用户的查询灵活性,但通常会忽略数据出处的保真度:没有一种方法可以确保尚未制造或伪造出来源数据本身。本文主张利用区块链提供不可能的自主数据出处服务,使科学数据值得信赖。对HPC采用区块链的挑战包括设计与HPC平台兼容的新区块链体系结构,更重要的是,在区块链上进行科学应用程序的一组新的共识协议。为此,我们设计了可观的可追溯性(POST)协议,并在区块链原型(即Scichain)中实现了它,即HPC的第一个区块链系统。我们通过将Scichain与多个最新系统进行比较来评估Scichain;实验结果表明,Scichain保证了值得信赖的数据,同时降低了较低的开销顺序。

The state-of-the-art for auditing and reproducing scientific applications on high-performance computing (HPC) systems is through a data provenance subsystem. While recent advances in data provenance lie in reducing the performance overhead and improving the user's query flexibility, the fidelity of data provenance is often overlooked: there is no such a way to ensure that the provenance data itself has not been fabricated or falsified. This paper advocates to leverage blockchains to deliver immutable and autonomous data provenance services such that scientific data are trustworthy. The challenges for adopting blockchains to HPC include designing a new blockchain architecture compatible with the HPC platforms and, more importantly, a set of new consensus protocols for scientific applications atop blockchains. To this end, we have designed the proof-of-scalable-traceability (POST) protocol and implemented it in a blockchain prototype, namely SciChain, the very first blockchain system for HPC. We evaluated SciChain by comparing it with multiple state-of-the-art systems; Experimental results showed that SciChain guaranteed trustworthy data while incurring orders of magnitude lower overhead.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源