论文标题
SZX:科学数据集的超快速遇到错误的有损压缩机
SZx: an Ultra-fast Error-bounded Lossy Compressor for Scientific Datasets
论文作者
论文摘要
当今的科学高性能计算(HPC)应用程序或高级工具正在跨广泛的域中生成大量数据,这给数据传输和存储带来了严重的负担。遇到错误的有损耗压缩已被开发并广泛用于科学界,因为它不仅可以显着减少数据量,而且还可以严格根据使用指定的误差绑定来严格控制数据失真。但是,现有的损耗压缩机无法提供超快速的压缩速度,这是很多应用程序或用例(例如内存压缩和在线仪器数据压缩)高度要求的。在本文中,我们提出了一种新型的超快速误差的有损压缩机,该压缩机可以在CPU和GPU上获得相当高的压缩性能,并且具有相当高的压缩比。主要贡献是三倍:(1)我们提出了一种新型的,通用的超快速误差损失的压缩框架(称为UFZ),即仅将我们的设计仅由超轻量级操作组成,例如位和加法/减法操作,仍然保持一定的高压缩比。 (2)我们在CPU和GPU上实施UFZ,并根据其架构仔细优化性能。 (3)我们对CPU和GPU上的6个实际生产级科学数据集进行了全面的评估。实验表明,就压缩和减压而言,UFZ的速度与CPU和GPU上的第二(第二(SZ)和GPU上的第二(SZ SZ或ZFP)相比,UFZ的速度为2〜16倍。
Today's scientific high performance computing (HPC) applications or advanced instruments are producing vast volumes of data across a wide range of domains, which introduces a serious burden on data transfer and storage. Error-bounded lossy compression has been developed and widely used in scientific community, because not only can it significantly reduce the data volumes but it can also strictly control the data distortion based on the use-specified error bound. Existing lossy compressors, however, cannot offer ultra-fast compression speed, which is highly demanded by quite a few applications or use-cases (such as in-memory compression and online instrument data compression). In this paper, we propose a novel ultra-fast error-bounded lossy compressor, which can obtain fairly high compression performance on both CPU and GPU, also with reasonably high compression ratios. The key contributions are three-fold: (1) We propose a novel, generic ultra-fast error-bounded lossy compression framework -- called UFZ, by confining our design to be composed of only super-lightweight operations such as bitwise and addition/subtraction operation, still keeping a certain high compression ratio. (2) We implement UFZ on both CPU and GPU and optimize the performance according to their architectures carefully. (3) We perform a comprehensive evaluation with 6 real-world production-level scientific datasets on both CPU and GPU. Experiments show that UFZ is 2~16X as fast as the second-fastest existing error-bounded lossy compressor (either SZ or ZFP) on CPU and GPU, with respect to both compression and decompression.