IDF ++：分析和改善无损压缩的整数离散流

论文标题

IDF ++：分析和改善无损压缩的整数离散流

IDF++: Analyzing and Improving Integer Discrete Flows for Lossless Compression

论文作者

Berg, Rianne van den, Gritsenko, Alexey A., Dehghani, Mostafa, Sønderby, Casper Kaae, Salimans, Tim

论文摘要

在本文中，我们分析并改善了无损压缩的整数离散流。整数离散流是最近提出的一类模型类，可以学习可逆变换的整数值随机变量。它们的离散性使它们特别适合通过熵编码方案无损压缩。我们首先研究了最近的理论主张，即离散随机变量的可逆流比其连续对应物的灵活性不那么灵活。我们证明，由于将数据嵌入有限的支持无限整数晶格中，因此该主张不适合整数离散流。此外，由于整数离散流中的直通估计量，我们缩小了梯度偏置的影响，并证明其影响高度依赖于架构选择，而不是以前认为的。最后，我们展示了不同的体系结构修改如何改善该模型类别的无损压缩的性能，并且它们还启用了更有效的压缩：流动层数量一半的模型与原始整数离散流模型相比或更好。

In this paper we analyse and improve integer discrete flows for lossless compression. Integer discrete flows are a recently proposed class of models that learn invertible transformations for integer-valued random variables. Their discrete nature makes them particularly suitable for lossless compression with entropy coding schemes. We start by investigating a recent theoretical claim that states that invertible flows for discrete random variables are less flexible than their continuous counterparts. We demonstrate with a proof that this claim does not hold for integer discrete flows due to the embedding of data with finite support into the countably infinite integer lattice. Furthermore, we zoom in on the effect of gradient bias due to the straight-through estimator in integer discrete flows, and demonstrate that its influence is highly dependent on architecture choices and less prominent than previously thought. Finally, we show how different architecture modifications improve the performance of this model class for lossless compression, and that they also enable more efficient compression: a model with half the number of flow layers performs on par with or better than the original integer discrete flow model.

下载PDF全文

下载文献需遵守相关版权规定

论文标题