论文标题
隐私保护高斯流程回归 - 一种模块化方法的同构加密方法
Privacy-Preserving Gaussian Process Regression -- A Modular Approach to the Application of Homomorphic Encryption
论文作者
论文摘要
大部分机器学习依赖于使用大量数据来训练模型来做出预测。当这些数据来自多个来源时,例如,当对机器学习模型的数据评估作为服务时,可能会出现隐私问题和有关数据共享的法律问题。完全同态加密(FHE)允许在加密时在加密时计算数据,这可以为数据隐私问题提供解决方案。但是,FHE既缓慢又限制,因此必须操纵现有的算法以使其在FHE范式下有效地工作。一些常用的机器学习算法,例如高斯过程回归,非常适合FHE,并且不能被操纵以有效,准确地工作。在本文中,我们表明一种模块化方法仅适用于需要保护的工作流程的敏感步骤,它允许一个方使用从另一方的数据构建的高斯流程回归模型对其数据进行预测,而无需任何一方访问对方的数据,以准确且高效的方式访问对方的数据。据我们所知,这种结构是有效加密高斯过程的第一个例子。
Much of machine learning relies on the use of large amounts of data to train models to make predictions. When this data comes from multiple sources, for example when evaluation of data against a machine learning model is offered as a service, there can be privacy issues and legal concerns over the sharing of data. Fully homomorphic encryption (FHE) allows data to be computed on whilst encrypted, which can provide a solution to the problem of data privacy. However, FHE is both slow and restrictive, so existing algorithms must be manipulated to make them work efficiently under the FHE paradigm. Some commonly used machine learning algorithms, such as Gaussian process regression, are poorly suited to FHE and cannot be manipulated to work both efficiently and accurately. In this paper, we show that a modular approach, which applies FHE to only the sensitive steps of a workflow that need protection, allows one party to make predictions on their data using a Gaussian process regression model built from another party's data, without either party gaining access to the other's data, in a way which is both accurate and efficient. This construction is, to our knowledge, the first example of an effectively encrypted Gaussian process.