论文标题
现代Hopfield网络和免疫曲目分类的关注
Modern Hopfield Networks and Attention for Immune Repertoire Classification
论文作者
论文摘要
机器学习的中心机制是识别,存储和识别模式。如何学习,访问和检索此类模式在Hopfield网络和最新的变压器体系结构中至关重要。我们表明,变压器体系结构的注意机制实际上是现代Hopfield网络的更新规则,可以呈指数级的模式。我们利用现代Hopfield网络的这种高存储能力来解决计算生物学中具有挑战性的多重实例学习(MIL)问题:免疫曲目分类。解决此问题的准确且可解释的机器学习方法可以为新的疫苗和疗法铺平道路,这是目前由COVID-19危机加强的非常相关的研究主题。基于个体的大量免疫试剂的免疫曲目分类是一个MIL问题,存在前所未有的大量实例,两个数量级比当前认为的问题大,并且证人率极低。在这项工作中,我们介绍了我们的新方法DEEPRC,该方法将类似变形金刚的关注或等效现代的Hopfield网络整合到大量MIL(例如免疫曲目分类)的深度学习体系中。我们证明,DEEPRC在大规模实验(包括模拟和现实世界的病毒感染数据)上的预测性能方面优于所有其他方法,并可以提取与给定疾病类别相关的序列基序。源代码和数据集:https://github.com/ml-jku/deeprc
A central mechanism in machine learning is to identify, store, and recognize patterns. How to learn, access, and retrieve such patterns is crucial in Hopfield networks and the more recent transformer architectures. We show that the attention mechanism of transformer architectures is actually the update rule of modern Hopfield networks that can store exponentially many patterns. We exploit this high storage capacity of modern Hopfield networks to solve a challenging multiple instance learning (MIL) problem in computational biology: immune repertoire classification. Accurate and interpretable machine learning methods solving this problem could pave the way towards new vaccines and therapies, which is currently a very relevant research topic intensified by the COVID-19 crisis. Immune repertoire classification based on the vast number of immunosequences of an individual is a MIL problem with an unprecedentedly massive number of instances, two orders of magnitude larger than currently considered problems, and with an extremely low witness rate. In this work, we present our novel method DeepRC that integrates transformer-like attention, or equivalently modern Hopfield networks, into deep learning architectures for massive MIL such as immune repertoire classification. We demonstrate that DeepRC outperforms all other methods with respect to predictive performance on large-scale experiments, including simulated and real-world virus infection data, and enables the extraction of sequence motifs that are connected to a given disease class. Source code and datasets: https://github.com/ml-jku/DeepRC