基于多重检测的多重实例学习网络，用于整个幻灯片图像分类

论文标题

基于多重检测的多重实例学习网络，用于整个幻灯片图像分类

Multiplex-detection Based Multiple Instance Learning Network for Whole Slide Image Classification

论文作者

Wang, Zhikang, Bi, Yue, Pan, Tong, Wang, Xiaoyu, Bain, Chris, Bassed, Richard, Imoto, Seiya, Yao, Jianhua, Song, Jiangning

论文摘要

多个实例学习（MIL）是对诊断病理学的整个幻灯片图像（WSI）进行分类的强大方法。 MIL对WSI分类的基本挑战是发现触发袋子标签的\ textit {critical Instances}。但是，先前的方法主要是在独立且相同的分布假设（\ textit {i.i.d}）下设计的，忽略了肿瘤实例或异质性之间的相关性。在本文中，我们提出了一种基于多重多重检测的新型多重实例学习（MDMIL）来解决上述问题。具体而言，MDMIL由内部查询产生模块（IQGM）和多重检测模块（MDM）构建，并在训练过程中基于内存的对比损失的辅助。首先，IQGM给出了实例的概率，并通过在分布分析后汇总高度可靠的功能来为随后的MDM生成内部查询（IQ）。其次，MDM中的多路复用检测交叉注意（MDCA）和多头自我注意力（MHSA）合作以生成WSI的最终表示形式。在此过程中，智商和可训练的变异查询（VQ）成功建立了实例之间的联系，并显着提高了模型对异质肿瘤的鲁棒性。最后，为了进一步在特征空间中执行限制并稳定训练过程，我们采用了基于内存的对比损失，即使在每次迭代中有一个样本作为输入，对于WSI分类也是可行的。我们对三个计算病理数据集进行实验，例如Camelyon16，TCGA-NSCLC和TCGA-RCC数据集。优越的准确性和AUC证明了我们提出的MDMIL优于其他最先进的方法。

Multiple instance learning (MIL) is a powerful approach to classify whole slide images (WSIs) for diagnostic pathology. A fundamental challenge of MIL on WSI classification is to discover the \textit{critical instances} that trigger the bag label. However, previous methods are primarily designed under the independent and identical distribution hypothesis (\textit{i.i.d}), ignoring either the correlations between instances or heterogeneity of tumours. In this paper, we propose a novel multiplex-detection-based multiple instance learning (MDMIL) to tackle the issues above. Specifically, MDMIL is constructed by the internal query generation module (IQGM) and the multiplex detection module (MDM) and assisted by the memory-based contrastive loss during training. Firstly, IQGM gives the probability of instances and generates the internal query (IQ) for the subsequent MDM by aggregating highly reliable features after the distribution analysis. Secondly, the multiplex-detection cross-attention (MDCA) and multi-head self-attention (MHSA) in MDM cooperate to generate the final representations for the WSI. In this process, the IQ and trainable variational query (VQ) successfully build up the connections between instances and significantly improve the model's robustness toward heterogeneous tumours. At last, to further enforce constraints in the feature space and stabilize the training process, we adopt a memory-based contrastive loss, which is practicable for WSI classification even with a single sample as input in each iteration. We conduct experiments on three computational pathology datasets, e.g., CAMELYON16, TCGA-NSCLC, and TCGA-RCC datasets. The superior accuracy and AUC demonstrate the superiority of our proposed MDMIL over other state-of-the-art methods.

下载PDF全文

下载文献需遵守相关版权规定

论文标题