论文标题
COVID-19中的胸部X射线图像在平面和分层分类方案上的识别
COVID-19 identification in chest X-ray images on flat and hierarchical classification scenarios
论文作者
论文摘要
COVID-19可能会引起严重的肺炎,据估计对医疗保健系统产生很大影响。肺炎的标准图像诊断测试是胸部X射线(CXR)和计算机断层扫描(CT)扫描。 CXR之所以有用,是因为它比CT便宜,更快,更广泛。这项研究旨在鉴定其他类型的Covid-19引起的肺炎,仅使用CXR图像健康肺。为了实现目标,我们已经提出了一个分类模式,考虑了多类和分层的观点,因为肺炎可以作为层次结构结构。鉴于该域中的自然数据不平衡,我们还提出了重新采样算法的使用,以重新平衡类分布。我们的分类模式提取功能使用一些众所周知的纹理描述符,并使用预先训练的CNN模型。我们还探索了早期和晚期的融合技术,以便同时利用多个纹理描述符和基本分类器的强度。为了评估该方法,我们组成了一个名为RYDLS-20的数据库,其中包含由不同病原体引起的肺炎的CXR图像以及健康肺的CXR图像。类分布遵循现实情况,其中某些病原体比其他病原体更常见。所提出的方法在层次分类方案中,使用多级方法的宏观AVG F1得分为0.65,F1得分为0.89。据我们所知,我们在不平衡的环境中获得了三个以上类别的环境中获得Covid-19识别获得的最佳名义率。我们还必须强调针对此任务的新颖提出的分层分类方法,该方法考虑了由不同的病原体引起的肺炎类型,并使我们达到了此处获得的最佳Covid-19识别率。
The COVID-19 can cause severe pneumonia and is estimated to have a high impact on the healthcare system. The standard image diagnosis tests for pneumonia are chest X-ray (CXR) and computed tomography (CT) scan. CXR are useful in because it is cheaper, faster and more widespread than CT. This study aims to identify pneumonia caused by COVID-19 from other types and also healthy lungs using only CXR images. In order to achieve the objectives, we have proposed a classification schema considering the multi-class and hierarchical perspectives, since pneumonia can be structured as a hierarchy. Given the natural data imbalance in this domain, we also proposed the use of resampling algorithms in order to re-balance the classes distribution. Our classification schema extract features using some well-known texture descriptors and also using a pre-trained CNN model. We also explored early and late fusion techniques in order to leverage the strength of multiple texture descriptors and base classifiers at once. To evaluate the approach, we composed a database, named RYDLS-20, containing CXR images of pneumonia caused by different pathogens as well as CXR images of healthy lungs. The classes distribution follows a real-world scenario in which some pathogens are more common than others. The proposed approach achieved a macro-avg F1-Score of 0.65 using a multi-class approach and a F1-Score of 0.89 for the COVID-19 identification in the hierarchical classification scenario. As far as we know, we achieved the best nominal rate obtained for COVID-19 identification in an unbalanced environment with more than three classes. We must also highlight the novel proposed hierarchical classification approach for this task, which considers the types of pneumonia caused by the different pathogens and lead us to the best COVID-19 recognition rate obtained here.