论文标题
混乱挑战 - 合并(CT-MR)健康腹部器官分割
CHAOS Challenge -- Combined (CT-MR) Healthy Abdominal Organ Segmentation
论文作者
论文摘要
多年来,腹部器官的细分一直是一个全面但尚未解决的研究领域。在过去的十年中,深度学习的密集发展(DL)引入了新的最新细分系统。为了扩展这些主题的知识,与IEEE国际生物医学成像研讨会(ISBI)结合组织了混乱(CT -MR)健康的腹部器官分割挑战(ISBI),2019年,在意大利的威尼斯。混乱提供了来自健康受试者的腹部CT和MR数据,用于单腹和多个腹部器官分割。已经设计了五个不同但互补的任务,以从多个角度分析当前方法的功能。与手动注释和互动方法相比,对结果进行了彻底研究。 The analysis shows that the performance of DL models for single modality (CT / MR) can show reliable volumetric analysis performance (DICE: 0.98 $\pm$ 0.00 / 0.95 $\pm$ 0.01) but the best MSSD performance remain limited (21.89 $\pm$ 13.94 / 20.85 $\pm$ 10.63 mm).参与模型的表演对于肝脏的跨模式任务显着降低(骰子:0.88 $ \ pm $ 0.15 MSSD:36.33 $ \ pm $ \ pm $ 21.97毫米)和所有器官(骰子:0.85 $ \ $ 0.21 MSSD:0.21 MSSD:33.17 $ \ pm PM $ \ $ 38.93毫米)。尽管在不同的应用程序上进行了相反的示例,但与特定于器官特定的器官相比,旨在分割所有器官的多任务DL模型似乎更差(性能下降约为5 \%)。此外,这种进一步研究的跨模式分割的方向将显着支持现实世界中的临床应用。此外,本文拥有1500多名参与者,另一个重要的贡献是对挑战组织的缺点的分析,例如多种提交和窥视现象的影响。
Segmentation of abdominal organs has been a comprehensive, yet unresolved, research field for many years. In the last decade, intensive developments in deep learning (DL) have introduced new state-of-the-art segmentation systems. In order to expand the knowledge on these topics, the CHAOS - Combined (CT-MR) Healthy Abdominal Organ Segmentation challenge has been organized in conjunction with IEEE International Symposium on Biomedical Imaging (ISBI), 2019, in Venice, Italy. CHAOS provides both abdominal CT and MR data from healthy subjects for single and multiple abdominal organ segmentation. Five different but complementary tasks have been designed to analyze the capabilities of current approaches from multiple perspectives. The results are investigated thoroughly, compared with manual annotations and interactive methods. The analysis shows that the performance of DL models for single modality (CT / MR) can show reliable volumetric analysis performance (DICE: 0.98 $\pm$ 0.00 / 0.95 $\pm$ 0.01) but the best MSSD performance remain limited (21.89 $\pm$ 13.94 / 20.85 $\pm$ 10.63 mm). The performances of participating models decrease significantly for cross-modality tasks for the liver (DICE: 0.88 $\pm$ 0.15 MSSD: 36.33 $\pm$ 21.97 mm) and all organs (DICE: 0.85 $\pm$ 0.21 MSSD: 33.17 $\pm$ 38.93 mm). Despite contrary examples on different applications, multi-tasking DL models designed to segment all organs seem to perform worse compared to organ-specific ones (performance drop around 5\%). Besides, such directions of further research for cross-modality segmentation would significantly support real-world clinical applications. Moreover, having more than 1500 participants, another important contribution of the paper is the analysis on shortcomings of challenge organizations such as the effects of multiple submissions and peeking phenomena.