论文标题
自动疾病诊断的分层增强学习
Hierarchical Reinforcement Learning for Automatic Disease Diagnosis
论文作者
论文摘要
动机:以疾病诊断为导向的对话系统模型的交互式咨询程序作为马尔可夫决策过程和强化学习算法用于解决该问题。现有方法通常采用平坦的政策结构,该结构平均治疗所有症状和疾病进行行动。在动作空间很小的简单情况下,该策略效果很好,但是,在实际环境中,其效率将受到挑战。受到离线咨询过程的启发,我们建议将两个级别的分层政策结构整合到对话系统中的政策学习中。高级政策由负责触发低级模型的AmasterModel组成,低级政策由几个症状检查器和疾病分类器组成。提出的政策结构能够处理包括大量疾病和症状在内的诊断问题。 结果:三个现实世界数据集和合成数据集的实验结果表明,与现有系统相比,我们的分层框架在疾病诊断中获得了更高的准确性和症状回忆。我们构建一个基准,包括数据集和实施现有算法以鼓励后续研究。 可用性:代码和数据可从https://github.com/fudandisc/discopen-medbox-dialodiagnosis获得 联系人:[email protected] 补充信息:可以在线生物信息学上获得补充数据。
Motivation: Disease diagnosis oriented dialogue system models the interactive consultation procedure as Markov Decision Process and reinforcement learning algorithms are used to solve the problem. Existing approaches usually employ a flat policy structure that treat all symptoms and diseases equally for action making. This strategy works well in the simple scenario when the action space is small, however, its efficiency will be challenged in the real environment. Inspired by the offline consultation process, we propose to integrate a hierarchical policy structure of two levels into the dialogue systemfor policy learning. The high-level policy consists of amastermodel that is responsible for triggering a low-levelmodel, the lowlevel policy consists of several symptom checkers and a disease classifier. The proposed policy structure is capable to deal with diagnosis problem including large number of diseases and symptoms. Results: Experimental results on three real-world datasets and a synthetic dataset demonstrate that our hierarchical framework achieves higher accuracy and symptom recall in disease diagnosis compared with existing systems. We construct a benchmark including datasets and implementation of existing algorithms to encourage follow-up researches. Availability: The code and data is available from https://github.com/FudanDISC/DISCOpen-MedBox-DialoDiagnosis Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics online.