高度可扩展的任务分组，用于预测表观遗传事件的深度多任务学习

论文标题

高度可扩展的任务分组，用于预测表观遗传事件的深度多任务学习

Highly Scalable Task Grouping for Deep Multi-Task Learning in Prediction of Epigenetic Events

论文作者

Shiri, Mohammad, Sun, Jiangwen

论文摘要

经过培训的用于预测DNA序列细胞事件的深层神经网络已成为新兴工具，以帮助阐明全基因组关联研究中鉴定的关联的生物学机制。为了增强培训，在以前的工作中通常利用了多任务学习（MTL），在这些工作中，需要训练的网络来使事件模态或单元格类型不同。所有现有作品均采用一个简单的MTL框架，在该框架中所有任务都共享一个单个特征提取网络。这种策略即使在一定程度上有效地导致了大量的负转移，这意味着存在大部分任务的存在，而通过MTL获得的模型的表现比单个任务学习的模型要差。已经开发了解决其他域中这种负转移的方法，例如计算机视觉。但是，这些方法通常很难扩展以处理大量任务。在本文中，我们提出了一个高度可扩展的任务分组框架，以通过可能彼此有益的共同培训任务来解决负面转移。所提出的方法利用了与任务特定分类头相关的网络权重，可以通过一次性的所有任务进行联合培训来廉价获得。我们使用由367个表观遗传概况组成的数据集的结果证明了该方法的有效性及其优于基线方法。

Deep neural networks trained for predicting cellular events from DNA sequence have become emerging tools to help elucidate the biological mechanism underlying the associations identified in genome-wide association studies. To enhance the training, multi-task learning (MTL) has been commonly exploited in previous works where trained networks were needed for multiple profiles differing in either event modality or cell type. All existing works adopted a simple MTL framework where all tasks share a single feature extraction network. Such a strategy even though effective to certain extent leads to substantial negative transfer, meaning the existence of large portion of tasks for which models obtained through MTL perform worse than those by single task learning. There have been methods developed to address such negative transfer in other domains, such as computer vision. However, these methods are generally difficult to scale up to handle large amount of tasks. In this paper, we propose a highly scalable task grouping framework to address negative transfer by only jointly training tasks that are potentially beneficial to each other. The proposed method exploits the network weights associated with task specific classification heads that can be cheaply obtained by one-time joint training of all tasks. Our results using a dataset consisting of 367 epigenetic profiles demonstrate the effectiveness of the proposed approach and its superiority over baseline methods.

下载PDF全文

下载文献需遵守相关版权规定

论文标题