论文标题

贝叶斯特征分配模型,用于使用细胞术数据识别细胞亚群

A Bayesian Feature Allocation Model for Identification of Cell Subpopulations Using Cytometry Data

论文作者

Lui, Arthur, Lee, Juhee, Thall, Peter F., Daher, May, Rezvani, Katy, Barar, Rafet

论文摘要

提出了一种贝叶斯特征分配模型(FAM),用于鉴定基于细胞表面或细胞内标记表达水平数据的多个细胞亚群,该数据通过飞行时间(CYTOF)获得的细胞仪获得。细胞亚群的特征是制造商的表达模式差异,并且基于其观察到的表达水平的模式将单个细胞聚集到亚群中。有限的印度自助餐过程用于将亚群建模为潜在特征,并且基于这些潜在特征亚群的基于模型的方法用于在每个样品中构造细胞簇。通过定义静态缺失的数据机制来解释由于质量细胞仪工具中技术伪像而导致的不可显着的丢失数据。与基于观察到的标记表达水平分别应用于不同样品的常规细胞聚类方法相反,基于FAM的方法可以同时应用于多个样品,并且可以鉴定出常规聚类可能遗漏的重要细胞亚群。提出的基于FAM的方法用于共同分析Cytof生成的三个数据集,以研究天然杀手(NK)细胞。由于FAM确定的亚群可以定义新的NK细胞子集,因此该统计分析可能会提供有关NK细胞生物学及其在癌症免疫疗法中的潜在作用的有用信息,这可能导致改善细胞疗法的发展。还提供了对拟议方法的行为进行的模拟研究,还提出了两种已知亚群的情况,然后分析了NK细胞表面标记数据的分析。

A Bayesian feature allocation model (FAM) is presented for identifying cell subpopulations based on multiple samples of cell surface or intracellular marker expression level data obtained by cytometry by time of flight (CyTOF). Cell subpopulations are characterized by differences in expression patterns of makers, and individual cells are clustered into the subpopulations based on the patterns of their observed expression levels. A finite Indian buffet process is used to model subpopulations as latent features, and a model-based method based on these latent feature subpopulations is used to construct cell clusters within each sample. Non-ignorable missing data due to technical artifacts in mass cytometry instruments are accounted for by defining a static missing data mechanism. In contrast to conventional cell clustering methods based on observed marker expression levels that are applied separately to different samples, the FAM based method can be applied simultaneously to multiple samples, and can identify important cell subpopulations likely to be missed by conventional clustering. The proposed FAM based method is applied to jointly analyze three datasets, generated by CyTOF, to study natural killer (NK) cells. Because the subpopulations identified by the FAM may define novel NK cell subsets, this statistical analysis may provide useful information about the biology of NK cells and their potential role in cancer immunotherapy which may lead, in turn, to development of improved cellular therapies. Simulation studies of the proposed method's behavior under two cases of known subpopulations also are presented, followed by analysis of the CyTOF NK cell surface marker data.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源