论文标题

通过有条件的Martingale差异差异进行高维变量筛选

High-dimensional Variable Screening via Conditional Martingale Difference Divergence

论文作者

Fang, Lei, Yuan, Qingcong, Yin, Xiangrong, Ye, Chenglong

论文摘要

可变筛选一直是一个有用的研究领域,涉及超高维数据。当对响应的边缘和共同依赖的预测都存在时,现有方法(例如条件筛选或迭代筛查)通常会分别遭受条件集或计算负担的选择。在本文中,我们提出了一项新的独立措施,称为条件性martingale差异差异(CMDH),可以将其视为条件或边际独立性措施。在规律性条件下,我们表明CMDH的确定筛选属性均具有边缘和共同活性变量。基于此措施,我们提出了一种基于内核的无模型可变筛选方法,该方法具有高效,灵活且稳定,以与响应的预测变量和异质性之间的高相关性。此外,我们还提供了一个数据驱动的方法来选择条件集。在模拟和实际数据应用中,我们证明了该方法的出色性能。

Variable screening has been a useful research area that deals with ultrahigh-dimensional data. When there exist both marginally and jointly dependent predictors to the response, existing methods such as conditional screening or iterative screening often suffer from instability against the selection of the conditional set or the computational burden, respectively. In this article, we propose a new independence measure, named conditional martingale difference divergence (CMDH), that can be treated as either a conditional or a marginal independence measure. Under regularity conditions, we show that the sure screening property of CMDH holds for both marginally and jointly active variables. Based on this measure, we propose a kernel-based model-free variable screening method, which is efficient, flexible, and stable against high correlation among predictors and heterogeneity of the response. In addition, we provide a data-driven method to select the conditional set. In simulations and real data applications, we demonstrate the superior performance of the proposed method.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源