论文标题

在多元深度统计概念中选择

Choosing among notions of multivariate depth statistics

论文作者

Mosler, Karl, Mozharovskyi, Pavlo

论文摘要

经典的多元统计量通过其与均值的距离衡量了一个点的遥远性,这是基于数据的平均值和协方差矩阵。多元深度函数是一个函数,鉴于D空间中的一个点和分布,在0到1之间的核心度中,同时满足了有关不变性,单调性,凸度和连续性的某些假设。因此,文献中已经提出了许多多元深度的概念,其中一些概念也与极其遥不可及的数据相当强大。与古典的玛哈拉诺省距离偏离的距离并非没有成本。不变性,鲁棒性和计算可行性之间存在权衡。在过去的几年中,已经构建了有效的精确算法以及近似算法,并以R包装提供。因此,在实际应用中,由于计算限制,深度统计量的选择不再仅限于一个或两个概念。相反,更多的观念是可行的,其中研究人员必须决定。文章辩论了此选择的理论和实践方面,包括不变性和唯一性,鲁棒性和计算可行性。比较精确算法的复杂性和速度。讨论了诸如随机Tukey深度之类的近似方法的准确性以及对大型和高维数据的应用。不久将解决与局部和功能深度的扩展以及与回归深度的连接。

Classical multivariate statistics measures the outlyingness of a point by its Mahalanobis distance from the mean, which is based on the mean and the covariance matrix of the data. A multivariate depth function is a function which, given a point and a distribution in d-space, measures centrality by a number between 0 and 1, while satisfying certain postulates regarding invariance, monotonicity, convexity and continuity. Accordingly, numerous notions of multivariate depth have been proposed in the literature, some of which are also robust against extremely outlying data. The departure from classical Mahalanobis distance does not come without cost. There is a trade-off between invariance, robustness and computational feasibility. In the last few years, efficient exact algorithms as well as approximate ones have been constructed and made available in R-packages. Consequently, in practical applications the choice of a depth statistic is no more restricted to one or two notions due to computational limits; rather often more notions are feasible, among which the researcher has to decide. The article debates theoretical and practical aspects of this choice, including invariance and uniqueness, robustness and computational feasibility. Complexity and speed of exact algorithms are compared. The accuracy of approximate approaches like the random Tukey depth is discussed as well as the application to large and high-dimensional data. Extensions to local and functional depths and connections to regression depth are shortly addressed.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源