论文标题
参数比例危害回归模型的强大假设检验和模型选择
Robust Hypothesis Testing and Model Selection for Parametric Proportional Hazard Regression Models
论文作者
论文摘要
半参数COX比例危害回归模型已在多个应用科学中广泛使用。但是,如果适当地假设,完全参数比例危害模型通常会导致更有效的推断。为了解决在此类完全参数比例危险模型下在数据中存在异常值的传统最大似然估计器的极端非稳定性,最近已经提出了一个可靠的估计程序,该程序在此设置下扩展了最小密度差异估计器(MDPDE)的概念。在本文中,我们考虑了参数比例危害模型下的统计推断问题,并使用MDPDES开发了强大的WALD型假设测试和模型选择程序。我们还得出了必要的渐近结果,该结果用于构建一般复合假说的测试程序并研究其渐近力。理论上通过适当的影响函数分析对声称的鲁棒性特性进行了研究。我们通过广泛的模拟研究了提出的基于MDPDE的WALD类型测试的有限样本水平和功率,在这些模拟中还与现有的半参数方法进行了比较。还讨论了选择适当的鲁棒性调整参数的重要问题。最终通过三个有趣的真实数据示例来说明了提出的可靠测试和模型选择程序的实际实用性。
The semi-parametric Cox proportional hazards regression model has been widely used for many years in several applied sciences. However, a fully parametric proportional hazards model, if appropriately assumed, can often lead to more efficient inference. To tackle the extreme non-robustness of the traditional maximum likelihood estimator in the presence of outliers in the data under such fully parametric proportional hazard models, a robust estimation procedure has recently been proposed extending the concept of the minimum density power divergence estimator (MDPDE) under this set-up. In this paper, we consider the problem of statistical inference under the parametric proportional hazards model and develop robust Wald-type hypothesis testing and model selection procedures using the MDPDEs. We have also derived the necessary asymptotic results which are used to construct the testing procedure for general composite hypothesis and study its asymptotic powers. The claimed robustness properties are studied theoretically via appropriate influence function analyses. We have studied the finite sample level and power of the proposed MDPDE based Wald type test through extensive simulations where comparisons are also made with the existing semi-parametric methods. The important issue of the selection of appropriate robustness tuning parameter is also discussed. The practical usefulness of the proposed robust testing and model selection procedures is finally illustrated through three interesting real data examples.