利用相关生存数据的信息提高Cox模型参数估计效率

Improving Estimation Efficiency of Cox Model via Using Other Related Survival Data

  • 摘要: Cox模型是生存分析中使用非常广泛的半参数回归模型, 其回归参数的极大部分似然估计具有相合性、渐近正态性及有效性等优良性质. 本文首次给出一种利用相关生存数据的信息提高Cox模型参数估计效率的方法, 利用著名的WLW (1989, JASA)边际比例风险模型及构造独特的回归参数估计方程对参数估计进行提高效率的研究. WLW模型在建模时对生存时间之间的相依结构不进行模型假定, 所收集到的数据可以方便地用WLW模型进行刻画, 然而直接由WLW方法进行参数估计无法达到提高估计效率的目的, 本文在Yang (2000)和Cui (2004)的基础上, 利用基于分割的方法, 在一定的最优准则下对生存时间进行``分割重组''构造出优良的估计方程, 求得的参数估计充分利用了相关信息, 由所提取的辅助相依信息提高了参数估计的效率. 模拟研究表明, 在生存时间之间具有一定相依性的情形下, 方法在提高估计效率方面有良好表现.

     

    Abstract: Cox model is the most popular model in modelling the relationship between a survival outcome and predictive covariates, and has been gotten great success about regression modelling survival data. It is well known that the maximum partial likelihood estimates of regression parameters of Cox model are consistent, asymptotically normal and semiparametrically efficient. In this paper, based on marginal proportional hazards model and a partitioning-based method, we develop an approach to improve estimation efficiency of regression parameters in Cox model through introducing some other handy or easily collected survival data. Practically, for each subject, there frequently exist some other possibly-multivariate survival data available in addition to the main endpoint of survival times, which are easily collected or handy, and belong to the same subject or group as the survival data of interest. All the data construct multivariate survival data, and the famous WLW model, one important marginal proportional hazards model of multivariate survival data proposed in 1989 by Wei, Lin and Weissfels (1989), is the model of natural choice to regression modelling the aforementioned multivariate survival data. But, as pointed out in this paper, by making direct use of the WLW method , the estimation efficiency of regression parameters of interest cannot be improved. Based on the partitioning-based method for WLW model, an approach to improve the estimation of regression parameter of Cox model is proposed and discussed. Simulation studies are conducted to investigate behavior of the proposed approach under practical sample size. Our results show that it performs well, only if the constructed multivariate survival data are correlated between the survival data of interest and the introduced survival data, even for small to moderate sample size.

     

/

返回文章
返回