Cox Survival Analysis of Microarray Gene Expression Data Using Correlation Principal Component Regression

Qiang Zhao, Texas State University
Jianguo Sun, University of Missouri-Columbia

Abstract

Statistical analysis of microarray gene expression data has recently attracted a great deal of attention. One problem of interest is to relate genes to survival outcomes of patients with the purpose of building regression models for the prediction of future patients' survival based on their gene expression data. For this, several authors have discussed the use of the proportional hazards or Cox model after reducing the dimension of the gene expression data. This paper presents a new approach to conduct the Cox survival analysis of microarray gene expression data with the focus on models' predictive ability. The method modifies the correlation principal component regression (Sun, 1995) to handle the censoring problem of survival data. The results based on simulated data and a set of publicly available data on diffuse large B-cell lymphoma show that the proposed method works well in terms of models' robustness and predictive ability in comparison with some existing partial least squares approaches. Also, the new approach is simpler and easy to implement.

Submitted: May 5, 2005 · Accepted: January 24, 2007 · Published: May 29, 2007

Recommended Citation

Zhao, Qiang and Sun, Jianguo (2007) "Cox Survival Analysis of Microarray Gene Expression Data Using Correlation Principal Component Regression," Statistical Applications in Genetics and Molecular Biology: Vol. 6 : Iss. 1, Article 16.
Available at: http://www.bepress.com/sagmb/vol6/iss1/art16

 
 
 
 

ISSN: 1544-6115 ©1999-2008 The Berkeley Electronic Press™ All rights reserved.

To submit, subscribe, recommend this journal to your library, or sign up for email alerts, please visit: http://www.bepress.com/sagmb