Dimension Reduction for Classification with Gene Expression Microarray Data

Jian J. Dai, University of California, Davis
Linh Lieu, University of California, Los Angeles
David Rocke, University of California, Davis

Abstract

An important application of gene expression microarray data is classification of biological samples or prediction of clinical and other outcomes. One necessary part of multivariate statistical analysis in such applications is dimension reduction. This paper provides a comparison study of three dimension reduction techniques, namely partial least squares (PLS), sliced inverse regression (SIR) and principal component analysis (PCA), and evaluates the relative performance of classification procedures incorporating those methods. A five-step assessment procedure is designed for the purpose. Predictive accuracy and computational efficiency of the methods are examined. Two gene expression data sets for tumor classification are used in the study.

Submitted: April 13, 2005 · Accepted: December 31, 2005 · Published: February 24, 2006

Recommended Citation

Dai, Jian J.; Lieu, Linh; and Rocke, David (2006) "Dimension Reduction for Classification with Gene Expression Microarray Data," Statistical Applications in Genetics and Molecular Biology: Vol. 5 : Iss. 1, Article 6.
Available at: http://www.bepress.com/sagmb/vol5/iss1/art6

Readers' Reactions

Anne-Laure Boulesteix, Reader's Reaction to "Dimension Reduction for Classification with Gene Expression Microarray Data" by Dai et al (2006) (June 2006)

 
 
 
 

ISSN: 1544-6115 ©1999-2008 The Berkeley Electronic Press™ All rights reserved.

To submit, subscribe, recommend this journal to your library, or sign up for email alerts, please visit: http://www.bepress.com/sagmb