PLS Dimension Reduction for Classification with Microarray Data

Anne-Laure Boulesteix, Department of Statistics, University of Munich

Abstract

Partial Least Squares (PLS) dimension reduction is known to give good prediction accuracy in the context of classification with high-dimensional microarray data. In this paper, the classification procedure consisting of PLS dimension reduction and linear discriminant analysis on the new components is compared with some of the best state-of-the-art classification methods. Moreover, a boosting algorithm is applied to this classification method. In addition, a simple procedure to choose the number of PLS components is suggested. The connection between PLS dimension reduction and gene selection is examined and a property of the first PLS component for binary classification is proved. In addition, we show how PLS can be used for data visualization using real data. The whole study is based on 9 real microarray cancer data sets.

Submitted: July 9, 2004 · Accepted: November 1, 2004 · Published: November 23, 2004

Recommended Citation

Boulesteix, Anne-Laure (2004) "PLS Dimension Reduction for Classification with Microarray Data," Statistical Applications in Genetics and Molecular Biology: Vol. 3 : Iss. 1, Article 33.
Available at: http://www.bepress.com/sagmb/vol3/iss1/art33

 
 
 
 

ISSN: 1544-6115 ©1999-2008 The Berkeley Electronic Press™ All rights reserved.

To submit, subscribe, recommend this journal to your library, or sign up for email alerts, please visit: http://www.bepress.com/sagmb