Quantifying the Association between Gene Expressions and DNA-Markers by Penalized Canonical Correlation Analysis

Sandra Waaijenborg, Academic Medical Center / University of Amsterdam
Philip C. Verselewel de Witt Hamer, Academic Medical Center / University of Amsterdam
Aeilko H. Zwinderman, Academic Medical Center / University of Amsterdam

Abstract

Multiple changes at the DNA level are at the basis of complex diseases. Identifying the genetic networks that are influenced by these changes might help in understanding the development of these diseases. Canonical correlation analysis is used to associate gene expressions with DNA-markers and thus reveals sets of co-expressed and co-regulated genes and their associating DNA-markers. However, when the number of variables gets high, e.g. in the case of microarray studies, interpretation of these results can be difficult. By adapting the elastic net to canonical correlation analysis the number of variables reduces, and interpretation becomes easier, moreover, due to the grouping effect of the elastic net co-regulated and co-expressed genes cluster. Additionally, our adaptation works well in situations where the number of variables exceeds by far the number of subjects.

Submitted: October 12, 2007 · Accepted: December 5, 2007 · Published: January 23, 2008

Recommended Citation

Waaijenborg, Sandra; Verselewel de Witt Hamer, Philip C.; and Zwinderman, Aeilko H. (2008) "Quantifying the Association between Gene Expressions and DNA-Markers by Penalized Canonical Correlation Analysis," Statistical Applications in Genetics and Molecular Biology: Vol. 7 : Iss. 1, Article 3.
Available at: http://www.bepress.com/sagmb/vol7/iss1/art3

 
 
 
 

ISSN: 1544-6115 ©1999-2008 The Berkeley Electronic Press™ All rights reserved.

To submit, subscribe, recommend this journal to your library, or sign up for email alerts, please visit: http://www.bepress.com/sagmb