Assessing the Validity Domains of Graphical Gaussian Models in Order to Infer Relationships among Components of Complex Biological Systems

Fanny Villers, INRA Jouy-En-Josas
Brigitte Schaeffer, INRA Jouy-En-Josas
Caroline Bertin, INRA Jouy-En-Josas
Sylvie Huet, INRA Jouy-En-Josas

Abstract

The study of the interactions of cellular components is an essential base step to understand the structure and dynamics of biological networks. Various methods were recently developed for this purpose. While most of them combine different types of data and a priori knowledge, methods based on graphical Gaussian models are capable of learning the network directly from raw data. They consider the full-order partial correlations which are partial correlations between two variables given the remaining ones, for modeling direct links between variables. Statistical methods were developed for estimating these links when the number of observations is larger than the number of variables. However, the rapid advance of new technologies that allow the simultaneous measure of genome expression, led to large-scale datasets where the number of variables is far larger than the number of observations. To get around this dimensionality problem, different strategies and new statistical methods were proposed. In this study we focused on statistical methods recently published. All are based on the fact that the number of direct relationships between two variables is very small in regards to the number of possible relationships, p(p-1)/2. In the biological context, this assumption is not always satisfied over the whole graph. It is essential to precisely know the behavior of the methods in regards to the characteristics of the studied object before applying them. For this purpose, we evaluated the validity domain of each method from wide-ranging simulated datasets. We then illustrated our results using recently published biological data.

Erratum

This article was originally published on September 11, 2008, with the title: ``Assessing the Validity Domains of Graphical Gaussian Models in Order."

The title was corrected to: ``Assessing the Validity Domains of Graphical Gaussian Models in Order to Infer Relationships among Components of Complex Biological Systems" on November 3, 2008.

Submitted: March 14, 2008 · Accepted: July 29, 2008 · Published: September 11, 2008

Recommended Citation

Villers, Fanny; Schaeffer, Brigitte; Bertin, Caroline; and Huet, Sylvie (2008) "Assessing the Validity Domains of Graphical Gaussian Models in Order to Infer Relationships among Components of Complex Biological Systems," Statistical Applications in Genetics and Molecular Biology: Vol. 7 : Iss. 2, Article 14.
DOI: 10.2202/1544-6115.1371
Available at: http://www.bepress.com/sagmb/vol7/iss2/art14

 
 
 
 

ISSN: 1544-6115 ©1999-2009 The Berkeley Electronic Press™ All rights reserved.

To submit, subscribe, recommend this journal to your library, or sign up for email alerts, please visit: http://www.bepress.com/sagmb