Bayesian Statistical Studies of the Ramachandran Distribution

Alexander Pertsemlidis, UT Southwestern Medical Center
Jan Zelinka, UT Southwestern Medical Center
John W. Fondon III, UT Southwestern Medical Center
R. Keith Henderson, UT Southwestern Medical Center
Zbyszek Otwinowski, UT Southwestern Medical Center

Abstract

We describe a method for the generation of knowledge-based potentials and apply it to the observed torsional angles of known protein structures. The potential is derived using Bayesian reasoning, and is useful as a prior for further such reasoning in the presence of additional data. The potential takes the form of a probability density function, which is described by a small number of coefficients with the number of necessary coefficients determined by tests based on statistical significance and entropy. We demonstrate the methods in deriving one such potential corresponding to two dimensions, the Ramachandran plot. In contrast to traditional histogram-based methods, the function is continuous and differentiable. These properties allow us to use the function as a force term in the energy minimization of appropriately described structures. The method can easily be extended to other observable angles and higher dimensions, or to include sequence dependence and should find applications in structure determination and validation.

Submitted: June 17, 2005 · Accepted: September 29, 2005 · Published: November 22, 2005

Recommended Citation

Pertsemlidis, Alexander; Zelinka, Jan; Fondon, John W. III; Henderson, R. Keith; and Otwinowski, Zbyszek (2005) "Bayesian Statistical Studies of the Ramachandran Distribution," Statistical Applications in Genetics and Molecular Biology: Vol. 4 : Iss. 1, Article 35.
Available at: http://www.bepress.com/sagmb/vol4/iss1/art35

 
 
 
 

ISSN: 1544-6115 ©1999-2008 The Berkeley Electronic Press™ All rights reserved.

To submit, subscribe, recommend this journal to your library, or sign up for email alerts, please visit: http://www.bepress.com/sagmb