Statistical Methods for Identifying Conserved Residues in Multiple Sequence Alignment

Virpi Ahola, University of Turku
Tero Aittokallio, University of Turku
Esa Uusipaikka, University of Turku
Mauno Vihinen, University of Tampere

Abstract

The assessment of residue conservation in a multiple sequence alignment is a central issue in bioinformatics. Conserved residues and regions are used to determine structural and functional motifs or evolutionary relationships between the sequences of a multiple sequence alignment. For this reason, residue conservation is a valuable measure for database and motif search or for estimating the quality of alignments. In this paper, we present statistical methods for identifying conserved residues in multiple sequence alignments. While most earlier studies examine the positional conservation of the alignment, we focus on the detection of individual conserved residues at a position. The major advantages of multiple comparison methods originate from their ability to select conserved residues simultaneously and to consider the variability of the residue estimates. Large-scale simulations were used for the comparative analysis of the methods. Practical performance was studied by comparing the structurally and functionally important residues of Src homology 2 (SH2) domains to the assignments of the conservation indices. The applicability of the indices was also compared in three additional protein families comprising different degrees of entropy and variability in alignment positions. The results indicate that statistical multiple comparison methods are sensitive and reliable in identifying conserved residues.

Submitted: July 7, 2004 · Accepted: September 21, 2004 · Published: October 30, 2004

Recommended Citation

Ahola, Virpi; Aittokallio, Tero; Uusipaikka, Esa; and Vihinen, Mauno (2004) "Statistical Methods for Identifying Conserved Residues in Multiple Sequence Alignment," Statistical Applications in Genetics and Molecular Biology: Vol. 3 : Iss. 1, Article 28.
Available at: http://www.bepress.com/sagmb/vol3/iss1/art28

 
 
 
 

ISSN: 1544-6115 ©1999-2008 The Berkeley Electronic Press™ All rights reserved.

To submit, subscribe, recommend this journal to your library, or sign up for email alerts, please visit: http://www.bepress.com/sagmb