Examining Protein Structure and Similarities by Spectral Analysis Technique

Krista Collins, Dalhousie University
Hong Gu, Dalhousie University
Chris Field, Dalhousie University

Abstract

The spectral envelope, a frequency based technique for analyzing categorical time series, is applied to amino acid sequences to examine their periodicity. The periodic signatures of such sequences is related to the secondary structure of the folding patterns in the gene. For a pair of sequences, we define a spectral envelope covariance which emphasizes the common periodicities in the two sequences. This is used to give a similarity measure for the two sequences which can then be used in a neighbour joining algorithm to construct a phylogeny. We apply the spectral methods to myoglobin sequences from primates and cetaceans. The spectral envelope reflects the structure of this protein and the tree constructed using spectral methods shows strong agreement with published trees.

The spectral envelope can be used to explore similarities between and within different protein families. Since we do not require aligned sequences, the spectral methods can be used to create phylogenies across different protein families. We apply the method to 11 protein families from PANDIT obtaining a tree where the families are separated and the relationship among the families is given.

Submitted: May 12, 2006 · Accepted: August 17, 2006 · Published: September 17, 2006

Recommended Citation

Collins, Krista; Gu, Hong; and Field, Chris (2006) "Examining Protein Structure and Similarities by Spectral Analysis Technique," Statistical Applications in Genetics and Molecular Biology: Vol. 5 : Iss. 1, Article 23.
Available at: http://www.bepress.com/sagmb/vol5/iss1/art23

 
 
 
 

ISSN: 1544-6115 ©1999-2008 The Berkeley Electronic Press™ All rights reserved.

To submit, subscribe, recommend this journal to your library, or sign up for email alerts, please visit: http://www.bepress.com/sagmb