Maximum Likelihood for Genome Phylogeny on Gene Content

Hongmei Zhang, University of West Florida
Xun Gu, Iowa State University

Abstract

With the rapid growth of entire genome data, reconstructing the phylogenetic relationship among different genomes has become a hot topic in comparative genomics. Maximum likelihood approach is one of the various approaches, and has been very successful. However, there is no reported study for any applications in the genome tree-making mainly due to the lack of an analytical form of a probability model and/or the complicated calculation burden. In this paper we studied the mathematical structure of the stochastic model of genome evolution, and then developed a simplified likelihood function for observing a specific phylogenetic pattern under four genome situation using gene content information. We use the maximum likelihood approach to identify phylogenetic trees. Simulation results indicate that the proposed method works well and can identify trees with a high correction rate. Real data application provides satisfied results. The approach developed in this paper can serve as the basis for reconstructing phylogenies of more than four genomes.

Submitted: April 27, 2004 · Accepted: September 12, 2004 · Published: November 14, 2004

Recommended Citation

Zhang, Hongmei and Gu, Xun (2004) "Maximum Likelihood for Genome Phylogeny on Gene Content," Statistical Applications in Genetics and Molecular Biology: Vol. 3 : Iss. 1, Article 31.
Available at: http://www.bepress.com/sagmb/vol3/iss1/art31

 
 
 
 

ISSN: 1544-6115 ©1999-2008 The Berkeley Electronic Press™ All rights reserved.

To submit, subscribe, recommend this journal to your library, or sign up for email alerts, please visit: http://www.bepress.com/sagmb