Maximum Likelihood for Genome Phylogeny on Gene Content
Abstract
With the rapid growth of entire genome data, reconstructing the phylogenetic relationship among different genomes has become a hot topic in comparative genomics. Maximum likelihood approach is one of the various approaches, and has been very successful. However, there is no reported study for any applications in the genome tree-making mainly due to the lack of an analytical form of a probability model and/or the complicated calculation burden. In this paper we studied the mathematical structure of the stochastic model of genome evolution, and then developed a simplified likelihood function for observing a specific phylogenetic pattern under four genome situation using gene content information. We use the maximum likelihood approach to identify phylogenetic trees. Simulation results indicate that the proposed method works well and can identify trees with a high correction rate. Real data application provides satisfied results. The approach developed in this paper can serve as the basis for reconstructing phylogenies of more than four genomes.Submitted: April 27, 2004 · Accepted: September 12, 2004 · Published: November 14, 2004
Recommended Citation
Zhang, Hongmei and Gu, Xun
(2004)
"Maximum Likelihood for Genome Phylogeny on Gene Content,"
Statistical Applications in Genetics and Molecular Biology:
Vol. 3
:
Iss.
1, Article 31.
Available at: http://www.bepress.com/sagmb/vol3/iss1/art31
