Accounting for Dependence in Similarity Data from DNA Fingerprinting

Graham Hepworth, The University of Melbourne
Ian R. Gordon, The University of Melbourne
Michael J. McCullough, The University of Melbourne

Abstract

Differentiating strains of a pathogen is often central to investigating its epidemiological aspects. The genetic similarity of a group of strains can be assessed by calculating a matrix of dissimilarities from their DNA fingerprinting profiles. The mean dissimilarity for each strain across other strains within the group is then used as an observation in a statistical analysis. These observations are not independent of each other, and so standard analysis techniques such as the t-test are inappropriate, because they underestimate the variance of the group means, and hence overstate the statistical significance of any differences. By examining the correlation between elements of the dissimilarity matrix, it is shown that the variance is underestimated by a factor of between about 2 and 4. Permutation tests are proposed as a way of addressing the problem of dependence, and are applied to a study of fluconazole resistance in Candida albicans.

Submitted: February 28, 2006 · Accepted: November 23, 2006 · Published: January 15, 2007

Recommended Citation

Hepworth, Graham; Gordon, Ian R.; and McCullough, Michael J. (2007) "Accounting for Dependence in Similarity Data from DNA Fingerprinting," Statistical Applications in Genetics and Molecular Biology: Vol. 6 : Iss. 1, Article 1.
Available at: http://www.bepress.com/sagmb/vol6/iss1/art1

 
 
 
 

ISSN: 1544-6115 ©1999-2008 The Berkeley Electronic Press™ All rights reserved.

To submit, subscribe, recommend this journal to your library, or sign up for email alerts, please visit: http://www.bepress.com/sagmb