Numerical Solutions for Patterns Statistics on Markov Chains

Gregory Nuel, Laboratoire Statistique et Genome, CNRS (8071), INRA (1152), UEVE, Evry, France

Abstract

We propose here a review of the methods available to compute pattern statistics on text generated by a Markov source. Theoretical, but also numerical aspects are detailed for a wide range of techniques (exact, Gaussian, large deviations, binomial and compound Poisson). The SPatt package (Statistics for Pattern, free software available at http://stat.genopole.cnrs.fr/spatt) implementing all these methods is then used to compare all these approaches in terms of computational time and reliability in the most complete pattern statistics benchmark available at the present time.

Submitted: April 7, 2006 · Accepted: July 12, 2006 · Published: October 17, 2006

Recommended Citation

Nuel, Gregory (2006) "Numerical Solutions for Patterns Statistics on Markov Chains," Statistical Applications in Genetics and Molecular Biology: Vol. 5 : Iss. 1, Article 26.
DOI: 10.2202/1544-6115.1219
Available at: http://www.bepress.com/sagmb/vol5/iss1/art26

 
 
 
 

ISSN: 1544-6115 ©1999-2009 The Berkeley Electronic Press™ All rights reserved.

To submit, subscribe, recommend this journal to your library, or sign up for email alerts, please visit: http://www.bepress.com/sagmb