Prediction of Motifs Based on a Repeated-Measures Model for Integrating Cross-Species Sequence and Expression Data

Elizabeth A. Siewert, University of Colorado, Denver
Katerina J. Kechris, University of Colorado, Denver

Abstract

De novo identification of transcription factor binding sites (TFBS) is a challenging computational problem because TFBSs are relatively short sequences buried in long genomic regions. Earlier methods incorporated genome-wide expression data and promoter sequences into a linear-model framework, regressing expression on counts of putative TFBSs in promoters for a single species. More recently, it has been shown that examining sequence data across multiple species improves the prediction of TFBSs. In this work, we describe an extension of the single-species, linear-model framework for the analysis of paired cross-species sequence and expression data. A repeated measures model for gene-expression measurements across species is used, accounting for phylogenetic relationships among species through the error covariance structure. This multiple-species algorithm is applied to a data set of four yeast species grown under heat-shock conditions and comparisons are made to the single species algorithm. Using evaluations based on transcription factor binding strength and an independent source of expression data, we find the multiple species results show an improvement in the prediction of TFBS.

Submitted: March 25, 2009 · Accepted: May 21, 2009 · Published: September 9, 2009

Recommended Citation

Siewert, Elizabeth A. and Kechris, Katerina J. (2009) "Prediction of Motifs Based on a Repeated-Measures Model for Integrating Cross-Species Sequence and Expression Data," Statistical Applications in Genetics and Molecular Biology: Vol. 8 : Iss. 1, Article 36.
DOI: 10.2202/1544-6115.1464
Available at: http://www.bepress.com/sagmb/vol8/iss1/art36

 
 
 
 

ISSN: 1544-6115 ©1999-2009 The Berkeley Electronic Press™ All rights reserved.

To submit, subscribe, recommend this journal to your library, or sign up for email alerts, please visit: http://www.bepress.com/sagmb