Search
- Browse Authors in the U.C. Berkeley Division of Biostatistics Working Paper Series
Notification
Most popular papers
COBRA Notification
Most Popular Papers
Institutions: Join COBRA
About COBRA
- Identification of Regulatory Elements Using A Feature Selection Method
-
- Sunduz Keles, Department of Statistics, University of Wisconsin, Madison
- Mark J. van der Laan, Division of Biostatistics, School of Public Health, University of California, Berkeley
- Michael B. Eisen, Dept. of Molecular and Cell Biology, University of California, Berkeley
-
Download the Paper
Forward to a colleague
- Published 2002 in
Bioinformatics,
18, pp. 1167-1175.
- Abstract:
- Many methods have been described to identify regulatory motifs in
the transcription control regions of genes that exhibit similar
patterns of gene expression across a variety of experimental
conditions. Here we focus on a single experimental condition, and
utilize gene expression data to identify sequence motifs
associated with genes that are activated under this experimental
condition. We use a linear model with two way interactions to
model gene expression as a function of sequence features (words)
present in presumptive transcription control regions. The most
relevant features are selected by a feature selection method
called stepwise selection with monte carlo cross validation. We
apply this method to a publicly available dataset of the yeast
Saccharomyces cerevisiae, focussing on the 800 basepairs
immediately upstream of each gene's translation start site (the
upstream control region (UCR)). We successfully identify
regulatory motifs that are known to be active under the
experimental conditions analyzed, and find additional significant
sequences that may represent novel regulatory motifs. We also
discuss a complementary method that utilizes gene expression data
from a single microarray experiment and allows averaging over
variety of experimental conditions as an alternative to motif
finding methods that act on clusters of co-expressed genes.
- Subject Area:
- Human Genetics, Microarrays, Statistical Models
- Suggested Citation:
- Sunduz Keles, Mark J. van der Laan, and Michael B. Eisen,
"Identification of Regulatory Elements Using A Feature Selection Method"
(September 2001).
U.C. Berkeley Division of Biostatistics Working Paper Series.
Working Paper 98.
http://www.bepress.com/ucbbiostat/paper98