Search
- Browse Authors in the U.C. Berkeley Division of Biostatistics Working Paper Series
Notification
Most popular papers
COBRA Notification
Most Popular Papers
Institutions: Join COBRA
About COBRA
- Multiple Testing Methods For ChIP-Chip High Density Oligonucleotide Array Data
-
- Sunduz Keles, Dept. of Statistics & Biostatistics & Medical Informatics, University of Wisconsin, Madison
- Mark J. van der Laan, Division of Biostatistics, School of Public Health, University of California, Berkeley
- Sandrine Dudoit, Division of Biostatistics, School of Public Health, University of California, Berkeley
- Simon E. Cawley, Affymetrix, 3380 Central Expressway, Santa Clara, CA 95051
-
Download the Paper
Forward to a colleague
- Article comments:
- Published 2006 in
Journal of Computational Biology,
13(3), 579-613.
- Abstract:
- Cawley et al. (2004) have recently mapped the locations of binding
sites for three transcription factors along human chromosomes 21
and 22 using ChIP-Chip experiments. ChIP-Chip experiments are a
new approach to the genome-wide identification of transcription
factor binding sites and consist of chromatin (Ch)
immunoprecipitation (IP) of transcription factor-bound genomic
DNA followed by high density oligonucleotide hybridization (Chip)
of the IP-enriched DNA. We investigate the ChIP-Chip data
structure and propose methods for inferring the location of
transcription factor binding sites from these data. The proposed
methods involve testing for each probe whether it is part of a
bound sequence or not using a scan statistic that takes into
account the spatial structure of the data. Different multiple
testing procedures are considered for controlling the family-wise
error rate and false discovery rate. A nested-Bonferroni
adjustment, that is more powerful than the traditional Bonferroni
adjustment when the test statistics are dependent, is discussed.
Simulation studies show that taking into account the spatial
structure of the data substantially improves the sensitivity of
the multiple testing procedures. Application of the proposed
methods to ChIP-Chip data for transcription factor p53 identified
many potential target binding regions along human chromosomes 21
and 22. Among these identified regions, 18% fall within a 3kb
vicinity of the 5'UTR of a known gene or CpG island, 31% fall
between the codon start site and the codon end site of a known
gene but not inside an exon. More than half of these potential
target sequences contain the p53 consensus binding site or very
close matches to it. Moreover, these target segments include the
13 experimentally verified p53 binding regions of
Cawley et al. (2004), as well as 49 additional regions that show
higher hybridization signal than these 13 experimentally verified
regions.
- Subject Area:
- Epidemiology, Medical Specialties, Statistical Theory and Methods
- Suggested Citation:
- Sunduz Keles, Mark J. van der Laan, Sandrine Dudoit, and Simon E. Cawley,
"Multiple Testing Methods For ChIP-Chip High Density Oligonucleotide Array Data"
(June 2004).
U.C. Berkeley Division of Biostatistics Working Paper Series.
Working Paper 147.
http://www.bepress.com/ucbbiostat/paper147