Asymptotic Optimality of Likelihood-Based Cross-Validation

Mark J. van der Laan, Division of Biostatistics, School of Public Health, University of California, Berkeley
Sandrine Dudoit, Division of Biostatistics, School of Public Health, University of California, Berkeley
Sunduz Keles, Division of Biostatistics, School of Public Health, University of California, Berkeley

Abstract

Likelihood-based cross-validation is a statistical tool for selecting a density estimate based on n i.i.d. observations from the true density among a collection of candidate density estimators. General examples are the selection of a model indexing a maximum likelihood estimator, and the selection of a bandwidth indexing a nonparametric (e.g. kernel) density estimator. In this article, we establish a finite sample result for a general class of likelihood-based cross-validation procedures (as indexed by the type of sample splitting used, e.g. V-fold cross-validation). This result implies that the cross-validation selector performs asymptotically as well (w.r.t. to the Kullback-Leibler distance to the true density) as a benchmark model selector which is optimal for each given dataset and depends on the true density. Crucial conditions of our theorem are that the size of the validation sample converges to infinity, which excludes leave-one-out cross-validation, and that the candidate density estimates are bounded away from zero and infinity. We illustrate these asymptotic results and the practical performance of likelihood-based cross-validation for the purpose of bandwidth selection with a simulation study. Moreover, we use likelihood-based cross-validation in the context of regulatory motif detection in DNA sequences.

Submitted: December 11, 2003 · Accepted: March 3, 2004 · Published: March 22, 2004

Recommended Citation

van der Laan, Mark J.; Dudoit, Sandrine; and Keles, Sunduz (2004) "Asymptotic Optimality of Likelihood-Based Cross-Validation," Statistical Applications in Genetics and Molecular Biology: Vol. 3 : Iss. 1, Article 4.
DOI: 10.2202/1544-6115.1036
Available at: http://www.bepress.com/sagmb/vol3/iss1/art4

 
 
 
 

ISSN: 1544-6115 ©1999-2010 The Berkeley Electronic Press™ All rights reserved.

To submit, subscribe, recommend this journal to your library, or sign up for email alerts, please visit: http://www.bepress.com/sagmb