Borrowing Information across Populations in Estimating Positive and Negative Predictive Values
The full text of this version of the working paper is not currently available online.
Abstract:

A marker's capacity to predict risk of a disease depends on disease prevalence in the target population and its classification accuracy, i.e. its ability to discriminate diseased subjects from non-diseased subjects. The latter is often considered an intrinsic property of the marker; it is independent of disease prevalence and hence more likely to be similar across populations than risk prediction measures. In this paper, we are interested in evaluating population-specific performance of a risk prediction marker in terms of positive predictive value (PPV) and negative predictive value (NPV) at given thresholds, when samples are available from the target population as well as from another population. A default strategy is to estimate PPV and NPV using samples from the target population only. However, when the marker's classification accuracy as characterized by a specific point on the ROC curve is invariant across populations, borrowing information across populations allows increased efficiency in estimating PPV and NPV. We develop estimators that optimally combine information across populations. We illustrate this methodology in a retrospective setting by evaluating PCA3 as a risk prediction marker for prostate cancer among subjects with or without initial biopsy.

Subject Area:
General Biostatistics
Suggested Citation:
Ying Huang, Ziding Feng, and Youyi Fong, "Borrowing Information across Populations in Estimating Positive and Negative Predictive Values" (October 16, 2008). UW Biostatistics Working Paper Series. Working Paper 339.
http://www.bepress.com/uwbiostat/paper339