Search

18 - Pooled prevalence estimates are biased!

For all of the frequentist methods for estimating prevalence from pooled samples, p is an upwardly biased estimator of the true prevalence. The magnitude of the bias is decreased with lower prevalence (p), increased numbers of pools (m), smaller pool size (k) and increased total sample size (n). Bias also increases as the probability that all pools will test positive increases.

In general, bias is negligible for m > 30. However, even for quite small values for m and k the bias may also be quite small, particularly if p is low.

The estimated prevalence (p) is also sensitive to (and may be biased by) errors in the assumptions of perfect test sensitivity or specificity. p is particularly sensitive to errors in sensitivity as p increases and if k is too large. Clustering or overdispersion of positive individuals in the sampled population can also result in substantial bias in prevalence estimates.

The actual bias in any estimate depends on the true prevalence, pool size and the number of pools and can be estimated for any particular pooling strategy using simulation methods. Simulation utilities are provided for both fixed and variable pool-size strategies to assist in evaluating the potential bias in proposed pooling strategies.

Bias can be minimised by ensuring an adequate total sample size, by testing a larger number of pools of smaller size, rather than vice versa or by testing several individual samples in addition to the pooled samples (using the variable pool size method).

« Previous

Contents
1	Introduction
2	Overview
3	Bayesian vs Frequentist methods
4	Fixed pool size and perfect tests
5	Fixed pool size and known Se & Sp
6	Fixed pool size and uncertain Se & Sp
7	Variable pool size and perfect tests
8	Pooled prevalence using a Gibbs sampler
9	True prevalence using one test
10	Estimated true prevalence using two tests with a Gibbs sampler
11	Estimation of parameters for prior Beta distributions
12	Sample size for fixed pool size and perfect test
13	Sample size for fixed pool size and known test sensitivity and specificity
14	Sample size for fixed pool size and uncertain test sensitivity and specificity
15	Simulate sampling for fixed pool size
16	Simulate sampling for variable pool sizes
17	Important Assumptions
18	Pooled prevalence estimates are biased!