Statistical Sample Quotas Using Clustering Model
In development
The Statistical Sample Quotas Using Clustering Model estimates the sample size needed in each sub-administrative area without finding a positive case to have high probability that prevalence in the population is at or below a threshold. The model explicitly accounts for increased disease transmission through the natural clustering behavior of hosts in the population and assumes a simple random sampling design.
Geographical Scale
- Administrative area, subdivided into a sub-administrative areas
Required Data
- Population size or population density of hosts in each sub-administrative area
User Inputs
- Average cluster size of hosts
- Correlation in disease status among hosts sharing a cluster
- Sensitivity of the diagnostic test used to declare a CWD-positive case
Outputs
- The number of randomly selected hosts that need to be tested in each sub-administrative area to have high probability (95%) that the prevalence of CWD in the overall population is at or below 0.5%, 1%, 1.5%, 2%, 3%, 4%, or 5%.

More Information
For more information, go to the CWD Data Warehouse User Manual: Statistical Sample Quotas Using Clustering Model.
Code and Docker Image
To view the code once deployed, go to the GitHub Repository: Statistical Sample Quotas Using Clustering Model. To view the docker image, go to the Docker Hub: cwhl/statistical-sample-size-with-clustering-model.
Citations
- Booth JG, Hanley BJ, Hodel FH, Jennelle CS, Guinness J, Them CE, Mitchell CI, Ahmed MS, Schuler KL. 2024. Sample size for estimating disease prevalence in free-ranging wildlife populations: A Bayesian modeling approach. Journal of Agricultural, Biological, and Environmental Sciences. 29, 438–454.
- Booth JG, Hanley BJ, Thompson NE, Gonzalez-Crespo C, Christensen SA, Jennelle CS, Caudell JN, Delisle Z, Guinness J, Hollingshead NA, Them CT, Schuler KL. Management agencies can leverage animal social structure for wildlife disease surveillance. Journal of Wildlife Diseases.