README

The /stan folder in this folder contains Bayesian model specifications written in the Stan probabalistic programming language. Each file corresponds to a variation of a model (originally developed in Keller et al., 2022) that uses environmental DNA (eDNA) data and “traditional” survey data to jointly estimate parameters. These model variations are accessed based on the type of input data and/or user-defined input parameters, including distributional assumptions.

#' @srrstats {PD2.0} This software represents probability distributions using 
#'   the 'Stan' programming language found in the model specifications in the 
#'   /stan folder.

#' @srrstats {PD1.0} Here the choices of distributions used in the model 
#'   specifications are justified.

Probability distributions were chosen for the model specifications using the model developed in Keller et al. 2022 (corresponding to stan/joint_binary_negbin.stan and stan/joint_binary_pois.stan). These original models use:

a binomial distribution to represent the probability of a qPCR detection (1) or non-detection(0) (Lahoz-Monfort et al., 2016)
a poisson or negative binomial distribution to represent how traditional survey count data are generated from an expected catch rate (Lindén and Mäntyniemi, 2011).
a normal distribution as the prior on the probability of a false positive eDNA detection. This prior is informative and specified by the user as an input argument when running jointModel() (Lahoz-Monfort et al., 2016).
a gamma distribution as the prior for the overdispersion parameter for traditional survey count data, if a negative binomial distribution is used. This prior can be informative and can be specified by the user as an input argument when running jointModel().
an ‘uninformative’ normal distribution is used as the prior for beta, which scales the sensitivity of eDNA surveys relative to traditional surveys.

Other variations on this original model specification include:

a gamma distribution to represent how continuous traditional survey data are generated from an expected catch rate.
a catchability coefficient if multiple traditional survey gear types are used and have different catchabilities
a regression to include site-level covariates that scale the sensitivity of eDNA sampling relative to traditional surveys. A “shrinkage” prior is used for these covariates as a form of Bayesian penalization (van Erp et al., 2019).

This folder also contains ‘traditional models’, which can be used to model the traditional survey data in isolation. These models can be used as a comparison with the joint model that adds eDNA survey data to determine if and how the addition of eDNA data affects inference.

References

Keller, A.G., Grason, E.W., McDonald, P.S., Ramon-Laca, A., Kelly, R.P. (2022). Tracking an invasion front with environmental DNA. Ecological Applications. 32(4): e2561. https://doi.org/10.1002/eap.2561

Lahoz-Monfort, J., Guillera-Arroita, G., Tingley, R. (2016). Statistical approaches to account for false-positive errors in environmental DNA samples. Molecular Ecology Resources. 16(3): 673-685. https://doi.org/10.1111/1755-0998.12486

Lindén, A., Mäntyniemi, S. (2011). Using the negative binomial distribution to model overdispersion in ecological count data. Ecology. 92(7): 1414-1421. https://doi.org/10.1890/10-1831.1

van Erp, S., Oberski, D.L., Mulder, J. (2019). Shrinkage priors for Bayesian penalized regression. Journal of Mathematical Psychology. 89: 31-50. https://doi.org/10.1016/j.jmp.2018.12.004