Title: | Reproducible Integrated Species Distribution Models Across Norway using 'INLA' |
---|---|
Description: | Integration of disparate datasets is needed in order to make efficient use of all available data and thereby address the issues currently threatening biodiversity. Data integration is a powerful modeling framework which allows us to combine these datasets together into a single model, yet retain the strengths of each individual dataset. We therefore introduce the package, 'intSDM': an R package designed to help ecologists develop a reproducible workflow of integrated species distribution models, using data both provided from the user as well as data obtained freely online. An introduction to data integration methods is discussed in Issac, Jarzyna, Keil, Dambly, Boersch-Supan, Browning, Freeman, Golding, Guillera-Arroita, Henrys, Jarvis, Lahoz-Monfort, Pagel, Pescott, Schmucki, Simmonds and O’Hara (2020) <doi:10.1016/j.tree.2019.08.006>. |
Authors: | Philip Mostert [aut, cre], Angeline Bruls [aut], Ragnhild {Bjørkås} [aut], Wouter Koch [aut], Ellen Martin [aut] |
Maintainer: | Philip Mostert <[email protected]> |
License: | GPL (>= 3) |
Version: | 2.1.0.9000 |
Built: | 2024-11-22 13:42:35 UTC |
Source: | https://github.com/philipmostert/intsdm |
Function used to format structure data into a coherent framework.
formatStructured(dataOCC, type, varsOld, varsNew, projection, boundary)
formatStructured(dataOCC, type, varsOld, varsNew, projection, boundary)
dataOCC |
The species occurrence data. May be either a |
type |
The type of observation model for the data. May be either: |
varsOld |
The names of the old variables in the model which need to be converted to something new. |
varsNew |
The name of the new variables in the model which will be used in the full model. |
projection |
The CRS object to add to the species occurrence data. |
boundary |
An sf object of the boundary of the study area, used to check if the data points are over the region. |
An sf
object containing the locations of the species.
Function used to generate absences for data coming from lists. This function takes all the sampling locations from all the species obtained from a given dataset, and generates an absence if a species does not occur at a given location.
generateAbsences( dataList, datasetName, speciesName, responseName, Projection, Richness )
generateAbsences( dataList, datasetName, speciesName, responseName, Projection, Richness )
dataList |
A List of data objects for the dataset. |
datasetName |
The name of the dataset. |
speciesName |
The name of the species variable name. |
responseName |
The name of the response variable name. |
Projection |
Coordinate reference system used. |
Richness |
Generate absences for the richness model. |
initValues
: Function to obtain initial values from a intModel
object.This function is used to obtain initial values by running a separate linear model on each sub-likelihood of the model to find maximum likelihood estimates, and averaging the estimates across all likelihoods.
initValues(data, formulaComponents)
initValues(data, formulaComponents)
data |
A |
formulaComponents |
A vector of fixed effects for which to find initial values. |
The return of the function depends on the argument Save
from the startWorkflow
function. If this argument is FALSE
then the objects will be saved to the specified directory. If this argument is TRUE
then a list of different outcomes from the workflow will be returned.
Function to obtain a sf
boundary object around a specified country.
obtainArea(names, projection, ...)
obtainArea(names, projection, ...)
names |
A vector of names of countries used in the analysis. |
projection |
Coordinate reference system used. |
... |
Additional arguments passed to gisco_get_countries. |
An sf
object of the boundary of the specified country.
Function to obtain covariate layers from WorldClim around a specified area.
obtainCovariate(covariates, res, projection, path)
obtainCovariate(covariates, res, projection, path)
covariates |
A vector of covariate names to obtain. |
res |
Resolution of the worldclim variable. Valid options are: |
projection |
Coordinate reference system to use in analysis. |
path |
The path where the covariate will be saved. |
A spatialRaster
object of the covariates across the specified area.
Function used to obtain species observations from GBIF.
obtainGBIF(query, geometry, projection, datasettype, ...)
obtainGBIF(query, geometry, projection, datasettype, ...)
query |
The scientific name of the species for which observations need to be obtained. |
geometry |
An |
projection |
The coordinate reference system used for the observations and geometry. |
datasettype |
The type of dataset that is obtained from GBIF. Can be one of: |
... |
Additional arguments to pass to occ_download. |
An sf
object containing the locations and other relevant information of the species obtained from GBIF.
obtainRichness
: Function to obtain richness estimates from a fitISDM
object.This function is used to obtain richness estimates for a multi-species ISDM.
obtainRichness( modelObject, predictionData, predictionIntercept, sampleSize = 1, inclProb = TRUE )
obtainRichness( modelObject, predictionData, predictionIntercept, sampleSize = 1, inclProb = TRUE )
modelObject |
A |
predictionData |
An |
predictionIntercept |
The name of the prediction dataset to use in the model. |
sampleSize |
The size of the sampling area for the prediction intercept dataset. Defaults to |
inclProb |
Include the individual probabilities for each species. Defaults to |
An sf
data.frame object of richness at each sampling location.
sdmWorkflow
: Function to compile the reproducible workflow.This function is used to compile the reproducible workflow from the R6
object created with startFunction
. Depending on what was specified before, this function will estimate the integrated species distribution model, perform cross-validation, create predictions from the model and plot these predictions.
sdmWorkflow( Workflow = NULL, predictionDim = c(150, 150), predictionData = NULL, initialValues = FALSE, inlaOptions = list(), ipointsOptions = NULL )
sdmWorkflow( Workflow = NULL, predictionDim = c(150, 150), predictionData = NULL, initialValues = FALSE, inlaOptions = list(), ipointsOptions = NULL )
Workflow |
The |
predictionDim |
The pixel dimensions for the prediction maps. Defaults to |
predictionData |
Optional argument for the user to specify their own data to predict on. Must be a |
initialValues |
Find initial values using a GLM before the model is estimated. Defaults to |
inlaOptions |
Options to specify in inla from the |
ipointsOptions |
Options to specify in fm_int's |
The return of the function depends on the argument Save
from the startWorkflow
function. If this argument is FALSE
then the objects will be saved to the specidfied directory. If this argument is TRUE
then a list of different outcomes from the workflow will be returned.
## Not run: if (requireNamespace('INLA')) { workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) workflow$addArea(countryName = 'Sweden') workflow$addGBIF(datasetName = 'exampleGBIF', datasetType = 'PA', limit = 10000, coordinateUncertaintyInMeters = '0,50') workflow$addMesh(cutoff = 20000, max.edge=c(60000, 80000), offset= 100000) workflow$workflowOutput('Model') Model <- sdmWorkflow(workflow) } ## End(Not run)
## Not run: if (requireNamespace('INLA')) { workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) workflow$addArea(countryName = 'Sweden') workflow$addGBIF(datasetName = 'exampleGBIF', datasetType = 'PA', limit = 10000, coordinateUncertaintyInMeters = '0,50') workflow$addMesh(cutoff = 20000, max.edge=c(60000, 80000), offset= 100000) workflow$workflowOutput('Model') Model <- sdmWorkflow(workflow) } ## End(Not run)
species_model
object.An object containing the data, covariates and other relevant information to be used in the reproducible workflow. The function startWorkflow acts as a wrapper in creating one of these objects. This object has additional slot functions within, which allow for further specification and customization of the reproducible workflow.
help()
Obtain documentation for a species_model
object.
species_model$help(...)
...
Not used
new()
initialize the species_model object.
species_model$new( Countries, Species, nameProject, Save, Richness, Directory, Projection, Quiet = TRUE )
Countries
Name of the countries to include in the workflow.
Species
Name of the species to include in the workflow.
nameProject
Name of the project for the workflow.
Save
Logical argument indicating if the model outputs should be saved.
Richness
Logical create a species richness model or not.
Directory
Directory where the model outputs should be saved.
Projection
The coordinate reference system used in the workflow.
Quiet
Logical variable indicating if the workflow should provide messages throughout the estimation procedure.
addArea()
species_model$addArea(Object = NULL, countryName = NULL, ...)
Object
An sf
object of the study area. If NULL
then countryName
needs to be provided.
countryName
Name of the countries to obtain a boundary for. This argument will then use the gisco_get_countries function from the giscoR
package to obtain a boundary.
...
Additional arguments passed to gisco_get_countries.
\dontrun{ workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) #Add boundary workflow$addArea(countryName = 'Sweden') }
print()
Prints the datasets, their data type and the number of observations, as well as the marks and their respective families.
species_model$print(...)
...
Not used.
workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) workflow$print()
plot()
Makes a plot of the features used in the integrated model.
species_model$plot( Mesh = FALSE, Boundary = TRUE, Species = FALSE, Covariates = FALSE )
Mesh
Add the mesh to the plot.
Boundary
Add the boundary to the plot.
Species
Add the species location data to the plot.
Covariates
Add the spatial covariates to the plot.
A ggplot object.
\dontrun{ workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) #Add boundary workflow$addArea(countryName = 'Germany') workflow$plot(Boundary = TRUE) }
workflowOutput()
Function to specify the workflow output from the model. This argument must be at least one of: 'Model'
, 'Prediction'
, 'Maps'
'Cross-validation'
, Bias
and 'Summary'
.
species_model$workflowOutput(Output)
Output
The names of the outputs to give in the workflow. Must be at least one of: 'Model'
, 'Prediction'
, 'Maps'
, 'Bias'
, Summary
and 'Cross-validation'
.
workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) workflow$workflowOutput('Predictions')
addStructured()
The function is used to convert structured datasets into a framework which is usable by the model. The three types of structured data allowed by this function are presence-absence (PA), presence-only (PO) and counts/abundance datasets, which are controlled using the datasetType
argument. The other arguments of this function are used to specify the appropriate variable (such as response name, trial name, species name and coordinate name) names in these datasets.
species_model$addStructured( dataStructured, datasetType, responseName, trialsName, datasetName = NULL, speciesName, coordinateNames, generateAbsences = FALSE )
dataStructured
The dataset used in the model. Must be either a data.frame
, sf
or SpatialPoints*
object, or a list
containing multiples of these classes.
datasetType
A vector which gives the type of dataset. Must be either 'count'
, 'PO'
or 'PA'
.
responseName
Name of the response variable in the dataset. If dataType
is 'PO'
, then this argument may be missing.
trialsName
Name of the trial name variable in the PA
datasets.
datasetName
An optional argument to create a new name for the dataset. Must be the same length as dataStructured
if that is provided as a list
.
speciesName
Name of the species variable name in the datasets.
coordinateNames
Names of the coordinate vector in the dataset. Only required if the datasets added are data.frame
objects.
generateAbsences
Generates absences for 'PA'
data. This is done by combining all the sampling locations for all the species in a given dataset, and creating an absence where each of the species do not occur. Requires datasetType = 'PA'
.
\dontrun{ workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) #Add boundary workflow$addArea(countryName = 'Sweden') #Generate random species speciesData <- data.frame(X = runif(1000, 12, 24), Y = runif(1000, 56, 68), Response = sample(c(0,1), 1000, replace = TRUE), Name = 'Fraxinus_excelsior') workflow$addStructured(dataStructured = speciesData, datasetType = 'PA', datasetName = 'xx', responseName = 'Response', speciesName = 'Name', coordinateNames = c('X', 'Y')) }
addMesh()
Function to add an fm_mesh_2d
object to the workflow. The user may either add their own mesh to the workflow, or use the arguments of this function to help create one.
species_model$addMesh(Object, ...)
Object
An fm_mesh_2d
object to add to the workflow.
...
Additional arguments to pass to fmesher
's fm_mesh_2d_inla
. Use ?fm_mesh_2d_inla
to find out more about the different arguments.
\dontrun{ workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) #Add boundary workflow$addArea(countryName = 'Sweden') workflow$addMesh(cutoff = 20000, max.edge=c(60000, 80000), offset= 100000) }
addGBIF()
Function to add species occurrence records from GBIF (using the rgbif
package) to the reproducible workflow. The arguments for this function are used to either filter the GBIF records, or to specify the characteristics of the observation model.
species_model$addGBIF( Species = "All", datasetName = NULL, datasetType = "PO", removeDuplicates = FALSE, generateAbsences = FALSE, ... )
Species
The names of the species to include in the workflow (initially specified using startWorkflow). Defaults to All
, which will find occurrence records for all specie specified in startWorkflow.
datasetName
The name to give the dataset obtained from GBIF. Cannot be NULL
.
datasetType
The data type of the dataset. Defaults to PO
, but may also be PA
or Counts
.
removeDuplicates
Argument used to remove duplicate observations for a species across datasets. May take a long time if there are many observations obtained across multiple datasets. Defaults to FALSE
.
generateAbsences
Generates absences for 'PA'
data. This is done by combining all the sampling locations for all the species, and creating an absence where a given species does not occur.
...
Additional arguments to specify the occ_data function from rgbif
. See ?occ_data
for more details.
\dontrun{ workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) workflow$addArea(countryName = 'Sweden') workflow$addGBIF(datasetName = 'exampleGBIF', datasetType = 'PA', limit = 10000, coordinateUncertaintyInMeters = '0,50') }
addCovariates()
Function to add spatial covariates to the workflow. The covariates may either be specified by the user, or they may come from worldClim obtained with the geodata
package.
species_model$addCovariates( Object = NULL, worldClim = NULL, res = 2.5, Months = "All", Function = "mean", ... )
Object
A object of class: spatRaster
, SpatialPixelsDataFrame
or raster
containing covariate information across the area. Note that this function will check if the covariates span the boundary area, so it may be preferable to add your own boundary using `.$addArea`
if this argument is specified.
worldClim
Name of the worldClim to include in the model. See ?worldclim_country
from the geodata
package for more information.
res
Resolution of the worldclim variable. Valid options are: 10
, 5
, 2.5
or 0.5
(minutes of a degree).
Months
The months to include the covariate for. Defaults to All
which includes covariate layers for all months.
Function
The function to aggregate the temporal data into one layer. Defaults to mean
.
...
Not used.
\dontrun{ if (requireNamespace('INLA')) { workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) #Add boundary workflow$addArea(countryName = 'Sweden') workflow$addCovariates(worldClim = 'tavg', res = '10') } }
crossValidation()
Function to add a spatial cross validation method to the workflow.
species_model$crossValidation( Method, blockOptions = list(k = 5, rows_cols = c(4, 4), plot = FALSE, seed = 123), blockCVType = "DIC" )
Method
The spatial cross-validation methods to use in the workflow. May be at least one of spatialBlock
or Loo
(leave-one-out). See the PointedSDMs
package for more details.
blockOptions
A list of options to specify the spatial block cross-validation. Must be a named list with arguments specified for: k
, rows_cols
, plot
, seed
. See blockCV::cv_spatial
for more information.
blockCVType
The cross-validation method to complete if Method = 'spatialBlock'
. May be one of 'DIC'
(default) which will iteratively return the DIC scores for each block, or 'Predict'
. This method return scores of marginal likelihood for each combination of dataset across all blocks, by fitting a model on all blocks but one, and predicting on the left out block. The prediction dataset is automatically chosen as the first PA dataset added to the model. See blockedCV for more information. Note that this may take a long time to estimate if there are many datasets included in the model.
workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) workflow$crossValidation(Method = 'Loo')
modelOptions()
Function to specify model options for the INLA
and PointedSDMs
parts of the model.
species_model$modelOptions(ISDM = list(), Richness = list())
ISDM
Arguments to specify in startISDM from the PointedSDMs
function. This argument needs to be a named list of the following options:
pointCovariates
: non-spatial covariates attached to the data points to be included in the model.
pointsIntercept
: Logical: intercept terms for the dataset. Defaults to TRUE
pointsSpatial
: Choose how the spatial effects are included in the model. If 'copy'
then the spatial effects are shared across the datasets, if 'individual'
then the spatial effects are created for each dataset individually, and if 'correlate'
then the spatial effects are correlated. If NULL
, then spatial effects are turned off for the datasets.
Offset
: The name of the offset variable.
See ?PointedSDMs::startISDM
for more details on these choices.
Richness
Options to specify the richness model. This argument needs to be a named list of the following options:
predictionIntercept
: The name of the dataset to use as the prediction intercept in the richness model. The sampling size of the protocol must be known.
samplingSize
: The sample area size for the dataset provided in predictionIntercept
. The units should be the same as specified in startWorkflow
speciesSpatial
: Specify the species spatial model. If 'replicate'
then create a spatial effect for each species with shared hyperparameters, if 'copy'
create a spatial effect for each species. If NULL
then the spatial effects for the species will be turned off.
speciesIntercept:
If TRUE
(default) incorporate a random intercept for the species, if FALSE
use a fixed intercept and if NULL
include no intercept for the species.
workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example'))
specifySpatial()
Function to specify pc priors for the shared random field in the model. See ?INLA::inla.spde2.pcmatern
for more details.
species_model$specifySpatial(...)
...
Arguments passed on to inla.spde2.pcmatern.
\dontrun{ if (requireNamespace('INLA')) { workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) #Add boundary workflow$addArea(countryName = 'Sweden') workflow$addMesh(cutoff = 20000, max.edge=c(60000, 80000), offset= 100000) workflow$specifySpatial(prior.range = c(200000, 0.05), prior.sigma = c(5, 0.1)) } }
specifyPriors()
Function to specify priors for the fixed effects in the model. The priors of the fixed effects are assumed to be Gaussian; this function alows the user to specify the parameters of this distribution.
species_model$specifyPriors( effectNames, Mean = 0, Precision = 0.01, copyModel = list(beta = list(fixed = FALSE)), priorIntercept = list(prior = "loggamma", param = c(1, 5e-05)), priorGroup = list(model = "iid", hyper = list(prec = list(prior = "loggamma", param = c(1, 5e-05)))) )
effectNames
The name of the effects to specify the prior for. Must be the name of any of the covariates incldued in the model, or 'Intercept' to specify the priors for the intercept terms.
Mean
The mean of the prior distribution. Defaults to 0
.
Precision
The precision (inverse variance) of the prior distribution. Defaults to 0.01
.
copyModel
List of model specifications given to the hyper parameters for the "copy"
model. Defaults to list(beta = list(fixed = FALSE))
.
priorIntercept
Prior for the precision parameter for the random intercept in the species richness model. Needs Output = "Richness"
. Defaults to the default INLA prior.
priorGroup
Prior for the precision for the iid effect in the species spatial effect in the richness model. Needs Output = "Richness"
and speciesSpatial = "replicate"
in the richness options. Defualts to the default INLA prior.
\dontrun{ if (requireNamespace('INLA')) { workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) #Add boundary workflow$addArea(countryName = 'Sweden') workflow$addMesh(cutoff = 20000, max.edge=c(60000, 80000), offset= 100000) workflow$specifyPriors(effectName = 'Intercept', mean = 0, Precision = 0.1) } }
biasFields()
Function to add bias fields to the model.
species_model$biasFields( datasetName, copyModel = FALSE, shareModel = FALSE, ... )
datasetName
Name of the dataset to add a bias field to.
copyModel
Create copies of the biasField across the different datasets. Defaults to FALSE
.
shareModel
Share a bias field across the datasets specified with datasetNames
. Defaults to FALSE
.
...
Additional arguments passed on to inla.spde2.pcmatern to customize the priors for the pc matern for the bias fields.
\dontrun{ if(requireNamespace('INLA')) { workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) workflow$addArea(countryName = 'Sweden') workflow$addGBIF(datasetName = 'exampleGBIF', datasetType = 'PA', limit = 10000, coordinateUncertaintyInMeters = '0,50') workflow$biasFields(datasetName = 'exampleGBIF') } }
modelFormula()
Add a formula to the model
species_model$modelFormula(covariateFormula, biasFormula)
covariateFormula
Change the covariate formula of the model.
biasFormula
Change the bias formula of the model
\dontrun{ workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) workflow$addArea(countryName = 'Sweden') workflow$addCovariate(rasterStack) workflow$addFormula(covariateFormula = ~ covariate) workflow$addFormula(biasFormula = ~ biasFormula) }
obtainMeta()
Obtain metadata from the workflow.
species_model$obtainMeta(Number = TRUE, Citations = TRUE)
Number
Print the number of observations per dataset. Defaults to TRUE
.
Citations
Print the citations for the GBIF obtained datasets. Defaults to TRUE
.
\dontrun{ workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) workflow$addArea(countryName = 'Sweden') workflow$addGBIF(datasetName = 'exampleGBIF', datasetType = 'PA', limit = 10000, coordinateUncertaintyInMeters = '0,50') workflow$obtainMeta() }
clone()
The objects of this class are cloneable with this method.
species_model$clone(deep = FALSE)
deep
Whether to make a deep clone.
## ------------------------------------------------ ## Method `species_model$addArea` ## ------------------------------------------------ ## Not run: workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) #Add boundary workflow$addArea(countryName = 'Sweden') ## End(Not run) ## ------------------------------------------------ ## Method `species_model$print` ## ------------------------------------------------ workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) workflow$print() ## ------------------------------------------------ ## Method `species_model$plot` ## ------------------------------------------------ ## Not run: workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) #Add boundary workflow$addArea(countryName = 'Germany') workflow$plot(Boundary = TRUE) ## End(Not run) ## ------------------------------------------------ ## Method `species_model$workflowOutput` ## ------------------------------------------------ workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) workflow$workflowOutput('Predictions') ## ------------------------------------------------ ## Method `species_model$addStructured` ## ------------------------------------------------ ## Not run: workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) #Add boundary workflow$addArea(countryName = 'Sweden') #Generate random species speciesData <- data.frame(X = runif(1000, 12, 24), Y = runif(1000, 56, 68), Response = sample(c(0,1), 1000, replace = TRUE), Name = 'Fraxinus_excelsior') workflow$addStructured(dataStructured = speciesData, datasetType = 'PA', datasetName = 'xx', responseName = 'Response', speciesName = 'Name', coordinateNames = c('X', 'Y')) ## End(Not run) ## ------------------------------------------------ ## Method `species_model$addMesh` ## ------------------------------------------------ ## Not run: workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) #Add boundary workflow$addArea(countryName = 'Sweden') workflow$addMesh(cutoff = 20000, max.edge=c(60000, 80000), offset= 100000) ## End(Not run) ## ------------------------------------------------ ## Method `species_model$addGBIF` ## ------------------------------------------------ ## Not run: workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) workflow$addArea(countryName = 'Sweden') workflow$addGBIF(datasetName = 'exampleGBIF', datasetType = 'PA', limit = 10000, coordinateUncertaintyInMeters = '0,50') ## End(Not run) ## ------------------------------------------------ ## Method `species_model$addCovariates` ## ------------------------------------------------ ## Not run: if (requireNamespace('INLA')) { workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) #Add boundary workflow$addArea(countryName = 'Sweden') workflow$addCovariates(worldClim = 'tavg', res = '10') } ## End(Not run) ## ------------------------------------------------ ## Method `species_model$crossValidation` ## ------------------------------------------------ workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) workflow$crossValidation(Method = 'Loo') ## ------------------------------------------------ ## Method `species_model$modelOptions` ## ------------------------------------------------ workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) ## ------------------------------------------------ ## Method `species_model$specifySpatial` ## ------------------------------------------------ ## Not run: if (requireNamespace('INLA')) { workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) #Add boundary workflow$addArea(countryName = 'Sweden') workflow$addMesh(cutoff = 20000, max.edge=c(60000, 80000), offset= 100000) workflow$specifySpatial(prior.range = c(200000, 0.05), prior.sigma = c(5, 0.1)) } ## End(Not run) ## ------------------------------------------------ ## Method `species_model$specifyPriors` ## ------------------------------------------------ ## Not run: if (requireNamespace('INLA')) { workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) #Add boundary workflow$addArea(countryName = 'Sweden') workflow$addMesh(cutoff = 20000, max.edge=c(60000, 80000), offset= 100000) workflow$specifyPriors(effectName = 'Intercept', mean = 0, Precision = 0.1) } ## End(Not run) ## ------------------------------------------------ ## Method `species_model$biasFields` ## ------------------------------------------------ ## Not run: if(requireNamespace('INLA')) { workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) workflow$addArea(countryName = 'Sweden') workflow$addGBIF(datasetName = 'exampleGBIF', datasetType = 'PA', limit = 10000, coordinateUncertaintyInMeters = '0,50') workflow$biasFields(datasetName = 'exampleGBIF') } ## End(Not run) ## ------------------------------------------------ ## Method `species_model$modelFormula` ## ------------------------------------------------ ## Not run: workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) workflow$addArea(countryName = 'Sweden') workflow$addCovariate(rasterStack) workflow$addFormula(covariateFormula = ~ covariate) workflow$addFormula(biasFormula = ~ biasFormula) ## End(Not run) ## ------------------------------------------------ ## Method `species_model$obtainMeta` ## ------------------------------------------------ ## Not run: workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) workflow$addArea(countryName = 'Sweden') workflow$addGBIF(datasetName = 'exampleGBIF', datasetType = 'PA', limit = 10000, coordinateUncertaintyInMeters = '0,50') workflow$obtainMeta() ## End(Not run)
## ------------------------------------------------ ## Method `species_model$addArea` ## ------------------------------------------------ ## Not run: workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) #Add boundary workflow$addArea(countryName = 'Sweden') ## End(Not run) ## ------------------------------------------------ ## Method `species_model$print` ## ------------------------------------------------ workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) workflow$print() ## ------------------------------------------------ ## Method `species_model$plot` ## ------------------------------------------------ ## Not run: workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) #Add boundary workflow$addArea(countryName = 'Germany') workflow$plot(Boundary = TRUE) ## End(Not run) ## ------------------------------------------------ ## Method `species_model$workflowOutput` ## ------------------------------------------------ workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) workflow$workflowOutput('Predictions') ## ------------------------------------------------ ## Method `species_model$addStructured` ## ------------------------------------------------ ## Not run: workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) #Add boundary workflow$addArea(countryName = 'Sweden') #Generate random species speciesData <- data.frame(X = runif(1000, 12, 24), Y = runif(1000, 56, 68), Response = sample(c(0,1), 1000, replace = TRUE), Name = 'Fraxinus_excelsior') workflow$addStructured(dataStructured = speciesData, datasetType = 'PA', datasetName = 'xx', responseName = 'Response', speciesName = 'Name', coordinateNames = c('X', 'Y')) ## End(Not run) ## ------------------------------------------------ ## Method `species_model$addMesh` ## ------------------------------------------------ ## Not run: workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) #Add boundary workflow$addArea(countryName = 'Sweden') workflow$addMesh(cutoff = 20000, max.edge=c(60000, 80000), offset= 100000) ## End(Not run) ## ------------------------------------------------ ## Method `species_model$addGBIF` ## ------------------------------------------------ ## Not run: workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) workflow$addArea(countryName = 'Sweden') workflow$addGBIF(datasetName = 'exampleGBIF', datasetType = 'PA', limit = 10000, coordinateUncertaintyInMeters = '0,50') ## End(Not run) ## ------------------------------------------------ ## Method `species_model$addCovariates` ## ------------------------------------------------ ## Not run: if (requireNamespace('INLA')) { workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) #Add boundary workflow$addArea(countryName = 'Sweden') workflow$addCovariates(worldClim = 'tavg', res = '10') } ## End(Not run) ## ------------------------------------------------ ## Method `species_model$crossValidation` ## ------------------------------------------------ workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) workflow$crossValidation(Method = 'Loo') ## ------------------------------------------------ ## Method `species_model$modelOptions` ## ------------------------------------------------ workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) ## ------------------------------------------------ ## Method `species_model$specifySpatial` ## ------------------------------------------------ ## Not run: if (requireNamespace('INLA')) { workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) #Add boundary workflow$addArea(countryName = 'Sweden') workflow$addMesh(cutoff = 20000, max.edge=c(60000, 80000), offset= 100000) workflow$specifySpatial(prior.range = c(200000, 0.05), prior.sigma = c(5, 0.1)) } ## End(Not run) ## ------------------------------------------------ ## Method `species_model$specifyPriors` ## ------------------------------------------------ ## Not run: if (requireNamespace('INLA')) { workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) #Add boundary workflow$addArea(countryName = 'Sweden') workflow$addMesh(cutoff = 20000, max.edge=c(60000, 80000), offset= 100000) workflow$specifyPriors(effectName = 'Intercept', mean = 0, Precision = 0.1) } ## End(Not run) ## ------------------------------------------------ ## Method `species_model$biasFields` ## ------------------------------------------------ ## Not run: if(requireNamespace('INLA')) { workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) workflow$addArea(countryName = 'Sweden') workflow$addGBIF(datasetName = 'exampleGBIF', datasetType = 'PA', limit = 10000, coordinateUncertaintyInMeters = '0,50') workflow$biasFields(datasetName = 'exampleGBIF') } ## End(Not run) ## ------------------------------------------------ ## Method `species_model$modelFormula` ## ------------------------------------------------ ## Not run: workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) workflow$addArea(countryName = 'Sweden') workflow$addCovariate(rasterStack) workflow$addFormula(covariateFormula = ~ covariate) workflow$addFormula(biasFormula = ~ biasFormula) ## End(Not run) ## ------------------------------------------------ ## Method `species_model$obtainMeta` ## ------------------------------------------------ ## Not run: workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = '+proj=longlat +ellps=WGS84', Save = FALSE, saveOptions = list(projectName = 'example')) workflow$addArea(countryName = 'Sweden') workflow$addGBIF(datasetName = 'exampleGBIF', datasetType = 'PA', limit = 10000, coordinateUncertaintyInMeters = '0,50') workflow$obtainMeta() ## End(Not run)
startWorkflow
: function to commence the integrated species distribution model workflow.Function to initialize the reproducible workflow using integrated species distribution models. The arguments for this function are used to specify which species and countries are to be studied, as well as how the results of the model should be saved (either as an R object or saved to some directory). This function outputs an R6
object with additional slot functions to help further customize the model specification. See ?species_model
for more details on these functions.
startWorkflow( Countries, Species, Projection, Save = TRUE, Richness = FALSE, saveOptions = list(projectDirectory = NULL, projectName = NULL), Quiet = FALSE )
startWorkflow( Countries, Species, Projection, Save = TRUE, Richness = FALSE, saveOptions = list(projectDirectory = NULL, projectName = NULL), Quiet = FALSE )
Countries |
A vector of country names to complete the analysis over. If missing, a boundary object (of class |
Species |
A vector of Species names (scientific) to include in the analysis. Names should be given carefully since the names provided will be used to obtain GBIF observations. |
Projection |
The coordinate reference system used in the workflow. |
Save |
Logical argument indicating if the model objects and outputs should be saved as .rds files. Defaults to |
Richness |
Logical option to create maps for each species individually, or create a species richness model. Defaults to |
saveOptions |
A list containing two items: |
Quiet |
Logical argument indicating if the workflow should provide the user messages during the setup and estimation process. Defaults to |
An R6 object of class species_model
. This object contains a collection of slot functions to assist the user in customizing their workflow.
##Start a workflow without saving objects workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = "+proj=longlat +ellps=WGS84", Save = FALSE, saveOptions = list(projectName = 'example'))
##Start a workflow without saving objects workflow <- startWorkflow(Species = 'Fraxinus excelsior', Projection = "+proj=longlat +ellps=WGS84", Save = FALSE, saveOptions = list(projectName = 'example'))