Use coverage-based methods to estimate Hill diversity measures over a gridded map or as a time series. Three Hill diversity measures are covered:
Species richness - hill0_map()
and hill0_ts()
Hill-Shannon diversity - hill1_map()
and hill1_ts()
Hill-Simpson diversity - hill2_map()
and hill2_ts()
(see 'Details' for more information).
Usage
hill0_map(data, coverage = 0.95, cutoff_length = 5, ...)
hill0_ts(data, coverage = 0.95, cutoff_length = 5, ...)
hill1_map(data, cutoff_length = 5, coverage = 0.95, ...)
hill1_ts(data, cutoff_length = 5, coverage = 0.95, ...)
hill2_map(data, cutoff_length = 5, coverage = 0.95, ...)
hill2_ts(data, cutoff_length = 5, coverage = 0.95, ...)
Arguments
- data
A data cube object (class 'processed_cube').
- coverage
The sample coverage value for the estimator. Default is 0.95.
- cutoff_length
The minimum number of data points for each grid cell. Grid cells with fewer data points will be removed before calculations to avoid errors. Default is 5.
- ...
Arguments passed on to
compute_indicator_workflow
ci_type
Type of bootstrap confidence intervals to calculate. (Default: "norm". Select "none" to avoid calculating bootstrap CIs.)
cell_size
Length of grid cell sides, in km. (Default: 10 for country, 100 for continent or world)
level
Spatial level: 'cube', 'continent', 'country', 'world', 'sovereignty', or 'geounit'. (Default: 'cube')
region
The region of interest (e.g., "Europe"). (Default: "Europe")
ne_type
The type of Natural Earth data to download: 'countries', 'map_units', 'sovereignty', or 'tiny_countries'. (Default: "countries")
ne_scale
The scale of Natural Earth data to download: 'small' - 110m, 'medium' - 50m, or 'large' - 10m. (Default: "medium")
output_crs
The CRS you want for your calculated indicator. (Leave blank to let the function choose a default based on grid reference system)
first_year
Exclude data before this year. (Uses all data in the cube by default.)
last_year
Exclude data after this year. (Uses all data in the cube by default.)
spherical_geometry
If set to FALSE, will temporarily disable spherical geometry while the function runs. Should only be used to solve specific issues. (Default is TRUE)
make_valid
Calls st_make_valid() from the sf package. Increases processing time but may help if you are getting polygon errors. (Default is FALSE).
num_bootstrap
Set the number of bootstraps to calculate for generating confidence intervals. (Default: 1000)
crs_unit_convert
Force a particular output CRS even when it has different units than the input CRS. (Default: FALSE)
shapefile_path
Path of an external shapefile to merge into the workflow. For example, if you want to calculate your indicator particular features such as protected areas or wetlands.
invert
Calculate an indicator over the inverse of the shapefile (e.g. if you have a protected areas shapefile this would calculate an indicator over all non protected areas)
Value
An S3 object with the classes 'indicator_map' or 'indicator_ts' and 'hill0' or 'hill1' or 'hill2' containing the calculated indicator values and metadata.
Details
Hill diversity
Hill (1973) introduced the concept of Hill diversity, which assumes that the number and relative abundance of species are inseparable components of diversity. Hill diversity uses a single equation to calculate multiple measures of diversity by varying a single parameter ℓ, which changes the emphasis on rare vs common species (Roswell et al., 2019). It represents the mean rarity of sampled species, and is calculated as: $$ D = \left( \sum_{i=1}^{S} p_i^\ell \right)^{1/(1-\ell)} $$where D is diversity, S is the number of species, pi is the proportion of individuals belonging to species i, ri is the rarity of species i, and ℓ determines the rarity scale for the mean. While ℓ can theoretically take almost any value, three common measures of diversity are special cases: species richness, and modified versions of the Shannon and Simpson diversity indices (Roswell et al., 2019). These three measures occur when ℓ takes the value of 1, 0 (or near-zero, as ℓ cannot actually take the value of 0), or -1, respectively.
Species Richness (ℓ = 1): $$ D = S $$
Hill-Shannon Diversity (ℓ ≈ 0): $$ D = e^{-\sum_{i=1}^{S} p_i \ln(p_i)} $$
Hill-Simpson Diversity (ℓ = -1): $$ D = \frac{1}{\sum_{i=1}^{S} p_i^2} $$
Richness uses an arithmetic scale (the arithmetic mean), thus giving rare species a lot of leverage. By contrast, Hill-Shannon diversity uses a logarithmic scale (the geometric mean), treating common and rare species equally, and Hill-Simpson diversity uses a reciprocal scale (the harmonic mean), giving common species higher leverage.
Coverage-based estimation
Hill diversity values can be estimated through different standardisation procedures as a way to mitigate the effects of sample size and sampling biases. One way to do this is by equalising sample size by calculating a species accumulation curve (a plot of cumulative species richness as a function of sample size) for each year or grid cell. The smallest sample size from among all the grid cells or years in the dataset is used as a reference to select richness values from each curve. This is called rarefaction. It is also possible to use a larger sample size as a reference, but this requires extrapolation of smaller samples, which is more prone to error than rarefaction.
However, results from sample-size based estimation can be problematic as they depend on both richness and evenness. A sample from a community with a more even distribution of individuals across species is likely to show higher richness than a sample of the same size from a community where many species are rare, as the rare species are less likely to appear in the sample. Similarly, a community containing a lot of species will appear less rich than it actually is if the sample size used for comparison is too small. Detectability also plays an important part; hard to detect species are less likely to appear in the sample, so communities in which rare species are more easily detectable are likely to yield richer samples.
Another way to estimate species richness is to standardise by coverage. The iNEXT package (Chao et al., 2014; Hsieh et al., 2016) for R is used to estimate species richness at an equal level of coverage (e.g. 0.95) for each cell or year in a biodiversity data cube. Coverage is the proportion of individuals in the community belonging to species in the sample. So, at a coverage of 0.95, 95% of individuals in the community belong to species detected in the sample while 5% belong to species that are not detected in the sample. Coverage is estimated based on the frequencies of species already in the sample. It can be illustrated using a species accumulation curve, the slope of which represents the probability of detecting a new species with the next individual you sample from a community. At a sample size of zero, the slope would be one, meaning the next individual sampled has a 100% probability of being a species not already in the sample. Therefore, a coverage value of one corresponds to the asymptote of a species accumulation curve (slope of zero), meaning no new species would be uncovered through further sampling.
References
Hill, M. O. (1973). Diversity and evenness: a unifying notation and its consequences. Ecology, 54(2), 427-432.
Roswell, M., Shipley, J., & Ewers, R. M. (2019). A conceptual guide to measuring and interpreting functional diversity. Journal of Applied Ecology, 56(12), 2533-2543.
Chao, A., Gotelli, N. J., Hsieh, T. C., Sander, E. L., Ma, K. H., Colwell, R. K., & Ellison, A. M. (2014). Rarefaction and extrapolation with Hill numbers: a framework for sampling and estimation in species diversity studies. Ecological monographs, 84(1), 45-67.
Hsieh, T. C., Ma, K. H., & Chao, A. (2016). iNEXT: an R package for rarefaction and extrapolation of species diversity (Hill numbers). Methods in Ecology and Evolution, 7(12), 1451-1456.
Examples
if (FALSE) { # \dontrun{
h0_map <- hill0_map(example_cube_1, level = "country", region = "Denmark")
plot(h0_map)
} # }
if (FALSE) { # \dontrun{
h0_ts <- hill0_ts(example_cube_1, first_year = 1985)
plot(h0_ts)
} # }
if (FALSE) { # \dontrun{
h1_map <- hill1_map(example_cube_1, level = "country", region = "Denmark")
plot(h1_map)
} # }
if (FALSE) { # \dontrun{
h1_ts <- hill1_ts(example_cube_1, first_year = 1985)
plot(h1_ts)
} # }
if (FALSE) { # \dontrun{
h2_map <- hill2_map(example_cube_1, level = "country", region = "Denmark")
plot(h2_map)
} # }
if (FALSE) { # \dontrun{
h2_ts <- hill2_ts(example_cube_1, first_year = 1985)
plot(h2_ts)
} # }