MarineSPEED is a benchmark dataset for presence-only species distribution modelling. It contains a collection of 514 marine species linked to environmental data for the 71 current climate layers of Bio-ORACLE and MARSPEC.
An R package for downloading and working with MarineSPEED is available on CRAN and on GitHub. For more information see the R package documentation, the README or contact me at mail@samuelbosch.com .
# Installation from CRAN install.packages("marinespeed") # or from GitHub devtools::install_github("samuelbosch/marinespeed")
Example usage:
library(marinespeed) ## list of all species species <- list_species() View(species) ## count number of occurrences for all species get_occ_count <- function(speciesname, occ) { nrow(occ) } record_counts <- lapply_species(get_occ_count) print(sum(unlist(record_counts))) ## plot first 2 folds for the first 10 species plot_occurrences <- function(speciesname, data, k) { title <- paste0(speciesname, " (fold = ", k, ")") plot(data$occurrence_train[,c("longitude", "latitude")], pch=".", col="blue", main = title) points(data$occurrence_test[,c("longitude", "latitude")], pch=".", col="red") } # plot training (blue) and test (red) occurrences of the first 2 disc folds # for the first 10 species species <- list_species() lapply_kfold_species(plot_occurrences, species=species[1:10,], fold_type = "disc", k = 1:2)
You can also directly download all MarineSPEED data with the following links.
The occurrence records where originally sourced from GBIF ( datasets ), OBIS ( datasets ), Reef Life Survey, INVASIVES project and personal communications.
The associated environmental data was extracted from Bio-ORACLE and MARSPEC. The appropriate citations are:
Taxonomic information was retrieved from the World Register of Marine Species (WoRMS).
Sampling bias information was visually assessed.
Ecoregion data was generated based on: Spalding M.D., Fox H.E., Allen G.R., Davidson N., FerdaƱa Z. a., Finlayson M., Halpern B.S., Jorge M. a., Lombana A., Lourie S. a., Martin K.D., Mcmanus E., Molnar J., Recchia C. a., & Robertson J. (2007) Marine Ecoregions of the World: A Bioregionalization of Coastal and Shelf Areas. BioScience, 57, 573. http://dx.doi.org/10.1641/B570707