It brings together several aspects of biodiversity data-cleaning in one place. 'bdc' is organized in thematic modules related to different biodiversity dimensions, including 1) Merge datasets: standardization and integration of different datasets; 2) Pre-filter: flagging and removal of invalid or non-interpretable information, followed by data amendments; 3) Taxonomy: cleaning, parsing, and harmonization of scientific names from several taxonomic groups against taxonomic databases locally stored through the application of exact and partial matching algorithms; 4) Space: flagging of erroneous, suspect, and low-precision geographic coordinates; and 5) Time: flagging and, whenever possible, correction of inconsistent collection date. In addition, it contains features to visualize, document, and report data quality – which is essential for making data quality assessment transparent and reproducible. The reference for the methodology is Bruno et al. (2022) <doi:10.1111/2041-210X.13868>.
Version: | 1.1.1 |
Imports: | CoordinateCleaner, dplyr, DT, foreach, fs, ggplot2, here, magrittr, purrr, qs, readr, rgnparser, rnaturalearth, sf (≥ 1.0.5), stringdist, stringi, stringr, taxadb (≥ 0.1.3), tibble, tidyselect |
Suggests: | contentid (≥ 0.0.15), countrycode, covr, cowplot, DBI, doParallel, duckdb (≥ 0.3.2), knitr (≥ 1.31), maps, markdown, rangeBuilder, rappdirs, raster, remotes, rlang (≥ 1.0.1), rmarkdown, rnaturalearthdata, rvest, sp, testthat (≥ 3.0.0), xml2 |
Published: | 2022-05-27 |
Author: | Bruno Ribeiro [aut, cre], Santiago Velazco [aut], Karlo Guidoni-Martins [aut], Geiziane Tessarolo [aut], Lucas Jardim [aut], Steven Bachman [ctb], Rafael Loyola [ctb] |
Maintainer: | Bruno Ribeiro <ribeiro.brr at gmail.com> |
BugReports: | https://github.com/brunobrr/bdc/issues |
License: | GPL (≥ 3) |
URL: | https://brunobrr.github.io/bdc/ (website) https://github.com/brunobrr/bdc |
NeedsCompilation: | no |
Language: | en-gb |
Materials: | README NEWS |
CRAN checks: | bdc results |
Reference manual: | bdc.pdf |
Vignettes: |
Standardization and integration of different datasets Pre-filter Space Taxonomy Time |
Package source: | bdc_1.1.1.tar.gz |
Windows binaries: | r-devel: bdc_1.1.1.zip, r-release: bdc_1.1.1.zip, r-oldrel: bdc_1.1.1.zip |
macOS binaries: | r-release (arm64): bdc_1.1.1.tgz, r-oldrel (arm64): bdc_1.1.1.tgz, r-release (x86_64): bdc_1.1.1.tgz, r-oldrel (x86_64): bdc_1.1.1.tgz |
Old sources: | bdc archive |
Please use the canonical form https://CRAN.R-project.org/package=bdc to link to this page.