reclin2: Record Linkage Toolkit

Functions to assist in performing probabilistic record linkage and deduplication: generating pairs, comparing records, em-algorithm for estimating m- and u-probabilities (I. Fellegi & A. Sunter (1969) <doi:10.1080/01621459.1969.10501049>, T.N. Herzog, F.J. Scheuren, & W.E. Winkler (2007), "Data Quality and Record Linkage Techniques", ISBN:978-0-387-69502-0), forcing one-to-one matching. Can also be used for pre- and post-processing for machine learning methods for record linkage. Focus is on memory, CPU performance and flexibility.

Version: 0.1.1
Depends: data.table, R (≥ 3.6.0)
Imports: stringdist, stats, utils, lpSolve, Rcpp, parallel
LinkingTo: Rcpp
Suggests: simplermarkdown
Published: 2022-01-07
Author: Jan van der Laan ORCID iD [aut, cre]
Maintainer: Jan van der Laan <r at eoos.dds.nl>
BugReports: https://github.com/djvanderlaan/reclin2/issues
License: GPL-3
URL: https://github.com/djvanderlaan/reclin2
NeedsCompilation: yes
SystemRequirements: C++11
Materials: README
CRAN checks: reclin2 results

Documentation:

Reference manual: reclin2.pdf
Vignettes: Deduplication using reclin2
Introduction to reclin2
Record linkage using machine learning
Using a cluster for record linkage

Downloads:

Package source: reclin2_0.1.1.tar.gz
Windows binaries: r-devel: reclin2_0.1.1.zip, r-release: reclin2_0.1.1.zip, r-oldrel: reclin2_0.1.1.zip
macOS binaries: r-release (arm64): reclin2_0.1.1.tgz, r-oldrel (arm64): reclin2_0.1.1.tgz, r-release (x86_64): reclin2_0.1.1.tgz, r-oldrel (x86_64): reclin2_0.1.1.tgz

Linking:

Please use the canonical form https://CRAN.R-project.org/package=reclin2 to link to this page.