Record Linkage for Empirically Motivated Priors
Check whether 2 records which are estimated to be linked have the same...
This function takes a set of pairwise links and identifies correct, in...
Function that returns the shared MPMMS (except with an easier conditio...
Function to compute a record's Maximal Matching Set (MMS) based on a s...
Function to compute a record's MPMMS based on a Gibbs sampler. Note: I...
Function to take links list that may contain 3-way, 4-way, etc. and re...
Gibbs sampler for empirically motivated Bayesian record linkage
An implementation of the model in Steorts (2015) <DOI:10.1214/15-BA965SI>, which performs Bayesian entity resolution for categorical and text data, for any distance function defined by the user. In addition, the precision and recall are in the package to allow one to compare to any other comparable method such as logistic regression, Bayesian additive regression trees (BART), or random forests. The experiments are reproducible and illustrated using a simple vignette. LICENSE: GPL-3 + file license.