Create Representative Records After Entity Resolution
Composite record from a cluster using a weighted average of each colum...
Prototype record from a cluster.
The distance between two records
dist_col_type Inner column type record distance function
Calculate the empirical KL divergence for a representative dataset as ...
Get posterior weights for each record post record-linkage using poster...
Create a representative dataset post record-linkage.
representr: A package for creating representative records post-record ...
within_category_compare_cpp Inner column type record distance function
An implementation of Kaplan, Betancourt, Steorts (2022) <doi:10.1080/00031305.2022.2041482> that creates representative records for use in downstream tasks after entity resolution is performed. Multiple methods for creating the representative records (data sets) are provided.