RecordLinkage0.4-12.4 package

Record Linkage Functions for Linking and Deduplicating Data Sets

mrl

Mean Residual Life Plot

mygllm

Generalized Log-Linear Fitting

isFALSE

Check for FALSE

makeBlockingPairs

Create record pairs from blocks of ids.

append-methods

Concatenate comparison patterns or classification results

classifySupv

Supervised Classification

classifyUnsup

Unsupervised Classification

clone

Serialization of record linkage object.

compare.rd

Compare Records

deleteNULLs

Remove NULL Values

editMatch

Edit Matching Status

emClassify

Weight-based Classification of Data Pairs

emWeights

Calculate weights

epiClassify

Classify record pairs with EpiLink weights

epiWeights

Calculate EpiLink weights

ff_vector-class

Class "ff_vector"

ffdf-class.rd

Class "ffdf"

genSamples

Generate Training Set

getErrorMeasures-methods.rd

Calculate Error Measures

getExpectedSize

Estimate number of record pairs.

getFrequencies-methods

Get attribute frequencies

getMinimalTrain

Create a minimal training set

getPairs-methods.rd

Extract Record Pairs

getPairsBackend

Backend function for getPairs

getParetoThreshold

Estimate Threshold from Pareto Distribution

getTable-methods

Build contingency table

gpdEst

Estimate Threshold from Pareto Distribution

internals

Internal functions and methods

optimalThreshold

Optimal Threshold for Record Linkage

phonetics

Phonetic Code

RecLinkClassif-class

Class "RecLinkClassif"

RecLinkData-class.rd

Class "RecLinkData"

RecLinkData.object.rd

Record Linkage Data Object

RecLinkResult-class

Class "RecLinkResult"

RecLinkResult.object.rd

Record Linkage Result Object

resample

Safe Sampling

RLBigData-class

Class "RLBigData"

RLBigData-constructors.rd

Constructors for big data objects.

RLBigDataDedup-class

Class "RLBigDataDedup"

RLBigDataLinkage-class

Class "RLBigDataLinkage"

RLdata.rd

Test data for Record Linkage

RLResult-class

Class "RLResult"

show

Show a RLBigData object

splitData

Split Data

stochastic.rd

Stochastic record linkage.

strcmp.rd

String Metrics

subset

Subset operator for record linkage objects

summary.rd

Print Summary of Record Linkage Data

summary.RLBigData.rd

summary methods for "RLBigData" objects.

summary.RLResult.rd

Summary method for "RLResult" objects.

texSummary

LaTeX Summary of linkage results

trainSupv

Train a Classifier

unorderedPairs

Create Unordered Pairs

Provides functions for linking and deduplicating data sets. Methods based on a stochastic approach are implemented as well as classification algorithms from the machine learning domain. For details, see our paper "The RecordLinkage Package: Detecting Errors in Data" Sariyar M / Borg A (2010) <doi:10.32614/RJ-2010-017>.

  • Maintainer: Murat Sariyar
  • License: GPL (>= 2)
  • Last published: 2022-11-08