RecordLinkage0.4-12.6 package

Record Linkage Functions for Linking and Deduplicating Data Sets

optimalThreshold.rd

Optimal Threshold for Record Linkage

phonetics.rd

Phonetic Code

RecLinkClassif-class.rd

Class "RecLinkClassif"

RecLinkData-class.rd

Class "RecLinkData"

RecLinkData.object.rd

Record Linkage Data Object

RecLinkResult-class.rd

Class "RecLinkResult"

RecLinkResult.object.rd

Record Linkage Result Object

resample.rd

Safe Sampling

RLBigData-class.rd

Class "RLBigData"

RLBigData-constructors.rd

Constructors for big data objects.

RLBigDataDedup-class.rd

Class "RLBigDataDedup"

RLBigDataLinkage-class.rd

Class "RLBigDataLinkage"

RLdata.rd

Test data for Record Linkage

RLResult-class.rd

Class "RLResult"

show.rd

Show a RLBigData object

splitData.rd

Split Data

stochastic.rd

Stochastic record linkage.

unorderedPairs.rd

Create Unordered Pairs

append-methods.rd

Concatenate comparison patterns or classification results

classifySupv.rd

Supervised Classification

classifyUnsup.rd

Unsupervised Classification

clone.rd

Serialization of record linkage object.

compare.rd

Compare Records

deleteNULLs.rd

Remove NULL Values

editMatch.rd

Edit Matching Status

emClassify.rd

Weight-based Classification of Data Pairs

emWeights.rd

Calculate weights

epiClassify.rd

Classify record pairs with EpiLink weights

epiWeights.rd

Calculate EpiLink weights

ff_vector-class.rd

Class "ff_vector"

ffdf-class.rd

Class "ffdf"

genSamples.rd

Generate Training Set

getErrorMeasures-methods.rd

Calculate Error Measures

getExpectedSize.rd

Estimate number of record pairs.

getFrequencies-methods.rd

Get attribute frequencies

getMinimalTrain.rd

Create a minimal training set

getPairs-methods.rd

Extract Record Pairs

getPairsBackend.rd

Backend function for getPairs

getParetoThreshold.rd

Estimate Threshold from Pareto Distribution

getTable-methods.rd

Build contingency table

gpdEst.rd

Estimate Threshold from Pareto Distribution

internals.rd

Internal functions and methods

isFALSE.rd

Check for FALSE

makeBlockingPairs.rd

Create record pairs from blocks of ids.

mrl.rd

Mean Residual Life Plot

mygllm.rd

Generalized Log-Linear Fitting

strcmp.rd

String Metrics

subset.rd

Subset operator for record linkage objects

summary.rd

Print Summary of Record Linkage Data

summary.RLBigData.rd

summary methods for "RLBigData" objects.

summary.RLResult.rd

Summary method for "RLResult" objects.

texSummary.rd

LaTeX Summary of linkage results

trainSupv.rd

Train a Classifier

Provides functions for linking and deduplicating data sets. Methods based on a stochastic approach are implemented as well as classification algorithms from the machine learning domain. For details, see our paper "The RecordLinkage Package: Detecting Errors in Data" Sariyar M / Borg A (2010) <doi:10.32614/RJ-2010-017>.

  • Maintainer: Murat Sariyar
  • License: GPL (>= 2)
  • Last published: 2026-01-25