Tiny example dataset for probabilistic linkage
Contains fictional records of 7 persons. data
Two data frames with resp. 6 and 5 records and 6 columns.
id
the id of the person; this contains no errors and can be used to validate the linkage.lastname
the last name of the person; contains errors.firstname
the first name of the persons; contains errors.address
the address; contains errors.sex
the sex; contains errors and missing values.postcode
the postcode; contains no errors.