fuzzylink R package [Documentation]

check_match

Test whether two strings match with an LLM prompt.

dot

Compute the dot product between two vectors

fuzzylink

Probabilistic Record Linkage Using Pretrained Text Embeddings

get_embeddings

Get pretrained text embeddings

get_similarity_matrix

Create matrix of embedding similarities

get_training_set

Create a training set

hand_label

Hand Label A Dataset

mistral_api_key

Install a MISTRAL API KEY in Your .Renviron File for Repeated Use

openai_api_key

Install an OPENAI API KEY in Your .Renviron File for Repeated Use

Download source package Read PDF manual

Links datasets through fuzzy string matching using pretrained text embeddings. Produces more accurate record linkage when lexical string distance metrics are a poor guide to match quality (e.g., "Patricia" is more lexically similar to "Patrick" than it is to "Trish"). Capable of performing multilingual record linkage. Methods are described in Ornstein (2025) <doi:10.1017/pan.2025.10016>.

Maintainer: Joe Ornstein
License: MIT + file LICENSE
Last published: 2025-08-29

Useful links

fuzzylink0.2.5 package

Functions

Dependencies

Imports

Versions