Fuzzy String Comparison
Matching based on common tokens
String similarity ratio
Score multiple candidate strings against a single input
Simplify a matrix
Ordered token matching
Weighted token similarity measure
Weighted version of lev_token_set_ratio()
Weighted version of lev_token_sort_ratio()
Find all substrings of a given length
Tokenise and sort a string
Tokenise a string
Default parameters inherited by other documentation
Internal functions
Get the best matched string from a list of candidates
String distance metrics
Ratio of the best-matching substring
Provides string similarity calculations inspired by the Python 'thefuzz' package. Compare strings by edit distance, similarity ratio, best matching substring, ordered token matching and set-based token matching. A range of edit distance measures are available thanks to the 'stringdist' package.