CategoricalDistances function

Distances among individuals using nominal variables.

Distances among individuals using nominal variables.

CategoricalDistances(x, y = NULL, coefficient = "GOW", transformation = "sqrt(1-S)")

Arguments

  • x: Matrix of Categorical Data
  • y: A second matrix of categorical data with the same variables as x
  • coefficient: Similarity coefficient to use (see details)
  • transformation: Transformation of the similarity into a distance

Details

The function calculates similarities and dissimilarities among a set ob ogjects characterized by a set of nominal variables. The function uses similarities and converts into dissimilarities using a variety of transformations controled by the user.

Returns

A matrix with distances among the rows of x and y. If y is NULL the interdistances among the rows of x are calculated.

References

dos Santos, T. R., & Zarate, L. E. (2015). Categorical data clustering: What similarity measure to recommend?. Expert Systems with Applications, 42(3), 1247-1260.

Boriah, S., Chandola, V., & Kumar, V. (2008). Similarity measures for categorical data: A comparative evaluation. red, 30(2), 3.

Author(s)

Jose Luis Vicente Villardon

Examples

##---- Should be DIRECTLY executable !! ----
  • Maintainer: Jose Luis Vicente Villardon
  • License: GPL (>= 2)
  • Last published: 2023-11-21

Useful links