Category Variable Encodings
Encode a given factor variable automatically
Encode a given factor variable using deviation encoding
Encode a given factor variable using difference encoding
Encode a given factor variable using dummy variables
Encode a given factor variable using helmert encoding
Encode a given factor variable using low rank encoding
Encode a given factor variable using means encoding
Encode a given factor variable using median encoding
Encode a given factor variable using a multinomial logit representatio...
Encode a given factor variable using a repeated effect encoding
Encode a given factor variable using a simple effect encoding
Encode a given factor variable using a sparse PCA representation
Simple, fast, and automatic encodings for category data using a data.table backend. Most of the methods are an implementation of "Sufficient Representation for Categorical Variables" by Johannemann, Hadad, Athey, Wager (2019) <arXiv:1908.09874>, particularly their mean, sparse principal component analysis, low rank representation, and multinomial logit encodings.
Useful links