mixdir R package [Documentation]

find_defining_features

Find the n defining features

find_predictive_features

Find the top predictive features and values for each latent class

find_typical_features

Find the most typical features and values for each latent class

mixdir

Cluster high dimensional categorical datasets

plot_features

Plot cluster distribution for a subset of features features

predict.mixdir

Predict the class of a new observation.

Download source package Read PDF manual

Scalable Bayesian clustering of categorical datasets. The package implements a hierarchical Dirichlet (Process) mixture of multinomial distributions. It is thus a probabilistic latent class model (LCM) and can be used to reduce the dimensionality of hierarchical data and cluster individuals into latent classes. It can automatically infer an appropriate number of latent classes or find k classes, as defined by the user. The model is based on a paper by Dunson and Xing (2009) <doi:10.1198/jasa.2009.tm08439>, but implements a scalable variational inference algorithm so that it is applicable to large datasets. It is described and tested in the accompanying paper by Ahlmann-Eltze and Yau (2018) <doi:10.1109/DSAA.2018.00068>.

Maintainer: Constantin Ahlmann-Eltze
License: GPL-3
Last published: 2019-09-20
https://github.com/const-ae/mixdir

mixdir0.3.0 package

Functions

Readme

Datasets

Dependencies

Imports

Versions

News