pmclust R package [Documentation]

00_pmclust-package

Parallel Model-Based Clustering

01-pmclust_pkmeans

Parallel Model-Based Clustering and Parallel K-means Algorithm

10_d.readme

Read Me First Function

11_d.set.global

Set Global Variables According to the global matrix X.gbd (X.spmd)

12_d.param

A Set of Parameters in Model-Based Clustering.

13_d.control

A Set of Controls in Model-Based Clustering.

20-assign.N.sample

Obtain a Set of Random Samples for X.spmd

30-em.one.e

Compute One E-step and Log Likelihood Based on Current Parameters

30-em.one.m

Compute One M-Step Based on Current Posterior Probabilities

30-em.one

One EM Step for GBD

30-em_initial

Initialization for EM-like Algorithms

30-em_like

EM-like Steps for GBD

40-generate.basic

Generate Examples for Testing

40-generate.MixSim

Generate MixSim Examples for Testing

41-get.N.CLASS

Obtain Total Elements for Every Clusters

50-indep.logL

Independent Function for Log Likelihood

50-mb.print

Print Results of Model-Based Clustering

50-update.class

Update CLASS.spmd Based on the Final Iteration

60-print

Functions for Printing or Summarizing Objects According to Classes

zz-internal

All Internal Functions

Download source package Read PDF manual

Aims to utilize model-based clustering (unsupervised) for high dimensional and ultra large data, especially in a distributed manner. The code employs 'pbdMPI' to perform a expectation-gathering-maximization algorithm for finite mixture Gaussian models. The unstructured dispersion matrices are assumed in the Gaussian models. The implementation is default in the single program multiple data programming model. The code can be executed through 'pbdMPI' and MPI' implementations such as 'OpenMPI' and 'MPICH'. See the High Performance Statistical Computing website <https://snoweye.github.io/hpsc/> for more information, documents and examples.

Maintainer: Wei-Chen Chen
License: GPL (>= 2)
Last published: 2021-02-11

Useful links

pmclust0.2-1 package

Functions

Dependencies

Imports

Versions