cluster-statistics function

Cluster size statistics

Cluster size statistics

SumOfRanges(x, cluster = 1) SumOfVariances(x, cluster = 1) SumOfVars(x, cluster = 1) MeanCentroidDistance(x, cluster = 1, Average = mean) MeanCentDist(x, cluster = 1, Average = mean) MeanCentroidDist(x, cluster = 1, Average = mean) DistanceFromMedian(x, cluster = 1, Average = mean) DistFromMed(x, cluster = 1, Average = mean) MeanNN(x, cluster = 1, Average = mean) MeanMSTEdge(x, cluster = 1)

Arguments

  • x: Matrix in which each row lists the coordinates of a point in a Euclidian space; or, where supported, dist object specifying distances between each pair of points.
  • cluster: Optional integer vector specifying the cluster or group to which each row in x belongs.
  • Average: Function to use to summarize distances. Defaults to mean; specifying median returns a value akin to the median absolute divergence (see mad).

Returns

SumOfRanges() returns a numeric specifying the sum of ranges within each cluster across all dimensions.

SumOfVariances() returns a numeric specifying the sum of variances within each cluster across all dimensions.

MeanCentroidDistance() returns a numeric specifying the mean distance from the centroid to points in each cluster.

DistanceFromMedian() returns a numeric specifying the mean distance of each point (except the median) from the median point of its cluster.

MeanNN() returns a numeric specifying the mean distance from each point within a cluster to its nearest neighbour.

MeanMSTEdge() returns a numeric specifying the mean length of an edge in the minimum spanning tree of points within each cluster.

Examples

points <- rbind(matrix(1:16, 4), rep(1, 4), matrix(1:32, 8, 4) / 10) cluster <- rep(1:3, c(4, 1, 8)) plot( points[, 1:2], # Plot first two dimensions of four-dimensional space col = cluster, pch = cluster, # Style by cluster membership asp = 1, # Fix aspect ratio to avoid distortion ann = FALSE, frame = FALSE # Simple axes ) SumOfRanges(points, cluster) SumOfVariances(points, cluster) MeanCentroidDistance(points, cluster) DistanceFromMedian(points, cluster) MeanNN(points, cluster) MeanMSTEdge(points, cluster)

See Also

Other tree space functions: Islands(), MSTSegments(), MapTrees(), MappingQuality(), SpectralEigens(), median.multiPhylo()

Other cluster functions: KMeansPP()

Author(s)

Martin R. Smith

(martin.smith@durham.ac.uk)