Clustering1.7.10 package

Techniques for Evaluating Clustering

metrics_validate

Method that returns the list of used metrics

mini_kmeans_method

Method that runs the mini_kmeans algorithm using the Euclidean metric ...

mona_method

Method that runs the mona algorithm using external or internal validat...

number_columnas_external

Method that returns how many external metrics there are in the array o...

number_columnas_internal

Method that returns how many internal metrics there are in the array o...

number_variables_dataset

Method that returns the number of variables in a dataset directory

packages

Method that returns the list of used packages

pam_euclidean_method

Method that runs the pam algorithm using the Euclidean metric to make ...

pam_manhattan_method

Method that runs the pam algorithm using the Manhattan metric to make ...

path_dataset

Method that return a list of files that exists in a directory

plot_clustering

Graphic representation of the evaluation measures.

precision_metric

Method to calculate the precision.

pvclust_correlation_method

Method that runs the pvclust algorithm using the Correlation metric to...

pvclust_euclidean_method

Method that runs the pvclust algorithm using the Euclidean metric to m...

pvpick_method

Method that runs the pvpick algorithm using an external or internal va...

read_file

Method that converts a dataset into a matrix

recall_metric

Method to calculate the recall.

refactorName

Method for refactoring the distance measurement name.

result_external_algorithm_by_metric

External results by algorithm.

result_internal_algorithm_by_metric

Internal results by algorithm

resultClustering

Method for filtering clustering results.

row_name_df_external

Method in charge of obtaining those metrics that are external from tho...

row_name_df_internal

Method in charge of obtaining those metrics that are internal from tho...

show_result_external_algorithm_by_metric

Method that returns a table with the algorithm and the metric indicate...

show_result_external_algorithm_group_by_clustering

Method in charge of obtaining a table with the results of the algorith...

show_result_internal_algorithm_by_metric

Method that returns a table with the algorithm and the metric indicate...

show_result_internal_algorithm_group_by_clustering

Method in charge of obtaining a table with the results of the algorith...

silhouette_metric

Method to calculate the silhouette.

sort.clustering

Returns the clustering result sorted by a set of metrics.

specify_decimal

Method that format a number with four digits

sub-.clustering

Filter metrics in a clustering object returning a new clustering o...

transform_dataset

Method for filtering external columns of a dataset.

transform_dataset_internal

Method for filtering internal columns of a dataset.

variation_information_metric

Method to calculate the variation information.

aggExCluster_euclidean

Method that runs the aggExcluster algorithm using the Euclidean metric...

agnes_euclidean_method

Method that runs the agnes algorithm using the Euclidean metric to mak...

agnes_manhattan_method

Method that runs the agnes algorithm using the manhattan metric to mak...

algorithm_amap

amap package algorithms

algorithm_apcluster

apcluster package algorithms

algorithm_cluster

cluster package algorithms

algorithm_clusterr

ClusterR package algorithms

algorithm_pvclust

pvclust package algorithms

algorithms

Method that returns the list of used algorithms

algorithms_package

Method that returns all the algorithms executed by the package

apclusterK_euclidean

Method that runs the apClusterK algorithm using the Euclidean metric t...

apclusterK_manhattan

Method that runs the apclusterK algorithm using the Manhattan metric t...

apclusterK_minkowski

Method that runs the apclusterK algorithm using the Minkowski metric t...

appClustering

Clustering GUI.

best_ranked_external_metrics

Best rated external metrics.

best_ranked_internal_metrics

Best rated internal metrics.

calculate_best_external_variables_by_metrics

Method that calculates the best rated external metrics.

calculate_best_internal_variables_by_metrics

Method that calculates the best rated internal metrics.

calculate_best_validation_external_by_metrics

Method that calculates which algorithm and which metric behaves best f...

calculate_best_validation_internal_by_metrics

Method that calculates which algorithm and which metric behaves best f...

calculate_connectivity

Method to calculate the Connectivity

calculate_dunn

Method to calculate the dunn.

calculate_result

Method that returns the value or variable depending on where it is in ...

calculate_result_internal

Method that returns the value or variable depending on where it is in ...

calculate_validation_external_by_metrics

Method that calculates which algorithm behaves best for the datasets p...

calculate_validation_internal_by_metrics

Method that calculates which algorithm behaves best for the datasets p...

clara_euclidean_method

Method that runs the clara algorithm using the Euclidean metric to mak...

clara_manhattan_method

Method that runs the clara algorithm using the Manhattan metric to mak...

clustering

Clustering algorithm.

connectivity_metric

Method to calculate the connectivity.

convert_numeric_matrix

Method that converts a matrix into numerical format.

convert_table

Method in charge of creating a table from an array with the values of ...

convert_toOrdinal

Method to convert columns to ordinal.

daisy_euclidean_method

Method that runs the daisy algorithm using the Euclidean metric to mak...

daisy_gower_method

Method that runs the daisy algorithm using the Gower metric to make an...

daisy_manhattan_method

Method that runs the daisy algorithm using the Manhattan metric to mak...

dataframe_by_metrics_evaluation

Method to filter only the external measurement columns

detect_definition_attribute

Method in charge of detecting the limit of a dataset header.

diana_euclidean_method

Method that runs the diana algorithm using the Euclidean metric to mak...

dunn_metric

Method to calculate the dunn.

entropy_formula

Method for calculating entropy.

entropy_metric

Method to calculate the entropy.

evaluate_all_column_dataset

Method in charge of calculating the average for all datasets using all...

evaluate_best_validation_external_by_metrics

Evaluates algorithms by measures of dissimilarity based on a metric.

evaluate_best_validation_internal_by_metrics

Evaluates algorithms by measures of dissimilarity based on a metric.

evaluate_validation_external_by_metrics

Evaluate external validations by algorithm.

evaluate_validation_internal_by_metrics

Evaluate internal validations by algorithm.

execute_datasets

Evaluation clustering algorithm.

execute_package_parallel

Evaluation clustering algorithm.

export_file_external

Export result of external metrics in latex.

export_file_internal

Export result of internal metrics in latex.

extension_file

Method that return the extension of a file

external_validation

Method that applicate differents external metrics about a data frame o...

fanny_euclidean_method

Method that runs the fanny algorithm using the Euclidean metric to mak...

fanny_manhattan_method

Method that runs the fanny algorithm using the Manhattan metric to mak...

fill_cluster_vector

Method that fill vector

fmeasure_metric

Method to calculate the f_measure.

fowlkes_mallows_index_metric

Method to calculate the fowlkes and mallows.

gmm_euclidean_method

Method that runs the gmm algorithm using the Euclidean metric to make ...

gmm_manhattan_method

Method that runs the gmm algorithm using the Manhattan metric to make ...

hclust_euclidean

Method that runs the hcluster algorithm using the Euclidean metric to ...

information_external

Method that returns an array with the external information of the clus...

information_internal

Method that returns an array with the internal information of the clus...

initializeExternalValidation

Method that return a list of internal validation initialized to zero.

initializeInternalValidation

Method that return a list of external validation initialized to zero.

internal_validation

Method that applicate differents internal metrics about a data frame o...

is_External_Metrics

Method that checks for external metrics

is_Internal_Metrics

Method that checks for internal metrics

kmeans_arma_method

Method that runs the kmeans_arma algorithm using the Euclidean metric ...

kmeans_rcpp_method

Method that runs the kmeans_rcpp algorithm using the Euclidean metric ...

max_value_metric

Method that return max value of metric.

measure_amap

Metrics of the amap algorithm

measure_apcluster

Metrics of the apcluster algorithm

measure_calculate

Method that returns all the measures executed by the package from the ...

measure_cluster

Metrics of the cluster algorithm

measure_clusterr

Metrics of the ClusterR algorithm

measure_package

Method that returns all the measures executed by the package

measure_pvclust

Metrics of the pvclust algorithm

metrics_calculate

Method in charge of verifying the implemented metrics

metrics_external

Method that returns the list of used external metrics

metrics_internal

Method that returns the list of used internal metrics

The design of this package allows us to run different clustering packages and compare the results between them, to determine which algorithm behaves best from the data provided. See Martos, L.A.P., García-Vico, Á.M., González, P. et al.(2023) <doi:10.1007/s13748-022-00294-2> "Clustering: an R library to facilitate the analysis and comparison of cluster algorithms.", Martos, L.A.P., García-Vico, Á.M., González, P. et al. "A Multiclustering Evolutionary Hyperrectangle-Based Algorithm" <doi:10.1007/s44196-023-00341-3> and L.A.P., García-Vico, Á.M., González, P. et al. "An Evolutionary Fuzzy System for Multiclustering in Data Streaming" <doi:10.1016/j.procs.2023.12.058>.