Miscellaneous Functions from Alexey Shipunov
Adjusted Rand index
Aggregates by one vector and uses it for row names
Finds all duplicates
Plot bootstrap values
Bootstrapped hclust
Calculates the best overlap
Adds correlation arrows to the scatterplot
Convert diagnostic keys and classification lists
Bootstrap clustering
Bootstrap with kNN
Bootstrap with 'randomForest()'
Grouped boxplots
System date, time plus easy save history
Adds confidence bands to the simple linear model plots
Samples along the class labels
Class projection
Clustergram: visualize the cluster structure
Average coefficients of determination for each variable
Compare checklists
Correlation matrix with p-values
Calculates correlation and converts results into the named long vector
Coefficients of variation
Removes duplicated data values downstream
Distance matrix based kNN classification
Improved dotcharts
Plot ellipse
Confidence ellipses
Boxplot explanation
Examples of colors
Examples of fonts
Examples of line types
Example of plot margins
Point examples
Examples of plot types
Colorize tips of 'hclust' plot
Textual file system browser
Fill data values downstream, like in spreadsheets
Gap coding
Generates datasets for clustering
Compute the simple Gini coefficient
Gower distance
Classification grid and decision boundaries
Draw with 'R'
Clustering to matrix
Counts matches between two hierarchical clusterings
Calculates coordinates of nodes from 'hclust' plot
Histogram with overlaid curve
Convex hulls for multiple groups
Rarefaction curves
Simple bootstrap and jackknife clustering
Coefficient of divergence
Game of Life
Dotchart-like plot sfor every scaled variable grouped by factor
Ls
Interpreter for effect sizes
MDS: dimension importance ("explained variance" surrogate)
Miney game
Minus names
Misclassification (confusion) table
Textual plot of missing data
Calls MrBayes
Matrix Representation of Hierarchical Clustering
Normalized Compression Distance
Check normality
Calculates overlap between polygons
Pairwise table of effects with magnitudes
Robust rank order test post hoc derivative
Pairwise Chi-squared or Fisher test for 2-dimensional tables
Plant phyllotaxis
Point in hull
Correlation circles (correlation pleiads)
Plot phylogenetic tree with clades collapsed
Dotchart which reflects the "best" base distance method
Plots dotchart with best clustering method
Dotchart which reflects the "best" of non-base distances
Changes the appearance of cluster dendrogram
Number of cases in each location reflected in the point size
Area of the polygon
Center of the polygon
Select rows from data frame
Imitation (!) of the modern 'R' logo
Read 'FASTA' files
Read 'NTSYSpc' files
Basic multiple recoding
Roots phylogenetic trees even if outgroup is not monophyletic
Calculates multiple correlation matrices (via 'factor1') and stacks th...
Converts 'rpart' object into Newick tree
Rresults shell script
Robust rank order test
S-value
salix_leaves
Say ``no'' to dynamite plots!
Simple Match distance
Areas of polygons
'str()' enchanced for data frames
Convert table to data frame saving structure
Calculates coordinates of tips from 'hclust' plot
Easy way to add text labels to 'hclust' plot
Binarize (make dummy variables)
Insert content to Linux X11 clipboard
Stacks correlation matrix
Text-to-columns
Educated distances for semi-supervised clustering
Arrange observations by the distance from center
Effect sizes of association between categorical variables
Write 'FASTA' files
Separate terminal pager for Linux
A collection of functions for data manipulation, plotting and statistical computing, to use separately or with the book "Visual Statistics. Use R!": Shipunov (2020) <http://ashipunov.info/shipunov/software/r/r-en.htm>. Dr Alexey Shipunov died in December 2022. Most useful functions: Bclust(), Jclust() and BootA() which bootstrap hierarchical clustering; Recode() which does multiple recoding in a fast, simple and flexible way; Misclass() which outputs confusion matrix even if classes are not concerted; Overlap() which measures group separation on any projection; Biarrows() which converts any scatterplot into biplot; and Pleiad() which is fast and flexible correlogram.