Percentage of missing values per feature are visualized as a bar plot.
PlotMissingvalues(Data,Names,WhichDefineMissing=c('NA','NaN','DUMMY','.',' '),PlotIt=TRUE,xlab='Amount Of Missing Values in Percent',xlim=c(0,100),...)
Arguments
Data: [1:n,1:d] data cases in rows, variables/features in columns
Names: [1:d] optional vector of string describing the names of the features
WhichDefineMissing: [1:d] optional vector of string describing missing values, usefull for character features. Currently up to five different options are possible.
PlotIt: If FALES: Does not plot
xlab: x label of bar plot
xlim: x axis limits in percent
...: Further arguments passed on to barplot, such as main for title
Returns
plots not finite and missing values as a bar plot for each feature d and returns with invisible the amount of missing values as a vector. Works even with character variables, but WhichDefineMissing cannot be changed at the current version. Please make a suggestion on GitHub how to improve this.
Author(s)
Michael Thrun
Note
Does not work with the tibble format, in such a case please call as.data.frame(as.matrix(Data))