GetNumTrainingValues Calculates the number of values to be used as a training set.
GetNumTrainingValues(n.row, prob.percent =0.15)
Arguments
n.row: Number of rows of the all dataset with training and test values.
prob.percent: Percentage of training values
Returns
Number of training values.
Details
the number of values to be used as a training set is calculated as a minimum between 15% of the number of rows in the dataset and 15% of 5000.
Examples
## Generate dataset.seed(100)n <-180x <- sample(1:100, n, replace =TRUE)x[70:90]<- sample(110:115,21, replace =TRUE)x[25]<-200x[150]<-170df <- data.frame(timestamp =1:n, value = x)# Get number of instances to train phaseGetNumTrainingValues(nrow(df))
References
A. Lavin and S. Ahmad, “Evaluating Real-time Anomaly Detection Algorithms – the Numenta Anomaly Benchmark,” in 14th International Conference on Machine Learning and Applications (IEEE ICMLA’15), 2015.