partition() R function from [mlr3]

Manually Partition into Training, Test and Validation Set

Creates a split of the row ids of a Task into a training and a test set, and optionally a validation set.


partition(task, ratio = 0.67)

Arguments

task: (Task )

Task to operate on.
ratio: (numeric())

Ratio of observations to put into the training set. If a 2 element vector is provided, the first element is the ratio for the training set, the second element is the ratio for the test set. The validation set will contain the remaining observations.

Examples


# regression task partitioned into training and test set
task = tsk("california_housing")
split = partition(task, ratio = 0.5)
data = data.frame(
  y = c(task$truth(split$train), task$truth(split$test)),
  split = rep(c("train", "predict"), lengths(split[c("train", "test")]))
)
boxplot(y ~ split, data = data)

# classification task partitioned into training, test and validation set
task = tsk("pima")
split = partition(task, c(0.66, 0.14))

mlr3 package Read PDF manual

Maintainer: Marc Becker
License: LGPL-3
Last published: 2025-03-12

Useful links

partition function

Manually Partition into Training, Test and Validation Set

Arguments

Examples