Spam Classification Task
Spam data set from the UCI machine learning repository (http://archive.ics.uci.edu/dataset/94/spambase). Data set collected at Hewlett-Packard Labs to classify emails as spam or non-spam. 57 variables indicate the frequency of certain words and characters in the e-mail. The positive class is set to "spam".
R6::R6Class inheriting from TaskClassif .
Creators: Mark Hopkins, Erik Reeber, George Forman, Jaap Suermondt. Hewlett-Packard Labs, 1501 Page Mill Rd., Palo Alto, CA 94304
Donor: George Forman (gforman at nospam hpl.hp.com) 650-857-7835
Preprocessing: Columns have been renamed. Preprocessed data taken from the list("kernlab") package.
This Task can be instantiated via the dictionary mlr_tasks or with the associated sugar function tsk()
:
mlr_tasks$get("spam")
tsk("spam")
FALSE
Dua, Dheeru, Graff, Casey (2017). UCI Machine Learning Repository.
http://archive.ics.uci.edu/datasets.
Chapter in the mlr3book: https://mlr3book.mlr-org.com/chapters/chapter2/data_and_basic_modeling.html
Package list("mlr3data") for more toy tasks.
Package list("mlr3oml") for downloading tasks from https://www.openml.org.
Package list("mlr3viz") for some generic visualizations.
Dictionary of Tasks : mlr_tasks
as.data.table(mlr_tasks)
for a table of available Tasks in the running session (depending on the loaded packages).
list("mlr3fselect") and list("mlr3filters") for feature selection and feature filtering.
Extension packages for additional task types:
Other Task: Task
, TaskClassif
, TaskRegr
, TaskSupervised
, TaskUnsupervised
, california_housing
, mlr_tasks
, mlr_tasks_breast_cancer
, mlr_tasks_german_credit
, mlr_tasks_iris
, mlr_tasks_mtcars
, mlr_tasks_penguins
, mlr_tasks_pima
, mlr_tasks_sonar
, mlr_tasks_wine
, mlr_tasks_zoo
Useful links