Diabetes dataset

Diabetes—A logistic data set, determining whether a woman tested positive for diabetes. 100 percent accurate results are possible using the logistic function in the Ensembles package.

  • Maintainer: Russ Conte
  • License: MIT + file LICENSE
  • Last published: 2025-10-12

About the dataset

  • Number of rows: 768
  • Number of columns: 9
  • Class: data.frame

Column names and types

  • Pregnancies:integer
  • Glucose:integer
  • BloodPressure:integer
  • SkinThickness:integer
  • Insulin:integer
  • BMI:numeric
  • DiabetesPedigreeFunction:numeric
  • Age:integer
  • Outcome:integer