A cleaned version of the original Pima Indians Diabetes dataset from the `mlbench` package. Useful for demonstrating regression approaches for binary outcomes.
Format
A data frame with 768 observations and 9 variables:
- pregnant
Number of times pregnant
- glucose
Plasma glucose concentration (glucose tolerance test)
- pressure
Diastolic blood pressure (mm Hg)
- triceps
Triceps skin fold thickness (mm)
- insulin
2-Hour serum insulin (mu U/ml)
- mass
Body mass index (BMI)
- pedigree
Diabetes pedigree function
- age
Age in years
- diabetes
Factor indicating diabetes status (pos/neg)