Datasets for DSCI 425
These datasets are in comma-delimited format (.csv) files. They are easily read in this format into both R and JMP.
Datasets from Section 2 and 3
Datasets from Section 4 - ACE/AVAS
Assignment 1 - Datasets
Datasets from Section 5 - MARS
Datasets from Section 6 - Projection Pursuit Regression
Datasets from Section 7 - Neural Networks
Assignment 2 - Dataset
Datasets from Section 8 - Regularized/Penalized Regression Methods
Assignment 3 - Datasets
Datasets from Section 9 - Dimension Reduction Methods - PCR and PLS Regression
yarn - contained in the pls package.
Assignment 4 - Datasets
College is in the ISLR Package you need to install from CRAN.
Datasets from Section 10 - Tree-based Regression Models
QSAR Melting Point - QSARmtp.csv (XGBoost Example)
Assignment 5 - Datasets
cars data set in the caret package from CRAN
MIDTERM PROJECT DATASETS
Solubility Training Data - SoluTrain.csv
Solubility Test Data - SoluTest.csv
Datasets from Section 11 - Nearest Neighbor Regression
City77.csv - used in example regarding statistical distance
Datasets from Section 13 - Nearest Neighbor Classification
Datasets from Section 14 - Naive Bayes Classification
Assignment 6 - Datasets
Oil Identification - Oils.csv
Assignment 7 - Datasets
Datasets from Section 15 - Tree-based Models for Classification
Cleveland Heart Disease Data - Cleveland.csv
Assignment 8 - Datasets
Assignment 9 - Datasets
Alzheimers - Alzheimers.csv
Final Project Datasets