DATASETS FOR DSCI 415 (these will be updated regularly!)

Datasets in Book

Section 1 - Graphics in R

Section 2 - Measuring Distance

Section 3 - Multidimensional Scaling

Section 4 - Principal Components and Other Dimension Reduction Methods

Section 5 - Cluster Analysis

Section 6 - Correspondence Analysis

Section 7 - Association Rules

Section 8 - Recommender Systems

Section 9 - Text Mining and Sentiment Analysis

Datasets for Assignments

Assignment 1 - Matrix Algebra and Graphics in R

Assignment 2 - Distance and Multidimensional Scaling

Assignment 3 - Principal Component Analysis (PCA)

Assignment 4 - Independent Component Analysis (ICA)

Assignment 5 - Cluster Analysis

Assignment 6 - MCA

Assignment 7 - Association Rules

Assignment 8 - Recommender Systems

===============================================================================================================================

 

Italian Olive Oils

Olives.JMP
Olives.txt

Brazil Faces

Brazil.csv ( each column are the pixels to create one of the 200 faces in the database)

Car Images

Cars.csv

Letter Recognition Data

Letter-recognition.JMP

Milk Truck Datasets (Milktruck, Milkdiesel, Milkgasoline in R)

Milktruck.JMP, Milkdiesel.JMP, Milkgasoline.JMP

Minnesota Districts, Teachers, and 8th Grade Test Scores (MNteachtest in R)

MNteachtest.JMP

NHL Skater Stats

PuckAnalytics.JMP

PuckAnalytics.csv

 

NCI Data

NCI.JMP

NCI transpose.JMP

Nutritional Data on Fast Food (Nutritional.Small and Nutritional.Large in R)

Nutritional (small).JMP and Nutritional (large).JMP


Orthopedic Sales Data

Orthopedic Sales.JMP

Orthopedic Sales.txt

Radiotherapy (Radiotherapy in R)

Radiotherapy.JMP

Salespeople

Salespeople.JMP

Schlerosis

Schlerosis.JMP

Sports Difficulty

Sports Difficulty.JMP (Description File click here)

Sports Difficulty.txt

Trackwomen (Trackwomen in R)

 

Trackwomen.JMP

Trackmen (Trackmen in R)

Trackmen.JMP

ZIP Code (Digit Recognition Data - zip.train and zip.test in ElemStatLearn package)

ZipTrain.JMP
ZipTest.JMP