Column

Introduction

The task at hand was to examine the Census data and using that data predict whether the income exceeds 50K per year for new individuals. The predictive models used to achieve the goal are; Random Forest and Naive Bayes. The models predict how many individuals earn over 50K per year.

In the top right, it is the random forest model that shows the confusion matrix of the census data frame, and the bottom right is the confusion matrix for the Naive Bayes model. The accuracy for the Random Forest confusion matrix is higher than of the Naive Bayes model.

Therefore, the bottom left bar graph shows the random forest prediction model on the new individuals data set to predict which of the 10 individuals earn over 50K per year, and which do not.

New Individuals

Column

Random Forest

Naive Bayes