Introduction

The data we are using is trying to predict the sales price of a house (in thousands). The model created shows that the year a remodel was added, the size of the ground floor living area, the number of cars the garage can hold, and the overall quality of the house contributes to the sales price of the home.

Data Description

Method

My method was to look at how different variables affected sales price. I converted sales price into thousands to make a smaller number that was easier for me to follow. I thought quality would be a good factor, so I chose that as my categorical variable, and decided to make it more concise by showing low quality vs high quality. I chose to do this because it was similar to what we did in class when looking a different grades. I tried to use variables that did not correlate much with each other but at least had some sort of correlation with sales price in thousands. I made a ggcorrplot to see these numbers and how they correlated with sales price.

Key Findings