Name

  1. The data set “Price” collected data on over 400 housing transactions in Jacksonville. The city is separated into five distinct wards that describe the location of the house, Price measures the sales price of the house, Sqft measures the size of the house in square feet, and quality rates the condition of the house.
  1. Describe the entities on which the data are collected for this study. Does this represent a population or a sample of houses in Jacksonville?
  1. Characterize each variable in the data set as categorical (qualitative) or quantitative. If the variable is categorical determine if it is nominal or ordinal. If the variable is quantitative then determine if it is discrete or continuous.
Ward Ward_Count Percent
Ward1 42 9.6
Ward2 106 24.1
Ward3 28 6.4
Ward4 171 39.0
Ward5 92 21.0

Ward 3 includes 6.4% of houses from the sample. Each ward is not represented equally. Ward 4 is represented the most with 39% and Ward 3 is represented with the fewest houses.

  1. Create a bar plot for the variable Wards and express as a percent and include in this document. Add appropriate labels to the chart.

e. Create a percent frequency table for the variable quality. How would you describe the overall quality of homes in Jacksonville?

Looking at quality 38.2% of the houses are rated 4 or 5. 64.2% of houses are rated a 3 or better. This suggests that large percentage of houses are rated highly.

Quality quality_Count Percent
1 48 10.9
2 109 24.8
3 114 26.0
4 109 24.8
5 59 13.4

  1. Create a histogram for the variable price. Describe the distribution for sales price in Jacksonville.

The histogram indicates right skew, with a few houses in the right tail pulling the distribution out.