Case study 1: Using the births data set part I
On our spreadsheet for today, go to the tab titled “BIRTHS”. In 2004, the state of North Carolina released to the public a large data set containing information on births recorded in this state. This data set has been of interest to medical researchers who are studying the relation between habits and practices of expectant mothers and the birth of their children. This is a random sample of 1,000 cases from this data set.
- Use google sheets to create a histogram of the variable mage which stands for “mother’s age”. Place this histogram in a separate tab and name it “HIST - mAGE”.
- Look at the histogram you just created which is a visual summary of the distribution of mother’s ages in the data set. When do most mothers seem to have children?
- Again using the histogram describing the distribution of mother’s age, do you happen to notice any outliers in the distribution? What are they? What do you think is going on with those observations?
- If you were to describe the shape of the distribution, how would you describe it?
- Using the first 10 observations within mage create a stem an leaf plot depicting the distribution of mage
Case study 3: Using the cars data set part I
On our spreadsheet for today, go to the tab titled “CARS”. This is a data matrix with 54 rows and 6 columns. The columns represent the variables type, price, mpgCity, driveTrain, passengers, weight for a sample of 54 cars from 1993.
- Use google sheets to create a histogram of the variable mpgCity which stands for “Miles per gallon, city”. Place this histogram in a separate tab and name it “HIST - MPG”.
- Look at the histogram you just created which is a visual summary of the distribution of the car’s MPG in the data set. What sort of MPG does it seem like most cars get in the distribution?
- Again using the histogram describing the distribution of car MPG, do you happen to notice any outliers in the distribution? What are they? What do you think is going on with those observations?
- If you were to describe the shape of the distribution, how would you describe it?
- Using the first 10 observations within mpgCity create a stem an leaf plot depicting the distribution of mpgCity