library(tidyverse)
library(openintro)

Exercise 1

The calculations below were done in the documentation for this lab. Note that all calculations occur between three ` marks that open and close the “code chuck”

2 + 2
## [1] 4
2^127 -1
## [1] 1.701412e+38
sqrt(9)
## [1] 3
log(10, 10)
## [1] 1

These are my calculations with code chunk

3*6
## [1] 18
sqrt(98)
## [1] 9.899495
8/4
## [1] 2
5-5
## [1] 0

This is the data set of Cars 93

?cars93
## starting httpd help server ... done
head(cars93)
## # A tibble: 6 × 6
##   type    price mpg_city drive_train passengers weight
##   <fct>   <dbl>    <int> <fct>            <int>  <int>
## 1 small    15.9       25 front                5   2705
## 2 midsize  33.9       18 front                5   3560
## 3 midsize  37.7       19 front                6   3405
## 4 midsize  30         22 rear                 4   3640
## 5 midsize  15.7       22 front                6   2880
## 6 large    20.8       19 front                6   3470

Exercise 2

We’re going to analyze the cars93 data set.

The data set has a handful of variables. How many? List them below along with their classification (eg numerical, categorical, discrete, continuous, etc.). The first one is included for you.

  1. type, regular categorical

  2. price, continuous numerical

  3. mpg_city, continuous numerical

  4. drive_train, regular categorical

  5. passengers, discrete numerical

  6. weight, continuous numerical

Answers 2

There are six variables within this data set. The variables are type, price, mpg city, drive train, and weight. Following that order to classify the variable are regular categorical, numerical Now determine the number of observations in this data set, using the code chunk below: There are 54 observations within the data set.;;

unique(cars93$type)
## [1] small   midsize large  
## Levels: large midsize small
summary(cars93$type)
##   large midsize   small 
##      11      22      21
summary(cars93$type)/length(cars93$type)
##     large   midsize     small 
## 0.2037037 0.4074074 0.3888889

Are observations rows or columns?

Answers 2.5

Observations are rows

Exercise 3

Still using the cars93 data set, answer the following questions:

  • What types of drive trains do the cars in this data set have?
  • How many cars in this data set are rear wheel drive?
  • What proportion of cars in the data set are 4WD.
summary(cars93$drive_train)
##   4WD front  rear 
##     2    43     9
summary(cars93$drive_train)/length(cars93$drive_train)
##        4WD      front       rear 
## 0.03703704 0.79629630 0.16666667

Answers 3

There are rear, front wheel, and four wheel drive trains within the cars of the data set. The amount of cars in the data set that are rear wheel drive is nine cars. The proportion of 4WD cars in the data set is 2/54.

Exercise 4

p <- ggplot(cars93, aes(x=drive_train)) + geom_bar(stat="count", fill="darkgreen")
p

Price Analysis of Cars

mean(cars93$price)
## [1] 19.99259
## [1] 19.99259
sd(cars93$price)
## [1] 11.50645
## [1] 11.50645
summary(cars93$price)
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##    7.40   10.95   17.25   19.99   26.25   61.90

Exercise 5

Compare the mean and median weight of cars in cars93. Is there a difference between these measures of center? If so, why do you think that is the case?

p <- ggplot(cars93, aes(x=weight)) + geom_histogram(bins=17, fill="darkgreen")
p

mean(cars93$weight)
## [1] 3037.407
sd(cars93$weight)
## [1] 657.6643
summary(cars93$weight)
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##    1695    2452    3198    3037    3522    4105

Answers 5

There is a difference in measures of center. The median is 3,198 ponds while the mean is 3,037 pounds. One possible explanation for the difference in measures of center is an outlier, which can especially skew the mean in a small data set.

Exercise 6 Construct a histogram to visualize the distribution of the weight variable.

p <- ggplot(cars93, aes(x=weight)) + geom_histogram(bins=14, fill="darkgreen")
p

Describe the shape of this distribution here.

Answers 6

The shape of the distribution could be classified as multimodal.

Exercise 7

Is there any evidence of a correlation between the price of vehicle and it’s fuel economy? Construct a scatter plot to answer this question.

p <- ggplot(data=cars93, aes(x = price, y = mpg_city)) + geom_point(color="darkgreen")
p

  • If there is a relationship, try to describe it here.
  • cars93 gives information from cars from the year 1993. Do you think the relationship between price and mpg_city would be similar for cars from 2022? Write your answer here.

Answers 7

The data displays a negative correlation between price and miles per gallon. As price increases, miles per gallon decreases. Based on current knowledge of price of today’s cars and their miles per gallon one could conclude that high valued cars have a decreased miles per gallon. This reflects the data displayed in the cars93 data set.

Exercise 8

Describe, in your own words, what a violin plot shows. You may want to look this up; feel free to simply search “violin plot” on the web and see what you find. Write your answer here.

Answers 8

A violin plot consists of a box and whisker plot with reflective modal distribution through the middle of the box and whisker. The wider the flares are on the “violin” the greater the frequency of that event.

LS0tDQp0aXRsZTogIkFjdGl2aXR5IDE6IEludHJvIHRvIFJTdHVkaW8iDQphdXRob3I6ICJEeWxhbiBCYWtlciINCmRhdGU6ICIwOS8xMi8yMDIyIg0Kb3V0cHV0OiBvcGVuaW50cm86OmxhYl9yZXBvcnQNCi0tLQ0KDQpgYGB7ciBsb2FkLXBhY2thZ2VzLCBtZXNzYWdlPUZBTFNFLCB3YXJuaW5nPUZBTFNFfQ0KbGlicmFyeSh0aWR5dmVyc2UpDQpsaWJyYXJ5KG9wZW5pbnRybykNCmBgYA0KDQojIyMgRXhlcmNpc2UgMQ0KDQpUaGUgY2FsY3VsYXRpb25zIGJlbG93IHdlcmUgZG9uZSBpbiB0aGUgZG9jdW1lbnRhdGlvbiBmb3IgdGhpcyBsYWIuIE5vdGUgdGhhdCBhbGwgY2FsY3VsYXRpb25zIG9jY3VyIGJldHdlZW4gdGhyZWUgXGAgbWFya3MgdGhhdCBvcGVuIGFuZCBjbG9zZSB0aGUgImNvZGUgY2h1Y2siDQoNCmBgYHtyIFRoYW5rcyBCcmlhaH0NCjIgKyAyDQoyXjEyNyAtMQ0Kc3FydCg5KQ0KbG9nKDEwLCAxMCkNCmBgYA0KDQpUaGVzZSBhcmUgbXkgY2FsY3VsYXRpb25zIHdpdGggY29kZSBjaHVuaw0KDQpgYGB7ciB3aWdnbGV9DQozKjYNCnNxcnQoOTgpDQo4LzQNCjUtNQ0KYGBgDQoNClRoaXMgaXMgdGhlIGRhdGEgc2V0IG9mIENhcnMgOTMNCg0KYGBge3IgQ2Fyc30NCj9jYXJzOTMNCmhlYWQoY2FyczkzKQ0KYGBgDQoNCiMjIyBFeGVyY2lzZSAyDQoNCldlJ3JlIGdvaW5nIHRvIGFuYWx5emUgdGhlIGBjYXJzOTNgIGRhdGEgc2V0Lg0KDQpUaGUgZGF0YSBzZXQgaGFzIGEgaGFuZGZ1bCBvZiB2YXJpYWJsZXMuIEhvdyBtYW55PyBMaXN0IHRoZW0gYmVsb3cgYWxvbmcgd2l0aCB0aGVpciBjbGFzc2lmaWNhdGlvbiAoZWcgbnVtZXJpY2FsLCBjYXRlZ29yaWNhbCwgZGlzY3JldGUsIGNvbnRpbnVvdXMsIGV0Yy4pLiBUaGUgZmlyc3Qgb25lIGlzIGluY2x1ZGVkIGZvciB5b3UuDQoNCjEuICBgdHlwZWAsIHJlZ3VsYXIgY2F0ZWdvcmljYWwNCg0KMi4gIGBwcmljZWAsIGNvbnRpbnVvdXMgbnVtZXJpY2FsDQoNCjMuICBgbXBnX2NpdHlgLCBjb250aW51b3VzIG51bWVyaWNhbA0KDQo0LiAgYGRyaXZlX3RyYWluYCwgcmVndWxhciBjYXRlZ29yaWNhbA0KDQo1LiAgYHBhc3NlbmdlcnNgLCBkaXNjcmV0ZSBudW1lcmljYWwNCg0KNi4gIGB3ZWlnaHRgLCBjb250aW51b3VzIG51bWVyaWNhbA0KDQojIyMjIEFuc3dlcnMgMg0KDQpUaGVyZSBhcmUgc2l4IHZhcmlhYmxlcyB3aXRoaW4gdGhpcyBkYXRhIHNldC4gVGhlIHZhcmlhYmxlcyBhcmUgdHlwZSwgcHJpY2UsIG1wZyBjaXR5LCBkcml2ZSB0cmFpbiwgYW5kIHdlaWdodC4gRm9sbG93aW5nIHRoYXQgb3JkZXIgdG8gY2xhc3NpZnkgdGhlIHZhcmlhYmxlIGFyZSByZWd1bGFyIGNhdGVnb3JpY2FsLCBudW1lcmljYWwgTm93IGRldGVybWluZSB0aGUgbnVtYmVyIG9mIG9ic2VydmF0aW9ucyBpbiB0aGlzIGRhdGEgc2V0LCB1c2luZyB0aGUgY29kZSBjaHVuayBiZWxvdzogVGhlcmUgYXJlIDU0IG9ic2VydmF0aW9ucyB3aXRoaW4gdGhlIGRhdGEgc2V0Ljs7DQoNCmBgYHtyIGRpbX0NCnVuaXF1ZShjYXJzOTMkdHlwZSkNCnN1bW1hcnkoY2FyczkzJHR5cGUpDQpzdW1tYXJ5KGNhcnM5MyR0eXBlKS9sZW5ndGgoY2FyczkzJHR5cGUpDQpgYGANCg0KQXJlIG9ic2VydmF0aW9ucyByb3dzIG9yIGNvbHVtbnM/DQoNCiMjIyMgQW5zd2VycyAyLjUNCg0KT2JzZXJ2YXRpb25zIGFyZSByb3dzDQoNCiMjIyBFeGVyY2lzZSAzDQoNClN0aWxsIHVzaW5nIHRoZSBgY2FyczkzYCBkYXRhIHNldCwgYW5zd2VyIHRoZSBmb2xsb3dpbmcgcXVlc3Rpb25zOg0KDQotICAgV2hhdCB0eXBlcyBvZiBkcml2ZSB0cmFpbnMgZG8gdGhlIGNhcnMgaW4gdGhpcyBkYXRhIHNldCBoYXZlPw0KLSAgIEhvdyBtYW55IGNhcnMgaW4gdGhpcyBkYXRhIHNldCBhcmUgcmVhciB3aGVlbCBkcml2ZT8NCi0gICBXaGF0IHByb3BvcnRpb24gb2YgY2FycyBpbiB0aGUgZGF0YSBzZXQgYXJlIDRXRC4NCg0KYGBge3IgRHJpdmUgVHJhaW59DQpzdW1tYXJ5KGNhcnM5MyRkcml2ZV90cmFpbikNCnN1bW1hcnkoY2FyczkzJGRyaXZlX3RyYWluKS9sZW5ndGgoY2FyczkzJGRyaXZlX3RyYWluKQ0KYGBgDQoNCiMjIyMgQW5zd2VycyAzDQoNClRoZXJlIGFyZSByZWFyLCBmcm9udCB3aGVlbCwgYW5kIGZvdXIgd2hlZWwgZHJpdmUgdHJhaW5zIHdpdGhpbiB0aGUgY2FycyBvZiB0aGUgZGF0YSBzZXQuIFRoZSBhbW91bnQgb2YgY2FycyBpbiB0aGUgZGF0YSBzZXQgdGhhdCBhcmUgcmVhciB3aGVlbCBkcml2ZSBpcyBuaW5lIGNhcnMuIFRoZSBwcm9wb3J0aW9uIG9mIDRXRCBjYXJzIGluIHRoZSBkYXRhIHNldCBpcyAyLzU0Lg0KDQojIyMgRXhlcmNpc2UgNA0KDQpgYGB7ciBEcml2ZS1UcmFpbiBCYXIgQ2hhcnR9DQpwIDwtIGdncGxvdChjYXJzOTMsIGFlcyh4PWRyaXZlX3RyYWluKSkgKyBnZW9tX2JhcihzdGF0PSJjb3VudCIsIGZpbGw9ImRhcmtncmVlbiIpDQpwDQpgYGANCg0KUHJpY2UgQW5hbHlzaXMgb2YgQ2Fycw0KDQpgYGB7ciBQcmljZSBBbmFseXNpc30NCm1lYW4oY2FyczkzJHByaWNlKQ0KIyMgWzFdIDE5Ljk5MjU5DQpzZChjYXJzOTMkcHJpY2UpDQojIyBbMV0gMTEuNTA2NDUNCnN1bW1hcnkoY2FyczkzJHByaWNlKQ0KYGBgDQoNCiMjIyBFeGVyY2lzZSA1DQoNCkNvbXBhcmUgdGhlIG1lYW4gYW5kIG1lZGlhbiB3ZWlnaHQgb2YgY2FycyBpbiBgY2FyczkzYC4gSXMgdGhlcmUgYSBkaWZmZXJlbmNlIGJldHdlZW4gdGhlc2UgbWVhc3VyZXMgb2YgY2VudGVyPyBJZiBzbywgd2h5IGRvIHlvdSB0aGluayB0aGF0IGlzIHRoZSBjYXNlPw0KDQpgYGB7ciB3ZWlnaHR9DQpwIDwtIGdncGxvdChjYXJzOTMsIGFlcyh4PXdlaWdodCkpICsgZ2VvbV9oaXN0b2dyYW0oYmlucz0xNywgZmlsbD0iZGFya2dyZWVuIikNCnANCm1lYW4oY2FyczkzJHdlaWdodCkNCnNkKGNhcnM5MyR3ZWlnaHQpDQpzdW1tYXJ5KGNhcnM5MyR3ZWlnaHQpDQpgYGANCg0KIyMjIyBBbnN3ZXJzIDUNCg0KVGhlcmUgaXMgYSBkaWZmZXJlbmNlIGluIG1lYXN1cmVzIG9mIGNlbnRlci4gVGhlIG1lZGlhbiBpcyAzLDE5OCBwb25kcyB3aGlsZSB0aGUgbWVhbiBpcyAzLDAzNyBwb3VuZHMuIE9uZSBwb3NzaWJsZSBleHBsYW5hdGlvbiBmb3IgdGhlIGRpZmZlcmVuY2UgaW4gbWVhc3VyZXMgb2YgY2VudGVyIGlzIGFuIG91dGxpZXIsIHdoaWNoIGNhbiBlc3BlY2lhbGx5IHNrZXcgdGhlIG1lYW4gaW4gYSBzbWFsbCBkYXRhIHNldC4NCg0KIyMjIEV4ZXJjaXNlIDYgQ29uc3RydWN0IGEgaGlzdG9ncmFtIHRvIHZpc3VhbGl6ZSB0aGUgZGlzdHJpYnV0aW9uIG9mIHRoZSBgd2VpZ2h0YCB2YXJpYWJsZS4NCg0KYGBge3IgV2VpZ2h0IERpc3RyaWJ1dGlvbn0NCnAgPC0gZ2dwbG90KGNhcnM5MywgYWVzKHg9d2VpZ2h0KSkgKyBnZW9tX2hpc3RvZ3JhbShiaW5zPTE0LCBmaWxsPSJkYXJrZ3JlZW4iKQ0KcA0KYGBgDQoNCkRlc2NyaWJlIHRoZSBzaGFwZSBvZiB0aGlzIGRpc3RyaWJ1dGlvbiBoZXJlLg0KDQojIyMjIEFuc3dlcnMgNg0KDQpUaGUgc2hhcGUgb2YgdGhlIGRpc3RyaWJ1dGlvbiBjb3VsZCBiZSBjbGFzc2lmaWVkIGFzIG11bHRpbW9kYWwuDQoNCiMjIyBFeGVyY2lzZSA3DQoNCklzIHRoZXJlIGFueSBldmlkZW5jZSBvZiBhIGNvcnJlbGF0aW9uIGJldHdlZW4gdGhlIHByaWNlIG9mIHZlaGljbGUgYW5kIGl0J3MgZnVlbCBlY29ub215PyBDb25zdHJ1Y3QgYSBzY2F0dGVyIHBsb3QgdG8gYW5zd2VyIHRoaXMgcXVlc3Rpb24uDQoNCmBgYHtyIFByaWNlIGFuZCBGdWVsIEVjb25vbXkgQ29ycmVsYXRpb259DQpwIDwtIGdncGxvdChkYXRhPWNhcnM5MywgYWVzKHggPSBwcmljZSwgeSA9IG1wZ19jaXR5KSkgKyBnZW9tX3BvaW50KGNvbG9yPSJkYXJrZ3JlZW4iKQ0KcA0KYGBgDQoNCi0gICBJZiB0aGVyZSBpcyBhIHJlbGF0aW9uc2hpcCwgdHJ5IHRvIGRlc2NyaWJlIGl0IGhlcmUuDQotICAgYGNhcnM5M2AgZ2l2ZXMgaW5mb3JtYXRpb24gZnJvbSBjYXJzIGZyb20gdGhlIHllYXIgMTk5My4gRG8geW91IHRoaW5rIHRoZSByZWxhdGlvbnNoaXAgYmV0d2VlbiBgcHJpY2VgIGFuZCBgbXBnX2NpdHlgIHdvdWxkIGJlIHNpbWlsYXIgZm9yIGNhcnMgZnJvbSAyMDIyPyBXcml0ZSB5b3VyIGFuc3dlciBoZXJlLg0KDQojIyMjIEFuc3dlcnMgNw0KDQpUaGUgZGF0YSBkaXNwbGF5cyBhIG5lZ2F0aXZlIGNvcnJlbGF0aW9uIGJldHdlZW4gcHJpY2UgYW5kIG1pbGVzIHBlciBnYWxsb24uIEFzIHByaWNlIGluY3JlYXNlcywgbWlsZXMgcGVyIGdhbGxvbiBkZWNyZWFzZXMuIEJhc2VkIG9uIGN1cnJlbnQga25vd2xlZGdlIG9mIHByaWNlIG9mIHRvZGF5J3MgY2FycyBhbmQgdGhlaXIgbWlsZXMgcGVyIGdhbGxvbiBvbmUgY291bGQgY29uY2x1ZGUgdGhhdCBoaWdoIHZhbHVlZCBjYXJzIGhhdmUgYSBkZWNyZWFzZWQgbWlsZXMgcGVyIGdhbGxvbi4gVGhpcyByZWZsZWN0cyB0aGUgZGF0YSBkaXNwbGF5ZWQgaW4gdGhlIGNhcnM5MyBkYXRhIHNldC4NCg0KIyMjIEV4ZXJjaXNlIDgNCg0KRGVzY3JpYmUsIGluIHlvdXIgb3duIHdvcmRzLCB3aGF0IGEgdmlvbGluIHBsb3Qgc2hvd3MuIFlvdSBtYXkgd2FudCB0byBsb29rIHRoaXMgdXA7IGZlZWwgZnJlZSB0byBzaW1wbHkgc2VhcmNoICJ2aW9saW4gcGxvdCIgb24gdGhlIHdlYiBhbmQgc2VlIHdoYXQgeW91IGZpbmQuIFdyaXRlIHlvdXIgYW5zd2VyIGhlcmUuDQoNCiMjIyMgQW5zd2VycyA4DQoNCkEgdmlvbGluIHBsb3QgY29uc2lzdHMgb2YgYSBib3ggYW5kIHdoaXNrZXIgcGxvdCB3aXRoIHJlZmxlY3RpdmUgbW9kYWwgZGlzdHJpYnV0aW9uIHRocm91Z2ggdGhlIG1pZGRsZSBvZiB0aGUgYm94IGFuZCB3aGlza2VyLiBUaGUgd2lkZXIgdGhlIGZsYXJlcyBhcmUgb24gdGhlICJ2aW9saW4iIHRoZSBncmVhdGVyIHRoZSBmcmVxdWVuY3kgb2YgdGhhdCBldmVudC4NCg==