library(tidyverse)
library(openintro)

Exercise 1

This exercise uses the rain gauge data from the city of Seattle.

rain <- read.csv(url("https://data.seattle.gov/api/views/rdtp-hzy3/rows.csv"))
# The next line of code tells the computer how the date is formatted. This is needed for the graph below. We will also filter just the first two gauges.
rain$Date <- as.Date(rain$Date, format = "%m/%d/%Y")

The code below makes a scatterplot for Rain Gauge 1 and Rain Gauge 2 over time.

ggplot(data=rain)+
  geom_point( aes(x=Date, y=RG01), color = "blue")+
  geom_point( aes(x=Date, y=RG02), color = "red")+
  scale_x_date(date_labels = "%b %Y")+
  theme_classic()+
  labs(x="Date", y= "Rain Fall Amount")

This graph is hard to read but it looks like the blue and red points are close to each other on each date. An easier way to see this is to remove the date and to graph each rain gauge on its own axis. Create this graph in the code chunk below.

ggplot(data=rain, aes(x=RG02, y=RG01))+
  geom_point( color = "blue")+
  theme_classic()+
  labs(x="RG02", y= "RG01")+
  geom_smooth(method = "lm", se = FALSE)
## `geom_smooth()` using formula = 'y ~ x'

Does there seem to be a correlation between the rain gauges? To look at the best fit line, add the code to the scatterplot to the code for the regression line below.

  geom_smooth(method = "lm", se = FALSE)
## geom_smooth: na.rm = FALSE, orientation = NA, se = FALSE
## stat_smooth: na.rm = FALSE, orientation = NA, se = FALSE, method = lm
## position_identity

To see how close a fit this line is to the data, we can find Pearson’s r with the code below.

  cor(rain$RG01, rain$RG02)
## [1] 0.9619801

What does this tell us? The data is not far from the line. ### Exercise 2

We will use the Human Freedom Index dataframe which is already loaded into R Studio. Use the glimpse command to look at the data.

#  glimpse(hfi)+

We only want to look at the year 2016 so use the code chunk below to create a new dataframe by filtering for the year 2016. Call it hfi_2016.

# filter for 2016 
hfi_2016 <- filter(hfi, year==2016)

We want to examine the relationship between the personal freedom score of a country and the pf_expression_control, or its score out of 10, with 0 being the most, of political pressures and controls on media content.

To start, make a scatterplot with pf_expression_control as the explanatory variable and pf_score the response variable.

ggplot(data= hfi_2016, aes(x=pf_expression_control , y=pf_score))+
  geom_point()+
  theme_classic()

Does there appear to be a linear relationship between these variables? Yes. Positive Linear relationship.

To measure the strength of this relationship, we look at Pearson’s coefficient of r by using the code chunk below.

  cor(hfi_2016$pf_expression_control, hfi_2016$pf_score)
## [1] 0.8450646

To look at the best fit line, add the code to the scatterplot to the code for the regression line below.

ggplot(data=hfi_2016, aes(x=pf_expression_control, y=pf_score))+
  geom_point()+
  theme_classic()+
  geom_smooth(method = "lm", se = FALSE)

What does this tell us about the relationship between personal freedom and political media control? The higher the personal freedom the higher the media is free to report what they do. This is a positive linear relationship.

LS0tDQp0aXRsZTogIldvcmtzaGVldCAxOCINCmF1dGhvcjogIkVsZGVuIEh1YmlzeiINCmRhdGU6ICJgciBTeXMuRGF0ZSgpYCINCm91dHB1dDogb3BlbmludHJvOjpsYWJfcmVwb3J0DQotLS0NCg0KYGBge3IgbG9hZC1wYWNrYWdlcywgbWVzc2FnZT1GQUxTRX0NCmxpYnJhcnkodGlkeXZlcnNlKQ0KbGlicmFyeShvcGVuaW50cm8pDQoNCmBgYA0KDQojIyMgRXhlcmNpc2UgMQ0KDQpUaGlzIGV4ZXJjaXNlIHVzZXMgdGhlIHJhaW4gZ2F1Z2UgZGF0YSBmcm9tIHRoZSBjaXR5IG9mIFNlYXR0bGUuDQoNCmBgYHtyIHJhaW59DQoNCnJhaW4gPC0gcmVhZC5jc3YodXJsKCJodHRwczovL2RhdGEuc2VhdHRsZS5nb3YvYXBpL3ZpZXdzL3JkdHAtaHp5My9yb3dzLmNzdiIpKQ0KIyBUaGUgbmV4dCBsaW5lIG9mIGNvZGUgdGVsbHMgdGhlIGNvbXB1dGVyIGhvdyB0aGUgZGF0ZSBpcyBmb3JtYXR0ZWQuIFRoaXMgaXMgbmVlZGVkIGZvciB0aGUgZ3JhcGggYmVsb3cuIFdlIHdpbGwgYWxzbyBmaWx0ZXIganVzdCB0aGUgZmlyc3QgdHdvIGdhdWdlcy4NCnJhaW4kRGF0ZSA8LSBhcy5EYXRlKHJhaW4kRGF0ZSwgZm9ybWF0ID0gIiVtLyVkLyVZIikNCmBgYA0KDQpUaGUgY29kZSBiZWxvdyBtYWtlcyBhIHNjYXR0ZXJwbG90IGZvciBSYWluIEdhdWdlIDEgYW5kIFJhaW4gR2F1Z2UgMiBvdmVyIHRpbWUuDQoNCmBgYHtyLCBncmFwaF9yYWluX2dhdWdlczF9DQoNCmdncGxvdChkYXRhPXJhaW4pKw0KICBnZW9tX3BvaW50KCBhZXMoeD1EYXRlLCB5PVJHMDEpLCBjb2xvciA9ICJibHVlIikrDQogIGdlb21fcG9pbnQoIGFlcyh4PURhdGUsIHk9UkcwMiksIGNvbG9yID0gInJlZCIpKw0KICBzY2FsZV94X2RhdGUoZGF0ZV9sYWJlbHMgPSAiJWIgJVkiKSsNCiAgdGhlbWVfY2xhc3NpYygpKw0KICBsYWJzKHg9IkRhdGUiLCB5PSAiUmFpbiBGYWxsIEFtb3VudCIpDQpgYGANCg0KVGhpcyBncmFwaCBpcyBoYXJkIHRvIHJlYWQgYnV0IGl0IGxvb2tzIGxpa2UgdGhlIGJsdWUgYW5kIHJlZCBwb2ludHMgYXJlIGNsb3NlIHRvIGVhY2ggb3RoZXIgb24gZWFjaCBkYXRlLiBBbiBlYXNpZXIgd2F5IHRvIHNlZSB0aGlzIGlzIHRvIHJlbW92ZSB0aGUgZGF0ZSBhbmQgdG8gZ3JhcGggZWFjaCByYWluIGdhdWdlIG9uIGl0cyBvd24gYXhpcy4gQ3JlYXRlIHRoaXMgZ3JhcGggaW4gdGhlIGNvZGUgY2h1bmsgYmVsb3cuDQoNCmBgYHtyLCBncmFwaF9yYWluX2dhdWdlc30NCg0KZ2dwbG90KGRhdGE9cmFpbiwgYWVzKHg9UkcwMiwgeT1SRzAxKSkrDQogIGdlb21fcG9pbnQoIGNvbG9yID0gImJsdWUiKSsNCiAgdGhlbWVfY2xhc3NpYygpKw0KICBsYWJzKHg9IlJHMDIiLCB5PSAiUkcwMSIpKw0KICBnZW9tX3Ntb290aChtZXRob2QgPSAibG0iLCBzZSA9IEZBTFNFKQ0KIA0KYGBgDQoNCkRvZXMgdGhlcmUgc2VlbSB0byBiZSBhIGNvcnJlbGF0aW9uIGJldHdlZW4gdGhlIHJhaW4gZ2F1Z2VzPyBUbyBsb29rIGF0IHRoZSBiZXN0IGZpdCBsaW5lLCBhZGQgdGhlIGNvZGUgdG8gdGhlIHNjYXR0ZXJwbG90IHRvIHRoZSBjb2RlIGZvciB0aGUgcmVncmVzc2lvbiBsaW5lIGJlbG93Lg0KDQpgYGB7ciBiZXN0LWZpdC1saW5lX3JhaW4sIG1lc3NhZ2UgPSBGQUxTRX0NCiAgZ2VvbV9zbW9vdGgobWV0aG9kID0gImxtIiwgc2UgPSBGQUxTRSkNCg0KYGBgDQoNClRvIHNlZSBob3cgY2xvc2UgYSBmaXQgdGhpcyBsaW5lIGlzIHRvIHRoZSBkYXRhLCB3ZSBjYW4gZmluZCBQZWFyc29uJ3MgciB3aXRoIHRoZSBjb2RlIGJlbG93Lg0KDQpgYGB7ciBjYWxjdWxhdGUtcl9yYWlufQ0KICBjb3IocmFpbiRSRzAxLCByYWluJFJHMDIpDQpgYGANCg0KV2hhdCBkb2VzIHRoaXMgdGVsbCB1cz8gVGhlIGRhdGEgaXMgbm90IGZhciBmcm9tIHRoZSBsaW5lLiBcIyMjIEV4ZXJjaXNlIDINCg0KV2Ugd2lsbCB1c2UgdGhlIEh1bWFuIEZyZWVkb20gSW5kZXggZGF0YWZyYW1lIHdoaWNoIGlzIGFscmVhZHkgbG9hZGVkIGludG8gUiBTdHVkaW8uIFVzZSB0aGUgZ2xpbXBzZSBjb21tYW5kIHRvIGxvb2sgYXQgdGhlIGRhdGEuDQoNCmBgYHtyIEh1bWFuLUZyZWVkb20tSW5kZXh9DQojICBnbGltcHNlKGhmaSkrDQogIA0KYGBgDQoNCldlIG9ubHkgd2FudCB0byBsb29rIGF0IHRoZSB5ZWFyIDIwMTYgc28gdXNlIHRoZSBjb2RlIGNodW5rIGJlbG93IHRvIGNyZWF0ZSBhIG5ldyBkYXRhZnJhbWUgYnkgZmlsdGVyaW5nIGZvciB0aGUgeWVhciAyMDE2LiBDYWxsIGl0IGhmaV8yMDE2Lg0KDQpgYGB7ciBGaWx0ZXJfMjAxNn0NCiMgZmlsdGVyIGZvciAyMDE2IA0KaGZpXzIwMTYgPC0gZmlsdGVyKGhmaSwgeWVhcj09MjAxNikNCmBgYA0KDQpXZSB3YW50IHRvIGV4YW1pbmUgdGhlIHJlbGF0aW9uc2hpcCBiZXR3ZWVuIHRoZSBwZXJzb25hbCBmcmVlZG9tIHNjb3JlIG9mIGEgY291bnRyeSBhbmQgdGhlIHBmX2V4cHJlc3Npb25fY29udHJvbCwgb3IgaXRzIHNjb3JlIG91dCBvZiAxMCwgd2l0aCAwIGJlaW5nIHRoZSBtb3N0LCBvZiBwb2xpdGljYWwgcHJlc3N1cmVzIGFuZCBjb250cm9scyBvbiBtZWRpYSBjb250ZW50Lg0KDQpUbyBzdGFydCwgbWFrZSBhIHNjYXR0ZXJwbG90IHdpdGggcGZfZXhwcmVzc2lvbl9jb250cm9sIGFzIHRoZSBleHBsYW5hdG9yeSB2YXJpYWJsZSBhbmQgcGZfc2NvcmUgdGhlIHJlc3BvbnNlIHZhcmlhYmxlLg0KDQpgYGB7ciBncmFwaCwgbWVzc2FnZSA9IEZBTFNFfQ0KZ2dwbG90KGRhdGE9IGhmaV8yMDE2LCBhZXMoeD1wZl9leHByZXNzaW9uX2NvbnRyb2wgLCB5PXBmX3Njb3JlKSkrDQogIGdlb21fcG9pbnQoKSsNCiAgdGhlbWVfY2xhc3NpYygpDQpgYGANCg0KRG9lcyB0aGVyZSBhcHBlYXIgdG8gYmUgYSBsaW5lYXIgcmVsYXRpb25zaGlwIGJldHdlZW4gdGhlc2UgdmFyaWFibGVzPyBZZXMuIFBvc2l0aXZlIExpbmVhciByZWxhdGlvbnNoaXAuDQoNClRvIG1lYXN1cmUgdGhlIHN0cmVuZ3RoIG9mIHRoaXMgcmVsYXRpb25zaGlwLCB3ZSBsb29rIGF0IFBlYXJzb24ncyBjb2VmZmljaWVudCBvZiByIGJ5IHVzaW5nIHRoZSBjb2RlIGNodW5rIGJlbG93Lg0KDQpgYGB7ciBjYWxjdWxhdGUtcn0NCiAgY29yKGhmaV8yMDE2JHBmX2V4cHJlc3Npb25fY29udHJvbCwgaGZpXzIwMTYkcGZfc2NvcmUpDQpgYGANCg0KVG8gbG9vayBhdCB0aGUgYmVzdCBmaXQgbGluZSwgYWRkIHRoZSBjb2RlIHRvIHRoZSBzY2F0dGVycGxvdCB0byB0aGUgY29kZSBmb3IgdGhlIHJlZ3Jlc3Npb24gbGluZSBiZWxvdy4NCg0KYGBge3IgYmVzdC1maXQtbGluZSwgbWVzc2FnZSA9IEZBTFNFfQ0KDQpnZ3Bsb3QoZGF0YT1oZmlfMjAxNiwgYWVzKHg9cGZfZXhwcmVzc2lvbl9jb250cm9sLCB5PXBmX3Njb3JlKSkrDQogIGdlb21fcG9pbnQoKSsNCiAgdGhlbWVfY2xhc3NpYygpKw0KICBnZW9tX3Ntb290aChtZXRob2QgPSAibG0iLCBzZSA9IEZBTFNFKQ0KDQpgYGANCg0KV2hhdCBkb2VzIHRoaXMgdGVsbCB1cyBhYm91dCB0aGUgcmVsYXRpb25zaGlwIGJldHdlZW4gcGVyc29uYWwgZnJlZWRvbSBhbmQgcG9saXRpY2FsIG1lZGlhIGNvbnRyb2w/IFRoZSBoaWdoZXIgdGhlIHBlcnNvbmFsIGZyZWVkb20gdGhlIGhpZ2hlciB0aGUgbWVkaWEgaXMgZnJlZSB0byByZXBvcnQgd2hhdCB0aGV5IGRvLiBUaGlzIGlzIGEgcG9zaXRpdmUgbGluZWFyIHJlbGF0aW9uc2hpcC4gDQo=