Exercise 1
This exercise uses the rain gauge data from the city of Seattle.
rain <- read.csv(url("https://data.seattle.gov/api/views/rdtp-hzy3/rows.csv"))
# The next line of code tells the computer how the date is formatted. This is needed for the graph below. We will also filter just the first two gauges.
rain$Date <- as.Date(rain$Date, format = "%m/%d/%Y")
The code below makes a scatterplot for Rain Gauge 1 and Rain Gauge 2
over time.
ggplot(data=rain)+
geom_point( aes(x=Date, y=RG01), color = "blue")+
geom_point( aes(x=Date, y=RG02), color = "red")+
scale_x_date(date_labels = "%b %Y")+
theme_classic()+
labs(x="Date", y= "Rain Fall Amount")

This graph is hard to read but it looks like the blue and red points
are close to each other on each date. An easier way to see this is to
remove the date and to graph each rain gauge on its own axis. Create
this graph in the code chunk below.
ggplot(data=rain, aes(x=RG02, y=RG01))+
geom_point( color = "blue")+
theme_classic()+
labs(x="RG02", y= "RG01")+
geom_smooth(method = "lm", se = FALSE)
## `geom_smooth()` using formula = 'y ~ x'

Does there seem to be a correlation between the rain gauges? To look
at the best fit line, add the code to the scatterplot to the code for
the regression line below.
geom_smooth(method = "lm", se = FALSE)
## geom_smooth: na.rm = FALSE, orientation = NA, se = FALSE
## stat_smooth: na.rm = FALSE, orientation = NA, se = FALSE, method = lm
## position_identity
To see how close a fit this line is to the data, we can find
Pearson’s r with the code below.
cor(rain$RG01, rain$RG02)
## [1] 0.9619801
What does this tell us? The data is not far from the line. ###
Exercise 2
We will use the Human Freedom Index dataframe which is already loaded
into R Studio. Use the glimpse command to look at the data.
We only want to look at the year 2016 so use the code chunk below to
create a new dataframe by filtering for the year 2016. Call it
hfi_2016.
# filter for 2016
hfi_2016 <- filter(hfi, year==2016)
We want to examine the relationship between the personal freedom
score of a country and the pf_expression_control, or its score out of
10, with 0 being the most, of political pressures and controls on media
content.
To start, make a scatterplot with pf_expression_control as the
explanatory variable and pf_score the response variable.
ggplot(data= hfi_2016, aes(x=pf_expression_control , y=pf_score))+
geom_point()+
theme_classic()

Does there appear to be a linear relationship between these
variables? Yes. Positive Linear relationship.
To measure the strength of this relationship, we look at Pearson’s
coefficient of r by using the code chunk below.
cor(hfi_2016$pf_expression_control, hfi_2016$pf_score)
## [1] 0.8450646
To look at the best fit line, add the code to the scatterplot to the
code for the regression line below.
ggplot(data=hfi_2016, aes(x=pf_expression_control, y=pf_score))+
geom_point()+
theme_classic()+
geom_smooth(method = "lm", se = FALSE)

What does this tell us about the relationship between personal
freedom and political media control? The higher the personal freedom the
higher the media is free to report what they do. This is a positive
linear relationship.
LS0tDQp0aXRsZTogIldvcmtzaGVldCAxOCINCmF1dGhvcjogIkVsZGVuIEh1YmlzeiINCmRhdGU6ICJgciBTeXMuRGF0ZSgpYCINCm91dHB1dDogb3BlbmludHJvOjpsYWJfcmVwb3J0DQotLS0NCg0KYGBge3IgbG9hZC1wYWNrYWdlcywgbWVzc2FnZT1GQUxTRX0NCmxpYnJhcnkodGlkeXZlcnNlKQ0KbGlicmFyeShvcGVuaW50cm8pDQoNCmBgYA0KDQojIyMgRXhlcmNpc2UgMQ0KDQpUaGlzIGV4ZXJjaXNlIHVzZXMgdGhlIHJhaW4gZ2F1Z2UgZGF0YSBmcm9tIHRoZSBjaXR5IG9mIFNlYXR0bGUuDQoNCmBgYHtyIHJhaW59DQoNCnJhaW4gPC0gcmVhZC5jc3YodXJsKCJodHRwczovL2RhdGEuc2VhdHRsZS5nb3YvYXBpL3ZpZXdzL3JkdHAtaHp5My9yb3dzLmNzdiIpKQ0KIyBUaGUgbmV4dCBsaW5lIG9mIGNvZGUgdGVsbHMgdGhlIGNvbXB1dGVyIGhvdyB0aGUgZGF0ZSBpcyBmb3JtYXR0ZWQuIFRoaXMgaXMgbmVlZGVkIGZvciB0aGUgZ3JhcGggYmVsb3cuIFdlIHdpbGwgYWxzbyBmaWx0ZXIganVzdCB0aGUgZmlyc3QgdHdvIGdhdWdlcy4NCnJhaW4kRGF0ZSA8LSBhcy5EYXRlKHJhaW4kRGF0ZSwgZm9ybWF0ID0gIiVtLyVkLyVZIikNCmBgYA0KDQpUaGUgY29kZSBiZWxvdyBtYWtlcyBhIHNjYXR0ZXJwbG90IGZvciBSYWluIEdhdWdlIDEgYW5kIFJhaW4gR2F1Z2UgMiBvdmVyIHRpbWUuDQoNCmBgYHtyLCBncmFwaF9yYWluX2dhdWdlczF9DQoNCmdncGxvdChkYXRhPXJhaW4pKw0KICBnZW9tX3BvaW50KCBhZXMoeD1EYXRlLCB5PVJHMDEpLCBjb2xvciA9ICJibHVlIikrDQogIGdlb21fcG9pbnQoIGFlcyh4PURhdGUsIHk9UkcwMiksIGNvbG9yID0gInJlZCIpKw0KICBzY2FsZV94X2RhdGUoZGF0ZV9sYWJlbHMgPSAiJWIgJVkiKSsNCiAgdGhlbWVfY2xhc3NpYygpKw0KICBsYWJzKHg9IkRhdGUiLCB5PSAiUmFpbiBGYWxsIEFtb3VudCIpDQpgYGANCg0KVGhpcyBncmFwaCBpcyBoYXJkIHRvIHJlYWQgYnV0IGl0IGxvb2tzIGxpa2UgdGhlIGJsdWUgYW5kIHJlZCBwb2ludHMgYXJlIGNsb3NlIHRvIGVhY2ggb3RoZXIgb24gZWFjaCBkYXRlLiBBbiBlYXNpZXIgd2F5IHRvIHNlZSB0aGlzIGlzIHRvIHJlbW92ZSB0aGUgZGF0ZSBhbmQgdG8gZ3JhcGggZWFjaCByYWluIGdhdWdlIG9uIGl0cyBvd24gYXhpcy4gQ3JlYXRlIHRoaXMgZ3JhcGggaW4gdGhlIGNvZGUgY2h1bmsgYmVsb3cuDQoNCmBgYHtyLCBncmFwaF9yYWluX2dhdWdlc30NCg0KZ2dwbG90KGRhdGE9cmFpbiwgYWVzKHg9UkcwMiwgeT1SRzAxKSkrDQogIGdlb21fcG9pbnQoIGNvbG9yID0gImJsdWUiKSsNCiAgdGhlbWVfY2xhc3NpYygpKw0KICBsYWJzKHg9IlJHMDIiLCB5PSAiUkcwMSIpKw0KICBnZW9tX3Ntb290aChtZXRob2QgPSAibG0iLCBzZSA9IEZBTFNFKQ0KIA0KYGBgDQoNCkRvZXMgdGhlcmUgc2VlbSB0byBiZSBhIGNvcnJlbGF0aW9uIGJldHdlZW4gdGhlIHJhaW4gZ2F1Z2VzPyBUbyBsb29rIGF0IHRoZSBiZXN0IGZpdCBsaW5lLCBhZGQgdGhlIGNvZGUgdG8gdGhlIHNjYXR0ZXJwbG90IHRvIHRoZSBjb2RlIGZvciB0aGUgcmVncmVzc2lvbiBsaW5lIGJlbG93Lg0KDQpgYGB7ciBiZXN0LWZpdC1saW5lX3JhaW4sIG1lc3NhZ2UgPSBGQUxTRX0NCiAgZ2VvbV9zbW9vdGgobWV0aG9kID0gImxtIiwgc2UgPSBGQUxTRSkNCg0KYGBgDQoNClRvIHNlZSBob3cgY2xvc2UgYSBmaXQgdGhpcyBsaW5lIGlzIHRvIHRoZSBkYXRhLCB3ZSBjYW4gZmluZCBQZWFyc29uJ3MgciB3aXRoIHRoZSBjb2RlIGJlbG93Lg0KDQpgYGB7ciBjYWxjdWxhdGUtcl9yYWlufQ0KICBjb3IocmFpbiRSRzAxLCByYWluJFJHMDIpDQpgYGANCg0KV2hhdCBkb2VzIHRoaXMgdGVsbCB1cz8gVGhlIGRhdGEgaXMgbm90IGZhciBmcm9tIHRoZSBsaW5lLiBcIyMjIEV4ZXJjaXNlIDINCg0KV2Ugd2lsbCB1c2UgdGhlIEh1bWFuIEZyZWVkb20gSW5kZXggZGF0YWZyYW1lIHdoaWNoIGlzIGFscmVhZHkgbG9hZGVkIGludG8gUiBTdHVkaW8uIFVzZSB0aGUgZ2xpbXBzZSBjb21tYW5kIHRvIGxvb2sgYXQgdGhlIGRhdGEuDQoNCmBgYHtyIEh1bWFuLUZyZWVkb20tSW5kZXh9DQojICBnbGltcHNlKGhmaSkrDQogIA0KYGBgDQoNCldlIG9ubHkgd2FudCB0byBsb29rIGF0IHRoZSB5ZWFyIDIwMTYgc28gdXNlIHRoZSBjb2RlIGNodW5rIGJlbG93IHRvIGNyZWF0ZSBhIG5ldyBkYXRhZnJhbWUgYnkgZmlsdGVyaW5nIGZvciB0aGUgeWVhciAyMDE2LiBDYWxsIGl0IGhmaV8yMDE2Lg0KDQpgYGB7ciBGaWx0ZXJfMjAxNn0NCiMgZmlsdGVyIGZvciAyMDE2IA0KaGZpXzIwMTYgPC0gZmlsdGVyKGhmaSwgeWVhcj09MjAxNikNCmBgYA0KDQpXZSB3YW50IHRvIGV4YW1pbmUgdGhlIHJlbGF0aW9uc2hpcCBiZXR3ZWVuIHRoZSBwZXJzb25hbCBmcmVlZG9tIHNjb3JlIG9mIGEgY291bnRyeSBhbmQgdGhlIHBmX2V4cHJlc3Npb25fY29udHJvbCwgb3IgaXRzIHNjb3JlIG91dCBvZiAxMCwgd2l0aCAwIGJlaW5nIHRoZSBtb3N0LCBvZiBwb2xpdGljYWwgcHJlc3N1cmVzIGFuZCBjb250cm9scyBvbiBtZWRpYSBjb250ZW50Lg0KDQpUbyBzdGFydCwgbWFrZSBhIHNjYXR0ZXJwbG90IHdpdGggcGZfZXhwcmVzc2lvbl9jb250cm9sIGFzIHRoZSBleHBsYW5hdG9yeSB2YXJpYWJsZSBhbmQgcGZfc2NvcmUgdGhlIHJlc3BvbnNlIHZhcmlhYmxlLg0KDQpgYGB7ciBncmFwaCwgbWVzc2FnZSA9IEZBTFNFfQ0KZ2dwbG90KGRhdGE9IGhmaV8yMDE2LCBhZXMoeD1wZl9leHByZXNzaW9uX2NvbnRyb2wgLCB5PXBmX3Njb3JlKSkrDQogIGdlb21fcG9pbnQoKSsNCiAgdGhlbWVfY2xhc3NpYygpDQpgYGANCg0KRG9lcyB0aGVyZSBhcHBlYXIgdG8gYmUgYSBsaW5lYXIgcmVsYXRpb25zaGlwIGJldHdlZW4gdGhlc2UgdmFyaWFibGVzPyBZZXMuIFBvc2l0aXZlIExpbmVhciByZWxhdGlvbnNoaXAuDQoNClRvIG1lYXN1cmUgdGhlIHN0cmVuZ3RoIG9mIHRoaXMgcmVsYXRpb25zaGlwLCB3ZSBsb29rIGF0IFBlYXJzb24ncyBjb2VmZmljaWVudCBvZiByIGJ5IHVzaW5nIHRoZSBjb2RlIGNodW5rIGJlbG93Lg0KDQpgYGB7ciBjYWxjdWxhdGUtcn0NCiAgY29yKGhmaV8yMDE2JHBmX2V4cHJlc3Npb25fY29udHJvbCwgaGZpXzIwMTYkcGZfc2NvcmUpDQpgYGANCg0KVG8gbG9vayBhdCB0aGUgYmVzdCBmaXQgbGluZSwgYWRkIHRoZSBjb2RlIHRvIHRoZSBzY2F0dGVycGxvdCB0byB0aGUgY29kZSBmb3IgdGhlIHJlZ3Jlc3Npb24gbGluZSBiZWxvdy4NCg0KYGBge3IgYmVzdC1maXQtbGluZSwgbWVzc2FnZSA9IEZBTFNFfQ0KDQpnZ3Bsb3QoZGF0YT1oZmlfMjAxNiwgYWVzKHg9cGZfZXhwcmVzc2lvbl9jb250cm9sLCB5PXBmX3Njb3JlKSkrDQogIGdlb21fcG9pbnQoKSsNCiAgdGhlbWVfY2xhc3NpYygpKw0KICBnZW9tX3Ntb290aChtZXRob2QgPSAibG0iLCBzZSA9IEZBTFNFKQ0KDQpgYGANCg0KV2hhdCBkb2VzIHRoaXMgdGVsbCB1cyBhYm91dCB0aGUgcmVsYXRpb25zaGlwIGJldHdlZW4gcGVyc29uYWwgZnJlZWRvbSBhbmQgcG9saXRpY2FsIG1lZGlhIGNvbnRyb2w/IFRoZSBoaWdoZXIgdGhlIHBlcnNvbmFsIGZyZWVkb20gdGhlIGhpZ2hlciB0aGUgbWVkaWEgaXMgZnJlZSB0byByZXBvcnQgd2hhdCB0aGV5IGRvLiBUaGlzIGlzIGEgcG9zaXRpdmUgbGluZWFyIHJlbGF0aW9uc2hpcC4gDQo=