library(tidyverse)
library(openintro)

Exercise 1

What command would you use to extract just the counts of girls baptized?

arbuthnot$girls
##  [1] 4683 4457 4102 4590 4839 4820 4928 4605 4457 4952 4784 5332 5200 4910 4617
## [16] 3997 3919 3395 3536 3181 2746 2722 2840 2908 2959 3179 3349 3382 3289 3013
## [31] 2781 3247 4107 4803 4881 5681 4858 4319 5322 5560 5829 5719 6061 6120 5822
## [46] 5738 5717 5847 6203 6033 6041 6299 6533 6744 7158 7127 7246 7119 7214 7101
## [61] 7167 7302 7392 7316 7483 6647 6713 7229 7767 7626 7452 7061 7514 7656 7683
## [76] 5738 7779 7417 7687 7623 7380 7288

Exercise 2

Is there an apparent trend in the number of girls baptized over the years? How would you describe it?

The plot shows that there is an increasing trend in the number of girls baptized over the years from 4683 baptized girls in the year 1629 to 7288 baptized girls in the year 1710, however in the year 1640, there was a sudden drop in the number of baptized girls, with the number rising again between the years 1660 and 1664.

ggplot(data = arbuthnot, aes(x = year, y = girls)) +
  geom_line()

Exercise 3

Now, generate a plot of the proportion of boys born over time. What do you see?

This plot shows a fluctuating proportion of boys born over time, with the highest ratio of boys born in 1663, and the lowest ratio of boys born in 1703. There is no noticeable upward or downward trend of boys being born.

arbuthnot <- arbuthnot %>%
  mutate(total = boys + girls)
arbuthnot <- arbuthnot %>%
  mutate(boy_ratio = boys / total)
ggplot(data = arbuthnot, aes(x = year, y = boy_ratio)) +
  geom_line()

Exercise 4

What years are included in this data set? 1940 to 2002

What are the dimensions of the data frame? 63 x 3

What are the variable (column) names? year, boys, girls

present
## # A tibble: 63 × 3
##     year    boys   girls
##    <dbl>   <dbl>   <dbl>
##  1  1940 1211684 1148715
##  2  1941 1289734 1223693
##  3  1942 1444365 1364631
##  4  1943 1508959 1427901
##  5  1944 1435301 1359499
##  6  1945 1404587 1330869
##  7  1946 1691220 1597452
##  8  1947 1899876 1800064
##  9  1948 1813852 1721216
## 10  1949 1826352 1733177
## # ℹ 53 more rows

Exercise 5

How do these counts compare to Arbuthnot’s? Are they of a similar magnitude?

The present count for girls and boys show similar upward trends compared with Arbuthnot’s at first, and they both show the similar sudden drop in birth rates for a few years followed by a rise again, however, in the present day counts, the birth rates for girls and boys do not exceed the births from before the sudden drop, whereas in Arbuthnot’s, the birth rates for girls and boys continue their upward trend as if the drop had little effect on their increase of births over the years.

The present birth rates are also on a much larger magnitude compared to Arbuthnot’s counts in the 1600s, with present birth rates reaching up to 4.2 million, whereas Arbuthnot’s birth rates only reached to the 16 thousands.

ggplot(data = present, aes(x = year, y = girls)) +
     geom_line()

ggplot(data = present, aes(x = year, y = boys)) +
     geom_line()

Exercise 6

Make a plot that displays the proportion of boys born over time. What do you see? Does Arbuthnot’s observation about boys being born in greater proportion than girls hold up in the U.S.?

There is a downward trend in the proportion of boys being born in the United States, however, the ratio of boys born over time stays above 0.51, which suggests that Arbuthnot’s observation about boys being born in greater proportion than girls does hold up in the present day U.S., despite its downward trend.

present <- present %>%
  mutate(total = boys + girls)
present <- present %>%
  mutate(boy_ratio = boys / total)
ggplot(data = present, aes(x = year, y = boy_ratio)) +
  geom_line()

Exercise 7

We saw the most total number of births in the U.S. in the year 1961 with a total of 4268326 births.

present <- present %>%
  mutate(total = boys + girls)
present %>%
  arrange(desc(total))
## # A tibble: 63 × 5
##     year    boys   girls   total boy_ratio
##    <dbl>   <dbl>   <dbl>   <dbl>     <dbl>
##  1  1961 2186274 2082052 4268326     0.512
##  2  1960 2179708 2078142 4257850     0.512
##  3  1957 2179960 2074824 4254784     0.512
##  4  1959 2173638 2071158 4244796     0.512
##  5  1958 2152546 2051266 4203812     0.512
##  6  1962 2132466 2034896 4167362     0.512
##  7  1956 2133588 2029502 4163090     0.513
##  8  1990 2129495 2028717 4158212     0.512
##  9  1991 2101518 2009389 4110907     0.511
## 10  1963 2101632 1996388 4098020     0.513
## # ℹ 53 more rows
LS0tCnRpdGxlOiAiTGFiIDE6IEludHJvIHRvIFIiCmF1dGhvcjogIkNhcm9saW5hIEFjZXJvIgpkYXRlOiAiYHIgU3lzLkRhdGUoKWAiCm91dHB1dDogb3BlbmludHJvOjpsYWJfcmVwb3J0Ci0tLQoKYGBge3IgbG9hZC1wYWNrYWdlcywgbWVzc2FnZT1GQUxTRX0KbGlicmFyeSh0aWR5dmVyc2UpCmxpYnJhcnkob3BlbmludHJvKQpgYGAKCiMjIyBFeGVyY2lzZSAxCgpXaGF0IGNvbW1hbmQgd291bGQgeW91IHVzZSB0byBleHRyYWN0IGp1c3QgdGhlIGNvdW50cyBvZiBnaXJscyBiYXB0aXplZD8gCgpgYGB7ciB2aWV3LWdpcmxzLWNvdW50c30KYXJidXRobm90JGdpcmxzCmBgYAoKCiMjIyBFeGVyY2lzZSAyCgpJcyB0aGVyZSBhbiBhcHBhcmVudCB0cmVuZCBpbiB0aGUgbnVtYmVyIG9mIGdpcmxzIGJhcHRpemVkIG92ZXIgdGhlIHllYXJzPyBIb3cgd291bGQgeW91IGRlc2NyaWJlIGl0PyAKClRoZSBwbG90IHNob3dzIHRoYXQgdGhlcmUgaXMgYW4gaW5jcmVhc2luZyB0cmVuZCBpbiB0aGUgbnVtYmVyIG9mIGdpcmxzIGJhcHRpemVkIG92ZXIgdGhlIHllYXJzIGZyb20gNDY4MyBiYXB0aXplZCBnaXJscyBpbiB0aGUgeWVhciAxNjI5IHRvIDcyODggYmFwdGl6ZWQgZ2lybHMgaW4gdGhlIHllYXIgMTcxMCwgaG93ZXZlciBpbiB0aGUgeWVhciAxNjQwLCB0aGVyZSB3YXMgYSBzdWRkZW4gZHJvcCBpbiB0aGUgbnVtYmVyIG9mIGJhcHRpemVkIGdpcmxzLCB3aXRoIHRoZSBudW1iZXIgcmlzaW5nIGFnYWluIGJldHdlZW4gdGhlIHllYXJzIDE2NjAgYW5kIDE2NjQuCgpgYGB7ciB0cmVuZC1naXJsc30KZ2dwbG90KGRhdGEgPSBhcmJ1dGhub3QsIGFlcyh4ID0geWVhciwgeSA9IGdpcmxzKSkgKwogIGdlb21fbGluZSgpCmBgYAoKCiMjIyBFeGVyY2lzZSAzCgpOb3csIGdlbmVyYXRlIGEgcGxvdCBvZiB0aGUgcHJvcG9ydGlvbiBvZiBib3lzIGJvcm4gb3ZlciB0aW1lLiBXaGF0IGRvIHlvdSBzZWU/CgpUaGlzIHBsb3Qgc2hvd3MgYSBmbHVjdHVhdGluZyBwcm9wb3J0aW9uIG9mIGJveXMgYm9ybiBvdmVyIHRpbWUsIHdpdGggdGhlIGhpZ2hlc3QgcmF0aW8gb2YgYm95cyBib3JuIGluIDE2NjMsIGFuZCB0aGUgbG93ZXN0IHJhdGlvIG9mIGJveXMgYm9ybiBpbiAxNzAzLiBUaGVyZSBpcyBubyBub3RpY2VhYmxlIHVwd2FyZCBvciBkb3dud2FyZCB0cmVuZCBvZiBib3lzIGJlaW5nIGJvcm4uCgpgYGB7ciBwbG90LXByb3AtYm95cy1hcmJ1dGhub3R9CmFyYnV0aG5vdCA8LSBhcmJ1dGhub3QgJT4lCiAgbXV0YXRlKHRvdGFsID0gYm95cyArIGdpcmxzKQphcmJ1dGhub3QgPC0gYXJidXRobm90ICU+JQogIG11dGF0ZShib3lfcmF0aW8gPSBib3lzIC8gdG90YWwpCmdncGxvdChkYXRhID0gYXJidXRobm90LCBhZXMoeCA9IHllYXIsIHkgPSBib3lfcmF0aW8pKSArCiAgZ2VvbV9saW5lKCkKYGBgCgoKIyMjIEV4ZXJjaXNlIDQKCldoYXQgeWVhcnMgYXJlIGluY2x1ZGVkIGluIHRoaXMgZGF0YSBzZXQ/IAoxOTQwIHRvIDIwMDIKCldoYXQgYXJlIHRoZSBkaW1lbnNpb25zIG9mIHRoZSBkYXRhIGZyYW1lPyAKNjMgeCAzCgpXaGF0IGFyZSB0aGUgdmFyaWFibGUgKGNvbHVtbikgbmFtZXM/CnllYXIsIGJveXMsIGdpcmxzCgoKYGBge3IgZGltLXByZXNlbnR9CnByZXNlbnQKYGBgCgoKIyMjIEV4ZXJjaXNlIDUKCkhvdyBkbyB0aGVzZSBjb3VudHMgY29tcGFyZSB0byBBcmJ1dGhub3TigJlzPyBBcmUgdGhleSBvZiBhIHNpbWlsYXIgbWFnbml0dWRlPwoKVGhlIHByZXNlbnQgY291bnQgZm9yIGdpcmxzIGFuZCBib3lzIHNob3cgc2ltaWxhciB1cHdhcmQgdHJlbmRzIGNvbXBhcmVkIHdpdGggQXJidXRobm90J3MgYXQgZmlyc3QsIGFuZCB0aGV5IGJvdGggc2hvdyB0aGUgc2ltaWxhciBzdWRkZW4gZHJvcCBpbiBiaXJ0aCByYXRlcyBmb3IgYSBmZXcgeWVhcnMgZm9sbG93ZWQgYnkgYSByaXNlIGFnYWluLCBob3dldmVyLCBpbiB0aGUgcHJlc2VudCBkYXkgY291bnRzLCB0aGUgYmlydGggcmF0ZXMgZm9yIGdpcmxzIGFuZCBib3lzIGRvIG5vdCBleGNlZWQgdGhlIGJpcnRocyBmcm9tIGJlZm9yZSB0aGUgc3VkZGVuIGRyb3AsIHdoZXJlYXMgaW4gQXJidXRobm904oCZcywgdGhlIGJpcnRoIHJhdGVzIGZvciBnaXJscyBhbmQgYm95cyBjb250aW51ZSB0aGVpciB1cHdhcmQgdHJlbmQgYXMgaWYgdGhlIGRyb3AgaGFkIGxpdHRsZSBlZmZlY3Qgb24gdGhlaXIgaW5jcmVhc2Ugb2YgYmlydGhzIG92ZXIgdGhlIHllYXJzLgoKVGhlIHByZXNlbnQgYmlydGggcmF0ZXMgYXJlIGFsc28gb24gYSBtdWNoIGxhcmdlciBtYWduaXR1ZGUgY29tcGFyZWQgdG8gQXJidXRobm904oCZcyBjb3VudHMgaW4gdGhlIDE2MDBzLCB3aXRoIHByZXNlbnQgYmlydGggcmF0ZXMgcmVhY2hpbmcgdXAgdG8gNC4yIG1pbGxpb24sIHdoZXJlYXMgQXJidXRobm904oCZcyBiaXJ0aCAgcmF0ZXMgb25seSByZWFjaGVkIHRvIHRoZSAxNiB0aG91c2FuZHMuCgpgYGB7ciBjb3VudC1jb21wYXJlfQpnZ3Bsb3QoZGF0YSA9IHByZXNlbnQsIGFlcyh4ID0geWVhciwgeSA9IGdpcmxzKSkgKwogICAgIGdlb21fbGluZSgpCmdncGxvdChkYXRhID0gcHJlc2VudCwgYWVzKHggPSB5ZWFyLCB5ID0gYm95cykpICsKICAgICBnZW9tX2xpbmUoKQpgYGAKCgojIyMgRXhlcmNpc2UgNgoKTWFrZSBhIHBsb3QgdGhhdCBkaXNwbGF5cyB0aGUgcHJvcG9ydGlvbiBvZiBib3lzIGJvcm4gb3ZlciB0aW1lLiBXaGF0IGRvIHlvdSBzZWU/IERvZXMgQXJidXRobm904oCZcyBvYnNlcnZhdGlvbiBhYm91dCBib3lzIGJlaW5nIGJvcm4gaW4gZ3JlYXRlciBwcm9wb3J0aW9uIHRoYW4gZ2lybHMgaG9sZCB1cCBpbiB0aGUgVS5TLj8gCgpUaGVyZSBpcyBhIGRvd253YXJkIHRyZW5kIGluIHRoZSBwcm9wb3J0aW9uIG9mIGJveXMgYmVpbmcgYm9ybiBpbiB0aGUgVW5pdGVkIFN0YXRlcywgaG93ZXZlciwgdGhlIHJhdGlvIG9mIGJveXMgYm9ybiBvdmVyIHRpbWUgc3RheXMgYWJvdmUgMC41MSwgd2hpY2ggc3VnZ2VzdHMgdGhhdCBBcmJ1dGhub3TigJlzIG9ic2VydmF0aW9uIGFib3V0IGJveXMgYmVpbmcgYm9ybiBpbiBncmVhdGVyIHByb3BvcnRpb24gdGhhbiBnaXJscyBkb2VzIGhvbGQgdXAgaW4gdGhlIHByZXNlbnQgZGF5IFUuUy4sIGRlc3BpdGUgaXRzIGRvd253YXJkIHRyZW5kLgoKCmBgYHtyIHBsb3QtcHJvcC1ib3lzLXByZXNlbnR9CnByZXNlbnQgPC0gcHJlc2VudCAlPiUKICBtdXRhdGUodG90YWwgPSBib3lzICsgZ2lybHMpCnByZXNlbnQgPC0gcHJlc2VudCAlPiUKICBtdXRhdGUoYm95X3JhdGlvID0gYm95cyAvIHRvdGFsKQpnZ3Bsb3QoZGF0YSA9IHByZXNlbnQsIGFlcyh4ID0geWVhciwgeSA9IGJveV9yYXRpbykpICsKICBnZW9tX2xpbmUoKQoKYGBgCgoKIyMjIEV4ZXJjaXNlIDcKCldlIHNhdyB0aGUgbW9zdCB0b3RhbCBudW1iZXIgb2YgYmlydGhzIGluIHRoZSBVLlMuIGluIHRoZSB5ZWFyIDE5NjEgd2l0aCBhIHRvdGFsIG9mIDQyNjgzMjYgYmlydGhzLgpgYGB7ciBmaW5kLW1heC10b3RhbH0KcHJlc2VudCA8LSBwcmVzZW50ICU+JQogIG11dGF0ZSh0b3RhbCA9IGJveXMgKyBnaXJscykKcHJlc2VudCAlPiUKICBhcnJhbmdlKGRlc2ModG90YWwpKQpgYGAKCg==