library(tidyverse)
library(reshape)

Introduction

For this assignment I used the dataset contained in the article “How Baby Boomers get high”. The article link is here link The artcile analyzes drug usage (including alcohol) by different age-groups. The article focuses on baby-boomers defined in this article as people ages 50-64. General findings were that boomers consume drugs in lower percentages than younger generations, but they were consuming it at higher rates than their parents.

Sub-setting of data

Just for fun for this excercise I will aim at comparing my age group (50-64) to my daughter’s age group (21 years old). Let’s see what I find.

Data Manipulation Steps

Load the data directly from site. In this case we will use the drug use by age-group.

drug_use <- read_csv("https://raw.githubusercontent.com/fivethirtyeight/data/master/drug-use-by-age/drug-use-by-age.csv")

## Rows: 17 Columns: 28

## -- Column specification --------------------------------------------------------
## Delimiter: ","
## chr  (7): age, cocaine-frequency, crack-frequency, heroin-frequency, inhalan...
## dbl (21): n, alcohol-use, alcohol-frequency, marijuana-use, marijuana-freque...

## 
## i Use `spec()` to retrieve the full column specification for this data.
## i Specify the column types or set `show_col_types = FALSE` to quiet this message.

Now we will remove the columns we don’t need. We will focus of usage numbers

drug_use <- drug_use %>% select(-contains("frequency"))

Now we will focus in my age group 50-64

drug_use <- drug_use %>% filter(age=="50-64" | age=="21")

Lets remove some columns we don’t need

drug_use <- drug_use %>% select(-c(n))

Lets transpose for ease of view and save it to another data frame

drug_use2 <- as_tibble(cbind(drug = names(drug_use), t(drug_use)))

## Warning: The `x` argument of `as_tibble.matrix()` must have unique column names if `.name_repair` is omitted as of tibble 2.0.0.
## Using compatibility `.name_repair`.
## This warning is displayed once every 8 hours.
## Call `lifecycle::last_lifecycle_warnings()` to see where this warning was generated.

Remove first row (age)

drug_use2 <-  filter(drug_use2,drug != 'age')

Convert to numeric

drug_use2$V2 <- as.numeric(drug_use2$V2)
drug_use2$V3 <- as.numeric(drug_use2$V3)

Lets sort it

drug_use2 <- arrange(drug_use2,desc(V2)) %>%
  dplyr::rename(Group_21_YO = V2, Group_50_64_YO =V3)
drug_use2

## # A tibble: 13 x 3
##    drug              Group_21_YO Group_50_64_YO
##    <chr>                   <dbl>          <dbl>
##  1 alcohol-use              83.2           67.2
##  2 marijuana-use            33              7.3
##  3 pain-releiver-use         9              2.5
##  4 hallucinogen-use          6.3            0.3
##  5 cocaine-use               4.8            0.9
##  6 stimulant-use             4.1            0.3
##  7 tranquilizer-use          3.9            1.4
##  8 inhalant-use              1.4            0.2
##  9 oxycontin-use             1.3            0.4
## 10 heroin-use                0.6            0.1
## 11 meth-use                  0.6            0.2
## 12 crack-use                 0.5            0.4
## 13 sedative-use              0.3            0.2

Graph them, but first convert to long format from wide format

data_long <- gather(drug_use2, age_group, drug_use_pct, Group_21_YO:Group_50_64_YO, factor_key=TRUE)

ggplot(data_long, # the data that I am using
       aes(x = drug, # 'aesthetic' includes x
           y = drug_use_pct, fill=age_group)) + # and y
  geom_bar(position="dodge", stat = "identity") + # use ACTUAL y for bar height
  coord_flip()

Conclusions

From the article

I would recommend repeat the study since it has been 10 year since this study came out. One of findings was the trend of baby boomers cosuming more drugs than their parents. After 10 yeasr we could repeat and see if this new cohort of 50-64 has continued the increasing trend or not.

From my comparison to my daughter’s age group

It was evident my age group consume much less drugs than 21 yo. In fact besides alcohol and marijuana, my age group has little consumption of other drugs vs 21 yo. The article mentioned that consumption in younger people is more about getting high vs older people who if they use it is to cope with stress and other underlying issues.

…

LS0tDQp0aXRsZTogIkRBVEEgNjA3IEFzc2lnbm1lbnQgMSINCmF1dGhvcjogIkp1YW4gRmFsY2siDQpkYXRlOiAiYHIgU3lzLkRhdGUoKWAiDQpvdXRwdXQ6IG9wZW5pbnRybzo6bGFiX3JlcG9ydA0KLS0tDQoNCmBgYHtyIGxvYWQtcGFja2FnZXMsIG1lc3NhZ2U9RkFMU0V9DQpsaWJyYXJ5KHRpZHl2ZXJzZSkNCmxpYnJhcnkocmVzaGFwZSkNCmBgYA0KDQojIEludHJvZHVjdGlvbg0KRm9yIHRoaXMgYXNzaWdubWVudCBJIHVzZWQgdGhlIGRhdGFzZXQgY29udGFpbmVkIGluIHRoZSBhcnRpY2xlICJIb3cgQmFieSBCb29tZXJzIGdldCBoaWdoIi4gVGhlIGFydGljbGUgbGluayBpcyBoZXJlIFtsaW5rXShodHRwczovL2ZpdmV0aGlydHllaWdodC5jb20vZmVhdHVyZXMvaG93LWJhYnktYm9vbWVycy1nZXQtaGlnaC8pDQpUaGUgYXJ0Y2lsZSBhbmFseXplcyBkcnVnIHVzYWdlIChpbmNsdWRpbmcgYWxjb2hvbCkgYnkgZGlmZmVyZW50IGFnZS1ncm91cHMuICBUaGUgYXJ0aWNsZSBmb2N1c2VzIG9uIGJhYnktYm9vbWVycyBkZWZpbmVkIGluIHRoaXMgYXJ0aWNsZSBhcyBwZW9wbGUgYWdlcyA1MC02NC4gR2VuZXJhbCBmaW5kaW5ncyB3ZXJlIHRoYXQgYm9vbWVycyBjb25zdW1lIGRydWdzIGluIGxvd2VyIHBlcmNlbnRhZ2VzIHRoYW4geW91bmdlciBnZW5lcmF0aW9ucywgYnV0IHRoZXkgd2VyZSBjb25zdW1pbmcgaXQgYXQgaGlnaGVyIHJhdGVzIHRoYW4gdGhlaXIgcGFyZW50cy4gIA0KDQoNCiMgU3ViLXNldHRpbmcgb2YgZGF0YQ0KSnVzdCBmb3IgZnVuIGZvciB0aGlzIGV4Y2VyY2lzZSBJIHdpbGwgYWltIGF0IGNvbXBhcmluZyBteSBhZ2UgZ3JvdXAgKDUwLTY0KSB0byBteSBkYXVnaHRlcidzIGFnZSBncm91cCAoMjEgeWVhcnMgb2xkKS4gTGV0J3Mgc2VlIHdoYXQgSSBmaW5kLg0KDQoNCiMjIERhdGEgTWFuaXB1bGF0aW9uIFN0ZXBzDQoNCkxvYWQgdGhlIGRhdGEgZGlyZWN0bHkgZnJvbSBzaXRlLiBJbiB0aGlzIGNhc2Ugd2Ugd2lsbCB1c2UgdGhlIGRydWcgdXNlIGJ5IGFnZS1ncm91cC4NCg0KYGBge3IgcDF9DQpkcnVnX3VzZSA8LSByZWFkX2NzdigiaHR0cHM6Ly9yYXcuZ2l0aHVidXNlcmNvbnRlbnQuY29tL2ZpdmV0aGlydHllaWdodC9kYXRhL21hc3Rlci9kcnVnLXVzZS1ieS1hZ2UvZHJ1Zy11c2UtYnktYWdlLmNzdiIpDQpgYGANCg0KDQpOb3cgd2Ugd2lsbCByZW1vdmUgdGhlIGNvbHVtbnMgd2UgZG9uJ3QgbmVlZC4gV2Ugd2lsbCBmb2N1cyBvZiB1c2FnZSBudW1iZXJzDQoNCmBgYHtyIHAyfQ0KZHJ1Z191c2UgPC0gZHJ1Z191c2UgJT4lIHNlbGVjdCgtY29udGFpbnMoImZyZXF1ZW5jeSIpKQ0KYGBgDQoNCk5vdyB3ZSB3aWxsIGZvY3VzIGluIG15IGFnZSBncm91cCA1MC02NA0KDQpgYGB7ciBwM30NCmRydWdfdXNlIDwtIGRydWdfdXNlICU+JSBmaWx0ZXIoYWdlPT0iNTAtNjQiIHwgYWdlPT0iMjEiKQ0KYGBgDQoNCg0KTGV0cyByZW1vdmUgc29tZSBjb2x1bW5zIHdlIGRvbid0IG5lZWQNCg0KYGBge3IgcDR9DQpkcnVnX3VzZSA8LSBkcnVnX3VzZSAlPiUgc2VsZWN0KC1jKG4pKQ0KYGBgDQoNCkxldHMgdHJhbnNwb3NlIGZvciBlYXNlIG9mIHZpZXcgYW5kIHNhdmUgaXQgdG8gYW5vdGhlciBkYXRhIGZyYW1lDQoNCmBgYHtyIHA1fQ0KZHJ1Z191c2UyIDwtIGFzX3RpYmJsZShjYmluZChkcnVnID0gbmFtZXMoZHJ1Z191c2UpLCB0KGRydWdfdXNlKSkpDQpgYGANCg0KUmVtb3ZlIGZpcnN0IHJvdyAoYWdlKQ0KDQpgYGB7ciBwNUJ9DQpkcnVnX3VzZTIgPC0gIGZpbHRlcihkcnVnX3VzZTIsZHJ1ZyAhPSAnYWdlJykNCmBgYA0KDQoNCkNvbnZlcnQgdG8gbnVtZXJpYw0KDQpgYGB7ciBwNn0NCmRydWdfdXNlMiRWMiA8LSBhcy5udW1lcmljKGRydWdfdXNlMiRWMikNCmRydWdfdXNlMiRWMyA8LSBhcy5udW1lcmljKGRydWdfdXNlMiRWMykNCmBgYA0KDQpMZXRzIHNvcnQgaXQNCg0KYGBge3IgcDd9DQpkcnVnX3VzZTIgPC0gYXJyYW5nZShkcnVnX3VzZTIsZGVzYyhWMikpICU+JQ0KICBkcGx5cjo6cmVuYW1lKEdyb3VwXzIxX1lPID0gVjIsIEdyb3VwXzUwXzY0X1lPID1WMykNCmRydWdfdXNlMg0KYGBgDQoNCkdyYXBoIHRoZW0sIGJ1dCBmaXJzdCBjb252ZXJ0IHRvIGxvbmcgZm9ybWF0IGZyb20gd2lkZSBmb3JtYXQNCg0KYGBge3IgcDh9DQpkYXRhX2xvbmcgPC0gZ2F0aGVyKGRydWdfdXNlMiwgYWdlX2dyb3VwLCBkcnVnX3VzZV9wY3QsIEdyb3VwXzIxX1lPOkdyb3VwXzUwXzY0X1lPLCBmYWN0b3Jfa2V5PVRSVUUpDQoNCmdncGxvdChkYXRhX2xvbmcsICMgdGhlIGRhdGEgdGhhdCBJIGFtIHVzaW5nDQogICAgICAgYWVzKHggPSBkcnVnLCAjICdhZXN0aGV0aWMnIGluY2x1ZGVzIHgNCiAgICAgICAgICAgeSA9IGRydWdfdXNlX3BjdCwgZmlsbD1hZ2VfZ3JvdXApKSArICMgYW5kIHkNCiAgZ2VvbV9iYXIocG9zaXRpb249ImRvZGdlIiwgc3RhdCA9ICJpZGVudGl0eSIpICsgIyB1c2UgQUNUVUFMIHkgZm9yIGJhciBoZWlnaHQNCiAgY29vcmRfZmxpcCgpDQoNCmBgYA0KDQojIENvbmNsdXNpb25zDQojIyBGcm9tIHRoZSBhcnRpY2xlDQpJIHdvdWxkIHJlY29tbWVuZCByZXBlYXQgdGhlIHN0dWR5IHNpbmNlIGl0IGhhcyBiZWVuIDEwIHllYXIgc2luY2UgdGhpcyBzdHVkeSBjYW1lIG91dC4gIE9uZSBvZiBmaW5kaW5ncyB3YXMgdGhlIHRyZW5kIG9mIGJhYnkgYm9vbWVycyBjb3N1bWluZyBtb3JlIGRydWdzIHRoYW4gdGhlaXIgcGFyZW50cy4gIEFmdGVyIDEwIHllYXNyIHdlIGNvdWxkIHJlcGVhdCBhbmQgc2VlIGlmIHRoaXMgbmV3IGNvaG9ydCBvZiA1MC02NCBoYXMgY29udGludWVkIHRoZSBpbmNyZWFzaW5nIHRyZW5kIG9yIG5vdC4NCg0KIyMgRnJvbSBteSBjb21wYXJpc29uIHRvIG15IGRhdWdodGVyJ3MgYWdlIGdyb3VwDQpJdCB3YXMgZXZpZGVudCBteSBhZ2UgZ3JvdXAgY29uc3VtZSBtdWNoIGxlc3MgZHJ1Z3MgdGhhbiAyMSB5by4gIEluIGZhY3QgYmVzaWRlcyBhbGNvaG9sIGFuZCBtYXJpanVhbmEsIG15IGFnZSBncm91cCBoYXMgbGl0dGxlIGNvbnN1bXB0aW9uIG9mIG90aGVyIGRydWdzIHZzIDIxIHlvLiBUaGUgYXJ0aWNsZSBtZW50aW9uZWQgdGhhdCBjb25zdW1wdGlvbiBpbiB5b3VuZ2VyIHBlb3BsZSBpcyBtb3JlIGFib3V0IGdldHRpbmcgaGlnaCB2cyBvbGRlciBwZW9wbGUgd2hvIGlmIHRoZXkgdXNlIGl0IGlzIHRvIGNvcGUgd2l0aCBzdHJlc3MgYW5kIG90aGVyIHVuZGVybHlpbmcgaXNzdWVzLg0KDQouLi4NCg0K

DATA 607 Assignment 1

Juan Falck

2022-02-05