IS 607 - Project 2:
The goal of this assignment is to give you practice in preparing different datasets for downstream analysis work. Your task is to: (1) Choose any three of the “wide” datasets identified in the Week 6 Discussion items. (You may use your own dataset; please don’t use my Sample Post dataset, since that was used in your Week 6 assignment!) For each of the three chosen datasets: ??? Create a .CSV file (or optionally, a MySQL database!) that includes all of the information included in the dataset. You’re encouraged to use a “wide” structure similar to how the information appears in the discussion item, so that you can practice tidying and transformations as described below. ??? Read the information from your .CSV file into R, and use tidyr and dplyr as needed to tidy and transform your data. [Most of your grade will be based on this step!] ??? Perform the analysis requested in the discussion item. ??? Your code should be in an R Markdown file, posted to rpubs.com, and should include narrative descriptions of your data cleanup work, analysis, and conclusions. (2) Please include in your homework submission, for each of the three chosen datasets: ??? The URL to the .Rmd file in your GitHub repository, and ??? The URL for your rpubs.com web page.
I have taken the dataset drug-use-by-age and prepared my .csv file.
library(knitr)
library(stringr)
library(tidyr)
library(dplyr)
##
## Attaching package: 'dplyr'
## The following objects are masked from 'package:stats':
##
## filter, lag
## The following objects are masked from 'package:base':
##
## intersect, setdiff, setequal, union
library(ggplot2)
drug_usage <- read.csv("https://raw.githubusercontent.com/Riteshlohiya/Data607-Week6/master/drug_usage.csv", sep=",")
head(drug_usage)
## age n alcohol.use alcohol.frequency marijuana.use marijuana.frequency
## 1 12 2798 3.9 3 1.1 4
## 2 13 2757 8.5 6 3.4 15
## 3 14 2792 18.1 5 8.7 24
## 4 15 2956 29.2 6 14.5 25
## 5 16 3058 40.1 10 22.5 30
## 6 17 3038 49.3 13 28.0 36
## cocaine.use cocaine.frequency crack.use crack.frequency heroin.use
## 1 0.1 5 0.0 - 0.1
## 2 0.1 1 0.0 3 0.0
## 3 0.1 5.5 0.0 - 0.1
## 4 0.5 4 0.1 9.5 0.2
## 5 1.0 7 0.0 1 0.1
## 6 2.0 5 0.1 21 0.1
## heroin.frequency hallucinogen.use hallucinogen.frequency inhalant.use
## 1 35.5 0.2 52 1.6
## 2 - 0.6 6 2.5
## 3 2 1.6 3 2.6
## 4 1 2.1 4 2.5
## 5 66.5 3.4 3 3.0
## 6 64 4.8 3 2.0
## inhalant.frequency pain.releiver.use pain.releiver.frequency
## 1 19 2.0 36
## 2 12 2.4 14
## 3 5 3.9 12
## 4 5.5 5.5 10
## 5 3 6.2 7
## 6 4 8.5 9
## oxycontin.use oxycontin.frequency tranquilizer.use
## 1 0.1 24.5 0.2
## 2 0.1 41 0.3
## 3 0.4 4.5 0.9
## 4 0.8 3 2.0
## 5 1.1 4 2.4
## 6 1.4 6 3.5
## tranquilizer.frequency stimulant.use stimulant.frequency meth.use
## 1 52.0 0.2 2.0 0.0
## 2 25.5 0.3 4.0 0.1
## 3 5.0 0.8 12.0 0.1
## 4 4.5 1.5 6.0 0.3
## 5 11.0 1.8 9.5 0.3
## 6 7.0 2.8 9.0 0.6
## meth.frequency sedative.use sedative.frequency
## 1 - 0.2 13.0
## 2 5 0.1 19.0
## 3 24 0.2 16.5
## 4 10.5 0.4 30.0
## 5 36 0.2 3.0
## 6 48 0.5 6.5
Rename pain releiver to pain.releiver
drug_usage <- drug_usage %>%
rename("pain releiver.use" = pain.releiver.use,
"pain releiver.frequency" = pain.releiver.frequency)
drug_usage
## age n alcohol.use alcohol.frequency marijuana.use
## 1 12 2798 3.9 3 1.1
## 2 13 2757 8.5 6 3.4
## 3 14 2792 18.1 5 8.7
## 4 15 2956 29.2 6 14.5
## 5 16 3058 40.1 10 22.5
## 6 17 3038 49.3 13 28.0
## 7 18 2469 58.7 24 33.7
## 8 19 2223 64.6 36 33.4
## 9 20 2271 69.7 48 34.0
## 10 21 2354 83.2 52 33.0
## 11 22-23 4707 84.2 52 28.4
## 12 24-25 4591 83.1 52 24.9
## 13 26-29 2628 80.7 52 20.8
## 14 30-34 2864 77.5 52 16.4
## 15 35-49 7391 75.0 52 10.4
## 16 50-64 3923 67.2 52 7.3
## 17 65+ 2448 49.3 52 1.2
## marijuana.frequency cocaine.use cocaine.frequency crack.use
## 1 4 0.1 5 0.0
## 2 15 0.1 1 0.0
## 3 24 0.1 5.5 0.0
## 4 25 0.5 4 0.1
## 5 30 1.0 7 0.0
## 6 36 2.0 5 0.1
## 7 52 3.2 5 0.4
## 8 60 4.1 5.5 0.5
## 9 60 4.9 8 0.6
## 10 52 4.8 5 0.5
## 11 52 4.5 5 0.5
## 12 60 4.0 6 0.5
## 13 52 3.2 5 0.4
## 14 72 2.1 8 0.5
## 15 48 1.5 15 0.5
## 16 52 0.9 36 0.4
## 17 36 0.0 - 0.0
## crack.frequency heroin.use heroin.frequency hallucinogen.use
## 1 - 0.1 35.5 0.2
## 2 3 0.0 - 0.6
## 3 - 0.1 2 1.6
## 4 9.5 0.2 1 2.1
## 5 1 0.1 66.5 3.4
## 6 21 0.1 64 4.8
## 7 10 0.4 46 7.0
## 8 2 0.5 180 8.6
## 9 5 0.9 45 7.4
## 10 17 0.6 30 6.3
## 11 5 1.1 57.5 5.2
## 12 6 0.7 88 4.5
## 13 6 0.6 50 3.2
## 14 15 0.4 66 1.8
## 15 48 0.1 280 0.6
## 16 62 0.1 41 0.3
## 17 - 0.0 120 0.1
## hallucinogen.frequency inhalant.use inhalant.frequency
## 1 52 1.6 19
## 2 6 2.5 12
## 3 3 2.6 5
## 4 4 2.5 5.5
## 5 3 3.0 3
## 6 3 2.0 4
## 7 4 1.8 4
## 8 3 1.4 3
## 9 2 1.5 4
## 10 4 1.4 2
## 11 3 1.0 4
## 12 2 0.8 2
## 13 3 0.6 4
## 14 2 0.4 3.5
## 15 3 0.3 10
## 16 44 0.2 13.5
## 17 2 0.0 -
## pain releiver.use pain releiver.frequency oxycontin.use
## 1 2.0 36 0.1
## 2 2.4 14 0.1
## 3 3.9 12 0.4
## 4 5.5 10 0.8
## 5 6.2 7 1.1
## 6 8.5 9 1.4
## 7 9.2 12 1.7
## 8 9.4 12 1.5
## 9 10.0 10 1.7
## 10 9.0 15 1.3
## 11 10.0 15 1.7
## 12 9.0 15 1.3
## 13 8.3 13 1.2
## 14 5.9 22 0.9
## 15 4.2 12 0.3
## 16 2.5 12 0.4
## 17 0.6 24 0.0
## oxycontin.frequency tranquilizer.use tranquilizer.frequency
## 1 24.5 0.2 52.0
## 2 41 0.3 25.5
## 3 4.5 0.9 5.0
## 4 3 2.0 4.5
## 5 4 2.4 11.0
## 6 6 3.5 7.0
## 7 7 4.9 12.0
## 8 7.5 4.2 4.5
## 9 12 5.4 10.0
## 10 13.5 3.9 7.0
## 11 17.5 4.4 12.0
## 12 20 4.3 10.0
## 13 13.5 4.2 10.0
## 14 46 3.6 8.0
## 15 12 1.9 6.0
## 16 5 1.4 10.0
## 17 - 0.2 5.0
## stimulant.use stimulant.frequency meth.use meth.frequency sedative.use
## 1 0.2 2.0 0.0 - 0.2
## 2 0.3 4.0 0.1 5 0.1
## 3 0.8 12.0 0.1 24 0.2
## 4 1.5 6.0 0.3 10.5 0.4
## 5 1.8 9.5 0.3 36 0.2
## 6 2.8 9.0 0.6 48 0.5
## 7 3.0 8.0 0.5 12 0.4
## 8 3.3 6.0 0.4 105 0.3
## 9 4.0 12.0 0.9 12 0.5
## 10 4.1 10.0 0.6 2 0.3
## 11 3.6 10.0 0.6 46 0.2
## 12 2.6 10.0 0.7 21 0.2
## 13 2.3 7.0 0.6 30 0.4
## 14 1.4 12.0 0.4 54 0.4
## 15 0.6 24.0 0.2 104 0.3
## 16 0.3 24.0 0.2 30 0.2
## 17 0.0 364.0 0.0 - 0.0
## sedative.frequency
## 1 13.0
## 2 19.0
## 3 16.5
## 4 30.0
## 5 3.0
## 6 6.5
## 7 10.0
## 8 6.0
## 9 4.0
## 10 9.0
## 11 52.0
## 12 17.5
## 13 4.0
## 14 10.0
## 15 10.0
## 16 104.0
## 17 15.0
We can see that the data is in wide format. Need to bring the data to long format using gather():
drug_usage1 <- drug_usage %>%
gather(key = "Group",value, -age, -n)%>%
arrange(age)
## Warning: attributes are not identical across measure variables;
## they will be dropped
drug_usage1
## age n Group value
## 1 12 2798 alcohol.use 3.9
## 2 12 2798 alcohol.frequency 3
## 3 12 2798 marijuana.use 1.1
## 4 12 2798 marijuana.frequency 4
## 5 12 2798 cocaine.use 0.1
## 6 12 2798 cocaine.frequency 5
## 7 12 2798 crack.use 0
## 8 12 2798 crack.frequency -
## 9 12 2798 heroin.use 0.1
## 10 12 2798 heroin.frequency 35.5
## 11 12 2798 hallucinogen.use 0.2
## 12 12 2798 hallucinogen.frequency 52
## 13 12 2798 inhalant.use 1.6
## 14 12 2798 inhalant.frequency 19
## 15 12 2798 pain releiver.use 2
## 16 12 2798 pain releiver.frequency 36
## 17 12 2798 oxycontin.use 0.1
## 18 12 2798 oxycontin.frequency 24.5
## 19 12 2798 tranquilizer.use 0.2
## 20 12 2798 tranquilizer.frequency 52
## 21 12 2798 stimulant.use 0.2
## 22 12 2798 stimulant.frequency 2
## 23 12 2798 meth.use 0
## 24 12 2798 meth.frequency -
## 25 12 2798 sedative.use 0.2
## 26 12 2798 sedative.frequency 13
## 27 13 2757 alcohol.use 8.5
## 28 13 2757 alcohol.frequency 6
## 29 13 2757 marijuana.use 3.4
## 30 13 2757 marijuana.frequency 15
## 31 13 2757 cocaine.use 0.1
## 32 13 2757 cocaine.frequency 1
## 33 13 2757 crack.use 0
## 34 13 2757 crack.frequency 3
## 35 13 2757 heroin.use 0
## 36 13 2757 heroin.frequency -
## 37 13 2757 hallucinogen.use 0.6
## 38 13 2757 hallucinogen.frequency 6
## 39 13 2757 inhalant.use 2.5
## 40 13 2757 inhalant.frequency 12
## 41 13 2757 pain releiver.use 2.4
## 42 13 2757 pain releiver.frequency 14
## 43 13 2757 oxycontin.use 0.1
## 44 13 2757 oxycontin.frequency 41
## 45 13 2757 tranquilizer.use 0.3
## 46 13 2757 tranquilizer.frequency 25.5
## 47 13 2757 stimulant.use 0.3
## 48 13 2757 stimulant.frequency 4
## 49 13 2757 meth.use 0.1
## 50 13 2757 meth.frequency 5
## 51 13 2757 sedative.use 0.1
## 52 13 2757 sedative.frequency 19
## 53 14 2792 alcohol.use 18.1
## 54 14 2792 alcohol.frequency 5
## 55 14 2792 marijuana.use 8.7
## 56 14 2792 marijuana.frequency 24
## 57 14 2792 cocaine.use 0.1
## 58 14 2792 cocaine.frequency 5.5
## 59 14 2792 crack.use 0
## 60 14 2792 crack.frequency -
## 61 14 2792 heroin.use 0.1
## 62 14 2792 heroin.frequency 2
## 63 14 2792 hallucinogen.use 1.6
## 64 14 2792 hallucinogen.frequency 3
## 65 14 2792 inhalant.use 2.6
## 66 14 2792 inhalant.frequency 5
## 67 14 2792 pain releiver.use 3.9
## 68 14 2792 pain releiver.frequency 12
## 69 14 2792 oxycontin.use 0.4
## 70 14 2792 oxycontin.frequency 4.5
## 71 14 2792 tranquilizer.use 0.9
## 72 14 2792 tranquilizer.frequency 5
## 73 14 2792 stimulant.use 0.8
## 74 14 2792 stimulant.frequency 12
## 75 14 2792 meth.use 0.1
## 76 14 2792 meth.frequency 24
## 77 14 2792 sedative.use 0.2
## 78 14 2792 sedative.frequency 16.5
## 79 15 2956 alcohol.use 29.2
## 80 15 2956 alcohol.frequency 6
## 81 15 2956 marijuana.use 14.5
## 82 15 2956 marijuana.frequency 25
## 83 15 2956 cocaine.use 0.5
## 84 15 2956 cocaine.frequency 4
## 85 15 2956 crack.use 0.1
## 86 15 2956 crack.frequency 9.5
## 87 15 2956 heroin.use 0.2
## 88 15 2956 heroin.frequency 1
## 89 15 2956 hallucinogen.use 2.1
## 90 15 2956 hallucinogen.frequency 4
## 91 15 2956 inhalant.use 2.5
## 92 15 2956 inhalant.frequency 5.5
## 93 15 2956 pain releiver.use 5.5
## 94 15 2956 pain releiver.frequency 10
## 95 15 2956 oxycontin.use 0.8
## 96 15 2956 oxycontin.frequency 3
## 97 15 2956 tranquilizer.use 2
## 98 15 2956 tranquilizer.frequency 4.5
## 99 15 2956 stimulant.use 1.5
## 100 15 2956 stimulant.frequency 6
## 101 15 2956 meth.use 0.3
## 102 15 2956 meth.frequency 10.5
## 103 15 2956 sedative.use 0.4
## 104 15 2956 sedative.frequency 30
## 105 16 3058 alcohol.use 40.1
## 106 16 3058 alcohol.frequency 10
## 107 16 3058 marijuana.use 22.5
## 108 16 3058 marijuana.frequency 30
## 109 16 3058 cocaine.use 1
## 110 16 3058 cocaine.frequency 7
## 111 16 3058 crack.use 0
## 112 16 3058 crack.frequency 1
## 113 16 3058 heroin.use 0.1
## 114 16 3058 heroin.frequency 66.5
## 115 16 3058 hallucinogen.use 3.4
## 116 16 3058 hallucinogen.frequency 3
## 117 16 3058 inhalant.use 3
## 118 16 3058 inhalant.frequency 3
## 119 16 3058 pain releiver.use 6.2
## 120 16 3058 pain releiver.frequency 7
## 121 16 3058 oxycontin.use 1.1
## 122 16 3058 oxycontin.frequency 4
## 123 16 3058 tranquilizer.use 2.4
## 124 16 3058 tranquilizer.frequency 11
## 125 16 3058 stimulant.use 1.8
## 126 16 3058 stimulant.frequency 9.5
## 127 16 3058 meth.use 0.3
## 128 16 3058 meth.frequency 36
## 129 16 3058 sedative.use 0.2
## 130 16 3058 sedative.frequency 3
## 131 17 3038 alcohol.use 49.3
## 132 17 3038 alcohol.frequency 13
## 133 17 3038 marijuana.use 28
## 134 17 3038 marijuana.frequency 36
## 135 17 3038 cocaine.use 2
## 136 17 3038 cocaine.frequency 5
## 137 17 3038 crack.use 0.1
## 138 17 3038 crack.frequency 21
## 139 17 3038 heroin.use 0.1
## 140 17 3038 heroin.frequency 64
## 141 17 3038 hallucinogen.use 4.8
## 142 17 3038 hallucinogen.frequency 3
## 143 17 3038 inhalant.use 2
## 144 17 3038 inhalant.frequency 4
## 145 17 3038 pain releiver.use 8.5
## 146 17 3038 pain releiver.frequency 9
## 147 17 3038 oxycontin.use 1.4
## 148 17 3038 oxycontin.frequency 6
## 149 17 3038 tranquilizer.use 3.5
## 150 17 3038 tranquilizer.frequency 7
## 151 17 3038 stimulant.use 2.8
## 152 17 3038 stimulant.frequency 9
## 153 17 3038 meth.use 0.6
## 154 17 3038 meth.frequency 48
## 155 17 3038 sedative.use 0.5
## 156 17 3038 sedative.frequency 6.5
## 157 18 2469 alcohol.use 58.7
## 158 18 2469 alcohol.frequency 24
## 159 18 2469 marijuana.use 33.7
## 160 18 2469 marijuana.frequency 52
## 161 18 2469 cocaine.use 3.2
## 162 18 2469 cocaine.frequency 5
## 163 18 2469 crack.use 0.4
## 164 18 2469 crack.frequency 10
## 165 18 2469 heroin.use 0.4
## 166 18 2469 heroin.frequency 46
## 167 18 2469 hallucinogen.use 7
## 168 18 2469 hallucinogen.frequency 4
## 169 18 2469 inhalant.use 1.8
## 170 18 2469 inhalant.frequency 4
## 171 18 2469 pain releiver.use 9.2
## 172 18 2469 pain releiver.frequency 12
## 173 18 2469 oxycontin.use 1.7
## 174 18 2469 oxycontin.frequency 7
## 175 18 2469 tranquilizer.use 4.9
## 176 18 2469 tranquilizer.frequency 12
## 177 18 2469 stimulant.use 3
## 178 18 2469 stimulant.frequency 8
## 179 18 2469 meth.use 0.5
## 180 18 2469 meth.frequency 12
## 181 18 2469 sedative.use 0.4
## 182 18 2469 sedative.frequency 10
## 183 19 2223 alcohol.use 64.6
## 184 19 2223 alcohol.frequency 36
## 185 19 2223 marijuana.use 33.4
## 186 19 2223 marijuana.frequency 60
## 187 19 2223 cocaine.use 4.1
## 188 19 2223 cocaine.frequency 5.5
## 189 19 2223 crack.use 0.5
## 190 19 2223 crack.frequency 2
## 191 19 2223 heroin.use 0.5
## 192 19 2223 heroin.frequency 180
## 193 19 2223 hallucinogen.use 8.6
## 194 19 2223 hallucinogen.frequency 3
## 195 19 2223 inhalant.use 1.4
## 196 19 2223 inhalant.frequency 3
## 197 19 2223 pain releiver.use 9.4
## 198 19 2223 pain releiver.frequency 12
## 199 19 2223 oxycontin.use 1.5
## 200 19 2223 oxycontin.frequency 7.5
## 201 19 2223 tranquilizer.use 4.2
## 202 19 2223 tranquilizer.frequency 4.5
## 203 19 2223 stimulant.use 3.3
## 204 19 2223 stimulant.frequency 6
## 205 19 2223 meth.use 0.4
## 206 19 2223 meth.frequency 105
## 207 19 2223 sedative.use 0.3
## 208 19 2223 sedative.frequency 6
## 209 20 2271 alcohol.use 69.7
## 210 20 2271 alcohol.frequency 48
## 211 20 2271 marijuana.use 34
## 212 20 2271 marijuana.frequency 60
## 213 20 2271 cocaine.use 4.9
## 214 20 2271 cocaine.frequency 8
## 215 20 2271 crack.use 0.6
## 216 20 2271 crack.frequency 5
## 217 20 2271 heroin.use 0.9
## 218 20 2271 heroin.frequency 45
## 219 20 2271 hallucinogen.use 7.4
## 220 20 2271 hallucinogen.frequency 2
## 221 20 2271 inhalant.use 1.5
## 222 20 2271 inhalant.frequency 4
## 223 20 2271 pain releiver.use 10
## 224 20 2271 pain releiver.frequency 10
## 225 20 2271 oxycontin.use 1.7
## 226 20 2271 oxycontin.frequency 12
## 227 20 2271 tranquilizer.use 5.4
## 228 20 2271 tranquilizer.frequency 10
## 229 20 2271 stimulant.use 4
## 230 20 2271 stimulant.frequency 12
## 231 20 2271 meth.use 0.9
## 232 20 2271 meth.frequency 12
## 233 20 2271 sedative.use 0.5
## 234 20 2271 sedative.frequency 4
## 235 21 2354 alcohol.use 83.2
## 236 21 2354 alcohol.frequency 52
## 237 21 2354 marijuana.use 33
## 238 21 2354 marijuana.frequency 52
## 239 21 2354 cocaine.use 4.8
## 240 21 2354 cocaine.frequency 5
## 241 21 2354 crack.use 0.5
## 242 21 2354 crack.frequency 17
## 243 21 2354 heroin.use 0.6
## 244 21 2354 heroin.frequency 30
## 245 21 2354 hallucinogen.use 6.3
## 246 21 2354 hallucinogen.frequency 4
## 247 21 2354 inhalant.use 1.4
## 248 21 2354 inhalant.frequency 2
## 249 21 2354 pain releiver.use 9
## 250 21 2354 pain releiver.frequency 15
## 251 21 2354 oxycontin.use 1.3
## 252 21 2354 oxycontin.frequency 13.5
## 253 21 2354 tranquilizer.use 3.9
## 254 21 2354 tranquilizer.frequency 7
## 255 21 2354 stimulant.use 4.1
## 256 21 2354 stimulant.frequency 10
## 257 21 2354 meth.use 0.6
## 258 21 2354 meth.frequency 2
## 259 21 2354 sedative.use 0.3
## 260 21 2354 sedative.frequency 9
## 261 22-23 4707 alcohol.use 84.2
## 262 22-23 4707 alcohol.frequency 52
## 263 22-23 4707 marijuana.use 28.4
## 264 22-23 4707 marijuana.frequency 52
## 265 22-23 4707 cocaine.use 4.5
## 266 22-23 4707 cocaine.frequency 5
## 267 22-23 4707 crack.use 0.5
## 268 22-23 4707 crack.frequency 5
## 269 22-23 4707 heroin.use 1.1
## 270 22-23 4707 heroin.frequency 57.5
## 271 22-23 4707 hallucinogen.use 5.2
## 272 22-23 4707 hallucinogen.frequency 3
## 273 22-23 4707 inhalant.use 1
## 274 22-23 4707 inhalant.frequency 4
## 275 22-23 4707 pain releiver.use 10
## 276 22-23 4707 pain releiver.frequency 15
## 277 22-23 4707 oxycontin.use 1.7
## 278 22-23 4707 oxycontin.frequency 17.5
## 279 22-23 4707 tranquilizer.use 4.4
## 280 22-23 4707 tranquilizer.frequency 12
## 281 22-23 4707 stimulant.use 3.6
## 282 22-23 4707 stimulant.frequency 10
## 283 22-23 4707 meth.use 0.6
## 284 22-23 4707 meth.frequency 46
## 285 22-23 4707 sedative.use 0.2
## 286 22-23 4707 sedative.frequency 52
## 287 24-25 4591 alcohol.use 83.1
## 288 24-25 4591 alcohol.frequency 52
## 289 24-25 4591 marijuana.use 24.9
## 290 24-25 4591 marijuana.frequency 60
## 291 24-25 4591 cocaine.use 4
## 292 24-25 4591 cocaine.frequency 6
## 293 24-25 4591 crack.use 0.5
## 294 24-25 4591 crack.frequency 6
## 295 24-25 4591 heroin.use 0.7
## 296 24-25 4591 heroin.frequency 88
## 297 24-25 4591 hallucinogen.use 4.5
## 298 24-25 4591 hallucinogen.frequency 2
## 299 24-25 4591 inhalant.use 0.8
## 300 24-25 4591 inhalant.frequency 2
## 301 24-25 4591 pain releiver.use 9
## 302 24-25 4591 pain releiver.frequency 15
## 303 24-25 4591 oxycontin.use 1.3
## 304 24-25 4591 oxycontin.frequency 20
## 305 24-25 4591 tranquilizer.use 4.3
## 306 24-25 4591 tranquilizer.frequency 10
## 307 24-25 4591 stimulant.use 2.6
## 308 24-25 4591 stimulant.frequency 10
## 309 24-25 4591 meth.use 0.7
## 310 24-25 4591 meth.frequency 21
## 311 24-25 4591 sedative.use 0.2
## 312 24-25 4591 sedative.frequency 17.5
## 313 26-29 2628 alcohol.use 80.7
## 314 26-29 2628 alcohol.frequency 52
## 315 26-29 2628 marijuana.use 20.8
## 316 26-29 2628 marijuana.frequency 52
## 317 26-29 2628 cocaine.use 3.2
## 318 26-29 2628 cocaine.frequency 5
## 319 26-29 2628 crack.use 0.4
## 320 26-29 2628 crack.frequency 6
## 321 26-29 2628 heroin.use 0.6
## 322 26-29 2628 heroin.frequency 50
## 323 26-29 2628 hallucinogen.use 3.2
## 324 26-29 2628 hallucinogen.frequency 3
## 325 26-29 2628 inhalant.use 0.6
## 326 26-29 2628 inhalant.frequency 4
## 327 26-29 2628 pain releiver.use 8.3
## 328 26-29 2628 pain releiver.frequency 13
## 329 26-29 2628 oxycontin.use 1.2
## 330 26-29 2628 oxycontin.frequency 13.5
## 331 26-29 2628 tranquilizer.use 4.2
## 332 26-29 2628 tranquilizer.frequency 10
## 333 26-29 2628 stimulant.use 2.3
## 334 26-29 2628 stimulant.frequency 7
## 335 26-29 2628 meth.use 0.6
## 336 26-29 2628 meth.frequency 30
## 337 26-29 2628 sedative.use 0.4
## 338 26-29 2628 sedative.frequency 4
## 339 30-34 2864 alcohol.use 77.5
## 340 30-34 2864 alcohol.frequency 52
## 341 30-34 2864 marijuana.use 16.4
## 342 30-34 2864 marijuana.frequency 72
## 343 30-34 2864 cocaine.use 2.1
## 344 30-34 2864 cocaine.frequency 8
## 345 30-34 2864 crack.use 0.5
## 346 30-34 2864 crack.frequency 15
## 347 30-34 2864 heroin.use 0.4
## 348 30-34 2864 heroin.frequency 66
## 349 30-34 2864 hallucinogen.use 1.8
## 350 30-34 2864 hallucinogen.frequency 2
## 351 30-34 2864 inhalant.use 0.4
## 352 30-34 2864 inhalant.frequency 3.5
## 353 30-34 2864 pain releiver.use 5.9
## 354 30-34 2864 pain releiver.frequency 22
## 355 30-34 2864 oxycontin.use 0.9
## 356 30-34 2864 oxycontin.frequency 46
## 357 30-34 2864 tranquilizer.use 3.6
## 358 30-34 2864 tranquilizer.frequency 8
## 359 30-34 2864 stimulant.use 1.4
## 360 30-34 2864 stimulant.frequency 12
## 361 30-34 2864 meth.use 0.4
## 362 30-34 2864 meth.frequency 54
## 363 30-34 2864 sedative.use 0.4
## 364 30-34 2864 sedative.frequency 10
## 365 35-49 7391 alcohol.use 75
## 366 35-49 7391 alcohol.frequency 52
## 367 35-49 7391 marijuana.use 10.4
## 368 35-49 7391 marijuana.frequency 48
## 369 35-49 7391 cocaine.use 1.5
## 370 35-49 7391 cocaine.frequency 15
## 371 35-49 7391 crack.use 0.5
## 372 35-49 7391 crack.frequency 48
## 373 35-49 7391 heroin.use 0.1
## 374 35-49 7391 heroin.frequency 280
## 375 35-49 7391 hallucinogen.use 0.6
## 376 35-49 7391 hallucinogen.frequency 3
## 377 35-49 7391 inhalant.use 0.3
## 378 35-49 7391 inhalant.frequency 10
## 379 35-49 7391 pain releiver.use 4.2
## 380 35-49 7391 pain releiver.frequency 12
## 381 35-49 7391 oxycontin.use 0.3
## 382 35-49 7391 oxycontin.frequency 12
## 383 35-49 7391 tranquilizer.use 1.9
## 384 35-49 7391 tranquilizer.frequency 6
## 385 35-49 7391 stimulant.use 0.6
## 386 35-49 7391 stimulant.frequency 24
## 387 35-49 7391 meth.use 0.2
## 388 35-49 7391 meth.frequency 104
## 389 35-49 7391 sedative.use 0.3
## 390 35-49 7391 sedative.frequency 10
## 391 50-64 3923 alcohol.use 67.2
## 392 50-64 3923 alcohol.frequency 52
## 393 50-64 3923 marijuana.use 7.3
## 394 50-64 3923 marijuana.frequency 52
## 395 50-64 3923 cocaine.use 0.9
## 396 50-64 3923 cocaine.frequency 36
## 397 50-64 3923 crack.use 0.4
## 398 50-64 3923 crack.frequency 62
## 399 50-64 3923 heroin.use 0.1
## 400 50-64 3923 heroin.frequency 41
## 401 50-64 3923 hallucinogen.use 0.3
## 402 50-64 3923 hallucinogen.frequency 44
## 403 50-64 3923 inhalant.use 0.2
## 404 50-64 3923 inhalant.frequency 13.5
## 405 50-64 3923 pain releiver.use 2.5
## 406 50-64 3923 pain releiver.frequency 12
## 407 50-64 3923 oxycontin.use 0.4
## 408 50-64 3923 oxycontin.frequency 5
## 409 50-64 3923 tranquilizer.use 1.4
## 410 50-64 3923 tranquilizer.frequency 10
## 411 50-64 3923 stimulant.use 0.3
## 412 50-64 3923 stimulant.frequency 24
## 413 50-64 3923 meth.use 0.2
## 414 50-64 3923 meth.frequency 30
## 415 50-64 3923 sedative.use 0.2
## 416 50-64 3923 sedative.frequency 104
## 417 65+ 2448 alcohol.use 49.3
## 418 65+ 2448 alcohol.frequency 52
## 419 65+ 2448 marijuana.use 1.2
## 420 65+ 2448 marijuana.frequency 36
## 421 65+ 2448 cocaine.use 0
## 422 65+ 2448 cocaine.frequency -
## 423 65+ 2448 crack.use 0
## 424 65+ 2448 crack.frequency -
## 425 65+ 2448 heroin.use 0
## 426 65+ 2448 heroin.frequency 120
## 427 65+ 2448 hallucinogen.use 0.1
## 428 65+ 2448 hallucinogen.frequency 2
## 429 65+ 2448 inhalant.use 0
## 430 65+ 2448 inhalant.frequency -
## 431 65+ 2448 pain releiver.use 0.6
## 432 65+ 2448 pain releiver.frequency 24
## 433 65+ 2448 oxycontin.use 0
## 434 65+ 2448 oxycontin.frequency -
## 435 65+ 2448 tranquilizer.use 0.2
## 436 65+ 2448 tranquilizer.frequency 5
## 437 65+ 2448 stimulant.use 0
## 438 65+ 2448 stimulant.frequency 364
## 439 65+ 2448 meth.use 0
## 440 65+ 2448 meth.frequency -
## 441 65+ 2448 sedative.use 0
## 442 65+ 2448 sedative.frequency 15
Now seperate each drugfor Use and frequency using seperate():
drug_usage2 <- drug_usage1 %>%
separate(Group, into = c("Substance", "class"), sep = "\\." )
head(drug_usage2,10)
## age n Substance class value
## 1 12 2798 alcohol use 3.9
## 2 12 2798 alcohol frequency 3
## 3 12 2798 marijuana use 1.1
## 4 12 2798 marijuana frequency 4
## 5 12 2798 cocaine use 0.1
## 6 12 2798 cocaine frequency 5
## 7 12 2798 crack use 0
## 8 12 2798 crack frequency -
## 9 12 2798 heroin use 0.1
## 10 12 2798 heroin frequency 35.5
Spreading the table using spread();
drug_usage3 <- drug_usage2 %>%
spread(class, value)
drug_usage3
## age n Substance frequency use
## 1 12 2798 alcohol 3 3.9
## 2 12 2798 cocaine 5 0.1
## 3 12 2798 crack - 0
## 4 12 2798 hallucinogen 52 0.2
## 5 12 2798 heroin 35.5 0.1
## 6 12 2798 inhalant 19 1.6
## 7 12 2798 marijuana 4 1.1
## 8 12 2798 meth - 0
## 9 12 2798 oxycontin 24.5 0.1
## 10 12 2798 pain releiver 36 2
## 11 12 2798 sedative 13 0.2
## 12 12 2798 stimulant 2 0.2
## 13 12 2798 tranquilizer 52 0.2
## 14 13 2757 alcohol 6 8.5
## 15 13 2757 cocaine 1 0.1
## 16 13 2757 crack 3 0
## 17 13 2757 hallucinogen 6 0.6
## 18 13 2757 heroin - 0
## 19 13 2757 inhalant 12 2.5
## 20 13 2757 marijuana 15 3.4
## 21 13 2757 meth 5 0.1
## 22 13 2757 oxycontin 41 0.1
## 23 13 2757 pain releiver 14 2.4
## 24 13 2757 sedative 19 0.1
## 25 13 2757 stimulant 4 0.3
## 26 13 2757 tranquilizer 25.5 0.3
## 27 14 2792 alcohol 5 18.1
## 28 14 2792 cocaine 5.5 0.1
## 29 14 2792 crack - 0
## 30 14 2792 hallucinogen 3 1.6
## 31 14 2792 heroin 2 0.1
## 32 14 2792 inhalant 5 2.6
## 33 14 2792 marijuana 24 8.7
## 34 14 2792 meth 24 0.1
## 35 14 2792 oxycontin 4.5 0.4
## 36 14 2792 pain releiver 12 3.9
## 37 14 2792 sedative 16.5 0.2
## 38 14 2792 stimulant 12 0.8
## 39 14 2792 tranquilizer 5 0.9
## 40 15 2956 alcohol 6 29.2
## 41 15 2956 cocaine 4 0.5
## 42 15 2956 crack 9.5 0.1
## 43 15 2956 hallucinogen 4 2.1
## 44 15 2956 heroin 1 0.2
## 45 15 2956 inhalant 5.5 2.5
## 46 15 2956 marijuana 25 14.5
## 47 15 2956 meth 10.5 0.3
## 48 15 2956 oxycontin 3 0.8
## 49 15 2956 pain releiver 10 5.5
## 50 15 2956 sedative 30 0.4
## 51 15 2956 stimulant 6 1.5
## 52 15 2956 tranquilizer 4.5 2
## 53 16 3058 alcohol 10 40.1
## 54 16 3058 cocaine 7 1
## 55 16 3058 crack 1 0
## 56 16 3058 hallucinogen 3 3.4
## 57 16 3058 heroin 66.5 0.1
## 58 16 3058 inhalant 3 3
## 59 16 3058 marijuana 30 22.5
## 60 16 3058 meth 36 0.3
## 61 16 3058 oxycontin 4 1.1
## 62 16 3058 pain releiver 7 6.2
## 63 16 3058 sedative 3 0.2
## 64 16 3058 stimulant 9.5 1.8
## 65 16 3058 tranquilizer 11 2.4
## 66 17 3038 alcohol 13 49.3
## 67 17 3038 cocaine 5 2
## 68 17 3038 crack 21 0.1
## 69 17 3038 hallucinogen 3 4.8
## 70 17 3038 heroin 64 0.1
## 71 17 3038 inhalant 4 2
## 72 17 3038 marijuana 36 28
## 73 17 3038 meth 48 0.6
## 74 17 3038 oxycontin 6 1.4
## 75 17 3038 pain releiver 9 8.5
## 76 17 3038 sedative 6.5 0.5
## 77 17 3038 stimulant 9 2.8
## 78 17 3038 tranquilizer 7 3.5
## 79 18 2469 alcohol 24 58.7
## 80 18 2469 cocaine 5 3.2
## 81 18 2469 crack 10 0.4
## 82 18 2469 hallucinogen 4 7
## 83 18 2469 heroin 46 0.4
## 84 18 2469 inhalant 4 1.8
## 85 18 2469 marijuana 52 33.7
## 86 18 2469 meth 12 0.5
## 87 18 2469 oxycontin 7 1.7
## 88 18 2469 pain releiver 12 9.2
## 89 18 2469 sedative 10 0.4
## 90 18 2469 stimulant 8 3
## 91 18 2469 tranquilizer 12 4.9
## 92 19 2223 alcohol 36 64.6
## 93 19 2223 cocaine 5.5 4.1
## 94 19 2223 crack 2 0.5
## 95 19 2223 hallucinogen 3 8.6
## 96 19 2223 heroin 180 0.5
## 97 19 2223 inhalant 3 1.4
## 98 19 2223 marijuana 60 33.4
## 99 19 2223 meth 105 0.4
## 100 19 2223 oxycontin 7.5 1.5
## 101 19 2223 pain releiver 12 9.4
## 102 19 2223 sedative 6 0.3
## 103 19 2223 stimulant 6 3.3
## 104 19 2223 tranquilizer 4.5 4.2
## 105 20 2271 alcohol 48 69.7
## 106 20 2271 cocaine 8 4.9
## 107 20 2271 crack 5 0.6
## 108 20 2271 hallucinogen 2 7.4
## 109 20 2271 heroin 45 0.9
## 110 20 2271 inhalant 4 1.5
## 111 20 2271 marijuana 60 34
## 112 20 2271 meth 12 0.9
## 113 20 2271 oxycontin 12 1.7
## 114 20 2271 pain releiver 10 10
## 115 20 2271 sedative 4 0.5
## 116 20 2271 stimulant 12 4
## 117 20 2271 tranquilizer 10 5.4
## 118 21 2354 alcohol 52 83.2
## 119 21 2354 cocaine 5 4.8
## 120 21 2354 crack 17 0.5
## 121 21 2354 hallucinogen 4 6.3
## 122 21 2354 heroin 30 0.6
## 123 21 2354 inhalant 2 1.4
## 124 21 2354 marijuana 52 33
## 125 21 2354 meth 2 0.6
## 126 21 2354 oxycontin 13.5 1.3
## 127 21 2354 pain releiver 15 9
## 128 21 2354 sedative 9 0.3
## 129 21 2354 stimulant 10 4.1
## 130 21 2354 tranquilizer 7 3.9
## 131 22-23 4707 alcohol 52 84.2
## 132 22-23 4707 cocaine 5 4.5
## 133 22-23 4707 crack 5 0.5
## 134 22-23 4707 hallucinogen 3 5.2
## 135 22-23 4707 heroin 57.5 1.1
## 136 22-23 4707 inhalant 4 1
## 137 22-23 4707 marijuana 52 28.4
## 138 22-23 4707 meth 46 0.6
## 139 22-23 4707 oxycontin 17.5 1.7
## 140 22-23 4707 pain releiver 15 10
## 141 22-23 4707 sedative 52 0.2
## 142 22-23 4707 stimulant 10 3.6
## 143 22-23 4707 tranquilizer 12 4.4
## 144 24-25 4591 alcohol 52 83.1
## 145 24-25 4591 cocaine 6 4
## 146 24-25 4591 crack 6 0.5
## 147 24-25 4591 hallucinogen 2 4.5
## 148 24-25 4591 heroin 88 0.7
## 149 24-25 4591 inhalant 2 0.8
## 150 24-25 4591 marijuana 60 24.9
## 151 24-25 4591 meth 21 0.7
## 152 24-25 4591 oxycontin 20 1.3
## 153 24-25 4591 pain releiver 15 9
## 154 24-25 4591 sedative 17.5 0.2
## 155 24-25 4591 stimulant 10 2.6
## 156 24-25 4591 tranquilizer 10 4.3
## 157 26-29 2628 alcohol 52 80.7
## 158 26-29 2628 cocaine 5 3.2
## 159 26-29 2628 crack 6 0.4
## 160 26-29 2628 hallucinogen 3 3.2
## 161 26-29 2628 heroin 50 0.6
## 162 26-29 2628 inhalant 4 0.6
## 163 26-29 2628 marijuana 52 20.8
## 164 26-29 2628 meth 30 0.6
## 165 26-29 2628 oxycontin 13.5 1.2
## 166 26-29 2628 pain releiver 13 8.3
## 167 26-29 2628 sedative 4 0.4
## 168 26-29 2628 stimulant 7 2.3
## 169 26-29 2628 tranquilizer 10 4.2
## 170 30-34 2864 alcohol 52 77.5
## 171 30-34 2864 cocaine 8 2.1
## 172 30-34 2864 crack 15 0.5
## 173 30-34 2864 hallucinogen 2 1.8
## 174 30-34 2864 heroin 66 0.4
## 175 30-34 2864 inhalant 3.5 0.4
## 176 30-34 2864 marijuana 72 16.4
## 177 30-34 2864 meth 54 0.4
## 178 30-34 2864 oxycontin 46 0.9
## 179 30-34 2864 pain releiver 22 5.9
## 180 30-34 2864 sedative 10 0.4
## 181 30-34 2864 stimulant 12 1.4
## 182 30-34 2864 tranquilizer 8 3.6
## 183 35-49 7391 alcohol 52 75
## 184 35-49 7391 cocaine 15 1.5
## 185 35-49 7391 crack 48 0.5
## 186 35-49 7391 hallucinogen 3 0.6
## 187 35-49 7391 heroin 280 0.1
## 188 35-49 7391 inhalant 10 0.3
## 189 35-49 7391 marijuana 48 10.4
## 190 35-49 7391 meth 104 0.2
## 191 35-49 7391 oxycontin 12 0.3
## 192 35-49 7391 pain releiver 12 4.2
## 193 35-49 7391 sedative 10 0.3
## 194 35-49 7391 stimulant 24 0.6
## 195 35-49 7391 tranquilizer 6 1.9
## 196 50-64 3923 alcohol 52 67.2
## 197 50-64 3923 cocaine 36 0.9
## 198 50-64 3923 crack 62 0.4
## 199 50-64 3923 hallucinogen 44 0.3
## 200 50-64 3923 heroin 41 0.1
## 201 50-64 3923 inhalant 13.5 0.2
## 202 50-64 3923 marijuana 52 7.3
## 203 50-64 3923 meth 30 0.2
## 204 50-64 3923 oxycontin 5 0.4
## 205 50-64 3923 pain releiver 12 2.5
## 206 50-64 3923 sedative 104 0.2
## 207 50-64 3923 stimulant 24 0.3
## 208 50-64 3923 tranquilizer 10 1.4
## 209 65+ 2448 alcohol 52 49.3
## 210 65+ 2448 cocaine - 0
## 211 65+ 2448 crack - 0
## 212 65+ 2448 hallucinogen 2 0.1
## 213 65+ 2448 heroin 120 0
## 214 65+ 2448 inhalant - 0
## 215 65+ 2448 marijuana 36 1.2
## 216 65+ 2448 meth - 0
## 217 65+ 2448 oxycontin - 0
## 218 65+ 2448 pain releiver 24 0.6
## 219 65+ 2448 sedative 15 0
## 220 65+ 2448 stimulant 364 0
## 221 65+ 2448 tranquilizer 5 0.2
Using mutate make the values as numeric:
drug_usage3 <- suppressWarnings(drug_usage3 %>%
mutate(use = as.numeric(use),
frequency = as.numeric(frequency)))
head(drug_usage3)
## age n Substance frequency use
## 1 12 2798 alcohol 3.0 3.9
## 2 12 2798 cocaine 5.0 0.1
## 3 12 2798 crack NA 0.0
## 4 12 2798 hallucinogen 52.0 0.2
## 5 12 2798 heroin 35.5 0.1
## 6 12 2798 inhalant 19.0 1.6
To just use Alcohol for analysis:
alcohol <- drug_usage3 %>%
filter(Substance=="alcohol") %>%
select(-Substance)
alcohol
## age n frequency use
## 1 12 2798 3 3.9
## 2 13 2757 6 8.5
## 3 14 2792 5 18.1
## 4 15 2956 6 29.2
## 5 16 3058 10 40.1
## 6 17 3038 13 49.3
## 7 18 2469 24 58.7
## 8 19 2223 36 64.6
## 9 20 2271 48 69.7
## 10 21 2354 52 83.2
## 11 22-23 4707 52 84.2
## 12 24-25 4591 52 83.1
## 13 26-29 2628 52 80.7
## 14 30-34 2864 52 77.5
## 15 35-49 7391 52 75.0
## 16 50-64 3923 52 67.2
## 17 65+ 2448 52 49.3
Ploting age and alcohol usage:
Age With Use:
ggplot(alcohol) + geom_point(aes(age, y = use), color = "red") + labs(x = "Age", y = "Usage")
Age With frequency:
ggplot(alcohol) + geom_point(aes(age, y = frequency), color = "blue") + labs(x = "Age", y = "Frequency")