## Warning: package 'dplyr' was built under R version 3.4.2
## Warning: package 'tidyr' was built under R version 3.4.2
## Warning: package 'Rcpp' was built under R version 3.4.2
Figure 1: Figure 1A shows the relative mean access count among the different tools; Figure 1B shows the relative purchase count of each tool.
I created a two part figure in order to show amount of engagement relative to the number of tools sold. We can see that despite being the lowest selling tool, FACS test has the highest mean access count. The rest of the tools have mean access counts that are so low that I created tables in order to investigate the numbers more precisely.
## Email Tool Access.Count
## 1 bonettiraphael@gmail.com FACS Test 10063
## 2 cm.box@icloud.com FACS Test 2448
## 3 studiog.messina@tiscali.it FACS Test 1449
## 4 535265294@qq.com FACS Test 936
## 5 osamura.m@gmail.com FACS Test 726
Figure 2: This figure shows the top 5 access counts for the FACS test.
## Email Tool Access.Count
## 1 caochoaq@gmail.com METT 87
## 2 johnpdoody2@yahoo.co.uk METT 44
## 3 yagana.yakubu@asburyseminary.edu METT 42
## 4 ediazl@up.edu.mx METT 40
## 5 paul.christi.kim@asburyseminary.edu METT 29
Figure 3: This figure shows the top 5 access counts for the METT test.
## Email Tool Access.Count
## 1 arthur.sher@icloud.com SETT 67
## 2 adavis44@utk.edu SETT 53
## 3 rationalshawn@gmail.com SETT 23
## 4 danielfarrer10@gmail.com SETT 22
## 5 mywong@hsmc.edu.hk SETT 22
Figure 4: This figure shows the top 5 access counts for the SETT test.
## Email Tool Access.Count
## 1 Khaled.Salih@icloud.com RETT: Family 7
## 2 j-lilja@hotmail.com RETT: Family 6
## 3 teunzijlmans@gmail.com RETT: Family 4
## 4 riccraig@protonmail.com RETT: Family 4
## 5 deivis_fsilva@yahoo.com.br RETT: Family 4
Figure 5: This figure shows the top 5 access counts for the RETT: Family test.
## Email Tool Access.Count
## 1 move_it321@yahoo.com RETT: Law Enforcement 10
## 2 clamabu@gmail.com RETT: Law Enforcement 9
## 3 ancient.eb@gmail.com RETT: Law Enforcement 9
## 4 rouradnik@gmail.com RETT: Law Enforcement 7
## 5 epotter@pottercounseling.com RETT: Law Enforcement 6
Figure 6: This figure shows the top 5 access counts for the RETT: Law Enforcement test.
## Email Tool Access.Count
## 1 zhiyan.zhang@shms-mail.ch RETT: Workplace 7
## 2 erdobber@hotmail.com RETT: Workplace 6
## 3 bethel@gmail.com RETT: Workplace 5
## 4 elisa.lazier@libero.it RETT: Workplace 5
## 5 therbkelly@hotmail.com RETT: Workplace 4
Figure 7: This figure shows the top 5 access counts for the RETT: Workplace test.
From these six tables, we can see that FACS had exponentially higher engagement than the rest of the tools. METT and SETT are at least in double digit access count numbers which is good considering that they are the highest selling tools. The rest of the tools (the RETT sets) are all pretty much in single digit access counts with the exception of one user for RETT: Law Enforcement who accessed the tool 10 times.
## X2017.01.01...2017.12.31 Units Dollars
## 1 Total 2278 114351.80
## 2 summer 213 10562.20
## 3 truth 184 9269.60
## 4 newyear 161 8227.60
## 5 fall4savings 128 6276.20
## 6 holiday 119 5516.20
## 7 savemore 116 5364.80
## 8 spooky 113 5352.80
## 9 universal 102 5154.40
## 10 winter 97 4630.60
## 11 fall 93 4601.00
## 12 expression 84 4091.00
## 13 spring 78 3677.80
## 14 lietome 69 3408.20
## 15 science 59 2965.00
## 16 eyes 56 2886.60
## 17 crush 48 2272.40
## 18 blackfriday 47 2390.60
## 19 learn 46 2106.80
## 20 washington 46 2154.80
## 21 emotional 45 2081.00
## 22 holidays 43 2151.40
## 23 appreciation 39 2138.00
## 24 usa1776 37 1880.00
## 25 fool 36 1968.60
## 26 ty25rtfb 35 1746.50
## 27 light 32 1547.60
## 28 clover 32 1637.60
## 29 mask 32 1405.60
## 30 december 25 1139.00
## 31 cyber 21 1197.80
## 32 relief 9 404.20
## 33 m316jjltd 7 280.60
## 34 upgrade_2017 5 747.00
## 35 welcomeback 4 143.75
## 36 hadnagy20 4 1011.40
## 37 sv&23!9 4 254.25
## 38 asbury20 2 489.80
## 39 anger25 2 99.50
## 40 pk!30 1 19.80
## 41 #facs_10_! 1 700.00
## 42 rain 1 59.80
## 43 bl-hadnagy12 1 100.00
## 44 highbaugh20 1 240.00
Figure 8: This figure shows all 44 coupon codes and their relative performance in decreasing order in both units and dollars.
We can see here that almost all of the highest performing coupon codes were holiday related with the exception of code “truth.”
Figure 9: This figure shows the number of sales we had by the billing country.
I think it is very impressive the range of countries that are represented in this figure because in order to even be represented, someone had to purchase it from there. We can see here that our highest engagement by far is the United States. However, there is considerable engagement in other countries but it is hard to discern which countries they are due to the scale of the chart. Therefore, I created a table in order to report explicitly the name of the top 20 countries.
## # A tibble: 20 x 2
## billing_country n
## <chr> <int>
## 1 US 1197
## 2 AU 165
## 3 DE 158
## 4 CA 128
## 5 GB 108
## 6 IT 77
## 7 NL 56
## 8 CH 53
## 9 BR 50
## 10 FR 49
## 11 JP 43
## 12 AT 39
## 13 RU 37
## 14 MX 36
## 15 PL 31
## 16 SE 28
## 17 ES 26
## 18 SG 22
## 19 IL 19
## 20 DK 18
Figure 10: This figure shows the top 20 countries in terms of sales.
One thing to note is that this figure is not representative of overall engagement but solely sales. It is logical to assume that they would be relatively correlative and therefore made deductions and extrapolations from this graph but it’s worth keeping this in mind. For billing country identification that might be unfamiliar (http://www.nationsonline.org/oneworld/country_code_list.html): AU = Australia; DE = Germany; CA = Canada; GB = Great Britain; IT = Italy; NL = Netherlands; CH = Switzerland; BR = Brazil; FR = France; JP = Japan; AT = Austria; RU = Russian Federation; MX = Mexico; PL = Poland; SE = Sweden; ES = Spain; SG = Singapore; IL = Israel; DK = Denmark
## # A tibble: 2,626 x 2
## # Groups: billing_company [584]
## ID billing_company
## <int> <chr>
## 1 38688
## 2 46043
## 3 43979
## 4 46408
## 5 45368
## 6 41506
## 7 46498
## 8 47375
## 9 37956
## 10 43401 Ping An Technology
## # ... with 2,616 more rows
Figure 11: This figure shows the billing company for sales.
As you can see here, there are only some companies reported - not every sale has a company reported. Looking at the billing company is only a circuitous way to try and find out what industry the client is in but it is highly inefficient to look up each company reported - particularly on this scale and especially considering the return we get which may or may not be reliable. A much better way to answer the question of how the clients are using our product and which clients are using our product (in terms of industry targeting) is through directly asking the client via a drop down menu or a list of options to select from. This way we would have completley representative data and it would be accurate. A potential way to posit this question would be “What are you using this product for?”
## Warning: Removed 1 rows containing missing values (geom_col).
Figure 12: This figure shows the number of sales by age.
I considered ommitting this graph but decided to keep it just to bring attention to why this figure looks so strange. The data we have in terms of age is completely unreliable. There was a considerable number of reported ages between 100-200 years old which is incredibly dubious. Also, when trying to clean up the data, I went through the Excel spreadsheet filtering with ‘10’ in order to more efficiently find those that are listed as 100 or older but then realized the number of 10-year-olds that apparently bought this tool. I also find this questionable. I don’t know if age is something we want to remotely pursue but the data we have right now for it is unreliable and therefore unusable.