Arrests for Marijuana Possession in Toronto

picture of canada and weed

Project Background and Description:

This project uses data on police treatment of individuals arrested in Toronto
for simple possession of small quantities of marijuana.
The data are part of a larger data set featured in a series of articles in the Toronto Star newspaper.


Full information at:

check the info out here

Data at:

check the data out here


Let’s get started!

Data Selection

To better understand the background of adults who continuously use Marijuana, I selected all the arrested individuals with more than one previous police records related to Marijuana. 2002 is excluded due to number of variables are low.

Questions

Although the data are part of a larger data set ,it could still provide some insights of the following:

  • What factor drive people to use Marijuana? Age? Employment status?
  • Which age group depends on Marijuana the most?
  • the trend of using Marijuana during 1997 to 2001

Therefore, a further analysis can be performed in certain area base on the result we found on this project.

Data wrangling

The code for data wrangling listed below:


urlfile <-'https://raw.githubusercontent.com/jayleecunysps/AssignmentforSPS/main/Arrests.csv' 
library(dplyr) 
## 
## Attaching package: 'dplyr'
## The following objects are masked from 'package:stats':
## 
##     filter, lag
## The following objects are masked from 'package:base':
## 
##     intersect, setdiff, setequal, union
Arrests<-read.csv(url(urlfile)) 
Arrests <- data.frame(Arrests)

Adultarrests <- subset(Arrests,age>17&checks>0&year<2002)

colnames(Adultarrests) <- c("released","race","arrest_year","arrest_age","sex","employed","citizen","policedatabase_in_records")

Adultarrests$employed <- as.character(Adultarrests$employed)
Adultarrests$released <- as.character(Adultarrests$released)
Adultarrests$employed [Adultarrests$employed == "Yes"] <- "Employed"
Adultarrests$employed [Adultarrests$employed == "No"] <- "Unemployed"
Adultarrests$released [Adultarrests$released == "Yes"] <- "Released"
Adultarrests$released [Adultarrests$released == "No"] <- "Unreleased"
Adultarrests$employed <- as.factor(Adultarrests$employed)
Adultarrests$released <- as.factor(Adultarrests$released) 

Arrests Table

The following paged table contains the full data set of target population.

library(rmarkdown)
paged_table(Adultarrests)

Arrests Summary

The following summary helps us to understanding the sample size better.

We can see mean and median of arrest age and number of records in police database is consistent during 1997 to 2001.

From the summary we can tell

  • Marijuana users does not shift to young group during 1997 to 2001.
  • Frequency of repeated arrests is slightly dropping.
  • Do not have great impact on arrest age
summary(Adultarrests)
##        released        race            arrest_year     arrest_age   
##  Released  :2082   Length:2644        Min.   :1997   Min.   :18.00  
##  Unreleased: 562   Class :character   1st Qu.:1998   1st Qu.:20.00  
##                    Mode  :character   Median :1999   Median :23.00  
##                                       Mean   :1999   Mean   :26.46  
##                                       3rd Qu.:2001   3rd Qu.:32.00  
##                                       Max.   :2001   Max.   :66.00  
##      sex                  employed      citizen         
##  Length:2644        Employed  :1872   Length:2644       
##  Class :character   Unemployed: 772   Class :character  
##  Mode  :character                     Mode  :character  
##                                                         
##                                                         
##                                                         
##  policedatabase_in_records
##  Min.   :1.00             
##  1st Qu.:2.00             
##  Median :3.00             
##  Mean   :2.57             
##  3rd Qu.:3.00             
##  Max.   :6.00
aggregate(cbind(arrest_age,policedatabase_in_records) ~ arrest_year,Adultarrests,mean)
##   arrest_year arrest_age policedatabase_in_records
## 1        1997   26.50923                  2.767528
## 2        1998   26.10129                  2.601293
## 3        1999   26.55424                  2.620339
## 4        2000   26.31659                  2.540335
## 5        2001   26.75529                  2.450151
aggregate(cbind(arrest_age,policedatabase_in_records) ~ arrest_year,Adultarrests,median)
##   arrest_year arrest_age policedatabase_in_records
## 1        1997         23                       3.0
## 2        1998         23                       3.0
## 3        1999         24                       3.0
## 4        2000         23                       3.0
## 5        2001         23                       2.5

Conclusion

  • Current treatment of simple possession of small quantities of marijuana may lack deterrent power.
  • Labor market situation do not play an important factor on Marijuana using, 71% of them are employed.
  • Marijuana using is increasing from 1997 to 2001 while average arrested age is about the same.
  • Younger group is using Marijuana more often than elderly.
  • Government should promote the drug facts and the risk of using Marijuana in middle school.

Suggestion for further analysis

  • For multiple arrested population, we can try to understand what level of treatment has enough deterrent power to stop them from carrying and taking it again.
  • For the employed group, we can try to collect the data of job area and stress level to see if there is a direct relation in between
  • For younger group, we can try to collect the data of the timing and reason (like peer pressure?) of first time using to see what is the main cause to write a solution plan.