Goal

I am seeking to understand the rates of criminal complaints by borough and then compare this with factors such as unemployment and income levels to determine if there is a correlation amongst them.

Motivation for performing the analysis

My sister just graduated with a master’s degree in education and is about to list the school of preference by borough. I would like to conduct this analysis to help her determine which borough is safest for her to be teaching.

Data Details

Data for this analysis is made up of three components.

Methods

  1. Write code to grab criminal data from the API, unemployment statistics and income & poverty estimates from Excel and store them in SQL

  2. Use the supervised learning approach (Support Vector Machine,Gradient Boosting) to identify which borough has the most criminal activity- rank the boroughs. I will,however,likely look into unsupervised methods as part of the process as well.

  3. Split the data in train & test,and tune a model to detect rate of criminal activity.

  4. Download images for all pages identified and visually confirm that I got what I wanted.

Source Data