My DATA 698 Project will be based on NYC Crime. I was wondering how violent is the city, if some neighborhoods or type of persons are most affected, and if I could identify some patterns for shooting over time as well compare to other crimes. My primary goal is to take the tools and resources learned in the DATA SCIENCE MS courses such as time series, trend, seasonality, cyclical analysis as well as graphical visualizations to portray my findings. The Data sets found contain very useful information to perform such as analysis such as date of occurrence, type of offense, borough, age, race and gender. My hypothesis consists of identifying or try to pin point the specific zones/times more likely to have crime and analyze this deeply to come up with solutions correlating to the techniques used by other Crime Analysis experts in the literature chosen. The literature chosen such as “The crime numbers game: Management by manipulation” and “Understanding new York’s crime drop.” may also be useful to identify any possible fallacy or errors in the data.
A trend exists when there is a long-term increase or decrease in the data. It does not have to be linear. Sometimes we will refer to a trend as “changing direction,” when it might go from an increasing trend to a decreasing trend. A seasonal pattern occurs when a time series is affected by seasonal factors such as the time of the year or the day of the week. Seasonality is always of a fixed and known frequency. A cycle occurs when the data exhibit rises and falls that are not of a fixed frequency.
Exploratory Data Analysis refers to the critical process of performing initial investigations on data so as to discover patterns,to spot anomalies,to test hypothesis and to check assumptions with the help of summary statistics and graphical representations.
It is a good practice to understand the data first and try to gather as many insights from it. EDA is all about making sense of data in hand,before getting them dirty with it.
In addition, after exploring the data as mentioned above we could possibly be able to identify what day of the week certain crimes may be committed, is there a specific time of day when the crimes occurr and which boroughs may be the safest.
The initial exploration for this Project began by me creating a shiny app[dashboard] in R to visualize the Open Source NYC Shooting Data. I created selection options for users to be able to view the data by year(2006-2020), select the type of incident[murder or non-murder shootings] and by borough. In addition to these options further information such as Total Number of Incidents, Percent Change vs. Previous Year and a Heatmap are provided.
This is viewable here: https://johnmazon90.shinyapps.io/nyc_shooting_app_jmazon/
NYC SHOOTING SHINY APP - BY JMAZON
NYC OPEN DATA NYPD Complaint Data Historic
This dataset includes all valid felony, misdemeanor, and violation crimes reported to the New York City Police Department (NYPD) from 2006 to the end of last year (2019). For additional details, please see the attached data dictionary in the ‘About’ section.
List of every shooting incident that occurred in NYC going back to 2006 through the end of the previous calendar year. This is a breakdown of every shooting incident that occurred in NYC going back to 2006 through the end of the previous calendar year. This data is manually extracted every quarter and reviewed by the Office of Management Analysis and Planning before being posted on the NYPD website. Each record represents a shooting incident in NYC and includes information about the event, the location and time of occurrence. In addition, information related to suspect and victim demographics is also included. This data can be used by the public to explore the nature of shooting/criminal activity. Please refer to the attached data footnotes for additional information about this dataset.
The NYPD maintains statistical data which is used as a management tool in reducing crime, improving procedures and training, and providing transparency to the public and government oversight agencies. In 1994, the department implemented CompStat, which through management, statistics, and accountability, successfully drove down crime to record levels not seen since the 1950s.
The department provides up-to-date crime-related statistics in the seven major crime categories on the citywide, borough, and precinct levels, as well as historical crime data. The public can also access this data through the department’s CompStat 2.0 portal.
Tags: shooting, crime, law enforcement, public safety, nypd
Block, C. R., Gould, W., & David Coldren, J. (1984). Ojp.Gov. https://bjs.ojp.gov/content/pub/pdf/ics.pdf
Eterno, J. A., & Silverman, E. B. (2012). The crime numbers game: Management by manipulation. CRC Press.
Monkkonen, E. H. (2001). Murder in New York City. University of California Press.
NYPD Shooting Incident Data (Historic). (n.d.). Opendatanetwork.Com. Retrieved February 21, 2022, from https://www.opendatanetwork.com/dataset/data.cityofnewyork.us/833y-fsy8
NYPD shooting incident data (year to date). (n.d.). NYC Open Data. Retrieved February 21, 2022, from https://data.cityofnewyork.us/Public-Safety/NYPD-Shooting-Incident-Data-Year-To-Date-/5ucz-vwe8/data
Police Department. (2016). NYPD Complaint Data Historic [Data set].
Rosenfeld, R., Terry, K., & Chauhan, P. (Eds.). (2020). Understanding new York’s crime drop. Routledge.