The objective of this analysis was to explore the shooting
incidences that took place throughout New York City from 2006 to 2022 (I
focused primarily on 2006 to 2019 as I felt that policy changes, an
election and everything going on with the Covid-19 pandemic would play
too much the part of an extreme influence). I looked to examine both
lethal and non lethal shootings with where and when they occurred to see
what patterns/trends might be obtained.
For this analysis:
I used visualizations such as bar charts,
line charts, heatmaps, and maps.
library(tidyverse)
library(DT)
library(lubridate)
library(ggthemes)
library(readxl)
sum(duplicated(nycfiredcopy4B)) # no duplicates
sum(is.na(nycfiredcopy4B)) #we have 78 null values
summary(is.na(nycfiredcopy4B)) # all variables except for incident_key contains Null values
nycfiredcopy4B <- na.omit(nycfiredcopy4B)
sum(duplicated(nycfiredcopy4B)) # no duplicates
sum(is.na(nycfiredcopy4B)) #no null values / NA
dataview <- nycfiredcopy4C %>%
relocate(incident_key,.after = 'occur_date') %>%
select(-c(latitude,longitude,year,day,month,dayofweek))
datatable(dataview)
nycfiredcopy4D <- nycfiredcopy4D %>%
relocate(month,.before = 'day')
summary(nycfiredcopy4D)
Brooklyn has the most shootings only focused on non murder- there are
far more shootings not flagged as murder than are flagged as murder
(8422 no murder) from what we can see 2008 saw the most shootings, July
is the month with the most shootings the most shootings occur on
Saturdays,Sundays, Mondays, Fridays
dailyshooting <- nycfiredcopy4D %>%
group_by(day, statistical_murder_flag) %>%
summarize(Total = n())
datatable(dailyshooting)
ggplot(dailyshooting,aes(day,Total))+geom_line()+
theme_classic() +facet_wrap(~statistical_murder_flag,scales = 'free')+theme(axis.text.x = element_text(angle = 90))
there seems to be a small uptrend in shootings during the first 20
days in the month and then in the last 10 it steadily declines
daily_monthly_shooting<- nycfiredcopy4D %>%
group_by(month,day,statistical_murder_flag) %>%
summarize(Total = n())
datatable(daily_monthly_shooting)
ggplot(daily_monthly_shooting,aes(reorder(Total,day),Total,fill=month))+
geom_col(, linewidth=5)+
ggtitle('Shootings by Day and month')+
theme_dark()+scale_fill_manual(values=colors)+theme(axis.text.x = element_text(angle = 90))+xlab('day')+theme(legend.position = 'none')
#month to year trend
shooting_month_year<- nycfiredcopy4D %>%
#filter(year<=2019) %>%
group_by(year,month,statistical_murder_flag) %>%
summarize(Total = n())
datatable(shooting_month_year)
monthyear1 <- shooting_month_year %>%
ggplot(aes(year,Total,fill=month))+theme_classic()+
geom_col(position = 'dodge')+
ggtitle('Shootings per months and year')+
scale_fill_manual(values=colors)+theme(legend.position = 'none')
monthyear1
non lethal shootings was at its highest in both July of 2007 and
August 2008 non lethal shootings was at its lowest in both February of
2018
shootings_month_weekday<- nycfiredcopy4D %>%
# filter(year<=2019) %>%
group_by(month,dayofweek,statistical_murder_flag) %>%
summarize(Total = n())
datatable(shootings_month_weekday)
ggplot(shootings_month_weekday,aes(reorder(month,Total),Total,fill=dayofweek))+theme_classic()+
geom_col(position = 'dodge', linewidth=8)+ggtitle('Shootings by days of week and month')+facet_wrap(~statistical_murder_flag,nrow=2,scales = 'free')+
scale_fill_manual(values=colors)+xlab('month')
non lethal shootings were at there peak on Sundays in the month of
July non lethal shootings were at there lowest on Thursdays in the month
of February
nycfiredcopy4D %>%
select(boro,statistical_murder_flag) %>%
group_by(boro,statistical_murder_flag) %>%
summarize(count=n()) %>%
ggplot(aes(reorder(boro,count),count))+theme_classic()+
geom_bar(stat='identity',fill='orange')+geom_text(aes(label=count))+ggtitle('Shooting per borough')+theme(axis.text.x = element_text(angle = 90))+xlab('borough')
nycfiredcopy4D %>%
select(boro,statistical_murder_flag,month) %>%
group_by(boro,month,statistical_murder_flag) %>%
summarize(count=n()) %>%
ggplot(aes(reorder(month,count),count,fill=boro))+theme_classic()+geom_col(position = 'dodge')+
ggtitle('Monthly Shooting per borough ')+theme(axis.text.x = element_text(angle = 90))+coord_flip()+theme(legend.position = 'bottom')+xlab('month')
for non lethal shootings, it is July as the month with the most
shootings and Brooklyn as the borough with the greatest majority in
July, while February is the month with the least shootings and Brooklyn
as the borough with the greatest majority in February Throughout all the
months Brooklyn had an overwhelming majority of shootings with the
exception of January where Brooklyn saw only one more shooting than
Bronx
nycfiredcopy4D %>%
select(boro,statistical_murder_flag,dayofweek) %>%
group_by(boro,dayofweek,statistical_murder_flag) %>%
summarize(count=n()) %>%
ggplot(aes(dayofweek,count,fill=boro))+theme_classic()+geom_col(position = 'dodge')+
ggtitle('Daily Shooting by weekday and borough ')+theme(axis.text.x = element_text(angle = 90))+coord_flip()
for non lethal shootings Brooklyn saw the highest count on both
Sundays and Saturdays, On Saturdays,
nycfiredcopy4D %>%
select(month,statistical_murder_flag,year,boro) %>%
group_by(year,statistical_murder_flag,boro) %>%
summarize(count=n()) %>%
ggplot(aes(year,count,fill=boro))+theme_classic()+geom_col(position = 'dodge')+
ggtitle('Shooting per year and borough ')+theme(axis.text.x = element_text(angle = 90))+coord_flip()
first, out of the five boroughs it was Brooklyn and Bronx that saw
the most non lethal shootings every year
nycfiredcopy4D %>%
#filter(year <= 2019) %>%
group_by(dayofweek,month,statistical_murder_flag) %>%
summarize(count=n()) %>%
ggplot(aes(dayofweek,month,fill=count))+
geom_tile(color='white',linewidth = .5,linetype = 1)+theme_dark()+ggtitle('Heat Map of Shootings by Day and Month')+
geom_text(aes(label = count),col='yellow',size=3.2)+facet_wrap(~statistical_murder_flag,nrow=2,scales = 'free')
From 2006 to 2019 it is clear that during June, July, August and in
some cases May we have the highest counts of non lethal shootings
occurring on Sunday with July & August reaching over 200 while
June,July,August and May we had Shooting counts under 200 but surpassing
150 taking place on Saturdays whereas throughout all other periods of
the week shootings tend to all be below 150 too even below 100
nycfiredcopy4D %>%
# filter(year<=2019) %>%
group_by(day,month,statistical_murder_flag) %>%
summarize(Total=n()) %>%
ggplot(aes(day,month,fill=Total))+
geom_tile(color='white',linewidth = .5,linetype = 1)+theme_dark()+ggtitle('Heat Map of non-lethal Shootings per Day and Month')+
geom_text(aes(label = Total),col='yellow',size=2.8)
it is January 1 that we see the highest count of non lethal shootings
at 51 all together the greatest majority of shootings are within the
first 20 days
nycfiredcopy4D %>%
#filter(year<=2019) %>%
group_by(year,month,statistical_murder_flag) %>%
summarize(Total=n()) %>%
ggplot(aes(month,year,fill=Total))+
geom_tile(color='white',linewidth = .5,linetype = 1)+theme_dark()+ggtitle('Heat Map of non lethal Shootings per Month and Year')+
geom_text(aes(label = Total),col='yellow',size=2.8)
July of 2007 and august 2008 had most non lethal shootings the late
spring to summer months of 2006,2007 and 2008 saw the greatest number of
non lethal shootings
#trend
nycfiredcopy4T <- nycfiredcopy4D
nycfiredcopy4T <- nycfiredcopy4T %>%
select(-c(dayofweek,month,day,))
nycfiredcopy4T %>%
# filter(year <=2019) %>%
select(year,statistical_murder_flag) %>%
group_by(year,statistical_murder_flag) %>%
summarise(total=n()) %>%
ggplot(aes(year,total,col=statistical_murder_flag))+
geom_line(linewidth=1.1)+geom_point(size=2.2,col='black')+theme_classic()+ggtitle('Overall annual trend of non-lethal Shootings') +
geom_text(aes(label = total),col='black',size=2.8,vjust=2)+geom_smooth(se=F,lty=2,col='black',linewidth=1)+theme(legend.position = 'none')
in 2008 the number of overall shootings reached its zenith of 875 non
lethal shootings only to then slowly drop to 392 shootings throughout
NYC. This amazing down trend in shootings clearly indicates some form of
substantial change to life in NYC whether it be economical or crime
related.
nycfiredcopy4T %>%
filter(year <=2019) %>%
select(year,boro,statistical_murder_flag) %>%
group_by(year,boro,statistical_murder_flag) %>%
summarise(total=n()) %>%
ggplot(aes(year,total,col=statistical_murder_flag))+
geom_line()+geom_point()+theme_dark()+ggtitle('annual trend of Shootings and Borough')+
facet_wrap(~boro,nrow=2,scales = 'free_x')+theme(legend.position = 'none')
Brooklyn and Bronx had the highest peaks of shootings in the mid
2000’s and since then we’ve see a trend showing the greatest decline in
overall shootings whereas the other boroughs remained relatively
stable.
nycfiredcopy4D %>%
filter(year <=2019) %>%
select(month,statistical_murder_flag) %>%
group_by(month,statistical_murder_flag) %>%
summarise(total=n()) %>%
ggplot(aes(month,total,fill=statistical_murder_flag))+theme_classic()+
geom_col()+ggtitle('Monthly trend of Shootings ')+geom_smooth(se=F,lty=2,col='black',linewidth=1)+
geom_text(aes(label = total),col='black',size=3.4)+theme(legend.position = 'none')
the plot shows that going into the fall or autumn season shootings
are on a decline and hit their lowest point in February at which point
we see a sharp rising trend especially in non lethal shootings until we
reach the month of the highest points of July and August from which then
drops in shootings then begins
the above plot indicates that July is the month with highest total
count of non lethal shootings
nycfiredcopy4D %>%
filter(year <=2019) %>%
select(dayofweek,statistical_murder_flag) %>%
group_by(dayofweek,statistical_murder_flag) %>%
summarise(total=n()) %>%
ggplot(aes(dayofweek,total,fill=statistical_murder_flag))+theme_classic()+
#geom_line(linewidth=.95)+geom_point()
geom_col()+ggtitle('Seven Day trend of Shootings')+geom_smooth(se=F,lty=2,col='black',linewidth=1)+
geom_text(aes(label = total),col='black',size=3.2)+theme(legend.position = 'none')
the highest amount of non lethal shootings occur on Sundays and
Saturdays while the lowest total shootings occur on Thursdays
nycfiredcopy4T %>%
filter(year<= 2019) %>%
ggplot(aes(longitude,latitude))+
geom_point(size=1,color='purple')+
#scale_x_continuous(limits=c(min_long,max_long))+
# scale_y_continuous(limits=c(min_lat,max_lat))+
theme_map()+ggtitle('Map of shootings not resulting in murder throughout New York')
Findings
-
Shootings tend to increase in the spring and summer months and decrease
in the autumn to winter months
-
The months of July and August had the highest counts of shootings
followed by June and May
-
Brooklyn had the most overall shootings followed by the Bronx
-
Brooklyn had the most shootings per month followed by the Bronx
-
Brooklyn had the most shootings per weekday followed by the Bronx
-
Brooklyn had the most shootings per year followed by the Bronx except
for the year 2018
-
Sundays and Saturdays were the days throughout the week that accounted
for the majority of shootings
-
The greatest majority of high volume shootings occurred within the first
20 days of the month
-
The months of July 2007 and August 2008 both shared the greatest number
of shootings at 110 while May through August of 2006,2007,2008 as well
as May of 2009 had shootings that were above or slightly below 100
-
it also seems that from 2006 to 2019 shootings throughout NYC
experienced a strong downtrend as they dropped from 875 in 2008 to 392
in 2019 and where this trend was most noticeable in Brooklyn and Bronx
Insights:
What the city can do despite the declining trend in year after year
non lethal shootings is reform policies that would provide summer time
recreational activities, shelters for the homeless, drug rehab programs,
therapy as well as provide any potential perpetrator with jobs which
would then greatly mitigate the number of shootings accounted for
throughout the summer months.
