R Markdown

This is an R Markdown document. Markdown is a simple formatting syntax for authoring HTML, PDF, and MS Word documents. For more details on using R Markdown see http://rmarkdown.rstudio.com.

When you click the Knit button a document will be generated that includes both content as well as the output of any embedded R code chunks within the document. You can embed an R code chunk like this:

library(readxl)
library(ggplot2)
ccrb_data<- read_excel("ccrb_datatransparencyinitiative.xlsx",sheet = "Complaints_Allegations")

Visualization 1

Distribution of incidents over different areas in NYC Based on below graph we have the following observations :- 1) The number of incidents have reduced over the year in each borough 2) The highest number of incidents occured in the year 2009 3) Brooklyn Borough has the highest number of incidents. 4) Staten island has the lowest number of incidents reported.

Visualization 2 Distribution of Complaint FileMode vs Complaint File place Based on below graph we have the following observations :- 1) Most of the complaints are filed in CCRB or IAB 2) Some In person and phone complaints are filed n the precint as well. 3) Mayors office mostly receives complaints via email

Visualization 3

Graph to analyze the year with highest frequqency of incidents. We can see that the year 2006 & 2007 has the highest number of incidents.

Visualization 4:

The below chart shows the following observations :- Brooklyn has the highest encounter frequency and the highest nuber of arrests

Visualization 5: The below graph is a summary of complaints distributed over allegations each close year.

Visualization 6:

The most important observation here is that most of the cases were closed with a year of receiving complaint. There are a only 3 exceptions where more time was taken to close the case.

Visualization 7:

The below pie chart indicates that most of the allegations(alomost 50%) were of type ‘Abuse of Authority’.

Visualization 8

Observations :-

Most of the full investigations were conducted for ’Abuse of authority’and majority resulted in No Arrest or Summons.

Visualization 9

Observations :- Close Year vs Frequency of cases closed with the fill of filing year.

Visualization 10

From the below graph we can conclude that majority of incidents have occured on street/highway.

Visualization 11

Summary :-

Exploratory data analysis helps us to explore and understand the data properly. We can check the variables and their relationships.EDA is for visualizingwhat the data can tell us beyond the formal modeling or hypothesis testing task.

  1. The number of incidents have reduced over the year in each borough
  2. The highest number of incidents occured in the year 2009
  3. Brooklyn Borough has the highest number of incidents.
  4. Staten island has the lowest number of incidents reported.
  5. Most of the complaints are filed in CCRB or IAB
  6. Some In person and phone complaints are filed n the precint as well.
  7. Mayors office mostly receives complaints via email
  8. We can see that the year 2006 & 2007 has the highest number of incidents.
  9. Most of the allegations(alomost 50%) were of type ‘Abuse of Authority’. 10)Most of the full investigations were conducted for ’Abuse of authority’and majority resulted in No Arrest or Summons. 11)Majority of incidents have occured on street/highway.