Analysis of Racial and Gender Bias in SLCPD's Use of Force and Street Checks, 2014-2017

Aria Cederlof, Kepuali Otuafi, Addison Scanlon, and Kenan Ince
November 21, 2021

Context

Nationally, racial bias in police use of force exploded into the national conversation in 2014 with the killings of Trayvon Martin, Michael Brown, and Eric Garner, all unarmed black men killed by police.
One year later, the Washington Post found that unarmed Black men were seven times more likely than whites to die by police gunfire.
On May 23, 2020, the Salt Lake City Police Department (SLCPD) shot and killed a Latino man, Bernardo Palacios Carbajal, by shooting him from behind while Palacios Carbajal was fleeing police. This shooting, coming two days before the murder of George Floyd, sparked anti-police brutality protests in the Salt Lake City area.

Context

Our study is the first independent study of SLCPD's use of racial profiling since \( 2000 \) (Deseret News, 2000). That study found no evidence of racial bias in ticketing.
SLCPD provides aggregate use of force data by race on their website (SLC Police Use of Force, 2019), but the data may not be cleaned and is limited to the last \( 24 \) months.
Moreover, this data is not compared to the SLC population to test for statistically significant evidence of bias.
SLCPD shared internal spreadsheets with us in order to determine whether there is evidence of bias in their use of force.

Methodology: The Skeptic's View

Statistical tests start by assuming there is no bias (the skeptical viewpoint)
SLC's population is \( 3.3\% \) Black, so we would expect the residents against which SLCPD used force to be \( 3.3\% \) Black as well (if no bias)
Pearson's \( \chi^{2} \) test quantifies how unlikely observed use-of-force demographics are given SLC's demographics (the expected use-of-force demographics under the skeptic's view).

Quantifying Chance with Pearson's Chi-Square Test

Expected proportions: American Community Survey SLC estimate for \( 2014 \), \( 2015 \) Selected Population Tables for \( 2015 \)-\( 2017 \)
- Conducted by Census Bureau
- Best representation of SLC's growing population
Observed proportions: Use of Force and Street Checks data
- Collected by the SLCPD, encompasses all of SLC
- Analyzed primarily by race and gender

Cleaning Force Data (2014)

We provide information about our analysis for \( 2014 \). We share provide the results, but not the methodology, of our analysis for \( 2015 \)-\( 2017 \).
We sought out and removed exact duplicates in the dataset (e.g., a subject charged with multiple crimes).
We removed entries that said “no force” or “race unknown”.
We then plotted the race distributions in our observed use of force data, which will later be compared to the American Community Survey.

plot of chunk unnamed-chunk-10

Comparing to the ACS values

“Asian” population in use of force dataset includes Pacific Islanders, but the ACS considers both groups separately
- we added together \( \text{AA}+\text{PI} \) in the ACS data in order to follow the police department's data.
ACS allows reporting more than one race, but SLCPD only has \( 4 \) monoracial categories
- unclear how to handle this
- we assigned multiracial folks who were white and a member of a racialized group to the corresponding racialized group
- unclear how to match multiracial folks of \( 2+ \) racialized groups to SLCPD dataset (we removed them from the dataset)

plot of chunk unnamed-chunk-13

Hypotheses

The null hypothesis for our dataset is that the proportion of races in our 2014 Use of Force dataset is equal to the proportion of races in the 2014 ACS. In symbols: \[ H_{0}: p_{\text{A}} = 0.079, p_{\text{B}} = 0.033, p_{\text{I}} = 0.019, p_{\text{W}} = 0.74 \]
The alternative hypothesis states that the proportion of races in our Use of Force dataset is not equal to the proportion of races in the 2014 ACS. This could mean that if any of our expected proportions are wrong, then we reject the null in favor of our alternative hypothesis.

Model Assumptions

We used a \( \chi^{2} \) model with \( 4-1=3 \) degrees of freedom. The three conditions that must be checked are:

Independence: SLCPD's use of force against one individual is (usually) not correlated with their use of force against another individual. (Or is it..?)
Sample size / distribution: the expected counts (the number of individuals of each of the four races in the ACS) is greater than \( 5 \) (by far).
Degrees of freedom: we have more than two degrees of freedom.

Chi-square Test Results (Use of Force 2014)

# A tibble: 1 × 4
  statistic p.value parameter method                                  
      <dbl>   <dbl>     <dbl> <chr>                                   
1     2237.       0         3 Chi-squared test for given probabilities

The value of our 2014 \( \chi^{2} \) is 2237.4214612 with a \( p \)-value of less than \( 0.0001 \). We reject the null hypothesis of no difference in racial demographics. There is statistically significant evidence that the distribution of races against whom SLCPD used force is different from the distribution of races in Salt Lake City.

Residuals

The residuals show the difference between our observed and expected counts. Negative numbers show underrepresentation, while positive numbers represent overrepresentation.
The farther the number is from 0, the more over/underrepresented a group is in our data. Residuals show a relative distance between the observed and expected values.
African Americans and Indigenous folks (coded as American Indians/Alaska Natives in the dataset) were overrepresented compared to their proportion in the population (thirty-four and twenty-seven more instances than expected). Asians were fairly underrepresented, with sixteen fewer instances than expected.

Table of Residuals (Use of Force 2014)

race_2014
Asian or Pacific Islander                     Black                Indigenous 
               -16.033295                 34.054482                 27.916891 
                    White 
                -6.426069

Graphing Residuals (Use of Force 2014)

plot of chunk unnamed-chunk-15

Chi-Square Test Results (Use of Force 2015-2017)

# A tibble: 1 × 4
  statistic p.value parameter method                                  
      <dbl>   <dbl>     <dbl> <chr>                                   
1     3196.       0         3 Chi-squared test for given probabilities

The value of our \( \chi^{2} \) is 3196.2268752. The \( p \)-value is less than \( 0.001 \). We reject the null hypothesis of no discrimination.

Graphing Residuals (Use of Force 2015-2017)

plot of chunk unnamed-chunk-18

Heatmap: Use of Force 2014-2017

A geographic heatmap of the number of use-of-force incidents in various neighborhoods of Salt Lake City

Potential Sources of Error

We may have made a Type I error, rejecting the null hypothesis when the null hypothesis was in fact true. This would mean that there really was no difference in the proportion of races, and that the observed overrepresentation of Black and Indigenous people of color in the use of force dataset was due to chance.

Analysis of Street Checks by Race: Context

“Typically a 'street check' or 'police stop' is a practice where police stop a person in public, question them, and record their personal information in a police database. A vehicle check stop is not a street check.” (British Columbia Civil Liberties Association, 2018).
A freedom of information request in \( 2017 \) showed that Edmonton, Alberta, police use street checks disproportionately often against Indigenous and Black people (CBC News, 2017).
Street check data is not made public by SLCPD, but the Department shared their data with us in a collaboration to better understand evidence of potential bias.
As before, we will analyze the street check data from \( 2014 \) separately from the data from \( 2015 \)-\( 2017 \).

Chi-Square Test Results (Street Checks 2014)

The value of our \( \chi^{2} \) is 46.7187277. The \( p \)-value is 3.9889362 × 10^-10.

The p-value calculated is exceptionally small, meaning that we reject the null in favor of the alternative hypothesis. This means that there is significant evidence that the proportions of races in our data is not equal to the model proportions in the American Community Survey.

Graphing Residuals (Street Checks 2014)

plot of chunk unnamed-chunk-20

Chi-Square Test Results (Street Checks 2015-2017)

The value of our \( \chi^{2} \) is 857.9825016. The \( p \)-value is 1.1498775 × 10^-185.

Graphing Residuals (Street Checks 2015-2017)

plot of chunk unnamed-chunk-22

Persons Linked to Cases (2014-2017)

Includes: all persons linked to cases entered into SLCPD’s Records Management System, including Fire Department Investigators and Airport Police Officers
Includes date, time, NCIC category, NCIC Subcategory, IBR code, Subject role, Subject sex, Subject race, City council district, Police zone, Police beat
439,833 individuals

PLC: Exploratory Data Analysis

plot of chunk unnamed-chunk-23

PLC: Heatmap

A geographic heatmap of the number of Persons Linked to Cases in various neighborhoods of Salt Lake City

Conclusions

There is statistically significant evidence of discrimination against Black and Indigenous people in SLCPD use of force (residuals \( +46.7 \) and \( +24.4 \), respectively, for 2015-2017), while Asians and Pacific Islanders are underrepresented in SLCPD use of force (residual \( -19.3 \))
There is statistically significant evidence of discrimination against Black people in SLCPD street checks (residual \( +24.5 \) for 2015-2017), while Asians and Pacific Islanders are underrepresented in street checks (residual \( -5.2 \) for 2015-2017).
This study should be viewed as preliminary and highlights the need for more research into SLCPD's use of force and street checks.

Limitations

Limited time frame
attempt to simplify multiracial people (assigning them to their racialized identity to match SLCPD records)
large proportion of individuals with “unknown” race in SLCPD datasets
not enough data to analyze ethnicity (e.g. Latin@/e folks)
no analysis on age

Limitations

no control for/analysis of crime rates by race or gender
no control for/analysis of locations of various races in SLC
need for more SLCPD context
- police demographics on particular beats could influence force used or if someone was checked
- no analysis of police presence in a given location

SLCPD Reaction

The Salt Lake City Chief of Police, Mike Brown, reacted to a scheduled presentation of this information by stating that “we only use force when we need to”.

Thank you!

Any questions?