This document compiles a few different plots meant to “kick the tires” on the risk-predictor. The intention is for us to decide on what model to use for the first draft. The document is outlines as follows:
Prediction Density Plots
Prediction Density Plots by Demographic Groups
ROC Curves
Model ECDF
Transition counts between old & new models
A note on definition of terms.
Old Charge Types refers to predictions from the model currently being used with the faulty charge-type categories
New Charge Types refers to the predictions from the model with the charge-type categories capturing more violent and property crime
Descriptions refers to the predictions from the model trained on the raw description data
Main Takeaway We seem to generate a “wider” distribution with the new charge types and descriptions. This makes sense as the optimal model has more trees & more categories than before.
Main Takeaway By and large the same trend as in figure 1 holds between genders and races. Notable is the difference for females and white-defendants in the Description-model predictions.
Main Takeaway This is the AUC on the Re-Arrest-Outcome the model is trained on. We can see that giving the raw charge descriptions increases the AUC marketable and statistically significantly. There is also 1%-point increase in AUC in the new-charge-type vs. old-charge-type models.
Looking at the AUC on the Detain-Outcome in the final regression we see that the old-charge-type predictions reach 0.624 AUC, the new-charge-type predictions reach 0.658, and the raw-description-model predictions reach 0.696.
Main Takeaway The ECDF shows the same results as the histograms in Figure 1. The description-model is able to cover a wider support than both other models, with the new-charge-type model doing marginally better than the current version. This plot is intended to give a ‘bin-width-invariant’ version of figure 1.
Here I am plotting the number of people who were in one of 4 quartiles of risk under the old-charge-type model, and there they have transitioned to in the new-charge-type and description model. Ideally we would want to see the largest count on the diagonal, as this would indicate our models are assigning most people approximately the same risk-probability.
Main Takeaway As expected (given the distribution above), the description-model gives a more diffuse prediction distribution and moves some people up/down a quartile in risk. The new-charge-type model is much more heavily concentrated on the diagonal, putting most people in the same prediction quartile as the old-charge-type model.