Estimating NY State Renter Occupancy Rates by Tract Using PUMA-Level Models
This is possible because:
We have PUMA and Tract data which represent the same populations
All of the variables which we need in order to predict renter occupancy exist in both the PUMA & tract data.
Use the slider at the center of the map to alternate between actual & predicted values.
The information below describes the model which is built from PUMA data to predict renter occupancy at the tract level.The model is displayed, and the the amount of error in the predictions made from this model is described through visualization & summary.
PUMA microdata is used to model the relationship between rental occupancy and race, age, and income of the head of household.
Tract Data (absent any rental occupancy data) will be then evaluated through this model, and rental occupancy will be estimated for each tract.
Model:
The logistic regression models predict renter occupancy rates by using race, age, and income of head-of-household. Alternate modelling methods (logistic mixed effects, spatial microsimulation) may improve accuracy and minimize error. Different and/or additional predictor variables may also improve predictions.
There is a linear relationship between our predictions & actuals - this is a good sign.
Error is not randomly distributed for all values of renter occupancy.
This model is likely to over-estimate renter occupancy in those tracts which have very low renter occupancy.
This model is likely to under-estimate renter occupancy in those tracts which have very high renter occupancy.