There is no technical documentation for clear reproducibility on the previous population and employment projections. It is difficult to say exactly which methods we used in the past, but it is a combination of using REMI and the Newling model (which focused on population density to predict growth). It is even less clear the past methodology when we consider that we produce different variations of the population/employment projections at a county, municipality, and then TAZ (Traffic analysis zone) level. It is my belief that we produced the county level projections at the county level through the use of REMI (either in-house or outsourced) and then utilized the “Newling model” completely separate from the REMI county level projections for the municipality/TAZ projections.
Based on what I understand, the last projections prior to 2022 were done by Bowen Liu and incorporated REMI (with land development simulation) at the county level and then used the Newling model to separately predict the municipality/TAZ level. To be explicit, if you summed the results of the Newling model produced municipality/TAZ level to the county level, they would not match the REMI level county projections. We also in the past did not try to “disaggregate” county level projections down to the municipality/TAZ level (which is a common method)
This is the most helpful summary of what was done (as stated before, there is no technical documentation for reproducibility for this method): (2019 Methodology)
Given the lack of reproducible documentation, we had to choose logical paths. The following is what was generally done as of July 2022. Each will be described in further detail in later sections.
From REMI, we worked with David Casazza (David.Casazza@remi.com) and Jeff Dykes (jeffrey.dykes@remi.com)
As stated above, county level projections for Lehigh and Northampton counties were taken out of REMI v4.5. We initially tried to replicate previous county level projections with land use development data (e.g. pipeline of the building of warehouses, apartments, etc.).
The previous 2019 REMI model simulation data is available: (2019 Simulation File). You can ask REMI to upload this XML file to your instance of REMI to see the prior simulation.
The reasons why we have not yet incorporated the land use development data into the county level projections via REMI:
Drawbacks of simple, generalized modeling - The methodology described in this (2019 Methodology) to model the impact of building apartments, warehouses, and retail, etc. via construction costs, employee hiring, and sales are dependent on very broad assumptions (e.g. employees per square foot, sales per square foot, internet based statistics). If the modeling of the land use development projects are very generalized and not more realistic, they may over inflate the population and employment forecasts.
Cost of time - If we are to improve the modeling of the land use development projects to make it more realistic, that is extremely time consuming as we may need to reach out to actual businesses and developers to understand more about their operations. We do not currently know whether that time investment will actually improve the accuracy of the forecasts.
Accuracy of REMI forecasts without land development data - Baseline forecasts in REMI are out-of-the-box and require no additional preparation. They are the forecasts without any of the land use development simulation data. During the disaggregation of county level projections into municipality level forecasts (a process described in the next section), we tested the accuracy of the municipality level forecasts against Census municipality level population data and they were satisfactorily accurate. It is difficult to justify the additional work to include and model land use development projects, which may or may not improve the accuracy of forecasts. The process may be more reasonable with more than a single staff member.
In contrast to previous projections, we utilized a disaggregation method of the REMI county level forecasts into each municipality. The general method was endorsed by the University of Tennessee Boyd Center and also used by Bob Diogo, NJTPA (New Jersey Transportation Planning Authority) in their projections.
The University of Tennessee Boyd Center population projections methodology is located: (UT Methodology)
Conversation with Bob Diogo confirming our methodology: (Bob Diogo NJTPA)
We applied the same methodology for the disaggregation of population and employment, these Excel based models are found here:
2022 Municipality Population Model (Population Disaggregation)
2022 Municipality Employment Model (Employment Disaggregation)
The report is made in R Markdown to produce a pdf. It is located here: (2022 Report)
It would make sense to replicate each future report using this as a template as all the graphs and tables just need new data. There is a folder named “Raw Data” in the (2022 Report Folder). You may need to confirm/change the working directories to make sure it is pulling the data from the “Raw Data” folder.