A Quantiative Spatial Model of the Cleveland Metropolitan Region, Version 1.0

Executive Summary

  • The Cleveland metropolitan area, having built the Midline, is expected to have about 90,000 more residents than the status quo scenario (on a multi-decade horizon). About a tenth of these individuals will reside in Fairfax, with the rest being scattered throughout the region.

  • The metropolitan area is expected to gain 42,000 jobs (again, versus baseline), with three-quarters of these positions being generated in the Midline’s vicinity.

  • Residential floorspace prices in Fairfax are expected to rise about 13% more than in the baseline scenario. This likely understates the overall rise in property values, as prices in this area are already rising quickly.

  • The highest gains in job growth and property values will be along the Euclid Avenue corridor, which is already a significant employment center. The highest population growth will be south of this area, in residential sections adjacent to the Midline. Average wages will rise across the region, but especially towards the Opportunity Corridor. Differential demand for housing and commercial real estate means that property value growth will be robust along the Midline, and especially to the north, but the southern tip (near the Blue/Green Lines) may end up being oversupplied.

  • These findings reinforce the impression that the Midline will generate significant spillover effects, and this will be felt most acutely in adjacent neighborhoods, i.e. Fairfax and Central. Given the magnitude of these effects, a carefully calibrated value-capture instrument could help the local community benefit from these changes.

Introduction

A central conceit of urban economics is the uneven distribution of economic activity across space. This is both due to exogenous (i.e. a priori geographical phenomena, such as the location of Cleveland at the outlet of the Cuyahoga River) and endogenous (relating to the interplay of agglomerative forces which arose after the city’s founding) factors. (Redding and Rossi-Hansberg 2017) In the previous century, the general equilibrium impacts of these factors were understood merely theoretically. It was not until economics embraced computing that the promise of quantitative spatial modelling was understood. Using real-world data and a set of parameters, economists and urban planners have begun to use QSMs as a means of considering the impact of hypothetical interventions on the metropolitan fabric.

At its core, a QSM is a set of equations which a computer solves and resolves as conditions change. Moving a resident from one neighborhood to another may improve the household’s utility, but the impacts of this decision change the parameters of life in the location they left. This is evident when one considers the experience of population decline on Cleveland’s East Side over the course of the late 20th century (negative, reinforcing externalities like crime and disinvestment) or New York’s Lower East Side decades earlier (positive, associated with a reduction in overcrowded tenements and overburdened public services). When running a QSM, therefore, it is essential that the parameters used (mostly elasticities) are well calibrated. These values, represented as a slew of Greek letters in the code, control the magnitude of the impacts generated by piecemeal changes, e.g. the hit to utility engendered by an incremental increase in congestion. In line with contemporary empirical methods, various studies have exploited exogenous shocks in order to generate reasonable estimates of such values, most notably (for this purposes of this report), Ahlfeldt et al., 2015, (Ahlfeldt et al. 2015) which uses the unique circumstances surrounding the division and reunification of Berlin over the course of the 20th century in order to estimate reasonable agglomeration/congestion elasticities. That study, and the parameters it generated, are at the core of almost all modern QSMs.

The Midline is a considerable endeavor with radical implications for the Cleveland area. The magnitude of the jobs created, the buildable land brought to market, and the amenities improved will impact the economy not only of the adjacent neighborhoods but also the metropolitan region as a whole. A narrow focus merely on the neighborhoods most impacted by the project could miss general equilibrium effects with crucial policy implications. This model, as I have calibrated it, predicts 90,000 additional residents for the Cleveland metropolitan area over the scenario in which the Midline is never created. These residents’ purchasing power and claims on public infrastructure imply a deeply divergent path for policymakers, urban planners, and the business community.

Methodology

This model, taking existing data into account, assumes that the current spatial distribution of households and workplaces represents a competitive general equilibrium resulting from decades of workers and firms allocating and reallocating throughout the urban area. Utility is thus maximized to the greatest extent. Assuming this, the model then “inverts” the data, producing the distribution of amenities, productivity, and land uses that match the observed economic geography of the region. By modifying these figures according to plans for the Midline, the equilibrium is disturbed. Individuals and firms gain arbitrage profits by changing their spatial position. The model then sees where aggregate utility is especially imbalanced, moves individuals away from areas where they receive low utility, and towards those where they would experience higher benefits. In the open city model, the metropolitan area also grows or shrinks with reference to an “outside option” represented by the utility gained by moving to another metropolitan area. The model performs these calculations until there are no or barely any moves which would enable individuals to arbitrage the system. This is assumed to be a new general equilibrium.

Limitations

QSMs are inherently quite limited. The authors of the QSM which I am building upon (the Cities Spatial Model) name several considerations which limit whole-cloth application of this methodology to real-world cities. They mention(Cole et al. 2024):

  • A lack of a time component to the results. My findings therefore represent an eventual “settling point” once the imbalances generated by the Midline are internalized in the price of land and housing as well as the distribution of workers and households across the metropolitan area. One can assume that it will take multiple decades for this process to be fully realized.

  • Like most basic economic models, this QSM has a representative agent, meaning that all individuals are assumed to have the same basic preferences. In real life, this is obviously a gross simplification. For the purposes of high-level modelling, however, it may make more sense to assume the vast majority of individuals are broadly trying to get the most they can out of a limited salary or wage and will move wherever they assess has the highest economic promise and amenities within the constraints of their budget.

  • This model assumes a single production sector, meaning that all industries combine labor and capital in similar ways. This allows us to generate a homogeneous measure of total factor production (TFP), but it is not terribly realistic. Consider the measures of productivity below as a composite of the various economic activities present in the Cleveland area.

  • This model assumes full employment and freedom of movement. Given Cleveland’s history of prolonged, elevated unemployment, as well as the challenges which segregation and information asymmetries have posed, these assumptions also seem unrealistic, though it should be noted that unemployment in the Cleveland-Elyria MSA is now in line with national averages (U. S. Bureau of Labor Statistics, n.d.) and that many areas which were hitherto off-limits to Black Clevelanders due to racial covenants and other manifestations of systemic racism are now rapidly diversifying. (Keating 1994)

  • Finally, as with many classical economic models, this QSM assumes perfect competition and rationality. This assumption lies at the crux of many critiques of quantitative economics, and is worth taking seriously, but at the level of abstraction this model operates at, competition and rationality isn’t far off mark. The market for housing and urban land in the US is by and large a free market, and self-interest is a chief concern of most actors involved.

Why use a QSM despite these limitations? As I have argued, none of these are fatal for the purposes of this back-of-the-napkin report. Furthermore, it is my view that having some sort of grounding scenario is better than none, even if there are obvious flaws. We may answer basic questions regarding the proposal (e.g. Will there be any appreciation in property values? Will there be any population growth?) while also understanding that all exact figures are meant to be interpreted with a grain of salt.

Data

The IGCities R package requires five spatial datasets covering the entirety of the urban/commuting area in question: the residential population, the workplace population, the area of each spatial unit, average floorspace prices (per square foot or meter), and commuting time between the units. In general, the spatial unit in question should be as granular as possible while also remaining meaningful in terms of sample size. I therefore chose to employ Census tracts as my basic unit of analysis. Blocks would have been even more granular, but would have been less meaningful especially regarding my index of floorspace prices.

1. Residential Population

I derive residential population by census tract from the 2023 American Community Survey (accessed via IPUMS). (Ruggles et al., n.d.) The population of tracts is heterogeneous across the metropolitan region since they are reapportioned periodically to ensure they average about 4,000 residents, though suburban tracts are generally more populous than urban tracts due to continued depopulation and lagged reapportionment.

2. Workplace Population

The Census Bureau tracks workplace population with its LEHD Origin-Destination Employment Statistics (LODES) survey, the most recent of which was published in 2023. (U. S. Census Bureau, n.d.) Workplace population is far more concentrated than residential population with particular clusters in Downtown, University Circle, along Euclid Avenue, and in suburban industrial/business parks.

3. Area

This was calculated easily by downloading a Census tract shapefile, reprojecting to an Ohio-specific projection, and then using QGIS to determine the area. As one would expect, the largest tracts are in the rural fringes of the metropolitan area while the smallest are in dense urban cores.

4. Floorspace Price per Square Foot

I was able to impute price per square foot (PPSF) by using data from Redfin, a real estate listing website. (Redfin, n.d.) For the purpose of this report, I assume that the relative inter-neighborhood price of residential floorspace is similar to that of commercial and industrial buildings. One could question this assumption, but unfortunately there few public sources of going rates for commercial floorspace in the region. Redfin publishes longitudinal data in various geographies, but not Census tracts. I therefore downloaded the most recent price per square foot data (generally from the first quarter of 2025) for Cleveland area neighborhoods and ZIP codes. Redfin defines “neighborhoods” in a unique and unreliable manner, meaning that determining their boundaries required scraping Redfin and searching raw HTML for Google Polylines, a compact system of storing spatial data. ZIP code geographies were easier to attain and I eliminated those which were almost entirely covered by neighborhoods.

Median PPSF by neighborhood
Median PPSF by neighborhood
Median PPSF by ZIP code
Median PPSF by ZIP code

Since neighborhoods are generally contained within ZIP code regions, this presented an opportunity to impute more granular data on floorspace prices using a technique called areal weighted interpolation. This involves leveraging discrepancies in areal statistics between overlapping geographies with known population densities to estimate statistics where the geographies do and do not overlap. The way this works is simple: imagine there is a neighborhood completely enclosed within a ZIP code. The number of households in the neighborhood is 500 versus 1000 in the ZIP code as a whole. Suppose we know that average household income is $750 within the neighborhood but $1000 in the ZIP code. We can estimate the average household income in the “remainder” of the ZIP code by solving for \(x\) within the expression \(pop_n*income_n+(pop_z-pop_n)x=pop_z*income_z\). In this example, the average household income in the remainder is $1250.

Using IPUMS, I was able to access block-level (very granular) counts of housing units, which then functioned as my “population” layer. This allowed me to impute (using the method discussed above) PPSF for the ZIP code areas outside of Redfin-identified neighborhoods. It bears noting that Redfin tracks median rather than mean PPSF, a distinction which means that this model assumes property values in a given neighborhood are normally distributed, something which seems more plausible in certain areas than others.

Median PPSF by neighborhood/ZIP code remainder
Median PPSF by neighborhood/ZIP code remainder

Instead of performing areal weighted interpolation again, this time with Census tracts, I instead chose to convert average PPSF figures into a point layer and used inverse distance weighted (IDW) interpolation to transform these data into a continuous raster. I chose to do this because the interpolated neighborhood/ZIP code data was sufficiently granular, and market-rate property values generally change gradually, barring a significant gap in the urban fabric (such as a controlled-access highway or a river). With this raster layer I then calculated the mean PPSF per tract. Familiar patterns emerge from a visualization of these data, namely low prices along the East Side corresponding to Cleveland’s most impoverished neighborhoods and higher prices in the direction of Shaker Heights, Lakewood, and other wealthier enclaves.

5. Commute Time Matrix

The final necessary ingredient is a matrix of commute times (ostensibly in minutes) between every possible combination of spatial units. Ideally, there are routing algorithms which could perform these sorts of calculations, but given the number of paths this would involve—\(166*166=27,556\)—it will take a lot of computing power to generate a full matrix. I am working on figuring out how to efficiently carry out this task, but in the meantime I use distance as a proxy, assuming an in-town average driving speed of 30 km/h. This is likely to differ a good deal from actual driving times, as the Cleveland area is connected by highways which hinder movement between adjacent neighborhoods even as they facilitate movement between suburbs and the city center, a dynamic which aggravates the region’s spatial inequalities.

Inversion

Having prepared the inputs, I can begin constructing the model. The first step is “inverting” the observed data, that is, using the outputs of the data-generating process (the local economy) to uncover the fundamentals which drive its spatial composition (productivity, amenities, etc.).

# put population 1 for empty tracts
current_data[is.na(current_data)] <- 1
current_data[] <- lapply(current_data, function(col) {
  if (is.numeric(col)) {
    col[col == 0] <- 1
  }
  return(col)
})

# prep data for inversionModel method
N=as.matrix(nrow(current_data))
L_i=as.matrix(current_data$home_pop)
L_j=as.matrix(current_data$work_pop)
K=as.matrix(current_data$X_area)
t_ij=naive_dist_matrix
Q=as.matrix(current_data$X_idw_mean)

# invert the model, converging after 1000 iterations (or sooner)
inverted_values <- inversionModel(N=N,L_i=L_i,L_j=L_j,Q=Q,K=K,t_ij=t_ij)

# extract values from inverted model
a=as.matrix(inverted_values$a)
b=as.matrix(inverted_values$b)
varphi=as.matrix(inverted_values$varphi)
w=as.matrix(inverted_values$w)
u=as.matrix(inverted_values$u)
Q_norm=as.matrix(inverted_values$Q_norm)
ttheta=as.matrix(inverted_values$ttheta)

Total Factor Productivity

The first of these is total factor productivity (TFP). In short, TFP is the ratio of outputs to inputs, or how efficiently a given firm, polity, or place combines labor and capital to create value. Generally this is expressed as the value \(A\) which solves the expression (a Cobb-Douglas production function, to be more precise) \(Y=A*K^\alpha*L^\beta\) where \(Y\) is output, \(K\) is capital input, and \(L\) is labor input. IGCities breaks TFP down into two main components: endogenous TFP and exogenous/baseline TFP. The first of these takes into account all factors which shape productivity, including externalities/spillovers generated by adjacent places, such as the knowledge agglomerating around universities or the enhanced access to labor markets provided by highway access. The latter speaks to the “inherent” productivity specific to the place, including infrastructure, capital stock, and local workforce.

The spatial pattern of \(A\) (taking into account externalities) and \(a\) (baseline) are generally similar. Poorer areas on the East Side and rural areas on the margin of the metropolitan area have lower productivity. That being said, subtracting \(a\) from \(A\) demonstrates that spillovers play a crucial role in the enhancing local output in Downtown and along several corridors radiating from the central business district. Interestingly, the model picks up on the presence of key transportation corridors (I-271, I-77, I-90, Euclid Avenue) without these explicitly being in the input data!

Amenities

Another crucial factor which determines the spatial composition of an urban area is the presence of amenities, that is, anything which creates value which might be crystallized in market value but is not directly related to production. Households may place a premium on living close to a park, good schools, diverse restaurants, and natural beauty, to give a few examples. As with productivity, the amenities may spillover into nearby areas. \(B\) therefore represents total amenities, including endogenous factors. The two most striking factors are the high level of amenities on the West Side versus the East and the prominence of rural tracts. Comparing this with \(b\), however, one realizes that there are actually two distinct dynamics at play. The West Side boasts high baseline amenities, likely due to the presence of the lake, public beaches, and ample parks. Places like Lakewood are nice in and of themselves. In rural areas, however, the amenities are almost entirely spillovers. In other words, the premium is the result of being far from undesirable areas with bad externalities. This dynamic represents the driving force behind historic and continuing flight, especially among White residents, from Cleveland proper.

Density of Development and Floorspace Use

Model inversion also conjectures the density of development (the density of floorspace brought to bear for residential and commercial uses), signified by the letter \(\phi\) and the share of that space under commercial/industrial use (\(\theta\)). The first map below shows the natural log of \(\phi\), demonstrating—as one would expect—more floorspace in urban areas with especially high values in Downtown, Euclid Avenue, and University Circle. These specific areas actually have several times more floorspace than any other tract in the area, which is the reason why I logged the data. Otherwise, the subtle distinctions between urban and rural would be illegible.

The following map shows that commercial development is especially prevalent along the Cuyahoga River, the I-271 corridor, outer Euclid Avenue, and Downtown. Again, it is impressive that a model with simple inputs is able to correctly identify individual industrial/business parks as well as the airport and important logistical hubs.

Wages

Finally, inverting the model generates a vector of wages paid out in each work location. Again, places with large amounts of workers and high productivity are prominent. Those with high amenities do not necessarily pay out the highest wages. These figures are important, however, as this model simulates individual workers’ attempts to maximize utility given their budget. Locations where individual workers can live cheaply while earning high wages and enjoy good amenities will quickly be arbitraged away by higher prices and the disamenities of congestion.

Solving the Model

Having reviewed my inputs, I can run a few simulations. This will involve allowing households across the metropolitan area to move, switch jobs, reassess their budgets, and even leave the region altogether. The result will be a snapshot of the local economy in equilibrium, a situation where no one may move without reducing their overall utility. In the real world, economies are never in equilibrium due to external shocks and “stickiness” within markets, especially for labor and housing. That being said, the equilibrium can be understood as a general trajectory towards which an economy will eventually settle. For the purposes of this task, we will compare the equilibrium produced by the existing fundamentals of the economy, i.e. the “do nothing” scenario, versus the circumstances produced by the execution of the Midline project.

Since IGCities normalizes all inputs (that is, it adjusts each vector’s range to match the other inputs) simply running the model will fail to produce meaningful results. Instead, we will compare the relative values generated by each scenario, allowing us to make statements along the lines of: “the project will result in land value increases of X% versus baseline.”

Baseline

Below, I call IGCities’ solveModel method to generate the baseline scenario. This requires our original inputs as well as values generated by inverting the model. This simulation quickly converges (155 iterations) on an equilibrium because Cleveland, as a real-life city, contains few opportunities for spatial arbitrage. This generates a list of matrices with the new, post-equilibrium population, wage, floorspace data, etc.

model_baseline <- IGCities::solveModel(
            N=N,
            L_i=L_i,
            L_j=L_j,
            K=K,
            t_ij=t_ij,
            a=a,
            b=b,
            varphi=varphi,
            w_eq=w,
            u_eq=u,
            Q_eq=Q_norm,
            ttheta_eq=ttheta
)

Counterfactual (closed)

Having generated the baseline scenario, I will now simulate the impact of the Midline. Ideally, I would have more exact data on how the project would modify the amenities, floorspace, and productive capacity, but for the purpose this rough simulation, I choose to increase \(\phi\), \(a\), \(b\), and \(\theta\) by a standard deviation in all of the tracts which intersect with the site. This approximates the combination of parks, industrial/commercial facilities, and improved infrastructure implied by the Site Fund’s plan.

midline_tracts <- which(current_data$GISJOIN %in% c("G3900350114501","G3900350197200","G3900350198400","G3900350114800"))

a_ <- a
b_ <- b
varphi_ <- varphi
ttheta_ <- ttheta

varphi_[midline_tracts, ] <- varphi[midline_tracts, ] + sd(inverted_values$varphi)
a_[midline_tracts, ] <- a[midline_tracts, ] + sd(inverted_values$a)
b_[midline_tracts, ] <- b[midline_tracts, ] + sd(inverted_values$b)
ttheta_[midline_tracts, ] <- ttheta[midline_tracts, ] + sd(inverted_values$ttheta)

Unmodified, IGCities does not allow migration into and out of a given urban area. This implies the city is a closed system. This may not matter for interventions which fail to significantly alter the attraction of Cleveland versus other cities. For example, the opening of a new park may lead more people to move to a neighborhood, but it is unlikely to cause many people to move to the Cleveland area who would otherwise live in a different state. In equilibrium, metropolitan populations are balanced apart from natural change. The aggregate utility within Cleveland would equal that of its outside options, meaning that individuals (on average, individual experiences may differ) do not gain from moving to Columbus or Detroit. This may actually be a reasonable assumption (at least regarding Cleveland vs. the rest of the world) because the metropolitan area’s population has varied within a band of ~150,000 residents since at least 1960, making it one of the most “stable” metro regions in the country despite dramatic internal shifts, most prominently the rapid development of the suburbs at the expense of Cleveland itself.

model_closed <- IGCities::solveModel(
            N=N,
            L_i=L_i,
            L_j=L_j,
            K=K,
            t_ij=t_ij,
            a=a_,
            b=b_,
            varphi=varphi_,
            w_eq=w,
            u_eq=u,
            Q_eq=Q_norm,
            ttheta_eq=ttheta_
)

Counterfactual (open)

That being said, the scale of the Midline project means that there is a strong chance that it could meaningfully increase Cleveland’s aggregate utility, generating in-migration. This requires modifying IGCities’ solveModel function to increase the metro area’s population until this spread is closed. Since adding 1 individual per iteration will not capture the full extent of the population increase associated with the Midline, I modify the function call to increment population by 200 every step. This prevents the model from precisely reaching a level of aggregate utility which equals outside utility, so the model never properly converges. We can tolerate an error of ±200, so this isn’t a terrible fault. I choose to cut off the model after 300 iterations, since errors fail to shrink after this point.

This modification incorporates two new inputs: U_bar (the outside option utility) and step (the amount of inhabitants to add or subtract in each iteration). Since Cleveland might already be in equilibrium with the outside world (meaning that its utility matches the outside option), my simulation uses the baseline scenario’s aggregate utility as the outside option. Since the Midline’s interventions increase utility, the model will increase population until outside and inside utility is equalized.

model_open <- solveModel_open(N=N,
            L_i=L_i,
            L_j=L_j,
            K=K,
            t_ij=t_ij,
            a=a_,
            b=b_,
            varphi=varphi_,
            w_eq=w,
            u_eq=u,
            Q_eq=Q_norm,
            ttheta_eq=ttheta_,
            maxiter=300,
            step=200,
            U_bar = model_baseline$U)

Results

Population

The open city model projects a metropolitan population 4.11% larger than the baseline scenario. Given a 2024 estimate of 2,171,877 residents (US Census Bureau), we might therefore expect an overall population gain of about 90,000 residents across the entirety of the region.

Population change
Metric \(\Delta\) vs. Baseline
Population change (%) +4.11%
Population change (raw) +89,157
Population change by tract (min.) -357 (Central)
Population change by tract (med.) +108
Population change by tract (mean) +137
Population change by tract (max.) +5,977 (Central/Fairfax)

This gain would be distributed throughout Cleveland, with most neighborhoods only gaining a couple dozen residents. Most of the action is adjacent to the Midline itself, with the area between Opportunity Corridor and Central/Quincy Avenues gaining several thousand residents. Interestingly, some of the tracts near but not directly abutting the Midline lose a couple hundred residents. In the closed city model, a broader pattern emerges which sees suburbs lose population as population rises in Cleveland proper.

Regarding jobs, the open city model sees the creation of 41,800 direct and indirect jobs, the majority of which are concentrated in one area: the northern part of Fairfax and the Euclid Avenue corridor. A smaller (but still significant) cluster emerges along the Opportunity Corridor between the East 79th and Woodhill Green/Blue Line stations. Few new jobs are created elsewhere, and in the closed city scenario Downtown, University Circle, and several industrial parks actually lose jobs due to firms relocating to the Midline.

Worker population change
Metric \(\Delta\) vs. Baseline
Worker population change (%) 4.12%
Worker population change (raw) +41,800
Worker population change by tract (min.) -10 (Warrensville Heights)
Worker population change by tract (med.) +1
Worker population change by tract (mean) +67
Worker population change by tract (max.) +28,502 (Fairfax/Health Tech Corridor)

Wages

In the open city model, wages in the average neighborhood increase by only 0.1% whereas in the closed city model wages increase by 1.3%. This discrepancy is due to the impact of in-migration driving down wages. Such an outcome may not result from outright wage decreases but may function as it does in other high-cost cities—employers let real wages fall by failing to compensate workers for higher prices elsewhere in the economy.

Wage change
Metric \(\Delta\) vs. Baseline
Change in mean wage (open) +0.1%
Change in mean wage (closed) +1.3%
Change in mean wage by tract (open, min.) -0.2% (North Madison)
Change in mean wage by tract (open, med.) 0%
Change in mean wage by tract (open, mean.) 0.1%
Change in mean wage by tract (open, max.) 28.7% (Kinsman/Opportunity Corridor)

In the both the open and the closed city models, wages increase by 7%-28% along the Midline, with the strongest gains in the southern sections of the site. Wages increase less along Euclid Avenue because this is already an area with comparatively high wages (the Cleveland Clinic, among other important employers, are in this area). In the open city model, everywhere beyond about a half-hour drive from the Midline experiences slight wage declines.

Floorspace Prices

Price per square foot, in general, increases rapidly along the northern section of the Midline, moderately along the middle, and actually decreases towards the south. This is likely due to very high demand for living and working space at the north as well as a slight oversupply problem at the south. Otherwise, floorspace gets more expensive in the swath surrounding the Midline, cheaper in a band encompassing the West Side and the outer East Side, and then expensive again in further areas.

Floorspace price change
Metric \(\Delta\) vs. Baseline
Change in price per square foot (open) +0.2%
Change in price per square foot (open) -2.6%
Change in price per square foot (open, min.) -1.4% (Kinsman/Opportunity Corridor)
Change in price per square foot (open, med.) +0.1%
Change in price per square foot (open, mean) +0.2%
Change in price per square foot (open, max.) +33.6% (Fairfax/Health Tech Corridor)

Summary

As the table below demonstrates, the Midline would have a significant impact on the economy of the Cleveland region, though the areas which would see the most dramatic changes would be concentrated in the immediate vicinity of the project, namely, Fairfax and Central. In the open city scenario, many of the benefits generated by new development would be diminished by new entries into the labor market. Wages, floorspace prices, and consumer market access would remain stagnant as competition heats up for jobs and places to live, leading to congestion in the market. That being said, the impact on Fairfax’s Census tracts are more potent, especially in directly impacted areas. The table below also includes a measure of commuter market access (CMA)—that is, a composite measure of access to employment. In both the open and closed city scenarios, workers’ access to jobs increases, which is to be expected, though only modestly.

It is important with these projections to note that these are all relative impacts due specifically to the modifications described above. Cleveland is already undergoing significant change due to new developments, the shifting attraction of other metropolitan regions, and broader changes in the economy as a whole. Therefore, one should think of the values in the table below as changes above the “do nothing” scenario in which the Midline never existed. Price per square foot may therefore (and probably will) increase by more than 13%, but that the Midline’s presence means that prices are 13% higher than what they would be otherwise.

Metric Open (Cleveland area) Open (Fairfax) Closed (Cleveland area) Closed (Fairfax)
Population +90k +9k +0 +8k
Jobs +42k +31k +0 +28k
Wages +0.1% +12% +1.3% +13%
PPSF +0.2% +13% -2.6% +10%
CMA +0.6% +1.9% +1.8% +3.1%

Discussion

The results of this simulation suggest that the Midline could add to the already significant price pressures which have shaped the market for housing and land in Fairfax since the pandemic. This lends credence to the idea that spillovers from the Midline could lead to displacement and that a community-led housing/land trust would make financial sense for local stakeholders as well as outside investors interested in supporting regeneration efforts on the East Side. That being said, it is important to underscore that, for the most part, most major spillovers are concentrated in the immediate neighborhood of the Midline, with only minor effects on areas a middling distance away. This is probably due to the fact that there is still ample land in Fairfax/Central which could be used for infill and densification, largely due to the abandonment of residential and commercial properties. A value capture instrument would do well to include holdings across the East Side—diversification will be important for attracting investors and smoothing returns—but any instrument which is framed explicitly in terms of its connection to the Midline should focus on the areas immediately nearby. It would be a worthwhile exercise to deepen our understanding of the specific subset of plots which are both Midline-adjacent and held by county and local land banks.

Another consideration which arises from the simulation is the need to ensure economic activity is drawn from the northern end of the project area towards the south. Businesses, when considering where to locate, will naturally gravitate towards the Euclid Avenue corridor, with its superior transportation infrastructure and existing agglomeration effects. Prudent planning must be employed in order to seed development in the Kinsman/Opportunity Corridor neighborhood. This may require different financial inducements, anchor firms/institutions, and intelligent urban design which invites recreational and business activity down the Greenway. Intensifying development in this area is also essential as a means of taking advantage of the neighborhood’s ample public transportation links, especially rapid transit.

Future Directions

Moving forwards, I hope to use these results as part of a financial model which will explore the possibility of pooling land bank holdings in Fairfax and creating a trust which raises capital for investment into housing construction/repair while returning most of its proceeds to current residents of the community, thus averting the threat of displacement in the face of steep property value appreciation. I already have access to detailed data on local land banks’ holdings, including appraisals, the condition of individual properties, etc., and applying these results (especially PPSF) to this data could be a first step in estimating a cap rate for the trust. This could also involve imputing appreciation in land values (rather than merely home values) using a regression on the existing relationship between PPSF and land value in the region. Plugging projected PPSF would generate the future value of publicly-held buildable land under a variety of scenarios, especially different zoning regimes.

Citations

Ahlfeldt, Gabriel M., Stephen J. Redding, Daniel M. Sturm, and Nikolaus Wolf. 2015. “The Economics of Density: Evidence from the Berlin Wall.” Econometrica 83 (6): 2127–89. https://doi.org/10.3982/ecta10876.
Cole, Hugh, Victoria Delbridge, Maria Del Mar Gómez, Nick Tsivanidis, and Román David Zárate. 2024. “Cities Spatial Model: A Quantitative Spatial Equilibrium Model to Guide Urban Planning, Policy, and Infrastructure Investment Decisions.”
Keating, Dennis W. 1994. The Suburban Racial Dilemma: Housing and Neighborhoods. Conflicts in Urban and Regional Development. Temple University Press.
Redding, Stephen J., and Esteban Rossi-Hansberg. 2017. “Quantitative Spatial Economics.” Annual Review of Economics 9 (1): 21–58.
Redfin. n.d. “Redfin Monthly Housing Market Data.”
Ruggles, Steven, Sarah Flood, Matthew Sobek, Daniel Backman, Grace Cooper, Julia A. Rivera Drew, Stephanie Richards, Renae Rogers, Jonathan Schroeder, and Kari C. W. Williams. n.d. “IPUMS USA.”
U. S. Bureau of Labor Statistics. n.d. “Unemployment Rate in Cleveland-Elyria, OH (MSA).”
U. S. Census Bureau. n.d. “LEHD Origin-Destination Employment Statistics, 8.3.”