Hot Spots vs Cold Spots

Hot spots show clusters where the model over-predicted value and cold spots show clusters where the model undervalued. NA clusters are areas where the models predictions were accurate or where over-prediction and under-prediction were spatially random.

Our theory is that hop spots are areas that are undervalued by the market and cold spots are over valued by the market.

Assignment of Clusters

Blocks were spatially joined to the to the cluster analysis, taking values the largest collection of cells in a given block. The map below displays the assignment. Cells and Block Groups can be turned off to see the correlation.

Analysis

Block Group level data was taken from the ACS and LODES for 2018.

Housing Value and Income

Median housing value and median household income are summarized below both taken from the ACS.

Cluster Housing_Price_Med Housing_Price_Mean Housing_Price_sd Income_Med Income_Mean Income_SD
Overvalued 291600 349188.4 226405.03 71154 79882.17 42851.91
Undervalued 129300 138913.6 61342.06 43750 46488.57 16305.86
NA 171300 182701.0 71537.41 48361 51897.12 19803.80

The overall distributions are similar except that cold spots have a more significant positive tails. With significantly more blocks having a median housing value of 500,000 dollars and having a median income higher than 150,000.

These differences are statistically significant as shown in the T-Tests Below.

T-Test for Differences of Means, Housing Value

Welch Two Sample t-test: hh_median_price and ll_median_price (continued below)
Test statistic df P value
-10.47 160.2 0.00000000000000000007172 *
Alternative hypothesis mean of x mean of y
two.sided 138914 349188

T-Test for Differences of Means, Income

Welch Two Sample t-test: hh_median_income and ll_median_income (continued below)
Test statistic df P value Alternative hypothesis
-8.595 189.1 0.000000000000003094 * * * two.sided
mean of x mean of y
46489 79882

Housing Density

There appears to be a minor, but significant difference between the clusters in Housing Density. With under valued areas having more housing per acre than over value areas.

Cluster Housing_Units_Per_Dev_Acre
Overvalued 1.704641
Undervalued 1.860646
NA 1.824505
Welch Two Sample t-test: over_housing_density and under_housing_density
Test statistic df P value Alternative hypothesis mean of x mean of y
-1.203 261.5 0.2299 two.sided 1.705 1.861

Developable Acerage

There appears to be no significant difference between develop able acreage in a given block for the clusters.

Cluster Dev_Acre
Overvalued 248543.7
Undervalued 220379.1
NA 243824.2
Welch Two Sample t-test: overvalued and undervalued
Test statistic df P value Alternative hypothesis mean of x mean of y
-0.09986 220.7 0.9205 two.sided 1714 1777

Salary

The main difference is the proportion of low in come jobs in hot spots, which significantly differs from the expected value if these variables were unrelated. See Chi Square Results below:

Pearson’s Chi-squared test: . (continued below)
Test statistic df
1425 2
P value
0.0000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000003844 *

Demographics

The distribution of most races is consistent across the clusters, except for white and black. In hot spots, there are significantly less White residents and significantly more black residents. The Chi Square Results back up this assumption.

Pearson’s Chi-squared test: .
Test statistic df P value
90483 6 0 * * *

Jobs

The variation in jobs is also significant, though a clear pattern is hard to determine due to the number of categories. It appears that Food Service jobs are more prevalent in cold spots, as are jobs in the arts. In hot spots professional service jobs seem to be less prevalent, as are other high paying industries such as healthcare, financial services, IT, and management.

The Chi Squared Test indicates these differences are significant from expected values.

Pearson’s Chi-squared test: .
Test statistic df P value
67569 19 0 * * *

Jobs Housing Balance

As expected examining are more urban area changed the relationships in JH balance. Overvalued areas in Orange County have higher JH balance than undervalued areas.

Cluster Jobs_Housing_Balance n
Overvalued 3.129059 144
Undervalued 1.090332 124
NA 3.403056 105
Welch Two Sample t-test: over_jh and under_jh (continued below)
Test statistic df P value Alternative hypothesis mean of x
2.569 157.4 0.01114 * two.sided 3.129
mean of y
1.09