Predicting New Home Sales with Google Trends

Tony Tushar

April 20, 2018

Main Question:

New Home Sales and Recessions

New Home Sales and Recessions

Monthly new home metrics (the number of permits, starts, completions, sales, etc.) are largely considered useful economic indicators. How well does Google Trends search data for new homes correlate to actual data? Can Google Trends data serve as a leading economic indicator sooner than standard monthly reports?

Data

Literature Review

Project Components

Initial Time Series Plotting

Checking for stationarity

## 
##  Augmented Dickey-Fuller Test
## 
## data:  NewSalesFRED.TS
## Dickey-Fuller = -1.4544, Lag order = 5, p-value = 0.8041
## alternative hypothesis: stationary

H0: There is a unit root for the series, non-stationary / Ha: There is no unit root for the series, stationary / With p-value greater than 0.05, we cannot reject the null hypothesis and the series is non-stationary

## 
##  Augmented Dickey-Fuller Test
## 
## data:  NewHomesGOOGLE.TS
## Dickey-Fuller = -2.5231, Lag order = 5, p-value = 0.3579
## alternative hypothesis: stationary

H0: There is a unit root for the series, non-stationary / Ha: There is no unit root for the series, stationary / With p-value greater than 0.05, we cannot reject the null hypothesis and the series is non-stationary

Log transformation

Seasonal differencing

Checking again for stationarity

## 
##  Augmented Dickey-Fuller Test
## 
## data:  NewSalesFRED.TSD
## Dickey-Fuller = -7.4592, Lag order = 5, p-value = 0.01
## alternative hypothesis: stationary

H0: There is a unit root for the series, non-stationary / Ha: There is no unit root for the series, stationary / With a p-value less than 0.05, wereject the null hypothesis in favor of the alternative hypothesis that the series is stationary.

## 
##  Augmented Dickey-Fuller Test
## 
## data:  NewHomesGOOGLE.TSD
## Dickey-Fuller = -6.8616, Lag order = 5, p-value = 0.01
## alternative hypothesis: stationary

H0: There is a unit root for the series, non-stationary / Ha: There is no unit root for the series, stationary / With a p-value less than 0.05, we reject the null hypothesis in favor of the alternative hypothesis that the series is stationary.

Tables

RMSE Table

Cross-Sectional Models

Autoregressive Models