Universidad Privada Boliviana / Universidad del Pacífico (Lima, Perú)
Prof. J. Dávalos (Ph.D.)
Error correction representation of cointegrated series
Cointegration examples
Consumption function
Total consumption = permanent + transitory, both I(1). Given the long run share between permanent income (\(y_p\)) and permanent consumption(\(c_p\)) \(\beta\):
\(C_{total} = \beta y_p + c_{trans}\)
where \(C_{total}\), \(\beta y_p\) are expected I(1) while \(c_{trans}\) is I(0).
Forward vs spot rates
Accurate pricing expectations imply that the futures market is a good predictor of the current (spot) price:
\(f(t+1)_t = s_{t+1} + \varepsilon_{t+1}\)
Today’s (t) forward price for (t+1) is on average equal to the observed (spot) price on t+1. Both prices are well known not to be I(0). Most likely I(1).
If this is true, taking advantage of the forward price will not lead to systematic gains. If it does not hold, speculating may be a profitable strategy.
Under the one price law, the price of a given good or service in two different markets (currencies) should be equal on average. Short-term deviations (\(\varepsilon\)) may lead to arbitrage and price equalization.
On average the foreign price \(p^*\) should match the local price \(p\) given the nominal exchange rate between the foreign and local prices (\(e\)): \(p^*= e \;p\)
Write the previous equation in log and add a non-systematic stationary discrepancy between both terms:
Cointegration: Linked stochastic trends
Consider two series \(\{X_t\}\) and \(\{Y_y\}\) both I(1):
\(X_t = \mu_t + e^{x}_{t}\)
\(Y_t = \mu_t + e^{y}_{t}\)
\(\mu_t = \mu_{t-1} + e^{\mu}_{t}\) a Random walk (unit-root) process
where all error terms \(e^{.}\) are WN. Both process are intrinsically identical in \(\mu_t\) (a I(1) process) except for \(e^{x}\) and \(e^{y}\).
Both process share the same stochastic trend \(\mu_t\)
Exercise 1
Simulate a cointegrated process for \(X_t\) and \(Z_t\) given t=100 (you decide on remaining simulation parameters). Analyse the correlogram of the estimated residuals from an OLS regression between these variables.
Error correction representation
We know how to test for the presence of cointegration using the DF or ADF test.
Residuals were stationary I(0). This guarantees cointegration and the existence of an error correction representation
However cleaner residuals are required to implement statistical inference with standard estimators (OLS, ML)
Thus, after finding cointegrated variables, the econometrician may introduce additional k lags of the many variables to get WN errors. This distributes additional lags to the model.
This is interpreted as a deviation from the long-run residual average (0). A positive deviation in a given period means that X is larger than its forecast, thus leading to a reduction in X during the forthcoming period. It may be interpreted as an empirical economic equilibrium condition that self-corrects itself.
The \(\textrm{ec}_t\) represents the long-run relationship or cointegrating relationship , aka cointegrating vector.
Note however, that this is not unique, as it can be represented for \(Z_t\) instead of \(X_t\).
Alternative generalization
The previous assumes that there is one reference variable that dictates the cointegrating relationship. This may not be true. There might be many, none, etc.
Thus, the ECM could be written using alternative reference variables. We need a more general approach to identify the cointegrating relationships and ECM.
Consider \(\bf X_t\) as a vector of \(n\) I(1) endogenous variables included in the relationship.
Also, let’s write the system as we did with the Dickey Fuller representation in levels.
\(\bf X_t = A \bf X_{t-1} + \bf\varepsilon_t\) (This is quite general, it allows the presence of unit-root or not like in the ADF)
Substracting the first lag just like in the univariate DF test:
This is clever. The \(\pi\) matrix informs whether the levels are informative to the FD
EC terms from \(\pi\)
Whiteboard…
If \(\pi\) does not exist i.e. its rank = 0, then, there are no cointegration relationships and the vector \(\bf X_t\) contains purely stationary variables.
The rank of \(\pi\) dictates whether there is cointegration and the number of cointegration vectors and ECM’s.
Had we need more lags to get WN errors does not change anything:
From the previous condition, show that 3 error correction terms can be obtained from \(\pi\) for n = 3
Johansen test
Consider the general notation (STATA) for a test on the levels (Y):
One may specify several alternative to a test on the rank of \(\pi\)
Example
Cointegration relationship between personal income, investment and consumption for the US.
The most standard configuration of I(1) is to drop the linear deterministic trend in first differences \(\tau = 0\). Including such trend may be informative for series with quadratic trends in levels.
The second most usual option is not to assume the presence of deterministic trend in the cointegrating relationship, thus we need \(\rho = 0\). A priori acknowledge of the variables of interest may point to stochastic trend variables (stock prices, interest rates, etc.). Assuming \(\tau=\rho=0\) is the default by stata.
Choice of lags.
The varsoc command and its Likelihood ratio test, provide good insights. Report your test from alternative lags in the neighborhood of the suggestion given by the test.
Null hyphotesis - maximum rank test.
Each line of the rank test provides a statistic to the test for a maximum number of cointegration vectors. If there is no cointegration, one should not reject the null at the very first line when max_rank=0.
Under cointegration, you are expected to reject the first rank =0, and to move forward to test for rank =1, etc. One stops once the null is not rejected.
If you always reject the null, the test might be misspecified. Check your lags and options.
webuse balance2varsocy i cvecranky i c, lags(3) // it always rejects the null, misspecification problemvecranky i c, lags(4) // As suggested by the LR test, it always rejects the nullvecranky i c, lags(5) // finally stops rejecting the null at r=2vecranky i c, lags(6) // finally stops rejecting the null at r=2
(macro data for VECM/balance study)
Lag-order selection criteria
Sample: 1960q1 thru 1982q4 Number of obs = 92
+---------------------------------------------------------------------------+
| Lag | LL LR df p FPE AIC HQIC SBIC |
|-----+---------------------------------------------------------------------|
| 0 | 494.087 4.6e-09 -10.6758 -10.6426 -10.5936 |
| 1 | 1178.08 1368 9 0.000 2.0e-15 -25.3496 -25.2169 -25.0207 |
| 2 | 1235.62 115.08 9 0.000 6.8e-16 -26.4049 -26.1725 -25.8292* |
| 3 | 1251.51 31.766 9 0.000 5.9e-16* -26.5545* -26.2226* -25.7322 |
| 4 | 1260.27 17.518* 9 0.041 6.0e-16 -26.5493 -26.1178 -25.4802 |
+---------------------------------------------------------------------------+
* optimal lag
Endogenous: y i c
Exogenous: _cons
Johansen tests for cointegration
Trend: Constant Number of obs = 93
Sample: 1959q4 thru 1982q4 Number of lags = 3
--------------------------------------------------------------
Critical
Maximum Trace value
rank Params LL Eigenvalue statistic 5%
0 21 1245.9796 . 41.2274 29.68
1 26 1256.8257 0.20804 19.5353 15.41
2 29 1261.8713 0.10283 9.4441 3.76
3 30 1266.5933 0.09656
--------------------------------------------------------------
Johansen tests for cointegration
Trend: Constant Number of obs = 92
Sample: 1960q1 thru 1982q4 Number of lags = 4
--------------------------------------------------------------
Critical
Maximum Trace value
rank Params LL Eigenvalue statistic 5%
0 30 1236.9203 . 46.6906 29.68
1 35 1250.8896 0.26190 18.7520 15.41
2 38 1256.7437 0.11950 7.0439 3.76
3 39 1260.2657 0.07371
--------------------------------------------------------------
Johansen tests for cointegration
Trend: Constant Number of obs = 91
Sample: 1960q2 thru 1982q4 Number of lags = 5
--------------------------------------------------------------
Critical
Maximum Trace value
rank Params LL Eigenvalue statistic 5%
0 39 1231.1041 . 46.1492 29.68
1 44 1245.3882 0.26943 17.5810 15.41
2 47 1252.5055 0.14480 3.3465* 3.76
3 48 1254.1787 0.03611
--------------------------------------------------------------
* selected rank
Johansen tests for cointegration
Trend: Constant Number of obs = 90
Sample: 1960q3 thru 1982q4 Number of lags = 6
--------------------------------------------------------------
Critical
Maximum Trace value
rank Params LL Eigenvalue statistic 5%
0 48 1222.3232 . 48.3578 29.68
1 53 1238.0479 0.29492 16.9084 15.41
2 56 1244.9553 0.14230 3.0936* 3.76
3 57 1246.5021 0.03379
--------------------------------------------------------------
* selected rank
There is evidence of 2 cointegrating relationships, thus 2 error correction model representation.