Use the data until December 31, 2015 to estimate the parameters of the Fama French 3-factor model.
We load in the csv file. First we filter the data so that we have until (and including) December 31st 2015. Next, we loop through each permno to perform the regression to estimate the parameters of the Fama-French 3 factor model.| Permno | Alpha | beta_M | beta_smb | beta_hml |
|---|---|---|---|---|
| 10026 | 0.0102 | 0.5632 | 0.5944 | 0.4427 |
| 10032 | 0.0052 | 1.7587 | 0.8591 | -0.2558 |
| 10044 | 0.0142 | 0.6237 | 0.1124 | 0.1665 |
| 10104 | 0.0033 | 1.2009 | 0.3213 | -0.7583 |
| 10200 | 0.0285 | 0.9231 | 2.0965 | -2.0745 |
| 10232 | 0.0058 | 0.2678 | 0.4411 | 0.4212 |
| 10252 | 0.0037 | 0.8362 | 0.5392 | 0.9268 |
| 10397 | 0.0063 | 0.7396 | 0.3085 | 0.2911 |
Give a brief interpretation of the values. The above table shows the estimated alpha along with variable coefficients for all variables in the 3 factor model. Alpha can be interpreted as the intercept, i.e. the return in excess of the expected return, conditional on the variables included in the model. The beta values are the regression coefficients for the included variables and shows how much of the expected excess return of asset i that is explained by the market excess return, the small minus big and high minus low factor returns respectively.
Derive the model-implied expected gross excess return and the model-implied variance covariance matrix.
| Permno | 10026 | 10032 | 10044 | 10104 | 10200 | 10232 | 10252 | 10397 |
|---|---|---|---|---|---|---|---|---|
| Mu | 0.0151 | 0.0124 | 0.0171 | 0.0055 | 0.0305 | 0.0093 | 0.011 | 0.0105 |
We use the estimated model for each stock to derive a full series of model-implied expected returns. Then we calculate the mean of the series, \(\hat\mu\) for each asset as an estimate for the model-implied expected gross excess return. To estimate the variance-covariance matrix, we utilize the method developed by Ledoit and Wolf regarding shrinkage transformation on the sample covariance matrix. This choice was made due to initial linear dependency in the return series.
| 10026 | 10032 | 10044 | 10104 | 10200 | 10232 | 10252 | 10397 | |
|---|---|---|---|---|---|---|---|---|
| 10026 | 0.0014 | 0.0029 | 0.0009 | 0.0016 | 0.0023 | 0.0008 | 0.0017 | 0.0013 |
| 10032 | 0.0029 | 0.0085 | 0.0025 | 0.0056 | 0.0085 | 0.0015 | 0.0035 | 0.0032 |
| 10044 | 0.0009 | 0.0025 | 0.0010 | 0.0016 | 0.0016 | 0.0005 | 0.0013 | 0.0011 |
| 10104 | 0.0016 | 0.0056 | 0.0016 | 0.0043 | 0.0066 | 0.0008 | 0.0018 | 0.0019 |
| 10200 | 0.0023 | 0.0085 | 0.0016 | 0.0066 | 0.0157 | 0.0011 | 0.0015 | 0.0023 |
| 10232 | 0.0008 | 0.0015 | 0.0005 | 0.0008 | 0.0011 | 0.0006 | 0.0010 | 0.0007 |
| 10252 | 0.0017 | 0.0035 | 0.0013 | 0.0018 | 0.0015 | 0.0010 | 0.0026 | 0.0017 |
| 10397 | 0.0013 | 0.0032 | 0.0011 | 0.0019 | 0.0023 | 0.0007 | 0.0017 | 0.0015 |
What determines the correlation of the returns? In our model, all predictors (market excess, smb and hml returns) have the same value throughout the series across all permnos. Therefore, the only thing that determines the estimated excess returns, and thus the only thing that makes the estimated returns differ, are the weights assigned to each variable, i.e. the coefficients. Adding to this, the betas themselves are determined using OLS, where the aim is to fit the regression by minimizing the squared errors, i.e. the sum of the square differences between the observed and predicted values. Therefore, the beta value of a given firm depends on the performance of the firm’s stock relative to the performance of all other stocks.
Use your parameter estimates to report the computed implied mean-variance efficient portfolio weight for an investor with risk aversion \(\gamma\) = 4 As visible in table 4, we end up with weights that indicate an extremely aggressive trading strategy. This is most visible by the long position of 1180% in permno 10044 and the short position of 821% in permno 10104. As our weights sum to 1, i.e. 100%, we accept the weights, however, we want to note that we do not necessarily deem it a realistic strategy.| 10026 | 10032 | 10044 | 10104 | 10200 | 10232 | 10252 | 10397 | Sum |
|---|---|---|---|---|---|---|---|---|
| 2.346 | 0.2548 | 11.804 | -8.2091 | 2.5804 | -6.8265 | -0.6031 | -0.3464 | 1 |
What is the intuition behind the parameter vector \(\theta\) and how do Brandt, Santa-Clara and Valkanov propose to estimate \(\theta\)? The authors model the portfolio weights of each asset as a function of the assets characteristics. The coefficients of this function are found by optimizing the investor’s average utility of the portfolio’s return over the sample period. The optimal weight in each stock is found by taking the sum of its market capitalization weight and an optimal deviation from that market cap weight which depends parametrically on the characteristics of the firms. The included characteristics are market cap, book-to-market ratio, lagged one-year returns of each firm. Moreover, investors are assumed to have constant relative risk aversion preferences.
The method implies that the coefficients \(\theta\) are held constant across assets over the entire time period. This implies that the portfolio weights depends solely on the characteristics and not its historic returns. Therefore, two stocks with similar characteristics associated with returns and risk should have similar weights even though their sample returns may differ. It is implicitly assumed that the aforementioned characteristics fully capture all aspects of the joint distribution of returns that are deemed relevant for building an optimal portfolio. This implies that the coefficients that maximizes the conditional expected utility of the investor at a given point in time are the same for all dates. Therefore, they also maximize the unconditional utility of the investors.
In the paper, the coefficients, i.e. the maximum expected utility estimator, are estimated as a method of moments estimator from which the asymptotic covariance matrix can be estimated using the consistent estimator V. This approach allows the authors to test the model for certain problems, e.g. misspecified portfolio policy or with the implementation of different constraints which is allowed as long as the authors take the appropriate measures to estimate V (e.g., by using an autocorrelation-adjusted estimator of V when constraints are imposed).
Discuss the benefit of the parametrizing portfolio weights directly relative to the two-step procedure. What are potential disadvantages of direct weight parametrization? The authors argues that their method is beneficial for different reasons. It is computationally simple, easily modified and extended, produces sensible portfolio weights, and offers robust performance in and out of sample. Moreover, it is argued that parameterization leads to a tremendous reduction in dimensionality, and it escapes issues with imprecise coefficient estimates and overfitting.
A disadvantage of the approach is the fact that characteristics are chosen arbitrarily, why one easily can assume that some information is not accounted for. Another disadvantage is that constraints on the portfolio weights (e.g. no-short selling constraints) implies that more complicated measures have to be implemented, as the weights will not automatically sum to one. Moreover, the critical assumptions regarding the characteristics fully capturing all aspects of the joint return distribution and the fact that coefficients are held constant for the entire period is somewhat unrealistic. As the authors write themselves: “While this is a convenient assumption, there is no obvious economic reason for the relation between firm characteristics and the joint distribution of returns to be time-invariant. In fact, there is substantial evidence that economic variables related to the business cycle forecast aggregate stock and bond returns”.
Report and discuss the values of \(\theta\): Are the signs intuitive?
In Table 4, the visible coefficients are the relative deviations from the benchmark naive portfolio due to characteristics market capitalization, book-to-market ratio and CAPM beta. These values optimize the objective function of maximizing the investors certainty equivalent. The values imply that expected utility for the investor decreases with size and beta, while it increases with bm. In other words, weight is assigned towards smaller companies, companies with lower book-to-market ratios and companies which stock move against the market. This is in line with past literature and findings in (Brandt, Santa Clara & Valkanov, 2009), that focus is to be put on smaller companies and companies with lower book-to-market ratios.
| Size | bm | beta |
|---|---|---|
| -0.647 | 0.568 | -0.6 |
Compute and report the optimal weights \(w^{PP}\)
| 10026 | 10032 | 10044 | 10104 | 10200 | 10232 | 10252 | 10397 |
|---|---|---|---|---|---|---|---|
| 0.166 | 0.095 | 0.155 | -0.16 | -0.021 | 0.413 | 0.146 | 0.207 |
| permno | FF-3 | PP | Naive | Efficient |
|---|---|---|---|---|
| 10026 | 2.3460 | 0.1659 | 0.125 | 0.4331 |
| 10032 | 0.2548 | 0.0951 | 0.125 | 0.0129 |
| 10044 | 11.8040 | 0.1554 | 0.125 | 0.2070 |
| 10104 | -8.2091 | -0.1603 | 0.125 | -0.2142 |
| 10200 | 2.5804 | -0.0215 | 0.125 | 0.1628 |
| 10232 | -6.8265 | 0.4127 | 0.125 | 0.2266 |
| 10252 | -0.6031 | 0.1461 | 0.125 | 0.1205 |
| 10397 | -0.3464 | 0.2065 | 0.125 | 0.0512 |
Report the annualized Sharpe Ratio of the four strategies and discuss the differences. The naive portfolio allocation is undoubtedly the best performing strategy in this backtest. The strategy is preferable both in terms of giving the highest return and the lowest standard deviation (volatility) with values respectively at 12.5% and 26.2%, which gives a Sharpe ratio of 0.47. The second best performing strategy is the portfolio allocation based on efficient weights gaining a Sharpe ratio of 0.21. Further, the strategy of parametric portfolio choices from part 2 and the Fama-French 3-factor model from part 1 delivers respectively a Sharpe ratio of 0.067 and -0.081.
The differences between on one side the efficient portfolio and naive, and on the other side the FF-3 and PP are quite significant. The FF-3 is performing really bad with a negative gross return, which can be supported by the fact that the portfolio weights are quite extreme in some cases. Table 4 shows that the strategy goes long in company ‘10044’ by more than 11.800% and short -8.209%, which will determine some extreme positions. In a small tradable stock universe like this, extreme positions will have high importance, which is clear for the former mentioned position, since this gives a return of -111% and -112% for 10104 (this isn’t reported in the table). This has a huge influence on the performance of the portfolio overall. Exposure towards single stocks can be diversified away, but with a small universe, it’s difficult to implement a strategy like this. The problem of high weights likely lies in the variance-covariance matrix, which is very close to being non-invertiable due to the nature of the model (return estimates being linear dependent).
| portfolio | Return | SD | Sharpe Ratio |
|---|---|---|---|
| Efficient | 0.077 | 0.363 | 0.211 |
| FF-3 | -0.748 | 9.182 | -0.081 |
| Naive | 0.125 | 0.262 | 0.478 |
| PP | 0.022 | 0.329 | 0.067 |
The differences between on one side the efficient portfolio and naive, and on the other side the FF-3 and PP are quite significant. The FF-3 is performing really bad with a negative gross return, which can be supported by the fact that the portfolio weights are quite extreme in some cases. Table 4 shows that the strategy goes long in company ‘10044’ by more than 11.800% and short -8.209%, which will determine some extreme positions. In a small tradable stock universe like this, extreme positions will have high importance, which is clear for the former mentioned position, since this gives a return of -111% and -112% for 10104 (this isn’t reported in the table). This has a huge influence on the performance of the portfolio overall. Exposure towards single stocks can be diversified away, but with a small universe, it’s difficult to implement a strategy like this. The problem of high weights likely lies in the variance-covariance matrix, which is very close to being non-invertiable due to the nature of the model (return estimates being linear dependent).
Outline the potential issues of the back-testing strategy in light of implementation short- falls and estimation uncertainty.
There exists several short-falls in the implementation of back-testing. To provide valid results through back-testing, all outside time-specific affecting factors has to be accounted for. Different time periods might not be suitable for comparisons, as circumstances that affect data points can change over time and results therefore can be affected by outside factors, such as booms and recessions or particular events. A general point is that what happened in the past cannot predict what will happen in the future, even though it can be argued that there lies information about the future in the past.
In the calculation of returns, all values has to be based on weighted averages and cannot just be summed, as otherwise the aggregates would be affected by the length of the period considered (i.e. comparing return on a portfolio spanning over 10 years with one over 5 years). If considering just a few self-selected tickers when performing financial analysis, there is a risk of selection bias, which consists of non-random selection not being able to represent a gven population. Thus a backtest might not be representable for other periods. Further, we calculate returns on the weights estimated in \(t_{end}\), meaning that time specific events (i.e. a very unusual event in the final month of 2015) might have influence on the estimation, and hence on the weights used going forward. Overfitting a model on a specific period might cause the model to not fit other periods.
Discuss your expectations regarding the performance of the four strategies for a high-dimensional asset universe and a realistic assessment of transaction costs.
Generally, performance will increase in line with the asset space. This as portfolios can be better diversified over different assets, while returns will not decrease. Transaction costs will affect performance negatively, however, to different extents for the different portfolios.
We expect the naive portfolio to do well also when considering a high-dimensional asset universe and transaction costs. Transaction costs will be comparatively low, as trades are strictly made when assets enter/exit the universe and when the relative weights between assets change as a result of last period differences in returns. However, comparing this to allocation changes associated with performance-tracking portfolio strategies, e.g. momentum strategies, the size of the re-balancing will be comparably low. The risk of low-performing stocks being a significant part of a portfolio decreases with the size of the asset universe, although this is also true for well-performing stocks. The general risk of the portfolio will decreases with a larger amount of assets as the contributing volatility of the individual asset become relatively smaller. The effort in maintenance is low. The portfolio will follow the idea that there is a higher performance of smaller companies, in comparison to a weighted market portfolio.
For the Fama-French strategy, trades are made in line with how companies perform in relation to the considered Fama-French factors. This can change relatively frequently, as firms’ returns may fluctuate from month to month. Moreover, this can cause returns for the factor returns to vary. In times of financial distress, this will be more prominent for e.g. the smb factor as small-cap stocks tend to perform significantly worse than their large-cap counterparts during recessions why the factor returns will suffer from high volatility ceteris paribus. Given this, a larger asset universe will imply more transactions, which further implies relatively large transaction costs. Therefore, a large asset universe may decrease the performance of this type of portfolio because large transaction costs will kill returns.
When the asset universe increase, the efficient portfolio will move closer towards the market portfolio. Transaction costs will decrease with an increase in the number of assets in the portfolio, as changes in returns will affect both factors of the Sharpe ratio, returns and risk. Therefore, we assess the effect of transaction costs as somewhat moderate, as they will be larger than those of the naive portfolio but smaller than those of the three-factor portfolio. This has been further supported by the performance we saw in the third mandatory assignment of this course.
The portfolio based on parametriced characteristics has its strength in large universes that requires extensive calculations, as it reduces dimensionality.It is also argued that the method escapes some estimation uncertainty, through its different approach. In terms of transaction costs, the model will be re balanced in line with changes in companies’ relation to the characteristics size, bm CAPM beta. It can be argued that weights will change relatively little over time, as companies will stay fairly stable over time, in relation to the different characteristics. Transaction costs can therefore be interpreted as relatively low. With a larger asset space, the risk in returns will decrease.