If any issues, questions or suggestions feel free to reach me out via e-mail wieczynskipawel@gmail.com or Linkedin. You can also visit my Github.
if(!require(pacman)) install.packages('pacman')
pacman::p_load(PolishStock, dplyr, ggplot2)
In that blog post we simulated 10000 portfolios of 4 iShares ETF’s (EEM, IAU, URTH, TIPS) and then selected portfolio with minamal variance and another portfolio with maximal Sharpe ratio. However, variance (or standard deviation) ain’t the only risk measures we use. Thus portfolio optimization problem can be generalized to any other risk measure \(\mathcal{R}\): \[\min_{\omega} \mathcal{R} \left( \omega^T \mu \right)\] with constraints that weights are non-negative (if short sale is forbidden) and sum of weights equals one.
In this post as risk measure we will use Value-at-Risk and Conditional Value-at-Risk (also known as Expected Shortfall). While standard approach with variance minimizes fluctuations of portfolio returns, approach with VaR and cVaR minimizes risk of huge dropdowns of the portfolio value.
Let’s denote portfolio losses \(L = -\omega^T \mu\). Distribution of random variable \(L\) is induced by distribution of asset returns \(\mu\) with density \(p_{\mu}\). Probability that loss \(L\) doesn’t exceed certain threshold \(z\) is \[\mathbb{P} ( L \leq z ) = F_L (z) \text{,}\] where \(F_L\) is cdf of \(L\). Value-at-Risk at confidence level \(\alpha \in (0,1)\) is defined as \[\text{VaR}\ (\alpha;\ \omega) := \inf \lbrace z \in \mathbb{R} \ | \ \mathbb{P} (L > z) \leq 1 - \alpha \rbrace = \inf \lbrace z \in \mathbb{R} \ | \ F_L(z) \geq \alpha \rbrace\] which is just \(1-\alpha\)-quantile of cdf \(F_L\). For instance, once one calculates VaR at \(\alpha = 95\%\), he can say “with 95% probability maximal loss won’t exceed 1000 EUR”. But remember, usually distribution of random variable \(L\) ain’t known, so estimation of this parameter is also subject to a certain margin of error.
But what happens if loss exceeds VaR? What we can say about remaining \(5\%\) of uncertainty? It’s well known fact, that time series of financial asset returns have heavy-tailed distributions, and especially events in the left tail are more frequent than events in the right tail. Conditional value-at-risk (or expected shortfall) is defined as average loss in this \(5\%\) tail, i.e. \[\text{cVaR}\ (\alpha;\ \omega) := \frac{1}{1-\alpha} \int_{\alpha}^1 q_u(F_L) \ du = \frac{1}{1-\alpha} \int_{\alpha}^1 \text{VaR}\ (u;\ \omega) \ du\] assuming random variable \(L\) is integrable, i.e. \(\mathbb{E} \ |L| < \infty\). Moreover, if cdf \(F_L\) is continuous we get more clear and intuitive definition of cVaR: \[\text{cVaR}\ (\alpha;\ \omega) = \mathbb{E} [\ L \ | \ L \geq \text{VaR}\ (\alpha;\ \omega)\ ] = \mathbb{E} [ \ L \ \pmb{1}_{ \lbrace L \geq \text{VaR}_{\alpha} \rbrace } \ ] \text{.}\]
Calculation of VaR boils down to determining the quantile of a generally unknown distribution, therefore there are different, more or less precise methods of calculating this measure. In our simulations we will use historical method which is just: \[ \widehat{\text{VaR}_{\alpha}} = \widehat{q_{\alpha}} \text{,} \] \[ \widehat{\text{ES}_{\alpha}} =
\frac{\sum_{i=1}^n L_i \pmb{1}_{ \lbrace L_i \geq \widehat{\text{VaR}_{\alpha}} \rbrace } }
{\sum_{i=1}^n \pmb{1}_{ \lbrace L_i \geq \widehat{\text{VaR}_{\alpha}} \rbrace } } \text{.} \]
Variance optimization problem is either quatratic programming problem or just system of linear equations, depending if short sale is allowed or not. I have good and bad news for the reader. Good news is that cVaR optimization can be reduced to linear programming problem, regardless of the distribution of asset return \(\mu\). Bad news is that VaR attainable portfolios don’t form convex set, what can be easily verified by vizualizing all the portfolios consisted of 2 assets.
In this post we will simulate \(10000\) portfolios and select portfolios with minimal VaR and cVaR. We also generalize Sharpe ratio to any risk measure, e.g. \[\text{Sharpe} = \frac{\mu_R (\omega) - \mu_F}{\text{VaR} \ (\alpha;\ \omega)} \text{,}\] so we will be able to select risk-reward optimal portfolios.
We use PolishStock package to download data from the website stooq.pl. Daily data for period January 2012 - February 2022 consists of 2345 observations.
tickers <- c('EEM.US', 'IAU.US', 'URTH.US', 'TIP.US')
for (ticker in tickers){
stooq_download(ticker, '20120101', '20220228'
,destination = paste0(getwd(), '/datasets/'))
}
close_prices <- df_prices(paste0(tickers, '.csv')
,source = 'datasets/')
returns <- sapply(select(close_prices, -Date)
,FUN = function(x) diff(log(x), lag = 1))
Although in the PolishStock package there exists a function which does the steps below, for educational purposes we write it explicitly here. First we calculate ETF’s annualized average returns. Assume \(2\%\) risk-free interest rate and significance level \(\alpha = 95\%\).
mean_returns <- (colMeans(returns) + 1)^252 - 1
rf <- 0.02
alpha = 0.95
We initialize matrix where portoflios weights will be stored and data frame where portfolios return, VaR, cVaR and sharpe ratios will be stored.
num_of_portfolios <- 1e4
weights <- matrix(nrow = num_of_portfolios, ncol = ncol(returns))
portfolio_metrics <- data.frame(
Returns = rep(0, num_of_portfolios)
,VaR = rep(0, num_of_portfolios)
,VaR_Sharpe = rep(0, num_of_portfolios)
,cVaR = rep(0, num_of_portfolios)
,cVaR_Sharpe = rep(0, num_of_portfolios)
)
We simulate \(10000\) random portfolios with random weights from Uniform[0,1] distribution (which means that short sale is not allowed in our model). We calculate portfolio return, VaR, cVaR and sharpe ratios as explained above.
set.seed(2137)
for (i in 1:num_of_portfolios) {
random_weights <- runif(ncol(weights))
random_weights <- random_weights / sum(random_weights)
weights[i, ] <- random_weights
portfolio_returns <- returns %*% random_weights
portfolio_metrics$Returns[i] <- random_weights %*% mean_returns
portfolio_metrics$VaR[i] <- -quantile(portfolio_returns, 1 - alpha)
portfolio_metrics$VaR_Sharpe[i] <- (portfolio_metrics$Returns[i] - rf) / portfolio_metrics$VaR[i]
cVaR_indicator <- (portfolio_returns < -portfolio_metrics$VaR[i])
portfolio_metrics$cVaR[i] <- -sum(portfolio_returns * cVaR_indicator) / sum(cVaR_indicator)
portfolio_metrics$cVaR_Sharpe[i] <- (portfolio_metrics$Returns[i] - rf) / portfolio_metrics$cVaR[i]
}
Below we see optimal portfolios:
- Minimal VaR and cVaR portfolios have only about \(3.3\%\) expected annual return and we are \(95\%\) condident that daily loss won’t exceed \(0.46\%\). In worst case scenario expected daily dropdown is about \(0.77\%\). These portfolios mainly consist of treasury inflation-protected bonds.
- Maximal sharpe portfolios have about \(7.3\%\) expected annual return and almost two times higher risk than minimal risk portfolios. These portfolios consist of developed markets stocks and treasury inflation-protected bonds in proportion about 50:50.
In fact, pretty similar results were obtained with variance optimization).
optimal_portfolios <- data.frame(
Min_VaR = c(
round(100*weights[which.min(portfolio_metrics$VaR),], 2)
,round(100*portfolio_metrics[which.min(portfolio_metrics$VaR), ], 2) %>% as.numeric()
)
,Max_VaR_Sharpe = c(
round(100*weights[which.max(portfolio_metrics$VaR_Sharpe),], 2)
,round(100*portfolio_metrics[which.max(portfolio_metrics$VaR_Sharpe), ], 2) %>% as.numeric()
)
,Min_cVaR = c(
round(100*weights[which.min(portfolio_metrics$cVaR),], 2)
,round(100*portfolio_metrics[which.min(portfolio_metrics$cVaR), ], 2) %>% as.numeric()
)
,Max_cVaR_Sharpe = c(
round(100*weights[which.max(portfolio_metrics$cVaR_Sharpe),], 2)
,round(100*portfolio_metrics[which.max(portfolio_metrics$cVaR_Sharpe), ], 2) %>% as.numeric()
)
) %>% t() %>% as.data.frame()
colnames(optimal_portfolios) <- c(colnames(returns), 'Expected Return', 'VaR', 'VaR Sharpe', 'cVaR', 'cVaR Sharpe')
optimal_portfolios
Below we see simulated portfolios on 2-dimensional risk-return planes. Minimal risk portfolios are green triangles and maximal sharpe portfolios are the orange ones. It’s not obvious from the vizualization, but VaR attainable portfolios ain’t convex set.
p1 <- ggplot(portfolio_metrics, aes(x = VaR, y = Returns, color = VaR_Sharpe)) +
geom_point() +
geom_point(aes(x = (optimal_portfolios$VaR[1] / 100)
,y = (optimal_portfolios$`Expected Return`)[1] / 100)
,color = '#2ca02c', size = 5, shape = 17) +
geom_point(aes(x = (optimal_portfolios$VaR[2] / 100)
,y = (optimal_portfolios$`Expected Return`)[2] / 100)
,color = '#ff7f0e', size = 5, shape = 17) +
theme_bw()
p2 <- ggplot(portfolio_metrics, aes(x = cVaR, y = Returns, color = cVaR_Sharpe)) +
geom_point() +
geom_point(aes(x = (optimal_portfolios$cVaR[3] / 100)
,y = (optimal_portfolios$`Expected Return`)[3] / 100)
,color = '#2ca02c', size = 5, shape = 17) +
geom_point(aes(x = (optimal_portfolios$cVaR[4] / 100)
,y = (optimal_portfolios$`Expected Return`)[4] / 100)
,color = '#ff7f0e', size = 5, shape = 17) +
theme_bw()
gridExtra::grid.arrange(p1, p2)

---
title: "Value-at-Risk Portfolio Optimization"
output: html_notebook
---

If any issues, questions or suggestions feel free to reach me out via e-mail <wieczynskipawel@gmail.com> or [Linkedin](https://www.linkedin.com/in/pawel-wieczynski/). You can also visit my [Github](https://github.com/pawel-wieczynski).

```{r libraries, warning=FALSE, message=FALSE}
if(!require(pacman)) install.packages('pacman')
pacman::p_load(PolishStock, dplyr, ggplot2)
```

In [that blog post](https://rpubs.com/pawel-wieczynski/874454) we simulated 10000 portfolios of 4 iShares ETF's (EEM, IAU, URTH, TIPS) and then selected portfolio with minamal variance and another portfolio with maximal Sharpe ratio. However, variance (or standard deviation) ain't the only risk measures we use. Thus portfolio optimization problem can be generalized to any other risk measure $\mathcal{R}$: $$\min_{\omega} \mathcal{R} \left( \omega^T \mu \right)$$ with constraints that weights are non-negative (if short sale is forbidden) and sum of weights equals one.

In this post as risk measure we will use Value-at-Risk and Conditional Value-at-Risk (also known as Expected Shortfall). While standard approach with variance minimizes fluctuations of portfolio returns, approach with VaR and cVaR minimizes risk of huge dropdowns of the portfolio value.

Let's denote portfolio losses $L = -\omega^T \mu$. Distribution of random variable $L$ is induced by distribution of asset returns $\mu$ with density $p_{\mu}$. Probability that loss $L$ doesn't exceed certain threshold $z$ is $$\mathbb{P} ( L \leq z ) = F_L (z) \text{,}$$ where $F_L$ is cdf of $L$. **Value-at-Risk** at confidence level $\alpha \in (0,1)$ is defined as $$\text{VaR}\ (\alpha;\ \omega) := \inf \lbrace z \in \mathbb{R} \ | \ \mathbb{P} (L > z) \leq 1 - \alpha \rbrace = \inf \lbrace z \in \mathbb{R} \ | \ F_L(z) \geq \alpha \rbrace$$ which is just $1-\alpha$-quantile of cdf $F_L$. For instance, once one calculates VaR at $\alpha = 95\%$, he can say *"with 95% probability maximal loss won't exceed 1000 EUR"*. But remember, usually distribution of random variable $L$ ain't known, so estimation of this parameter is also subject to a certain margin of error.

But what happens if loss exceeds VaR? What we can say about remaining $5\%$ of uncertainty? It's well known fact, that time series of financial asset returns have heavy-tailed distributions, and especially events in the left tail are more frequent than events in the right tail. **Conditional value-at-risk** (or expected shortfall) is defined as average loss in this $5\%$ tail, i.e. $$\text{cVaR}\ (\alpha;\ \omega) :=  \frac{1}{1-\alpha} \int_{\alpha}^1 q_u(F_L) \ du = \frac{1}{1-\alpha} \int_{\alpha}^1 \text{VaR}\ (u;\ \omega) \ du$$ assuming random variable $L$ is integrable, i.e. $\mathbb{E} \ |L| < \infty$. Moreover, if cdf $F_L$ is continuous we get more clear and intuitive definition of cVaR: $$\text{cVaR}\ (\alpha;\ \omega) = \mathbb{E} [\ L \ | \ L \geq \text{VaR}\ (\alpha;\ \omega)\ ] = \mathbb{E} [ \ L \ \pmb{1}_{ \lbrace L \geq \text{VaR}_{\alpha} \rbrace } \ ] \text{.}$$

Calculation of VaR boils down to determining the quantile of a generally unknown distribution, therefore there are different, more or less precise methods of calculating this measure. In our simulations we will use **historical method** which is just: $$ \widehat{\text{VaR}_{\alpha}} = \widehat{q_{\alpha}} \text{,} $$
$$ \widehat{\text{ES}_{\alpha}} = 
\frac{\sum_{i=1}^n L_i \pmb{1}_{ \lbrace L_i \geq \widehat{\text{VaR}_{\alpha}} \rbrace } } 
{\sum_{i=1}^n \pmb{1}_{ \lbrace L_i \geq \widehat{\text{VaR}_{\alpha}} \rbrace }  } \text{.} $$

[Variance optimization problem](https://rpubs.com/pawel-wieczynski/874454) is either quatratic programming problem or just system of linear equations, depending if short sale is allowed or not. I have good and bad news for the reader. Good news is that cVaR optimization can be reduced to linear programming problem, regardless of the distribution of asset return $\mu$. Bad news is that VaR attainable portfolios don't form convex set, what can be easily verified by vizualizing all the portfolios consisted of 2 assets.

In this post we will simulate $10000$ portfolios and select portfolios with minimal VaR and cVaR. We also **generalize Sharpe ratio** to any risk measure, e.g. $$\text{Sharpe} = \frac{\mu_R (\omega) - \mu_F}{\text{VaR} \ (\alpha;\ \omega)} \text{,}$$ so we will be able to select risk-reward optimal portfolios.

We use [PolishStock package](https://github.com/pawel-wieczynski/PolishStock) to download data from the website [stooq.pl](https://stooq.pl/). Daily data for period January 2012 - February 2022 consists of 2345 observations.
```{r get_data, message=FALSE, warning=FALSE}
tickers <- c('EEM.US', 'IAU.US', 'URTH.US', 'TIP.US')

for (ticker in tickers){
  stooq_download(ticker, '20120101', '20220228'
                 ,destination = paste0(getwd(), '/datasets/'))
}

close_prices <- df_prices(paste0(tickers, '.csv')
                          ,source = 'datasets/')

returns <- sapply(select(close_prices, -Date)
                  ,FUN = function(x) diff(log(x), lag = 1))

```

Although in the [PolishStock package](https://github.com/pawel-wieczynski/PolishStock) there exists a function which does the steps below, for educational purposes we write it explicitly here. First we calculate ETF's annualized average returns. Assume $2\%$ risk-free interest rate and significance level $\alpha = 95\%$.
```{r mean_and_covariance}
mean_returns <- (colMeans(returns) + 1)^252 - 1
rf <- 0.02
alpha = 0.95
```

We initialize matrix where portoflios weights will be stored and data frame where portfolios return, VaR, cVaR and sharpe ratios will be stored.
```{r initialize_tables}
num_of_portfolios <- 1e4
weights <- matrix(nrow = num_of_portfolios, ncol = ncol(returns))

portfolio_metrics <- data.frame(
  Returns = rep(0, num_of_portfolios)
  ,VaR = rep(0, num_of_portfolios)
  ,VaR_Sharpe = rep(0, num_of_portfolios)
  ,cVaR = rep(0, num_of_portfolios)
  ,cVaR_Sharpe = rep(0, num_of_portfolios)
)
```

We simulate $10000$ random portfolios with random weights from Uniform[0,1] distribution (which means that short sale is not allowed in our model). We calculate portfolio return, VaR, cVaR and sharpe ratios as explained above.
```{r simulation}
set.seed(2137)
for (i in 1:num_of_portfolios) {
  
  random_weights <- runif(ncol(weights))
  random_weights <- random_weights / sum(random_weights)
  weights[i, ] <- random_weights
  
  portfolio_returns <- returns %*% random_weights
  
  portfolio_metrics$Returns[i] <- random_weights %*% mean_returns
  portfolio_metrics$VaR[i] <- -quantile(portfolio_returns, 1 - alpha)
  portfolio_metrics$VaR_Sharpe[i] <- (portfolio_metrics$Returns[i] - rf) / portfolio_metrics$VaR[i]
  cVaR_indicator <- (portfolio_returns < -portfolio_metrics$VaR[i])
  portfolio_metrics$cVaR[i] <- -sum(portfolio_returns * cVaR_indicator) / sum(cVaR_indicator)
  portfolio_metrics$cVaR_Sharpe[i] <- (portfolio_metrics$Returns[i] - rf) / portfolio_metrics$cVaR[i]
  
}
```

Below we see optimal portfolios: \
 - Minimal VaR and cVaR portfolios have only about $3.3\%$ expected annual return and we are $95\%$ condident that daily loss won't exceed $0.46\%$. In worst case scenario expected daily dropdown is about $0.77\%$. These portfolios mainly consist of treasury inflation-protected bonds. \
 - Maximal sharpe portfolios have about $7.3\%$ expected annual return and almost two times higher risk than minimal risk portfolios. These portfolios consist of developed markets stocks and treasury inflation-protected bonds in proportion about 50:50.
 
In fact, pretty similar results were obtained with [variance optimization]((https://rpubs.com/pawel-wieczynski/874454))). 
```{r optimal_portfolios}
optimal_portfolios <- data.frame(
  Min_VaR = c(
    round(100*weights[which.min(portfolio_metrics$VaR),], 2)
    ,round(100*portfolio_metrics[which.min(portfolio_metrics$VaR), ], 2) %>% as.numeric()
  )
  ,Max_VaR_Sharpe = c(
    round(100*weights[which.max(portfolio_metrics$VaR_Sharpe),], 2)
    ,round(100*portfolio_metrics[which.max(portfolio_metrics$VaR_Sharpe), ], 2) %>% as.numeric()
  )
  ,Min_cVaR = c(
    round(100*weights[which.min(portfolio_metrics$cVaR),], 2)
    ,round(100*portfolio_metrics[which.min(portfolio_metrics$cVaR), ], 2) %>% as.numeric()
  )
  ,Max_cVaR_Sharpe = c(
    round(100*weights[which.max(portfolio_metrics$cVaR_Sharpe),], 2)
    ,round(100*portfolio_metrics[which.max(portfolio_metrics$cVaR_Sharpe), ], 2) %>% as.numeric()
  )
) %>% t() %>% as.data.frame()

colnames(optimal_portfolios) <- c(colnames(returns), 'Expected Return', 'VaR', 'VaR Sharpe', 'cVaR', 'cVaR Sharpe')
optimal_portfolios
```

Below we see simulated portfolios on 2-dimensional risk-return planes. Minimal risk portfolios are green triangles and maximal sharpe portfolios are the orange ones. It's not obvious from the vizualization, but VaR attainable portfolios ain't convex set.
```{r vizualization}
p1 <- ggplot(portfolio_metrics, aes(x = VaR, y = Returns, color = VaR_Sharpe)) +
  geom_point() +
  geom_point(aes(x = (optimal_portfolios$VaR[1] / 100)
                 ,y = (optimal_portfolios$`Expected Return`)[1] / 100)
             ,color = '#2ca02c', size = 5, shape = 17) +
  geom_point(aes(x = (optimal_portfolios$VaR[2] / 100)
                 ,y = (optimal_portfolios$`Expected Return`)[2] / 100)
             ,color = '#ff7f0e', size = 5, shape = 17) + 
  theme_bw()

p2 <- ggplot(portfolio_metrics, aes(x = cVaR, y = Returns, color = cVaR_Sharpe)) +
  geom_point() +
  geom_point(aes(x = (optimal_portfolios$cVaR[3] / 100)
                 ,y = (optimal_portfolios$`Expected Return`)[3] / 100)
             ,color = '#2ca02c', size = 5, shape = 17) +
  geom_point(aes(x = (optimal_portfolios$cVaR[4] / 100)
                 ,y = (optimal_portfolios$`Expected Return`)[4] / 100)
             ,color = '#ff7f0e', size = 5, shape = 17) + 
  theme_bw()

gridExtra::grid.arrange(p1, p2)
```