About

In this worksheet we look at different variance, covariance, volatility, and causality calculations. We finish with a short matematical proof (no R required).

Setup

Remember to always set your working directory to the source file location. Go to ‘Session’, scroll down to ‘Set Working Directory’, and click ‘To Source File Location’. Read carefully the below and follow the instructions to complete the tasks and answer any questions. Submit your work to RPubs as detailed in previous notes.

Note

For clarity, tasks/questions to be completed/answered are highlighted in red color (color visible only in preview mode) and numbered according to their particular placement in the task section. Type your answers outside the red color tags!

Quite often you will need to add your own code chunk. Execute sequentially all code chunks, preview, publish, and submit link on Sakai following the naming convention. Make sure to add comments to your code where appropriate. Use own language!

Any sign of plagiarism, will result in dissmissal of work!


Task 1: Variance, Covariance, and Volatility

This task follows the two examples in the book R Example 2.5/p. 58 and R Example 2.6/p. 66

# Require will load the package only if not installed 
# Dependencies = TRUE makes sure that dependencies are install
if(!require("quantmod",quietly = TRUE))
  install.packages("quantmod",dependencies = TRUE, repos = "https://cloud.r-project.org")

##### 1A) Calculate the correlation and covariance matrix of the adjusted daily log returns for four different stocks of your choice. Explain your observations in terms of potential relationships.

# Once you have obtained the adjusted daily log returns for your stocks, omitting the time index, you will need to combine them to create a matrix. Below is an example.  For more details see the Help command in R on cbind, cov, and cor.
# M <- cbind(A,B,C) # create a matrix where each column is an array/vector of numerical values 
# cov(M) # compute the covariance matrix
# cor(M, method="pearson") # compute the correlation matrix based on the Pearson method
require(quantmod)
symbols=c('ILMN','CGIX','NVTA','CELG')
getSymbols(symbols, src="yahoo", from="2016-01-01", to="2018-12-04")
[1] "ILMN" "CGIX" "NVTA" "CELG"
ILMNad = ILMN$ILMN.Adjusted 
CGIXad = CGIX$CGIX.Adjusted
NVTAad = NVTA$NVTA.adjusted
CELGad = CELG$CELG.Adjusted
ilmnRD = periodReturn(ILMN, period = "daily", type = "log")
cgixRD = periodReturn(CGIX, period = "daily", type = "log")
nvtaRD = periodReturn(NVTA, period = "daily", type = "log")
celgRD = periodReturn(CELG, period = "daily", type = "log")
require(graphics)
ilmnR = as.numeric(ilmnRD)
cgixR = as.numeric(cgixRD)
nvtaR = as.numeric(nvtaRD)
celgR = as.numeric(celgRD)
M <- cbind(ilmnR, cgixR, nvtaR, celgR)
cov(M)
             ilmnR        cgixR        nvtaR        celgR
ilmnR 0.0006426052 0.0001424649 0.0002758442 0.0001640151
cgixR 0.0001424649 0.0028512927 0.0001996236 0.0001715704
nvtaR 0.0002758442 0.0001996236 0.0016049141 0.0001625055
celgR 0.0001640151 0.0001715704 0.0001625055 0.0003610570
cor(M, method = "pearson")
          ilmnR      cgixR      nvtaR     celgR
ilmnR 1.0000000 0.10524823 0.27162256 0.3405053
cgixR 0.1052482 1.00000000 0.09331793 0.1690961
nvtaR 0.2716226 0.09331793 1.00000000 0.2134786
celgR 0.3405053 0.16909609 0.21347860 1.0000000

The Covariance of of all these stocks have a positive covariance, probably because all of these stocks are genetic development companies and follow similar trends.

The Correlation of of these stocks are also all positive for the same reason above.

##### 1B) Calculate the three types of volatility for a particular stock of your choice. Consider a time window extending one year back from most recent obtainable closing day price. Order the three estimates from low to high volatility and explain how the ordering makes sense.

# For this task make sure you understand well what the variables n,m represent in the book's referenced example.
require(quantmod)
getSymbols("KHC", src="yahoo", subset= "2016:2017")
[1] "KHC"
khc = KHC["2016-01/2017-01"]; m=length(khc$KHC.Close);
ohlc <-khc[,c("KHC.Open","KHC.High","KHC.Low","KHC.Close")]
vClose <- volatility(ohlc, n= m,calc="close",N=272)
vParkinson <- volatility(ohlc, n= m,calc="parkinson",N=272)
vGK <- volatility(ohlc, n= m,calc="garman",N=272)
vClose[m]; vParkinson[m]; vGK[m];
                [,1]
2017-01-31 0.2067402
                [,1]
2017-01-31 0.1808091
                [,1]
2017-01-31 0.1824004

Task 2: Auto-Correlation and Auto-Regression

Follow the example in the book R Example 3.2/p. 74 and R Example 4.1/p. 115

##### 2A) Calculate the ACF for a stock of your choice. Consider both the log return and squared log return. Interpret your results in terms of possible existence of autocorrelation.

The only point at which Illumina could have significant autocorrelation is about at the 10 lag point.

##### 2B) Plot the exchange rate for USD versus another currency of your choice. Interpret your results in terms of behavior.

require(quantmod)
symbols=c("USD", "JPY")
getSymbols(symbols, src="oanda")
incorrectly specified currency pair USDincorrectly specified currency pair JPY
[1] "USD" "JPY"
getFX("JPY/USD")
[1] "JPYUSD"
plot(JPYUSD)

The JPY/USD exchange rate declined significantly over time, meaning that the yen has become stronger, relative to the dollar, over time.

##### 2C) Test for the possible existence of an underlying AR(1) – Markov process in your exchange rate currency pair. To this end, plot the ACF and the partial ACF (PACF). Interpret your results. Clearly refer to the lags, and their impacts in determining the order.

require(quantmod)
symbols=c("USD", "JPY")
getSymbols(symbols, src="oanda")
incorrectly specified currency pair USDincorrectly specified currency pair JPY
[1] "USD" "JPY"
getFX("JPY/USD")
[1] "JPYUSD"
acf(JPYUSD)

pacf(JPYUSD)

Task 3: Granger Causality Test

To conduct this test the package lmtest will be required, as already done in the code chunk below.

# Require will load the package only if not installed 
# Dependencies = TRUE makes sure that dependencies are install
if(!require("lmtest",quietly = TRUE))
  install.packages("lmtest",dependencies = TRUE, repos = "https://cloud.r-project.org")

##### 3A) Include below the code chunk to solve for 3.5.7 R Lab/p. 106. Write your conclusions. A curious application of Granger causality is the by now classic experiment of Thruman and Fisher (1988) to assess which came first, if the chicken or the egg. The necessary data is in the package lmtest, and it is the data set ChickEgg that contains the yearly production of chicken and eggs in the US from 1930 to 1983. To assess if chicken causes egg, execute grangertest(egg chicken, order=3, data=ChickEgg). Do also the test for egg causing chicken, and write your conclusions.

# More information about the data used in testing for causality can be obtained by typing the name of the data set `ChickEgg` in the R Help menu.
require(lmtest)
grangertest(egg ~ chicken, order=3, data=ChickEgg)
Granger causality test

Model 1: egg ~ Lags(egg, 1:3) + Lags(chicken, 1:3)
Model 2: egg ~ Lags(egg, 1:3)
  Res.Df Df      F Pr(>F)
1     44                 
2     47 -3 0.5916 0.6238
require(lmtest)
grangertest(chicken ~ egg, order=3, data=ChickEgg)
Granger causality test

Model 1: chicken ~ Lags(chicken, 1:3) + Lags(egg, 1:3)
Model 2: chicken ~ Lags(chicken, 1:3)
  Res.Df Df     F   Pr(>F)   
1     44                     
2     47 -3 5.405 0.002966 **
---
Signif. codes:  
0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

The P-Value for Chicken causes egg = 0.6238, which is > 0.05. This means we fail to reject the null hypothesis and we cannot say that chicken causes egg.

The P-value for egg causes Chicken = 0.002966, which is < 0.05. This means we rejct the null hypothesis and we can say that the egg causes chicken.

##### 3B) Briefly describe the data in terms of time range and variables. Similar to the linear autoegressive model described in class, write the mathematical regression model solved in each Granger test, including the proper order. Use naming conventions, and notations more reflective of the data set considered for ChickEgg.

!(imgs/IMG_1864.jpg)
Error: object 'imgs' not found

Task 4: Mathematical Proof

##### 4A) Prove the two results in Eq (2.32)/p. 53. No R-coding is needed here. Clearly show your steps. Hint: Use the definition of \(E(X^n)\) for X-log normally distributed. Observe also that \(Var(X) = E(X^2)-E^2(X)\) for any random variable X.

*http://computationalfinance.lsi.upc.edu

This is an R Markdown Notebook. When you execute code within the notebook, the results appear beneath the code.

Try executing this chunk by clicking the Run button within the chunk or by placing your cursor inside it and pressing Cmd+Shift+Enter.

plot(cars)

Add a new chunk by clicking the Insert Chunk button on the toolbar or by pressing Cmd+Option+I.

When you save the notebook, an HTML file containing the code and output will be saved alongside it (click the Preview button or press Cmd+Shift+K to preview the HTML file).

The preview shows you a rendered HTML copy of the contents of the editor. Consequently, unlike Knit, Preview does not run any R code chunks. Instead, the output of the chunk when it was last run in the editor is displayed.

---
title: "FINC621 Winter 2018-19 Lab Worksheet 03"
author: "Grace Onufer"
date: "December 5th, 2018"
output:
  html_notebook: default
  html_document: default
subtitle: Variance, Covariance, Correlation & Causality (finc621-lab03)
---

### About

In this worksheet we look at different variance, covariance, volatility, and causality calculations. We finish with a short matematical proof (no R required).  

### Setup

Remember to always set your working directory to the source file location. Go to 'Session', scroll down to 'Set Working Directory', and click 'To Source File Location'. Read carefully the below and follow the instructions to complete the tasks and answer any questions.  Submit your work to RPubs as detailed in previous notes. 

### Note

For clarity, tasks/questions to be completed/answered are highlighted in red color (color visible only in preview mode) and numbered according to their particular placement in the task section.  Type your answers outside the red color tags!

Quite often you will need to add your own code chunk. Execute sequentially all code chunks, preview, publish, and submit link on Sakai following the naming convention. Make sure to add comments to your code where appropriate. Use own language!

**Any sign of plagiarism, will result in dissmissal of work!**

--------------

### Task 1: Variance, Covariance, and Volatility

This task follows the two examples in the book `R Example 2.5/p. 58` and `R Example 2.6/p. 66` 

```{r}
# Require will load the package only if not installed 
# Dependencies = TRUE makes sure that dependencies are install
if(!require("quantmod",quietly = TRUE))
  install.packages("quantmod",dependencies = TRUE, repos = "https://cloud.r-project.org")
```


<span style="color:red">
##### 1A) Calculate the correlation and covariance matrix of the adjusted daily log returns for four different stocks of your choice. Explain your observations in terms of potential relationships.
</span>

```{r}
# Once you have obtained the adjusted daily log returns for your stocks, omitting the time index, you will need to combine them to create a matrix. Below is an example.  For more details see the Help command in R on cbind, cov, and cor.
# M <- cbind(A,B,C) # create a matrix where each column is an array/vector of numerical values 
# cov(M) # compute the covariance matrix
# cor(M, method="pearson") # compute the correlation matrix based on the Pearson method
```

```{r}
require(quantmod)
symbols=c('ILMN','CGIX','NVTA','CELG')
getSymbols(symbols, src="yahoo", from="2016-01-01", to="2018-12-04")

ILMNad = ILMN$ILMN.Adjusted 
CGIXad = CGIX$CGIX.Adjusted
NVTAad = NVTA$NVTA.adjusted
CELGad = CELG$CELG.Adjusted

ilmnRD = periodReturn(ILMN, period = "daily", type = "log")
cgixRD = periodReturn(CGIX, period = "daily", type = "log")
nvtaRD = periodReturn(NVTA, period = "daily", type = "log")
celgRD = periodReturn(CELG, period = "daily", type = "log")

require(graphics)
ilmnR = as.numeric(ilmnRD)
cgixR = as.numeric(cgixRD)
nvtaR = as.numeric(nvtaRD)
celgR = as.numeric(celgRD)

M <- cbind(ilmnR, cgixR, nvtaR, celgR)

cov(M)

cor(M, method = "pearson")

```
The Covariance of of all these stocks have a positive covariance, probably because all of these stocks are genetic development companies and follow similar trends. 

The Correlation of of these stocks are also all positive for the same reason above.  


<span style="color:red">
##### 1B) Calculate the three types of volatility for a particular stock of your choice. Consider a time window extending one year back from most recent obtainable closing day price. Order the three estimates from low to high volatility and explain how the ordering makes sense.
</span>

```{r}
# For this task make sure you understand well what the variables n,m represent in the book's referenced example.
```

```{r}
require(quantmod)
getSymbols("KHC", src="yahoo", subset= "2016:2017")

khc = KHC["2016-01/2017-01"]; m=length(khc$KHC.Close);

ohlc <-khc[,c("KHC.Open","KHC.High","KHC.Low","KHC.Close")]

vClose <- volatility(ohlc, n= m,calc="close",N=272)

vParkinson <- volatility(ohlc, n= m,calc="parkinson",N=272)

vGK <- volatility(ohlc, n= m,calc="garman",N=272)

vClose[m]; vParkinson[m]; vGK[m];
```



### Task 2: Auto-Correlation and Auto-Regression

Follow the example in the book  `R Example 3.2/p. 74` and `R Example 4.1/p. 115`

<span style="color:red">
##### 2A) Calculate the ACF for a stock of your choice. Consider both the log return and squared log return. Interpret your results in terms of possible existence of autocorrelation.
</span>

```{r}
require(quantmod)
symbols=c('ILMN','CGIX','NVTA','CELG')
getSymbols(symbols, src="yahoo", from="2016-01-01", to="2018-12-04")

ilmnRD = periodReturn(ILMN, period = "daily", type = "log")

require(graphics)
ilmnR = as.numeric(ilmnRD)

acf(na.omit(ilmnR),main="acf of ILMN",ylim=c(-0.2,0.2))
```
The only point at which Illumina could have significant autocorrelation is about at the 10 lag point.  

<span style="color:red">
##### 2B) Plot the exchange rate for USD versus another currency of your choice. Interpret your results in terms of behavior.
</span>

```{r}
require(quantmod)

symbols=c("USD", "JPY")
getSymbols(symbols, src="oanda")

getFX("JPY/USD")
plot(JPYUSD)
```
The JPY/USD exchange rate declined significantly over time, meaning that the yen has become stronger, relative to the dollar, over time. 

<span style="color:red">
##### 2C) Test for the possible existence of an underlying AR(1) – Markov process in your exchange rate currency pair. To this end, plot the ACF and the partial ACF (PACF). Interpret your results.  Clearly refer to the lags, and their impacts in determining the order.
</span>

```{r}
require(quantmod)

symbols=c("USD", "JPY")
getSymbols(symbols, src="oanda")

getFX("JPY/USD")
acf(JPYUSD)
pacf(JPYUSD)
```



### Task 3: Granger Causality Test

To conduct this test the package `lmtest` will be required, as already done in the code chunk below.

```{r}
# Require will load the package only if not installed 
# Dependencies = TRUE makes sure that dependencies are install
if(!require("lmtest",quietly = TRUE))
  install.packages("lmtest",dependencies = TRUE, repos = "https://cloud.r-project.org")
```

<span style="color:red">
##### 3A) Include below the code chunk to solve for 3.5.7 R Lab/p. 106.  Write your conclusions.
</span>
A curious application of Granger causality is the by now classic experiment of Thruman and Fisher (1988) to assess which came first, if the chicken or the egg. The necessary data is in the package lmtest, and it is the data set ChickEgg that contains the yearly production of chicken and eggs in the US from 1930 to 1983. To assess if chicken causes egg, execute grangertest(egg chicken, order=3, data=ChickEgg). Do also the test for egg causing chicken, and write your conclusions.
```{r}
# More information about the data used in testing for causality can be obtained by typing the name of the data set `ChickEgg` in the R Help menu.
```


```{r}
require(lmtest)
#chick causes egg?
grangertest(egg ~ chicken, order=3, data=ChickEgg)
```

```{r}
require(lmtest)
#egg causes chicken?
grangertest(chicken ~ egg, order=3, data=ChickEgg)
```
The P-Value for Chicken causes egg = 0.6238, which is > 0.05.
This means we fail to reject the null hypothesis and we cannot say that chicken causes egg.

The P-value for egg causes Chicken = 0.002966, which is < 0.05.
This means we rejct the null hypothesis and we can say that the egg causes chicken.

<span style="color:red">
##### 3B) Briefly describe the data in terms of time range and variables. Similar to the linear autoegressive model described in class, write the mathematical regression model solved in each Granger test, including the proper order. Use naming conventions, and notations more reflective of the data set considered for  `ChickEgg`.
</span>
```{r}
!(imgs/IMG_1864.jpg)
```


### Task 4: Mathematical Proof

<span style="color:red">
##### 4A) Prove the two results in Eq (2.32)/p. 53.  No R-coding is needed here.  Clearly show your steps. Hint: Use the definition of $E(X^n)$ for X-log normally distributed.   Observe also that $Var(X) = E(X^2)-E^2(X)$ for any random variable X.
</span>





*[http://computationalfinance.lsi.upc.edu ](http://computationalfinance.lsi.upc.edu)

This is an [R Markdown](http://rmarkdown.rstudio.com) Notebook. When you execute code within the notebook, the results appear beneath the code. 

Try executing this chunk by clicking the *Run* button within the chunk or by placing your cursor inside it and pressing *Cmd+Shift+Enter*. 

```{r}
plot(cars)
```

Add a new chunk by clicking the *Insert Chunk* button on the toolbar or by pressing *Cmd+Option+I*.

When you save the notebook, an HTML file containing the code and output will be saved alongside it (click the *Preview* button or press *Cmd+Shift+K* to preview the HTML file). 

The preview shows you a rendered HTML copy of the contents of the editor. Consequently, unlike *Knit*, *Preview* does not run any R code chunks. Instead, the output of the chunk when it was last run in the editor is displayed.

