About

In this worksheet we look at different variance, covariance, volatility, and causality calculations. We finish with a short matematical proof (no R required).

Setup

Remember to always set your working directory to the source file location. Go to ‘Session’, scroll down to ‘Set Working Directory’, and click ‘To Source File Location’. Read carefully the below and follow the instructions to complete the tasks and answer any questions. Submit your work to RPubs as detailed in previous notes.

Note

For clarity, tasks/questions to be completed/answered are highlighted in red color (color visible only in preview mode) and numbered according to their particular placement in the task section. Type your answers outside the red color tags!

Quite often you will need to add your own code chunk. Execute sequentially all code chunks, preview, publish, and submit link on Sakai following the naming convention. Make sure to add comments to your code where appropriate. Use own language!

Any sign of plagiarism, will result in dissmissal of work!


Task 1: Variance, Covariance, and Volatility

This task follows the two examples in the book R Example 2.5/p. 58 and R Example 2.6/p. 66

# Require will load the package only if not installed 
# Dependencies = TRUE makes sure that dependencies are install
if(!require("quantmod",quietly = TRUE))
  install.packages("quantmod",dependencies = TRUE, repos = "https://cloud.r-project.org")

##### 1A) Calculate the correlation and covariance matrix of the adjusted daily log returns for four different stocks of your choice. Explain your observations in terms of potential relationships.

Here the 4 shares show a positive covariance, which indicates that if one share goes up, so does the other. a covariance lias between -1 and +1, we can decide the strength of the covariance from the fact that how close it is to 1. closer to 1, stronger the covarience. the correlation between these 4 stocks is very high in most cases it is more than 50%. Which means they are very similar and mostly from the same industry.

 getSymbols("ORCL",src="yahoo")
[1] "ORCL"
ORCLRD=dailyReturn(ORCL,periods="daily", type="log")
getSymbols("SAP",src="yahoo")
[1] "SAP"
SAPRD=dailyReturn(SAP,periods="daily", type="log")
getSymbols("ADBE",src="yahoo")
[1] "ADBE"
ADBERD=dailyReturn(ADBE,periods="daily", type="log")
getSymbols("IBM",src="yahoo")
[1] "IBM"
IBMRD=dailyReturn(IBM,periods="daily", type="log")
M<- cbind(ORCLRD,SAPRD,ADBERD,IBMRD)
colnames(M)<- c("ORCLRD","SAPRD","ADBERD","IBMRD")
colnames(M)
[1] "ORCLRD" "SAPRD"  "ADBERD" "IBMRD" 
cov(M)
             ORCLRD        SAPRD       ADBERD        IBMRD
ORCLRD 0.0003047189 0.0001829241 0.0002035892 0.0001412052
SAPRD  0.0001829241 0.0003068716 0.0001929072 0.0001299595
ADBERD 0.0002035892 0.0001929072 0.0004310746 0.0001431262
IBMRD  0.0001412052 0.0001299595 0.0001431262 0.0001956820
cor(M, method = "pearson")
          ORCLRD     SAPRD    ADBERD     IBMRD
ORCLRD 1.0000000 0.5981953 0.5617314 0.5782631
SAPRD  0.5981953 1.0000000 0.5303882 0.5303399
ADBERD 0.5617314 0.5303882 1.0000000 0.4927964
IBMRD  0.5782631 0.5303399 0.4927964 1.0000000
  

##### 1B) Calculate the three types of volatility for a particular stock of your choice. Consider a time window extending one year back from most recent obtainable closing day price. Order the three estimates from low to high volatility and explain how the ordering makes sense.

nflx=NFLX['2017-11/2018-11'];m=length(nflx$NFLX.Close);
ohcl<-nflx[,c("NFLX.Open","NFLX.High","NFLX.Low","NFLX.Close")]
vClose<-volatility(ohcl,n=m,calc="close",N=252)
vParkinson<-volatility(ohcl,n=m,calc="parkinson", N=252)
vGK<-volatility(ohcl,n=m,calc="garman",N=252)
vClose[m];vParkinson[m];vGK[m]
                [,1]
2018-11-30 0.4257689
                [,1]
2018-11-30 0.3756358
                [,1]
2018-11-30 0.3708174

The volatilities above show that, the more we stabalize this stock, lesser is the volatility. if we just consider the opening and closing price it shows a very high volatility but as we add more factors like random walk, highs and lows to it, the number starts to diffuse. This says that although the stock is volatile, the numbers of volatility may be a bit inflated.

Task 2: Auto-Correlation and Auto-Regression

Follow the example in the book R Example 3.2/p. 74 and R Example 4.1/p. 115

##### 2A) Calculate the ACF for a stock of your choice. Consider both the log return and squared log return. Interpret your results in terms of possible existence of autocorrelation.

1,13,17,29 show some linear dependence on past values. We do see some level of autocorrelation

getSymbols("ORCL",src="yahoo")
[1] "ORCL"
ORCLRD=dailyReturn(ORCL,periods="daily", type="log")
ORCLRD1<- ORCLRD^2
acf(ORCLRD)

pacf(ORCLRD)

acf(ORCLRD1)

pacf(ORCLRD1)

##### 2B) Plot the exchange rate for USD versus another currency of your choice. Interpret your results in terms of behavior.

As we can see the rates started declining since august and continued to fall rapidly upto october.In october the INR saw it’s lowest point and then started gaining ground as it tried to regain it’s previous position. The decline was a result of pull out by the FIIs.

##### 2C) Test for the possible existence of an underlying AR(1) – Markov process in your exchange rate currency pair. To this end, plot the ACF and the partial ACF (PACF). Interpret your results. Clearly refer to the lags, and their impacts in determining the order.

There exists a Markov process in the currency data as the past rate of the currency on any given date does not affect it’s future rate given the present rate. The future rate is independent and moves based on other factors than the past rate. As we can see in the ACF and PACF plots there is no substantial linear correlation.

Task 3: Granger Causality Test

To conduct this test the package lmtest will be required, as already done in the code chunk below.

grangertest(chicken ~ egg, order = 3, data = ChickEgg)
Granger causality test

Model 1: chicken ~ Lags(chicken, 1:3) + Lags(egg, 1:3)
Model 2: chicken ~ Lags(chicken, 1:3)
  Res.Df Df     F   Pr(>F)   
1     44                     
2     47 -3 5.405 0.002966 **
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
grangertest(egg ~ chicken, order = 3, data = ChickEgg)
Granger causality test

Model 1: egg ~ Lags(egg, 1:3) + Lags(chicken, 1:3)
Model 2: egg ~ Lags(egg, 1:3)
  Res.Df Df      F Pr(>F)
1     44                 
2     47 -3 0.5916 0.6238

##### 3A) Include below the code chunk to solve for 3.5.7 R Lab/p. 106. Write your conclusions.

grangertest(chicken ~ egg, order = 3, data = ChickEgg)
Granger causality test

Model 1: chicken ~ Lags(chicken, 1:3) + Lags(egg, 1:3)
Model 2: chicken ~ Lags(chicken, 1:3)
  Res.Df Df     F   Pr(>F)   
1     44                     
2     47 -3 5.405 0.002966 **
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
grangertest(egg ~ chicken, order = 3, data = ChickEgg)
Granger causality test

Model 1: egg ~ Lags(egg, 1:3) + Lags(chicken, 1:3)
Model 2: egg ~ Lags(egg, 1:3)
  Res.Df Df      F Pr(>F)
1     44                 
2     47 -3 0.5916 0.6238

In the first case of chicken causing egg, the p-value is 0.002966 which is less than the critical value, so we reject the null hypothesis and say that chicken does depend on eggs and chicken.

In the second case of eggs causing chicken, the p-value is 0.6238 which is above the critical value, so we cannot reject the null hypothesis that eggs may not depend on chicken.

##### 3B) Briefly describe the data in terms of time range and variables. Similar to the linear autoegressive model described in class, write the mathematical regression model solved in each Granger test, including the proper order. Use naming conventions, and notations more reflective of the data set considered for ChickEgg.

Task 4: Mathematical Proof

##### 4A) Prove the two results in Eq (2.32)/p. 53. No R-coding is needed here. Clearly show your steps. Hint: Use the definition of \(E(X^n)\) for X-log normally distributed. Observe also that \(Var(X) = E(X^2)-E^2(X)\) for any random variable X.

```

*http://computationalfinance.lsi.upc.edu

---
title: "FINC621 Winter 2018-19 Lab Worksheet 03"
author: "Sangamitra AGrawal"
date: "12/05/2018"
output:
  html_notebook: default
  html_document: default
subtitle: Variance, Covariance, Correlation & Causality (finc621-lab03)
---

### About

In this worksheet we look at different variance, covariance, volatility, and causality calculations. We finish with a short matematical proof (no R required).  

### Setup

Remember to always set your working directory to the source file location. Go to 'Session', scroll down to 'Set Working Directory', and click 'To Source File Location'. Read carefully the below and follow the instructions to complete the tasks and answer any questions.  Submit your work to RPubs as detailed in previous notes. 

### Note

For clarity, tasks/questions to be completed/answered are highlighted in red color (color visible only in preview mode) and numbered according to their particular placement in the task section.  Type your answers outside the red color tags!

Quite often you will need to add your own code chunk. Execute sequentially all code chunks, preview, publish, and submit link on Sakai following the naming convention. Make sure to add comments to your code where appropriate. Use own language!

**Any sign of plagiarism, will result in dissmissal of work!**

--------------

### Task 1: Variance, Covariance, and Volatility

This task follows the two examples in the book `R Example 2.5/p. 58` and `R Example 2.6/p. 66` 

```{r}
# Require will load the package only if not installed 
# Dependencies = TRUE makes sure that dependencies are install
if(!require("quantmod",quietly = TRUE))
  install.packages("quantmod",dependencies = TRUE, repos = "https://cloud.r-project.org")
```


<span style="color:red">
##### 1A) Calculate the correlation and covariance matrix of the adjusted daily log returns for four different stocks of your choice. Explain your observations in terms of potential relationships.
</span>

Here the 4 shares show a positive covariance, which indicates that if one share goes up, so does the other. a covariance lias between -1 and +1, we can decide the strength of the covariance from the fact that how close it is to 1. closer to 1, stronger the covarience.
the correlation between these 4 stocks is very high in most cases it is more than 50%. Which means they are very similar and mostly from the same industry.


```{r}
 getSymbols("ORCL",src="yahoo")
ORCLRD=dailyReturn(ORCL,periods="daily", type="log")

getSymbols("SAP",src="yahoo")
SAPRD=dailyReturn(SAP,periods="daily", type="log")

getSymbols("ADBE",src="yahoo")
ADBERD=dailyReturn(ADBE,periods="daily", type="log")

getSymbols("IBM",src="yahoo")
IBMRD=dailyReturn(IBM,periods="daily", type="log")


M<- cbind(ORCLRD,SAPRD,ADBERD,IBMRD)

colnames(M)<- c("ORCLRD","SAPRD","ADBERD","IBMRD")
colnames(M)

cov(M)

cor(M, method = "pearson")
  

```

<span style="color:red">
##### 1B) Calculate the three types of volatility for a particular stock of your choice. Consider a time window extending one year back from most recent obtainable closing day price. Order the three estimates from low to high volatility and explain how the ordering makes sense.
</span>

```{r}
# For this task make sure you understand well what the variables n,m represent in the book's referenced example.

getSymbols("NFLX", src='yahoo')
nflx=NFLX['2017-11/2018-11'];m=length(nflx$NFLX.Close);
ohcl<-nflx[,c("NFLX.Open","NFLX.High","NFLX.Low","NFLX.Close")]
vClose<-volatility(ohcl,n=m,calc="close",N=252)
vParkinson<-volatility(ohcl,n=m,calc="parkinson", N=252)
vGK<-volatility(ohcl,n=m,calc="garman",N=252)
vClose[m];vParkinson[m];vGK[m]
```
The volatilities above show that, the more we stabalize this stock, lesser is the volatility. if we just consider the opening and closing price it shows a very high volatility but as we add more factors like random walk, highs and lows to it, the number starts to diffuse. This says that although the stock is volatile, the numbers of volatility may be a bit inflated.

### Task 2: Auto-Correlation and Auto-Regression

Follow the example in the book  `R Example 3.2/p. 74` and `R Example 4.1/p. 115`

<span style="color:red">
##### 2A) Calculate the ACF for a stock of your choice. Consider both the log return and squared log return. Interpret your results in terms of possible existence of autocorrelation.  

1,13,17,29 show some linear dependence on past values.
We do see some level of autocorrelation

</span>
```{r}
 getSymbols("ORCL",src="yahoo")
ORCLRD=dailyReturn(ORCL,periods="daily", type="log")

ORCLRD1<- ORCLRD^2
acf(ORCLRD)
pacf(ORCLRD)

acf(ORCLRD1)
pacf(ORCLRD1)

  
```

<span style="color:red">
##### 2B) Plot the exchange rate for USD versus another currency of your choice. Interpret your results in terms of behavior.
</span>
```{r}
getFX("INR/USD")
  plot(INRUSD)
```
As we can see the rates started declining since august and continued to fall rapidly upto october.In october the INR saw it's lowest point and then started gaining ground as it tried to regain it's previous position. The decline was a result of pull out by the FIIs.


<span style="color:red">
##### 2C) Test for the possible existence of an underlying AR(1) – Markov process in your exchange rate currency pair. To this end, plot the ACF and the partial ACF (PACF). Interpret your results.  Clearly refer to the lags, and their impacts in determining the order.
</span>
```{r}
  acf(INRUSD)

```
```{r}
pacf(INRUSD)
```
There exists a Markov process in the currency data as the past rate of the currency on any given date does not affect it's future rate given the present rate. The future rate is independent and moves based on other factors than the past rate. As we can see in the ACF and PACF  plots there is no substantial linear correlation.


### Task 3: Granger Causality Test

To conduct this test the package `lmtest` will be required, as already done in the code chunk below.

```{r}
# Require will load the package only if not installed 
# Dependencies = TRUE makes sure that dependencies are install
if(!require("lmtest",quietly = TRUE))
  install.packages("lmtest",dependencies = TRUE, repos = "https://cloud.r-project.org")
library(lmtest)


```

<span style="color:red">
##### 3A) Include below the code chunk to solve for 3.5.7 R Lab/p. 106.  Write your conclusions.
</span>

```{r}
# More information about the data used in testing for causality can be obtained by typing the name of the data set `ChickEgg` in the R Help menu.

grangertest(chicken ~ egg, order = 3, data = ChickEgg)
grangertest(egg ~ chicken, order = 3, data = ChickEgg)


```
In the first case of chicken causing egg, the p-value is 0.002966 which is less than the critical value, so we reject the null hypothesis and say that chicken does depend on eggs and chicken.

In the second case of eggs causing chicken, the p-value is 0.6238 which is above the critical value, so we cannot reject the null hypothesis that eggs may not depend on chicken.



<span style="color:red">
##### 3B) Briefly describe the data in terms of time range and variables. Similar to the linear autoegressive model described in class, write the mathematical regression model solved in each Granger test, including the proper order. Use naming conventions, and notations more reflective of the data set considered for  `ChickEgg`.
</span>


### Task 4: Mathematical Proof

<span style="color:red">
##### 4A) Prove the two results in Eq (2.32)/p. 53.  No R-coding is needed here.  Clearly show your steps. Hint: Use the definition of $E(X^n)$ for X-log normally distributed.   Observe also that $Var(X) = E(X^2)-E^2(X)$ for any random variable X.
</span>




```


*[http://computationalfinance.lsi.upc.edu ](http://computationalfinance.lsi.upc.edu)
