About
This worksheet has three main taks: analyze the time series of returns, assess for normal distribution, and check for exponential behavior of prices time series.
Setup
Remember to always set your working directory to the source file location. Go to ‘Session’, scroll down to ‘Set Working Directory’, and click ‘To Source File Location’. Read carefully the below and follow the instructions to complete the tasks and answer any questions. Submit your work to RPubs as detailed in previous notes.
Note
For clarity, tasks/questions to be completed/answered are highlighted in red color (color visible only in preview mode) and numbered according to their particular placement in the task section. Type your answers outside the red color tags!
Quite often you will need to add your own code chunk. Execute sequentially all code chunks, preview, publish, and submit link on Sakai following the naming convention. Make sure to add comments to your code where appropriate. Use own language!
Any sign of plagiarism, will result in dissmissal of work!
Task 1: Plot of Returns & Testing for Normality Distribution
In this task we will look at various type of daily returns calculations for comparison and to test normality.
# Require will load the package only if not installed
# Dependencies = TRUE makes sure that dependencies are install
if(!require("quantmod",quietly = TRUE))
install.packages("quantmod",dependencies = TRUE, repos = "https://cloud.r-project.org")
##### 1A) Follow the instructions on p.41 to generate four plots for daily, daily log, weekly, and monthly returns. Select a stock of your choice and a time period long enough (5-10 years) to capture the returns behavior.
getSymbols('AAPL',src='yahoo', from="2008-01-01", to="2018-12-30")
[1] "AAPL"
aplRd = periodReturn(AAPL,period="daily")
plot(aplRd, main="APPLE daily returns")

aplRd2 = periodReturn(AAPL,period="daily",type="log")
plot(aplRd, main="APPLE daily log returns")

aplRd3 = periodReturn(AAPL,period="weekly")
plot(aplRd, main="APPLE weekly returns")

aplRd4 = periodReturn(AAPL,period="monthly")
plot(aplRd, main="APPLE monthly returns")

##### 1B) For the case of daily log returns only, write down the mathematical formula representing the calculation in the code. Confirm integrity of your mathematical formula by selecting a recent data point from your time series object, substituting the corresponding values in the formula to manually calculate the log return, and comparing both results.
The mathematical formula representing the calculation in the code is \(r_{t}\)=ln(1+\(R_{t}\))=ln(\(\frac{P_{t}}{P_{t-1}}\))=ln\(P_{t}\)-ln\(P_{t-1}\), in which \(r_{t}\) is daily log return, \(R_{t}\) is daily return, \(P_{t}\) is daily price of APPL. To confirm the integrity of my mathematical formula, I select a recent data point from time series object: the daily price on 11-12-2018 is 194.17, and the daily price on 11-13-2018 is 192.23. The daily return on 11-13-2018 is -9.991255\(10^{-3}\), and the calculated daily log return is ln(192.23)-1n(194.17)=-1.004149\(10^{-2}\), which is almostly the same as the data -1.004150*\(10^{-2}\) shown on the daily-log-returns plot.
##### 1C) Check the normality of the daily returns using the R function qqnorm() to generate a Q-Q plot. For the function to work properly, you will need to extract first the numeric values from the time series object. Note that a time series object contains both a date and a corresponding value. To extract the numerical value only, on can use the R function as.numeric() on the time series object. Explain what the Y and X axis of the Q-Q plot represent, and share your observation on the normality of the returns distribution.
getSymbols('AAPL',src='yahoo', from="2007-01-01", to="2017-12-30")
[1] "AAPL"
aplRd = periodReturn(AAPL,period="daily")
numericv=as.numeric(aplRd)
#dsd=density(aplRd) #estimate density of daily log ret
ylim=c(-0.05,0.05) #set y limit
qqnorm(numericv, ylim, main = "Normal Q-Q Plot",
xlab = "Theoretical Quantiles", ylab = "Sample Quantiles",
plot.it = TRUE, datax = FALSE)
qqline(numericv, datax = FALSE, distribution = qnorm,
probs = c(0.25, 0.75), qtype = 7)

The X axis of the Q-Q plot represents the theoretical quantiles of daily-returns data set, the Y axis of the Q-Q plot represents the sample quantiles of daily-returns data set. Since the plot we draw is near the qqline, which is the Q-Q plot of normal distrubution, we can conclude that although the distribution of daily returns of APPL stock is symmetric with fat tails, it is still similar to the normal distribution.
Task 2: Density Distribution
Another way to assess the normality of a distribution, other than a Q-Q plot, is to look at the actual density distribution and compare to a normal distribution.
##### 2A) Follow the example in R Lab 2.7.9/p. 70 to generate the density distribution for your stock of choice. Comment on your results.
require(quantmod)
getSymbols("AAPL",src="yahoo")
[1] "AAPL"
appl=AAPL
apRd= periodReturn(appl,period="daily",type="log")
dsd=density(apRd) #estimate density of daily log ret
yl=c(min(dsd$y),max(dsd$y)) #set y limits
plot(dsd,main=NULL,ylim=yl)
##plot the normal density with mean, stdv of apRd
a=seq(min(apRd),max(apRd),0.001)
points(a,dnorm(a,mean(apRd),sd(apRd)), type="l",lty=2)

apRd= periodReturn(appl,period="weekly",type="log")
dsd=density(apRd) #estimate density of weekly log ret
yl=c(min(dsd$y),max(dsd$y)) #set y limits
plot(dsd,main=NULL,ylim=yl)
##plot the normal density with mean, stdv of apRd
a=seq(min(apRd),max(apRd),0.001)
points(a,dnorm(a,mean(apRd),sd(apRd)), type="l",lty=2)

apRd= periodReturn(appl,period="monthly",type="log")
dsd=density(apRd) #estimate density of monthly log ret
yl=c(min(dsd$y),max(dsd$y)) #set y limits
plot(dsd,main=NULL,ylim=yl)
##plot the normal density with mean, stdv of apRd
a=seq(min(apRd),max(apRd),0.001)
points(a,dnorm(a,mean(apRd),sd(apRd)), type="l",lty=2)

From the plots of results above, we can see the density distributions for AAPL stock’s daily, weekly and monthly returns. And we can also conclude that the aggregational normality holds in stock returns, which refers to the fact that as one increases the time scale (from daily to monthly), the distribution of the related time-period returns looks more like a normal distribution.
Task 3: Exponential Behavior of Prices & Curve Fitting
In general, the price history of a stock, over a sufficiently large time window, tends to follow an exponential curve. Many other economic indicators like GDP, population growth, and inflation also follow exponential growth over a long time. Keep in mind that for investment purposes we care more about returns and not prices.
##### 3A) Follow the example in R Lab 2.7.2/p. 67 or R Labs 2 from bookâs website (*) to generate an exponential fit for the Dow Jones Industrial Average DJIA. In case the suggested command in the book does not work, consider using instead the command in the code chunk below to capture the DIJA prices.
#Federal Reserve Bank of St Louis
getSymbols.FRED("DJIA",env=globalenv())
[1] "DJIA"
serie=DJIA["2008/2018"]
price=as.numeric(serie) #extract numeric values of price
time = index(serie) #extract the indices
x=1:length(price)
model=lm(log(price)~x)
expo=exp(model$coef[1]+model$coef[2]*x)
plot(x=time,y=price, main="Dow Jones",type="l")
lines(time,expo,col=2,lwd=2)

##### 3B) Write down the mathematical form representing the exponential function in the code. Substitute for the exact coefficients in the exponential form and clearly label the variables in the function, in particular the time index.
The mathematical form representing the exponential function in the code is expo=\(e^{coef[1]+coef[2]*x}\), in which expo is the fitted price variable of the DJIA, coef[1] and coef[2] are the estimated coefficients of the function log(price)=coef[1]+coef[2]*x, x is an continuous integer vector that has number from 1 to the length of the DJIA’s price data set. Price is the real daily numeric value of DJIA, and time index is a data collection that contains the date of the daily value of DJIA.
##### 3C) Repeat the exercise in 3A) for AAPL Adjusted prices.
getSymbols('AAPL',src='yahoo', from="2008-01-01", to="2018-12-30")
[1] "AAPL"
##to see only the adjusted close price for that period
applAdj = AAPL$AAPL.Adjusted["2008-01-01/2018-12-30"]
serie=applAdj["2008/2018"]
price=as.numeric(serie) #extract numeric values of price
time = index(serie) #extract the indices
x=1:length(price)
model=lm(log(price)~x)
expo=exp(model$coef[1]+model$coef[2]*x)
plot(x=time,y=price, main="APPL stock",type="l")
lines(time,expo,col=2,lwd=2)

*http://computationalfinance.lsi.upc.edu
---
title: "FINC621 Winter 2018-19 Lab Worksheet 02"
author: "Yue Huang"
date: "11-21-2018"
output:
  html_notebook: default
  html_document: default
subtitle: Time Series Distributions & Normality (finc621-lab02)
---

### About

This worksheet has three main taks: analyze the time series of returns, assess for normal distribution, and check for exponential behavior of prices time series.

### Setup

Remember to always set your working directory to the source file location. Go to 'Session', scroll down to 'Set Working Directory', and click 'To Source File Location'. Read carefully the below and follow the instructions to complete the tasks and answer any questions.  Submit your work to RPubs as detailed in previous notes. 

### Note

For clarity, tasks/questions to be completed/answered are highlighted in red color (color visible only in preview mode) and numbered according to their particular placement in the task section.  Type your answers outside the red color tags!

Quite often you will need to add your own code chunk. Execute sequentially all code chunks, preview, publish, and submit link on Sakai following the naming convention. Make sure to add comments to your code where appropriate. Use own language!

**Any sign of plagiarism, will result in dissmissal of work!**

--------------

### Task 1: Plot of Returns & Testing for Normality Distribution

In this task we will look at various type of daily returns calculations for comparison and to test normality.

```{r}
# Require will load the package only if not installed 
# Dependencies = TRUE makes sure that dependencies are install
if(!require("quantmod",quietly = TRUE))
  install.packages("quantmod",dependencies = TRUE, repos = "https://cloud.r-project.org")
```


<span style="color:red">
##### 1A) Follow the instructions on p.41 to generate four plots for daily, daily log, weekly, and monthly returns. Select a stock of your choice and a time period long enough (5-10 years) to capture the returns behavior. 
</span>

```{r}
getSymbols('AAPL',src='yahoo', from="2008-01-01", to="2018-12-30")
aplRd = periodReturn(AAPL,period="daily")
plot(aplRd, main="APPLE daily returns")
aplRd2 = periodReturn(AAPL,period="daily",type="log")
plot(aplRd, main="APPLE daily log returns")
aplRd3 = periodReturn(AAPL,period="weekly")
plot(aplRd, main="APPLE weekly returns")
aplRd4 = periodReturn(AAPL,period="monthly")
plot(aplRd, main="APPLE monthly returns")
```


<span style="color:red">
##### 1B) For the case of **daily log returns**  only, write down the mathematical formula representing the calculation in the code. Confirm integrity of your mathematical formula by selecting a recent data point from your time series object, substituting the corresponding values in the formula to manually calculate the log return, and comparing both results.
</span>

The mathematical formula representing the calculation in the code is $r_{t}$=ln(1+$R_{t}$)=ln($\frac{P_{t}}{P_{t-1}}$)=ln$P_{t}$-ln$P_{t-1}$, in which $r_{t}$ is daily log return, $R_{t}$ is daily return, $P_{t}$ is daily price of APPL. To confirm the integrity of my mathematical formula, I select a recent data point from time series object: the daily price on 11-12-2018 is 194.17, and the daily price on 11-13-2018 is 192.23. The daily return on 11-13-2018 is -9.991255*$10^{-3}$, and the calculated daily log return is
ln(192.23)-1n(194.17)=-1.004149*$10^{-2}$,
which is almostly the same as the data -1.004150*$10^{-2}$ shown on the daily-log-returns plot. 

<span style="color:red">
##### 1C) Check the normality of the **daily returns** using the R function `qqnorm()` to generate a Q-Q plot. For the function to work properly, you will need to extract first the numeric values from the time series object. Note that a time series object contains both a date and a corresponding value. To extract the numerical value only, on can use the R function `as.numeric()` on the time series object. Explain what the Y and X axis of the Q-Q plot represent, and share your observation on the normality of the returns distribution.
</span>

```{r}
getSymbols('AAPL',src='yahoo', from="2007-01-01", to="2017-12-30")
aplRd = periodReturn(AAPL,period="daily")
numericv=as.numeric(aplRd)
ylim=c(-0.05,0.05) #set y limit
qqnorm(numericv, ylim, main = "Normal Q-Q Plot",
       xlab = "Theoretical Quantiles", ylab = "Sample Quantiles",
       plot.it = TRUE, datax = FALSE)
qqline(numericv, datax = FALSE, distribution = qnorm,
       probs = c(0.25, 0.75), qtype = 7)
```
The X axis of the Q-Q plot represents the theoretical quantiles of daily-returns data set, the Y axis of the Q-Q plot represents the sample quantiles of daily-returns data set. Since the plot we draw is near the qqline, which is the Q-Q plot of normal distrubution, we can conclude that although the distribution of daily returns of APPL stock is symmetric with fat tails, it is still similar to the normal distribution.

### Task 2: Density Distribution

Another way to assess the normality of a distribution, other than a Q-Q plot, is to look at the actual density distribution and compare to a normal distribution.

<span style="color:red">
##### 2A) Follow the example in R Lab 2.7.9/p. 70 to generate the density distribution for your stock of choice. Comment on your results.
</span>

```{r}
require(quantmod)
getSymbols("AAPL",src="yahoo")
appl=AAPL
apRd= periodReturn(appl,period="daily",type="log")
dsd=density(apRd) #estimate density of daily log ret
yl=c(min(dsd$y),max(dsd$y)) #set y limits
plot(dsd,main=NULL,ylim=yl)
##plot the normal density with mean, stdv of apRd
a=seq(min(apRd),max(apRd),0.001)
points(a,dnorm(a,mean(apRd),sd(apRd)), type="l",lty=2)
apRd= periodReturn(appl,period="weekly",type="log")
dsd=density(apRd) #estimate density of weekly log ret
yl=c(min(dsd$y),max(dsd$y)) #set y limits
plot(dsd,main=NULL,ylim=yl)
##plot the normal density with mean, stdv of apRd
a=seq(min(apRd),max(apRd),0.001)
points(a,dnorm(a,mean(apRd),sd(apRd)), type="l",lty=2)
apRd= periodReturn(appl,period="monthly",type="log")
dsd=density(apRd) #estimate density of monthly log ret
yl=c(min(dsd$y),max(dsd$y)) #set y limits
plot(dsd,main=NULL,ylim=yl)
##plot the normal density with mean, stdv of apRd
a=seq(min(apRd),max(apRd),0.001)
points(a,dnorm(a,mean(apRd),sd(apRd)), type="l",lty=2)
```
From the plots of results above, we can see the density distributions for AAPL stock's daily, weekly and monthly returns. And we can also conclude that the aggregational normality holds in stock returns, which refers to the fact that as one increases the time scale (from daily to monthly), the distribution of the related time-period returns looks more like a normal distribution.

### Task 3: Exponential Behavior of Prices & Curve Fitting

In general, the price history of a stock, over a sufficiently large time window, tends to follow an exponential curve. Many other economic indicators like GDP, population growth, and inflation also follow exponential growth over a long time. Keep in mind that for investment purposes we care more about returns and not prices.

<span style="color:red">
##### 3A) Follow the example in R Lab 2.7.2/p. 67 or R Labs 2 from book’s website (*) to generate an exponential fit for the Dow Jones Industrial Average DJIA. In case the suggested command in the book does not work, consider using instead the command in the code chunk below to capture the DIJA prices.  
</span>

```{r}
#Federal Reserve Bank of St Louis
getSymbols.FRED("DJIA",env=globalenv())
serie=DJIA["2008/2018"]
price=as.numeric(serie) #extract numeric values of price
time = index(serie) #extract the indices
x=1:length(price)
model=lm(log(price)~x)
expo=exp(model$coef[1]+model$coef[2]*x)
plot(x=time,y=price, main="Dow Jones",type="l")
lines(time,expo,col=2,lwd=2)

```

<span style="color:red">
##### 3B) Write down the mathematical form representing the exponential function in the code. Substitute for the exact coefficients in the exponential form and clearly label the variables in the function, in particular the time index. 
</span>

The mathematical form representing the exponential function in the code is expo=$e^{coef[1]+coef[2]*x}$, in which expo is the fitted price variable of the DJIA, coef[1] and coef[2] are the estimated coefficients of the function log(price)=coef[1]+coef[2]*x, x is an continuous integer vector that has number from 1 to the length of the DJIA's price data set. Price is the real daily numeric value of DJIA, and time index is a data collection that contains the date of the daily value of DJIA.


<span style="color:red">
##### 3C) Repeat the  exercise in 3A) for AAPL Adjusted prices.
</span>

```{r}
getSymbols('AAPL',src='yahoo', from="2008-01-01", to="2018-12-30")
##to see only the adjusted close price for that period
applAdj = AAPL$AAPL.Adjusted["2008-01-01/2018-12-30"]
serie=applAdj["2008/2018"]
price=as.numeric(serie) #extract numeric values of price
time = index(serie) #extract the indices
x=1:length(price)
model=lm(log(price)~x)
expo=exp(model$coef[1]+model$coef[2]*x)
plot(x=time,y=price, main="APPL stock",type="l")
lines(time,expo,col=2,lwd=2)

```





*[http://computationalfinance.lsi.upc.edu ](http://computationalfinance.lsi.upc.edu)
