In this project you will investigate the exponential distribution in R and compare it with the Central Limit Theorem. The exponential distribution can be simulated in R with rexp(n, lambda) where lambda is the rate parameter. The mean of exponential distribution is 1/lambda and the standard deviation is also 1/lambda. Set lambda = 0.2 for all of the simulations. You will investigate the distribution of averages of 40 exponentials. Note that you will need to do a thousand simulations.
lambda<-.2
nSample=40
nSimulation=1000
hist(rexp(1000, lambda))
data = NULL
for (i in 1 : nSimulation) data = c(data, mean(rexp(nSample, lambda)))
hist(data)
sample_mean <- mean(data)
theory_mean <- 1/lambda
print(sample_mean)
## [1] 4.984811
print(theory_mean)
## [1] 5
sample_var <- var(data)
theory_sd <- (1/lambda)/sqrt(nSample)
theory_var<-theory_sd^2
print(sample_var)
## [1] 0.6178577
print(theory_var)
## [1] 0.625
How one can tell the distribution is approximately normal?
library(ggplot2)
data<-data.frame(data)
g<-ggplot(data, aes(x=data))+geom_histogram(aes(y = ..density..))+stat_function(fun=dnorm,args=list( mean=mean(data$data), sd=sd(data$data)),color="blue")
print(g)
## stat_bin: binwidth defaulted to range/30. Use 'binwidth = x' to adjust this.