Syeda Hafsa Mujeeb (s3796697) Vaishnaavee Rajaraman (s3813544) Vidavaluru Sai Haneesha (s3799365)
Last updated: 27 October, 2019
The aim of this report is to investigate if there is a decrease in forest land area. The data was collected for east-asia and pacific regions for the years 2010 and 2015.
The sample was measured twice, in 2010 and in 2015. This makes it dependent or paired samples.
Since these two are dependent samples, paired sample t-test is used to determine if there was any statistically significant change in the forest land area in these regions.
The data was collected from worldbank website using world development indicators database https://databank.worldbank.org/source/world-development-indicators#
Countries under the region East asia and Pacific were selected for the investigation and the forest area(in sq km) of these countries were considered for the years 2010 and 2015
For year ‘2010’
ForestArea %>% summarise(Min = min(`2010`,na.rm = TRUE),
Q1 = quantile(`2010`,probs = .25,na.rm = TRUE),
Median = median(`2010`, na.rm = TRUE),
Q3 = quantile(`2010`,probs = .75,na.rm = TRUE),
Max = max(`2010`,na.rm = TRUE),
Mean = mean(`2010`, na.rm = TRUE),
SD = sd(`2010`, na.rm = TRUE),
n = n(),
Missing = sum(is.na(`2010`))) -> table1
knitr::kable(table1)| Min | Q1 | Median | Q3 | Max | Mean | SD | n | Missing |
|---|---|---|---|---|---|---|---|---|
| 0 | 353.1 | 9929 | 151885 | 2006103 | 182015.9 | 410444.5 | 42 | 7 |
For year ‘2015’
ForestArea %>% summarise(Min = min(`2015`,na.rm = TRUE),
Q1 = quantile(`2015`,probs = .25,na.rm = TRUE),
Median = median(`2015`, na.rm = TRUE),
Q3 = quantile(`2015`,probs = .75,na.rm = TRUE),
Max = max(`2015`,na.rm = TRUE),
Mean = mean(`2015`, na.rm = TRUE),
SD = sd(`2015`, na.rm = TRUE),
n = n(),
Missing = sum(is.na(`2015`))) -> table2
knitr::kable(table2)| Min | Q1 | Median | Q3 | Max | Mean | SD | n | Missing |
|---|---|---|---|---|---|---|---|---|
| 0 | 349 | 10172 | 155860 | 2083213 | 183224.7 | 419745.6 | 42 | 7 |
ForestArea<- na.omit(ForestArea)
ForestArea %>% select(`2010`:`2015`) %>% boxplot(horizontal = TRUE)For 2015:
\[H_0: \mu_\Delta= 0 \]
\[H_A: \mu_\Delta \ne 0\] - For paired sample t-test we use C.I.= 95% and α=0.05
#paired t-test:
t.test(
ForestArea$`2010`, ForestArea$`2015`,
paired = TRUE,
alternative = "two.sided"
)##
## Paired t-test
##
## data: ForestArea$`2010` and ForestArea$`2015`
## t = -0.45411, df = 34, p-value = 0.6526
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
## -6618.576 4200.919
## sample estimates:
## mean of the differences
## -1208.828
From the paired sample t-test the t value is t = -0.45411 and p value reported is 0.6526. The p value is greater than α=0.05, so we fail to reject H0. Also the 95% CI is [-6618.576 , 4200.919] which captures the null hypothesis H0, so we fail to reject H0.
From the above results, it was found that there was no statistically significant difference in the forest area from the year 2010 to 2015 in east asia and pacific region countries.
We can say that these countries were sucessfull in maintaining their forest area over a period of 5 years and it has neither increased nor decreased.