In this module we will discuss the foundation of statistics and data science.
statistics is science of data collection, organization, presentation,s summarizing, interpreting , drawing, calculation and prescribing the results. the subject is broadly classified into two main fields namely:
descriptive statistics is simply defined as the sample to sample statistics. It is mainly classified as the study of five different measurements including:
##Data management
library(haven)
## Warning: package 'haven' was built under R version 4.3.2
data<-read_dta("Togdheer2.dta")
library(psych)
## Warning: package 'psych' was built under R version 4.3.2
describe(data$numberofdeath)
boxplot(data$mothersageatfirstbirth,col="blue")
hist(data$mothersageatfirstbirth, col="green")
plot(density(data$mothersageatfirstbirth),col="red")
polygon(density(data$mothersageatfirstbirth),col="cyan",border="magenta")