in this module we discuss the foundations of statistics and data science.
statistics is a science of data collection, organization, presentation, summarizing, interpreting, drawing calculations and prescribing results.
The subjects broadly classified into two main fields namely:
Descriptive statistics is simply defined as the sample to sample statistics.
its mainly classified as the study of five different measurements including:
Importing and Exporting different data sets from different files and statistical soft wares.
we need the haven package to import easily all different data files from different statistical software.
library(haven)
## Warning: package 'haven' was built under R version 4.3.2
data<-read_dta("Togdheer2.dta")
library(psych)
## Warning: package 'psych' was built under R version 4.3.2
describe(data$numberofdeath)
boxplot(data$mothersageatfirstbirth,col="blue")
hist(data$mothersageatfirstbirth,col = "green")
plot(data$mothersageatfirstbirth,col = "red")
polygon(data$mothersageatfirstbirth,col = "cyan", border = "magenta")