Module I: introduction to statistics

In this module we will discuss the foundation of statistics and data science.

Fundamental concepts of statistics

statistics is science of data collection, organization, presentation,s summarizing, interpreting , drawing, calculation and prescribing the results. the subject is broadly classified into two main fields namely:

  1. descriptive statistics, and
  2. inferential statistics.

Descriptive statistics

descriptive statistics is simply defined as the sample to sample statistics. It is mainly classified as the study of five different measurements including:

  1. measure of central tendency
  2. measure of dispersion
  3. measure of position
  4. measure of shapes
  5. measure of frequency

##Data management

library(haven)
## Warning: package 'haven' was built under R version 4.3.2
data<-read_dta("Togdheer2.dta")
library(psych)
## Warning: package 'psych' was built under R version 4.3.2
describe(data$numberofdeath)
boxplot(data$mothersageatfirstbirth,col="blue")

hist(data$mothersageatfirstbirth, col="green")

plot(density(data$mothersageatfirstbirth),col="red")
polygon(density(data$mothersageatfirstbirth),col="cyan",border="magenta")