Module 1 : Introduction to Statistics

in this module we discuss the foundations of statistics and data science.

fundamental concepts of statistics

statistics is a science of data collection, organization, presentation, summarizing, interpreting, drawing calculations and prescribing results.

The subjects broadly classified into two main fields namely:

  1. Descriptive statistics, and
  2. Inferential statistics.

Descriptive statistics

Descriptive statistics is simply defined as the sample to sample statistics.

its mainly classified as the study of five different measurements including:

  1. Measures of central tendency,
  2. Measures of Dispersion,
  3. Measures of Position,
  4. Measures of Shape, and
  5. Measures of Frequency

Data Management

Importing and Exporting different data sets from different files and statistical soft wares.

Importing data sets

we need the haven package to import easily all different data files from different statistical software.

library(haven)
## Warning: package 'haven' was built under R version 4.3.2
data<-read_dta("Togdheer2.dta")
library(psych)
## Warning: package 'psych' was built under R version 4.3.2
describe(data$numberofdeath)
boxplot(data$mothersageatfirstbirth,col="blue")

hist(data$mothersageatfirstbirth,col = "green")

plot(data$mothersageatfirstbirth,col = "red")
polygon(data$mothersageatfirstbirth,col = "cyan", border = "magenta")