setwd(“~/Desktop”) read.csv(“income.csv”)
rawdata <- (“income.csv”) summary(rawdata)
title: "income output: html_document: css: ~/Desktop/cssfile.css
Income dataset description
Some of the variables in the income data set are:
Industry - The categorical variable showing the different industries of the Workers and their income
Occupation - The Occupation title the workers belong to categorised by industry
Workers - The volume of workers from the income dataset based on occupation title and the industry they belong to
Salary - The Dollar value of the workers based on Occupation title
\[ Industry gender preference = (Total income of Male - Total income of Female)/industry category \]
Gap of income across different industries. Does the difference happen by chance or statsitically significant
\[ \]