Earnings Differentials Between US Born and Migrant Workers
Econometrics Data Analysis Project
Author
Affiliations
John Karuitha
Karatina University, School of Business
Graduate School of Business Administration, University of the Witwatersrand
Published
September 7, 2023
Modified
September 7, 2023
Abstract
In this analysis, I examine the drivers of the differential in earnings between US born and migrant workers in the United States. The analysis shows that migrants have the lowest earnings in their first year of arrival. Earnings increase steeply in the first five years, followed by a gradual decline. The drivers of income differentials include gender, age, race, education, certification, hours worked, job location (rural vs urban), and occupation. The data has a severe case of missing data making analysis challenging. After including more control variables. it appears that all else remaining the same, foreign born workers are likely to earn more than US born workers. There is potential for omitted variable bias.
Introduction
The aim of the project is to study the differences in real wages in 2019 US$ (rw) between immigrants and non-immigrants, and draw some conclusions as to what explains these differences. The issue of migrant wages features prominently in the United States, a country that receives a significant proportion of migrants from around the world. A major issue is that migrants contribute significantly to wage inequality in the United States. While some researchers argue that migrants do exacerbate wage inequality, other researchers opine that the observation only holds for unskilled migrants (Lin & Weiss, 2019).
In light of these concerns, the objectives of this analysis are as follows;
To quantify the wage gap between immigrants and non-immigrants in the United States.
To investigate whether and how the wage gap varies by time since immigrants entered the US.
To identify possible drivers of the migrant wage gap using regression analysis.
The data has observations for variables. I start by eliminating variables that have no data which leaves 133 variables. The data has substantial missing values. The target variable, real wages (rw) has 137,111 data points missing, which is 47.054119908027 %. I drop all the observations that have a missing value for the target variable, real wages (rw), which leaves us with 154279 observations (Amo-Agyei & Office, 2020).
[1] 154279
I visualise the missing data below. There is still 23% of the data missing.
The “Data” section give an overview of the analysis dataset you constructed, and describe the differences between immigrants and non-immigrants.
In the appendix below you will find a list of variables and description. Categorical variables are imported as factor variables with descriptive labels, and the file should be self explanatory.
The dataset contains many demographic and employment related variables that may explain wages. You should carefully go over the variable list to see what information is available and tabulate variables to see how they are coded.
Empirical Approach
I first do exploratory analysis to quantify the wage gap between migrant and non-migrant workers in the United States. specifically, I construct plots and calculate summary statistics.
Next, I run regressions to examine the factors that have a relationship with real wages between migran and non-migrant workers. In this analysis, I use nationality and the length that migrants have been in the United States as controls, over and above personal and household variables that may affect income. From theory and past empirical analysis, I find that the following factors have a significant relationship with wage differentials between migrants and non-immigrants.
Quantifying the Wage Gap Between Migrants and Non_Immigrants
I first start by running summary statistics regarding the earnings of migrants versus non-immigrants.
Summary of Wages for US Citizens and Migrants
skim_type
forborn
Mean
SD
Q1
Median
Q3
Max
numeric
US
25.94918
19.59816
13.95
20
31.25000
392.3050
numeric
Foreign
25.31543
20.56011
13.00
18
29.81375
288.3333
Table 1 shows a notable disparity in wages between migrants and citizens. We can visualise this disparity. The Figure below shows that we fail to reject the null hypothesis that there is no wage disparity and accept the alternative hypothesis that there is indeed a significant wage disparity between migrants and US born residents at 1% significance level.
Wage Differential Between Migrants and Non-Immigrants
Which factors Have a Notable relationship with the Wage Gap Between Migrants and Non-Immigrants?
In this section, I use data visualization to explore factors that may have a significant relationship with wages for migrants and non-immigrants.
I start by doing a pairs plot for all the variables that I hypothesize have a relationship with the wages gap.
The data shows that there is a notable difference in the incomes of citizens and immigrants. Citizens have a higher mean and median income compared to immigrants.
The income of immigrants varies by the time of arrival. New arrivals receive substantially less income compared to other people. However, the income improves markedly after the second year. After the 5th year, there is a gradual decline in income.
The significant drivers of wage differential between migrants and non-migrants are;
Foreign Born (forborn).
The level of education (educ).
Gender (female).
Occupation, which may have high correlation with the level of education (docc03).
Age, which may proxy experience (age).
Race (wbhaom).
Certification (wbhaom).
Hours worked (hourslw).
Family income (faminc).
Time of arrival in the United States (arrived).
Pairs Plots
Regression Analysis
This section presents the findings from my regression models which look at
The immigrant wage gap and possible explanations
Whether and how the wage gap varies by time since immigrants entered the US.
In this analysis, I regress the income (rw) of individuals and several variables. The summary of regression results is in the table below. The analysis shows the following variables to be the significant drivers of wages.
Age: This variable has a positive relationship with income. Meaning that older employees with greater experience tend to have more pay, ceteris paribus.
Education: More educated employees have better pay compared to less educated employees, all else remaining the same.
Certification: Employees with a professional certification eran more on average, all else remaing the same.
Race: White people tend to have better pay compared to employees from other races hodling other factors constant.
Hours worked (hourslw): The more hours an employee puts in , the higher the average pay holding other factors constant.
Rural: Employees in rural locations tend to get less pay.
Occupation: Some occupations are more lucrative than others. Given that immigrants are likely to occupy unskilled occupations, it is likely that they have lower average salaries.
Year of arrival in the united states: Very new arrivals in the US have notably lower pay compared to those that arrived earlier. The income rises after the first year and then declines gradually.
We start by running a simple model of real wages against the forborn, a variable that captures whether or not the person was born outside the United States.
Call:
lm(formula = rw ~ forborn, data = my_df)
Residuals:
Min 1Q Median 3Q Max
-24.95 -12.20 -6.08 5.30 366.36
Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 25.94918 0.05429 477.953 < 2e-16 ***
forbornForeign -0.63374 0.14345 -4.418 9.98e-06 ***
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
Residual standard error: 19.74 on 154277 degrees of freedom
Multiple R-squared: 0.0001265, Adjusted R-squared: 0.00012
F-statistic: 19.52 on 1 and 154277 DF, p-value: 9.977e-06
This simple model shows that foreign born US residents earn significantly less than the people born in the United States. However, the explanatory power of the model is too low. I add more variables in the next model.
The explanatory power of the model improves significantly. It appears that all else remaining the same, foreign born workers earn significantly more than US citizens. That means that given two people with the same characteristics (the variables in the model), except that one is US born and the other is not, the foreign born worker is likely to be earning more.
Summary and conclusion
In this analysis, I examined the drivers of the differential in earnings between US born and migrant workers in the United States. The data shows that migrants have the lowest earnings in their first year of arrival. Earnings increase steeply in the first five years, followed by a gradual decline. The drivers of income differentials include gender, age, race, education, certification, hours worked, job location (rural vs urban), and occupation. The data has a severe case of missing data making analysis challenging. After including more control variables. it appears that all else remaining the same, foreign born workers are likely to earn more than US born workers. There is potential for omitted variable bias.
References
Amo-Agyei, S., & Office, I. L. (2020). The migrant pay gap: Understanding wage differences between migrants and nationals. International Labour Organisation (ILO).
Lin, K.-H., & Weiss, I. (2019). Immigration and the wage distribution in the united states. Demography, 56(6), 2229–2252.