Susan Li

April 12, 2017

The H-1B program allows employers to temporarily employ foreign workers in the U.S on a nonimmigrant basis in specialty occupations. This is the most common visa status applied by international students after they complete higher education in the U.S and work in a full-time position. For those graduates to apply for H-1B visa, their employers must offer a job and petition for H-1B visa with the US immigration department.

The Office of Foreign Labor Certification (OFLC) generates program data. However, I downloaded the dataset from Kaggle directly after it has been mostly cleaned. To make it as relevant as possible, I will be only looking at the data from 2016.

I decide to remove all missing values, so about 3% of applications were omitted.

## 'data.frame':    629311 obs. of  11 variables:
##  $ X                 : int  1 2 3 4 5 6 7 8 10 11 ...
##  $ CASE_STATUS       : chr  "CERTIFIED-WITHDRAWN" "CERTIFIED-WITHDRAWN" "CERTIFIED-WITHDRAWN" "CERTIFIED-WITHDRAWN" ...
##  $ EMPLOYER_NAME     : chr  "UNIVERSITY OF MICHIGAN" "GOODMAN NETWORKS, INC." "PORTS AMERICA GROUP, INC." "GATES CORPORATION, A WHOLLY-OWNED SUBSIDIARY OF TOMKINS PLC" ...
##  $ SOC_NAME          : chr  "BIOCHEMISTS AND BIOPHYSICISTS" "CHIEF EXECUTIVES" "CHIEF EXECUTIVES" "CHIEF EXECUTIVES" ...
##  $ JOB_TITLE         : chr  "POSTDOCTORAL RESEARCH FELLOW" "CHIEF OPERATING OFFICER" "CHIEF PROCESS OFFICER" "REGIONAL PRESIDEN, AMERICAS" ...
##  $ FULL_TIME_POSITION: chr  "N" "Y" "Y" "Y" ...
##  $ PREVAILING_WAGE   : num  36067 242674 193066 220314 157518 ...
##  $ YEAR              : int  2016 2016 2016 2016 2016 2016 2016 2016 2016 2016 ...
##  $ WORKSITE          : chr  "ANN ARBOR, MICHIGAN" "PLANO, TEXAS" "JERSEY CITY, NEW JERSEY" "DENVER, COLORADO" ...
##  $ lon               : num  -83.7 -96.7 -74.1 -105 -90.2 ...
##  $ lat               : num  42.3 33 40.7 39.7 38.6 ...

Throughout the analysis, I will attempt to answer the following questions:

What are the top occupations?

Technology related professions such as software developer, computer system analyst, programmer are among the most in demand occupations, analyst, accountant, engineer are among the second most in demand occupations.

Who are the top employers that submit the most application?

Infosys Limited leads by a large margin and submitted over 25000 applications last year. As a matter of fact, eight of the top 20 employers are Indian multinational IT companies.

Technology leads and technology analysts are in huge demand at Infosys, developers and programmers are liked by Tata, Google is mainly interested in software engineers. Deloitte and Ernst & Young apply visa for consultants and advisors.

What occupations make the most Money?

When I look at prevailing wage distribution, I found something interesting.

##      Min.   1st Qu.    Median      Mean   3rd Qu.      Max. 
##         0     57512     68411     89016     85176 329139200

Minimum wage is 0 and maximum wage is 329100000. I suspected ’0’s were missing values, and there are not many of them. But let’s have a look which company offered $329100000.

##         X CASE_STATUS      EMPLOYER_NAME           SOC_NAME
## 5275 5580      DENIED E AND D MEDIA INC. MARKETING MANAGERS
##                               JOB_TITLE FULL_TIME_POSITION PREVAILING_WAGE
## 5275 DIRECTOR, SOCIAL AND DIGITAL MEDIA                  Y       329139200
##      YEAR                 WORKSITE       lon      lat
## 5275 2016 SANTA MONICA, CALIFORNIA -118.4912 34.01945

For a marketing manager? I don’t understand. The application was denied anyway.

Majority of the prevailing wages were between 50K and 100K USD per annum.

physicians and surgeons enjoy the highest average prevailing wages that almost reach $175K per annum last year, computer information systems managers and electrical engineers take the second spot make approximate $163K per annum last year.

Which Employers pay the most?

When comes to the job title and employer, consultants hired by Deloitte enjoy the highest average prevailing wage, software engineer hird by Google are paid far more than the same job title hired by the other companeis.

## top_employer_df$EMPLOYER_NAME: ACCENTURE LLP
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   21299   64938   74006   76417   84723  214947 
## -------------------------------------------------------- 
## top_employer_df$EMPLOYER_NAME: AMAZON CORPORATE LLC
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   39042   83138   91187   93594  101088  173056 
## -------------------------------------------------------- 
## top_employer_df$EMPLOYER_NAME: CAPGEMINI AMERICA INC
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   21050   69493   78395   81108   90792  229382 
## -------------------------------------------------------- 
## top_employer_df$EMPLOYER_NAME: COGNIZANT TECHNOLOGY SOLUTIONS U.S. CORPORATION
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   29557   59738   70554   72923   83388  163883 
## -------------------------------------------------------- 
## top_employer_df$EMPLOYER_NAME: DELOITTE CONSULTING LLP
##      Min.   1st Qu.    Median      Mean   3rd Qu.      Max. 
##     32219     62754     74963     94997     87818 136757920 
## -------------------------------------------------------- 
## top_employer_df$EMPLOYER_NAME: ERNST & YOUNG U.S. LLP
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   32531   58178   72654   74122   85114  200928 
## -------------------------------------------------------- 
## top_employer_df$EMPLOYER_NAME: GOOGLE INC.
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   32552   98342  103145  105294  119350  221291 
## -------------------------------------------------------- 
## top_employer_df$EMPLOYER_NAME: HCL AMERICA, INC.
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   17264   60570   74090   75025   87838  165568 
## -------------------------------------------------------- 
## top_employer_df$EMPLOYER_NAME: IBM CORPORATION
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   33966   71578   78749   80925   88327  210413 
## -------------------------------------------------------- 
## top_employer_df$EMPLOYER_NAME: IBM INDIA PRIVATE LIMITED
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   42245   63315   68869   69878   74630 3277248 
## -------------------------------------------------------- 
## top_employer_df$EMPLOYER_NAME: IGATE TECHNOLOGIES INC.
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   40435   63690   68869   69722   74693  129334 
## -------------------------------------------------------- 
## top_employer_df$EMPLOYER_NAME: INFOSYS LIMITED
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   21299   59738   69971   72065   82202  180315 
## -------------------------------------------------------- 
## top_employer_df$EMPLOYER_NAME: L&T TECHNOLOGY SERVICES LIMITED
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   16598   47674   56763   56659   63170  138299 
## -------------------------------------------------------- 
## top_employer_df$EMPLOYER_NAME: LARSEN & TOUBRO INFOTECH LIMITED
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   27456   53518   62046   64043   73424  171413 
## -------------------------------------------------------- 
## top_employer_df$EMPLOYER_NAME: MICROSOFT CORPORATION
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   37752   85176  101088   99116  110630  173139 
## -------------------------------------------------------- 
## top_employer_df$EMPLOYER_NAME: SYNTEL CONSULTING INC.
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   34174   58677   66872   68783   74844  143749 
## -------------------------------------------------------- 
## top_employer_df$EMPLOYER_NAME: TATA CONSULTANCY SERVICES LIMITED
##      Min.   1st Qu.    Median      Mean   3rd Qu.      Max. 
##     15222     57928     63461     73699     67891 133209440 
## -------------------------------------------------------- 
## top_employer_df$EMPLOYER_NAME: TECH MAHINDRA (AMERICAS),INC.
##      Min.   1st Qu.    Median      Mean   3rd Qu.      Max. 
##     21202     64043     71427    189128     77605 170460160 
## -------------------------------------------------------- 
## top_employer_df$EMPLOYER_NAME: UST GLOBAL INC
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   27456   52187   56555   57503   60112  145350 
## -------------------------------------------------------- 
## top_employer_df$EMPLOYER_NAME: WIPRO LIMITED
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   51979   63149   69867   72754   77771  183872

Microsoft and Google offer the highest wages to their H-1B Visa workers. Their median wage exceeded $100K per annum, while the median wage of UST Global and L&T Technology are less than $60K per annum.

Which states and cities apply the most H-1B visas?

Expectedly, California hires the most workers on H-1B visas, followed by Texas, New York, New Jersey and Illinois.

This time New York City takes the lead by a large margin in the number of H-1B Visa applications. Not only high tech companies in New York hire H-1B visa workers, but also New York’s fashion industry is heavily reliant on immigrants, from top designers to creative staffs to the sewing workers.

A Map of H-1B Visa Applications

states_map <- map_data(“state”) ggplot(visa_state, aes(map_id = region)) + geom_map(aes(fill = count), map = states_map) + scale_colour_brewer(palette=‘Greens’) + expand_limits(x = states_map\(long, y = states_map\)lat) + xlab(‘longitude’) + ylab(‘latitude’) + ggtitle(‘State H-1B Visa Applications’)

The End

It has been a great experience exploring 2016 H-1B visa petition data. With Trump’s Cracking down on the H-1B Visa program that Silicon Valley loves, I can’t wait to learn the data for 2017.