title: “US Perm Visa Denied” author: “CT” date: “2/26/2020” output: html_document —## US Visa Application for Labor Certification Dataset US Visa Application for Labor Certification Dataset Background: This dataset contains administrative data from employers’ Applications for Permanent Employment Certification (ETA Form 9089) and certification determinations processed by the Department’s Office of Foreign Labor Certification, Employment and Training Administration, where the date of the determination was issued on or after October 1, 2018, and on or before September 30, 2019.
The process is that the employers file and not the employee. In general, the DOL works to ensure that the admission of foreign workers to work in the U.S. will not adversely affect the job opportunities, wages and working conditions of U.S. workers. Once a permanent labor certification application has been approved by the DOL, the employer will need to seek the immigration authorization from the U.S. Citizenship and Immigration Services (USCIS). DOL processes Applications for Permanent Employment Certification, ETA Form 9089, except for Schedule A and sheepherder applications which are filed under 20 CFR § 656.16. The date the labor certification application is received by the DOL is known as the filing date and is used by USCIS and the Department of State as the priority date. After the labor certification application is certified by DOL, it is valid for 180 days and it should be submitted to the appropriate USCIS Service Center with a Form I-140, Immigrant Petition for Alien Worker. Purpose: The purpose of this project was to practice data visualization techniques using R. as a beginner. The process: Data cleaning, data subset and variable selection Since the dataset contained originally 154 columns with over 50,000 observations, a subset was selected for the analysis: visa applications “denied”. This step narrowed down the dataset close to 25,000 observations. From there, some 29 variables were selected to explore some key trends in the visa denied subset. Subsequently the following variables were retained. The ones used to produce the graphs include:
case_number, case_status employer_name, employer_state, pw_soc_code pw_soc_title job_info_education job_info_major, job_info_alt_field, job_info_experience, job_info_foreign_ed, job_info_job_req_normal, country_of_citizenship, recr_info_professional_occ, foreign_worker_info_education, pw_job_title_9089, ri_coll_tch_basic_process, job_info_foreign_lang_req
Read the data set setwd(“C://Users/rande/Documents/Data visualization/Spring2020”)
fileURL<-“C://Users/rande/Documents/Data visualization/Spring2020”
getwd() visaData <- read.csv(“uspermvisas.csv”) visaData ## R Markdown
This is an R Markdown document. Markdown is a simple formatting syntax for authoring HTML, PDF, and MS Word documents. For more details on using R Markdown see http://rmarkdown.rstudio.com.
When you click the Knit button a document will be generated that includes both content as well as the output of any embedded R code chunks within the document. You can embed an R code chunk like this:
r}
```{rvisaData <- read.csv("uspermvisas.csv")
visaData}
```
You can also embed plots, for example:
Note that the echo = FALSE parameter was added to the code chunk to prevent printing of the R code that generated the plot.