Approach

My approach for Assignment 5A Air delays is to create the data frame based on the table in the pdf attachment as a CSV file. Upload that csv file then code then display it and perform count analyst. Checking for which airline and in which cities experienced the most delays and the most on times. After exploring more of the data I may also bring up any other interesting counts the I notice. Then transform the data from wide to long. The end I will show the percentage of on time and arrivals for each city split based on airlines.

Challenges

I don’t expected the recreation of the file from the table will be too challenging but I will know when I attempted to recreate the same results. Transforming the data from wide to long is new to me but I do not expect that to be too difficult either. From what I’ve read I just need to use the pivot_longer function in tidyr to transform it. The last thing I expect may be tricky is to accurate calculate and interpreted the percentage ratios of the airline flight arrival and delay time. I worry of not accurately speaking correctly about what the data says and incorrectly calculating what I am trying to prove. Trying to avoid this it is best to take time and go over what I have written and getting a seconded opinion.