Please indicate
Plot a “time series” of the proportion of flights that were delayed by > 30 minutes on each day. i.e.
Using this plot, indicate describe the seasonality of when delays over 30 minutes tend to occur.
Interpretation The delays per day show a seasonal increase around the summer months and the winter holidays. Looking at the trend line, it is clear that there is an increase in delays greater than 30 minutes at these two periods.
Some people prefer flying on older planes. Even though they aren’t as nice, they tend to have more room. Which airlines should these people favor?
Interpretation The two carriers with the oldest set of airplanes is MQ and AA, which are American Eagle and American Airlines. Though MQ has the oldest planes, there are only two physical planes in the data set. That makes American Airlines the oldest carrier with a substantial number of planes.
For example, Southwest Airlines Flight 60 to Dallas consists of a single flight path, but since it flew 299 times in 2013, it would be counted as 299 flights.
Interpretation For both flights and flight paths, there were the most flights within Texas for flights to and from Houston. The majority of the remaining flights and flightpaths are within the south, like Florida, Louisiana and Oklahoma. California and Illinois also has a fair amount of flights to it. I also removed cancelled flights from my analysis since those cancelled flights were never completed.
I want to know proportionately what regions (NE, south, west, midwest) each carrier flies to/from Houston in the month of July. Consider the month() function from the lubridate package.
Interpretation The majority of flights from Texas remain within the south. As we saw earlier, the majority of flights are just within Texas to begin with. For example, though American Airlines is an international carrier, its largest hub is in Dallas, so that is likely why American Airlines only flies within the south from Houston to Dallas.