Homework4 introduces us to new packages/ libraries. It also makes us understand different methods for creating data visualization using different datasets. Group has demonstarted plots such as: density plot, heat maps, alluvial plot, tree map stacked bar plot and donut chart.

In Task 1 we have generated density plot by pivoting the data set such that each delay type has different color and transformed x axis with a log 10 by using ggplot2. All the delay types are specified in a way that the data spread is clearly visualized; color codes are used for each of the delay types using fill. We found that delays due to security checks are frequent but short.

In Task 2, we have generated a correlation plot using corrplot.mixed() for the types of delays with the carriers. The plot is properly labelled without any overlapping. We found that,there is a very strong correlation between all kinds of delay except Security Delay.

In Task3, we have plotted a heat map which shows the difference between male and female according to their occupation and year. The negative value which is displayed in blue color represents number of female are more than male employees and red indicates more male than female.

In Task 4, we have generated alluvial chart to visualizes frequency distributions over time or frequency tables involving several categorical variables. Libraries like ggalluvial are used to plot population vs gender according to years from 2014- 2018.

In Task 5, we have plotted the stacked bar graph to show the average wage of male and female according to their occupation and average wage. The geom_bar() function is used to plot the bar chart.

In Task 6, we have plotted tree map to display hierarchical data as a set of nested rectangles. The group is represented by a rectangle, which area is proportional to the number of people working in that Detailed Occupation. In the above plot we can see the area of waiters and waitress is more indicating the domination of that particular occupation.

In Task 8, we created a Donut Chart using ggplot and filled it using hex colors.

In the Homework4 we have used ggplot() to visualize the data according to the required problem statement. Interpreted the data using density plot, heat maps, tree map and bar/ alluvial charts.