Warning: Removed 37 rows containing non-finite outside the scale range
(`stat_smooth()`).
Warning: Removed 37 rows containing missing values or values outside the scale range
(`geom_point()`).
The 5th graph is a histogram to show the distribution of daily air temperatures. Whereas all the prior graphs visualize relationships between the ozone levels and the other variable in the data, this graph shows frequency of values for temperatures only. The x axis represents the air temperatures in degrees Fahrenheit and the y axis represents the frequency of each range of temperatures.
This plot suggests that a good portion of the temperatures are in the mid 70’s to mid 80’s range, so this plot mostly represents warm summer days. There is not a large number of values in either extremes, meaning there is not as much variety.
For this plot, I employed the ggplot() function using geom_histogram(). The x aesthetic was defined as the temperature variable, and the number of bins were chosen to be 15 to produce a clearly visible distribution and avoid excessive smoothing. Labels and a caption were added using the labs function to provide context and source for the data. This plot helps to summaries the dataset, providing context for relating temperature with air quality.