This is a report showing the distribution of the square footage of enclosed vs. unenclosed sidewalk cafes. It is based on the latest release of data by NYC at the time of the report generation. This report was generated on rdate().

The data was split into two data frames containing the enclosed and unenclosed data using the following code:

library(dplyr)
enclosed <- filter(sidewalk, sidewalk$Sidewalk.Cafe.Type == 'Enclosed')
unenclosed <- filter (sidewalk, sidewalk$Sidewalk.Cafe.Type == 'Unenclosed')

Note, the use of the filter function from the dplyr package. It works on tibbles and is identical to the subset function.

The data was then plotted with this code:

boxplot(enclosed$Lic.Area.Sq.Ft, unenclosed$Lic.Area.Sq.Ft, names=c("Enclosed","Unenclosed"),main = "NYC Sidewalk Cafe Size")