This is a report showing distribution of sq footage of enclosed vs. unenclosed sidewalk cafes. It is based on the latest release of data by NYC at the time of report generation. This report was generated on Wed Sep 10 20:01:30 2025.

A code chunk highlighted in RStudio. Now we have to process the data, specifically we want to split the sidewalk café data set into two sets, representing the enclosed and unenclosed data.

The data was split into two data frames containing hte enclosed and unenclosed data using the following code:

library(dplyr)
enclosed <- filter(sidewalk, sidewalk$Sidewalk.Cafe.Type == 'Enclosed')
unenclosed <- filter(sidewalk, sidewalk$Sidewalk.Cafe.Type == 'Unenclosed')

Note, the use of filter function from the dplyr package. It works on tibbles and is identifcal to the subset function.

The data was then plotted with this code:

boxplot(enclosed$Lic.Area.Sq.Ft, unenclosed$Lic.Area.Sq.Ft, names=c("Enclosed", "Unenclosed"), main="NYC Sidewalk Cafe Size")