This is a report showing the distribution of the square footage of enclosed vs.unenclosed sidewalk cafes. It is based on the latest release of data by NYC at the time of the report generation. This report was generated on Tue Feb 18 14:48:43 2025.

The data was split into two data frames containing the enclosed and unenclosed data using the following code:

library(dplyr)
## 
## Attaching package: 'dplyr'
## The following objects are masked from 'package:stats':
## 
##     filter, lag
## The following objects are masked from 'package:base':
## 
##     intersect, setdiff, setequal, union
enclosed <- filter(sidewalk, sidewalk$Sidewalk.Cafe.Type == 'Enclosed')
unenclosed <- filter(sidewalk, sidewalk$Sidewalk.Cafe.Type == 'Unenclosed')

Note, the use of the filter function from the dplyr package. It works on tibbles and is identical to the subset function.

The data was then plotted with this code:

boxplot(enclosed$Lic.Area.Sq.Ft, unenclosed$Lic.Area.Sq.Ft, 
        names=c("Enclosed","Unenclosed"), 
        main="NYC Sidewalk Cafe Size")