The BARI Geographic Infrastructure is essentially just property records data along with aggregate versions of these data. BARI releases these at the property, parcel, Census block, Census blockgroup, and Census tract levels.
I wanted to explore some of the dynamics in the use of individual parcels in the property records. For this I’m going to start with the individual parcel-level dataset (Land.Parcels.2018.csv, posted on the BARI Dataverse here ####) rather than one of the geographic aggregated datasets.
# check out the data
names(parcel_data)
[1] "X.1" "Land_Parcel_ID" "full_address" "property_N" "unit_N" "unit_N_orig" "AV_LAND"
[8] "AV_BLDG" "AV_TOTAL" "LAND_SF" "GROSS_AREA" "LIVING_AREA" "LU" "OWN_OCC"
[15] "NUM_FLOORS" "YR_BUILT" "YR_REMOD" "R_BLDG_STYL" "owner_address" "comcenter" "medhos"
[22] "supermkt" "parking" "vacant" "rel" "lib" "bps" "police"
[29] "fire" "private" "X" "Y" "TLID" "Blk_ID_10" "BG_ID_10"
[36] "CT_ID_10" "ALAND10" "AWATER10" "POP100" "HU100" "Type" "Res"
[43] "CT_ID_10.y" "BRA_PD_ID" "BRA_PD" "City_Counc" "WARD" "ISD_Nbhd" "Police_Dis"
[50] "Fire_Distr" "PWD" "geometry"
Neighborhood dynamics
First I’m going to summarize the parcel data using one of the included geographic identifiers - Census Tracts.
parcel_tractsumm <- parcel_data %>%
group_by(CT_ID_10) %>%
summarize(total_area = sum(GROSS_AREA),
perc_parcels_parking = mean(parking,na.rm=T),
perc_area_parking = sum(GROSS_AREA[parking==1],na.rm=T)/total_area)
summary(parcel_tractsumm$perc_area_parking)
Min. 1st Qu. Median Mean 3rd Qu. Max. NA's
0.000000 0.000000 0.000000 0.006793 0.000000 0.287028 1
So, most census tracts are low in their total percentage of parking parcels - with the caveat that these are only counted as parking if they are OFF-street parking and parking alone, rather than, for instance, a residential parcel with two parking spots.
#percentage plot for different types of permits
(ct_area_parking <- ggplot(parcel_tractsumm) +
geom_histogram(aes(x = perc_area_parking)) +
scale_x_continuous("Proportion of tract's area that is parking") +
theme_bw() +
theme(legend.position="none")
)

ggsave(plot=ct_area_parking,"figures/tracts_percarea_parking.pdf",height=7,width=10)
What about how these vary over neighborhoods that are larger in scale than Census Tracts? To look at this I brought in one of the aggregate versions of data that BARI provides with this dataset - the Tract-level data, (BostonTracts2010) - and connect the georgraphic units in that file with the parcel-level data. This enables me to easily summarize the parcel-level information about parking at various geographic levels. I did this by neighborhoods (as defined by the City’s Inspectional Services Department).
# bring in neighborhoods and other variables:
cts <- read_sf(paste0(BARIdrive_path,"Geographical Infrastructure/Data/Geographical Infrastructure 2018/Data/BostonTracts2010/"),"BostonTracts2010")
cts$GEOID10 <- as.numeric(cts$GEOID10)
parcel_data <- left_join(parcel_data,cts,by=c("CT_ID_10" = "GEOID10"))
parcel_nhsumm <- subset(parcel_data,!is.na(ISD_Nbhd)) %>%
group_by(ISD_Nbhd) %>%
summarize(total_area = sum(GROSS_AREA),
perc_parcels_parking = mean(parking,na.rm=T),
perc_area_parking = sum(GROSS_AREA[parking==1],na.rm=T)/total_area)
parcel_nhsumm
This summarization tells us that the area dedicated to off-site parking parcels varies a lot according to neighborhood. Let’s visualize this:
(nh_area_parking <- ggplot(parcel_nhsumm) +
geom_bar(aes(x=ISD_Nbhd,y = perc_area_parking),stat="identity") +
scale_y_continuous("Percent of area that is parking",labels = paste0(seq(0,1,0.02)*100,"%"),breaks=seq(0,1,0.02)) +
scale_x_discrete("Neighborhood") +
theme_bw() +
theme(legend.position="none",axis.text.x = element_text(angle = 45, hjust = 1))
)

This shows that some of the smallest and densest neighborhoods (e.g. Bay Village and Chinatown) are most heavily dedicated to off-street parking. This is especially interesting considering their proximity to downtown and their proximity to mass transit stations.
ggsave(plot=nh_area_parking,"figures/nh_percarea_parking.pdf",height=7,width=10)
Temporal dynamics
Next, let’s look at how has the existence of parking has fluctuated over time. One way to check this is by seeing when a parcel was developed.
parcel_yearsumm <- parcel_data %>%
group_by(YR_BUILT) %>%
summarize(total_area = sum(GROSS_AREA),
perc_parcels_parking = mean(parking,na.rm=T),
perc_area_parking = sum(GROSS_AREA[parking==1],na.rm=T)/total_area)
summary(parcel_yearsumm$perc_parcels_parking)
Min. 1st Qu. Median Mean 3rd Qu. Max.
0.000000 0.000000 0.000000 0.001259 0.000000 0.042630
summary(parcel_yearsumm$perc_area_parking)
Min. 1st Qu. Median Mean 3rd Qu. Max.
0.000000 0.000000 0.000000 0.008046 0.000000 0.430404
parcel_yearsumm_r <- parcel_data %>%
group_by(YR_REMOD) %>%
summarize(total_area = sum(GROSS_AREA),
perc_parcels_parking = mean(parking,na.rm=T),
perc_area_parking = sum(GROSS_AREA[parking==1],na.rm=T)/total_area)
summary(parcel_yearsumm_r$perc_parcels_parking)
Min. 1st Qu. Median Mean 3rd Qu. Max.
0.000000 0.000000 0.000000 0.000455 0.000000 0.011040
summary(parcel_yearsumm_r$perc_area_parking)
Min. 1st Qu. Median Mean 3rd Qu. Max. NA's
0.000000 0.000000 0.000000 0.003394 0.000000 0.088492 1
Looks like up to 8% of the total area remodeled in a given year was parking, in some cases! Let’s visualize these patterns.
(yearsumm_parkingprop_plot <- ggplot(parcel_yearsumm_r) +
geom_smooth(aes(x = YR_REMOD, y = perc_area_parking),method="loess",span=0.2,se = F) +
scale_y_continuous('Proportion of total remodeled area in year') +
scale_x_continuous("Year remodeled",limits=c(1970,2018.5),breaks=seq(1900,2020,5)) +
theme_bw()
)
So, overall very little parking square footage is remodeled in most years, with many years having no remodeled parking, but in a few years it jumps up notably – for instance, in 2014.
ggsave(plot=yearsumm_parkingprop_plot,"figures/prop_areaparking_year.pdf",height=7,width=10)
Geographic dynamics
Now let’s see where these parking parcels exist in Boston. To do this we’re going to use the spatial files from BARI.
library(leaflet)
library(RColorBrewer) # color scales
library(classInt) # color scales
Loading required package: spData
To access larger datasets in this package, install the spDataLarge package with:
`install.packages('spDataLarge', repos='https://nowosad.github.io/drat/', type='source'))`
library(scales)
Attaching package: ‘scales’
The following object is masked from ‘package:purrr’:
discard
The following object is masked from ‘package:readr’:
col_factor
parcels_shp <- st_read(dsn=paste0(BARIdrive_path,"Geographical Infrastructure/Data/Geographical Infrastructure 2018/Data/BostonParcels2018"),layer="Parcels2018")
Reading layer `Parcels2018' from data source `/Users/justin/Google Drive/BARI Research Team Data Library/Geographical Infrastructure/Data/Geographical Infrastructure 2018/Data/BostonParcels2018' using driver `ESRI Shapefile'
Simple feature collection with 98930 features and 2 fields
geometry type: MULTIPOLYGON
dimension: XY
bbox: xmin: -71.19091 ymin: 42.22786 xmax: -70.86865 ymax: 42.39701
epsg (SRID): 4326
proj4string: +proj=longlat +datum=WGS84 +no_defs
Now that the data is combined with geographic shapefiles, let’s make some simple visuals
# add parking variable into shapefile:
parcels_shp$parking <- parcel_data$parking[match(parcels_shp$Ln_P_ID,parcel_data$Land_Parcel_ID)]
# pdf("figures/newcon_pp_ct_2017.pdf")
# par(mar=c(2,2,4,2))
plot(st_geometry(subset(parcels_shp,parking==1)),col="grey")

# title(main="Demographic change, pre-2015")
# dev.off()
Now lets make some better maps:
# b1 <- b0 +
# geom_polygon(data=parcels_shp2[which(parcels_shp2$parking==1)], aes(x=lon,y=lat), colour = "red", fill = "grey") +
# geom_polygon(data=parcels_shp2[which(parcels_shp2$parking==0),], aes(x=lon,y=lat), col="blue",fill=NA) +
# theme(legend.position = c(0.85,0.15),plot.margin = unit(c(0.1,0.1,0.1,0.1),"cm"))
(b1 <- b0 +
coord_sf(crs = st_crs(4326)) +
geom_sf(data=cts,col="darkgrey",lwd=0.1,inherit.aes=F) +
geom_sf(data=subset(parcels_shp,parking==1),col=vturq,inherit.aes=F)
)

This map spatially demonstrates what the charts showed: off-street parking (plotted in turquoise on this map) is incredibly concentrated (relative to total area) in the downtown areas of Boston.
ggsave(b1,filename = "figures/parking_parcels_gg.pdf",height=7,width=9)
boston_base_bg <- get_stamenmap(bbox=boston_bbox3, zoom=12, color="bw",
source="stamen",maptype = "toner-lite",crop=T)
b0_bg <- ggmap(boston_base_bg, extent="device",legend="none",maprange=T)
(b1_bg <- b0_bg +
coord_sf(crs = st_crs(4326)) +
geom_sf(data=subset(parcels_shp,parking==1),col=vturq,inherit.aes=F)
)

ggsave(b1_bg,filename = "figures/parking_parcels_lite.pdf",height=7,width=9)
On a policy note, these summaries point out one way in which transportation resources currently exist in the City. As Boston moves towards a more climate-friendly resilience plan, leveraging the area dedicated to cars to instead prioritize non-driving forms of transportation will be important. Currently, dedicating so much of the downtown neighborhoods to parking only serves to increase the number of cars there, which seems counterintuitive for traffic and urban planning more broadly. If the City of Boston wanted to decrease traffic in these central neighborhoods and decrease the number of cars entering such dense neighborhoods, dedicating less of the area in these neighborhoods to cars would be a good first step.
More broadly, these summaries and various visuals together demonstrate the use of BARI’s geographical infrastructure dataset more generally: easily being able to summarize information at various geographic levels across the City.
tm_shape(subset(parcels_shp,parking==1)) +
tm_fill(col = vturq,title = "Parking parcels")
Error in tm_shape(subset(parcels_shp, parking == 1)) :
could not find function "tm_shape"
---
title: 'Where people can pahk their cahr in Boston'
subtitle: 'A demonstration using the Geographical Infrastructure of the City of Boston'
output:
  html_notebook: default
  html_document: default
---

The BARI Geographic Infrastructure is essentially just property records data along with aggregate versions of these data. BARI releases these at the property, parcel, Census block, Census blockgroup, and Census tract levels.

I wanted to explore some of the dynamics in the use of individual parcels in the property records. For this I'm going to start with the individual parcel-level dataset (`Land.Parcels.2018.csv`, posted on the BARI Dataverse here ####) rather than one of the geographic aggregated datasets.


```{r message=FALSE, warning=FALSE}
library(tidyverse)
library(plotly)
library(sf)
# color definitions:
vpurple = '#440154'
vturq = '#21908C'

# importing the data ####
BARIdrive_path <- "~/Google Drive/BARI Research Team Data Library/"
parcel_data <- read.csv(paste0(BARIdrive_path,"Geographical Infrastructure/Data/Geographical Infrastructure 2018/Data/Land.Parcels.2018.csv"))

# check out the data
names(parcel_data) 
```

# Neighborhood dynamics

First I'm going to summarize the parcel data using one of the included geographic identifiers - Census Tracts.
```{r}
parcel_tractsumm <- parcel_data %>%
      group_by(CT_ID_10) %>%
      summarize(total_area = sum(GROSS_AREA),
      					perc_parcels_parking = mean(parking,na.rm=T),
      					perc_area_parking = sum(GROSS_AREA[parking==1],na.rm=T)/total_area)

summary(parcel_tractsumm$perc_area_parking)
```
So, most census tracts are low in their total percentage of parking parcels - with the caveat that these are only counted as parking if they are OFF-street parking and parking *alone*, rather than, for instance, a residential parcel with two parking spots.


```{r}
#percentage plot for different types of permits
(ct_area_parking <- ggplot(parcel_tractsumm) + 
    geom_histogram(aes(x = perc_area_parking)) + 
    scale_x_continuous("Proportion of tract's area that is parking") + 
      theme_bw() + 
       theme(legend.position="none")
 )
```

```{r}
ggsave(plot=ct_area_parking,"figures/tracts_percarea_parking.pdf",height=7,width=10)
```

What about how these vary over neighborhoods that are larger in scale than Census Tracts? To look at this I brought in one of the aggregate versions of data that BARI provides with this dataset - the Tract-level data, (`BostonTracts2010`) - and connect the georgraphic units in that file with the parcel-level data. This enables me to easily summarize the parcel-level information about parking at various geographic levels. I did this by neighborhoods (as defined by the City's Inspectional Services Department).

```{r}
# bring in neighborhoods and other variables:
cts <- read_sf(paste0(BARIdrive_path,"Geographical Infrastructure/Data/Geographical Infrastructure 2018/Data/BostonTracts2010/"),"BostonTracts2010")
cts$GEOID10 <- as.numeric(cts$GEOID10)
parcel_data <- left_join(parcel_data,cts,by=c("CT_ID_10" = "GEOID10"))

parcel_nhsumm <- subset(parcel_data,!is.na(ISD_Nbhd)) %>%
      group_by(ISD_Nbhd) %>%
      summarize(total_area = sum(GROSS_AREA),
      					perc_parcels_parking = mean(parking,na.rm=T),
      					perc_area_parking = sum(GROSS_AREA[parking==1],na.rm=T)/total_area)
parcel_nhsumm
```
This summarization tells us that the area dedicated to off-site parking parcels varies a lot according to neighborhood. Let's visualize this:

```{r}
(nh_area_parking <- ggplot(parcel_nhsumm) + 
    geom_bar(aes(x=ISD_Nbhd,y = perc_area_parking),stat="identity") + 
    scale_y_continuous("Percent of area that is parking",labels = paste0(seq(0,1,0.02)*100,"%"),breaks=seq(0,1,0.02)) + 
 	scale_x_discrete("Neighborhood") + 
      theme_bw() + 
       theme(legend.position="none",axis.text.x = element_text(angle = 45, hjust = 1))
 )

```

This shows that some of the smallest and densest neighborhoods (e.g. Bay Village and Chinatown) are most heavily dedicated to off-street parking. This is especially interesting considering their proximity to downtown and their proximity to mass transit stations.

```{r}
ggsave(plot=nh_area_parking,"figures/nh_percarea_parking.pdf",height=7,width=10)
```

# Temporal dynamics
Next, let's look at how has the existence of parking has fluctuated over time. One way to check this is by seeing when a parcel was developed.

```{r}

parcel_yearsumm <- parcel_data %>%
      group_by(YR_BUILT) %>%
      summarize(total_area = sum(GROSS_AREA),
      					perc_parcels_parking = mean(parking,na.rm=T),
      					perc_area_parking = sum(GROSS_AREA[parking==1],na.rm=T)/total_area)
summary(parcel_yearsumm$perc_parcels_parking)
summary(parcel_yearsumm$perc_area_parking)

parcel_yearsumm_r <- parcel_data %>%
      group_by(YR_REMOD) %>%
      summarize(total_area = sum(GROSS_AREA),
      					perc_parcels_parking = mean(parking,na.rm=T),
      					perc_area_parking = sum(GROSS_AREA[parking==1],na.rm=T)/total_area)
summary(parcel_yearsumm_r$perc_parcels_parking)
summary(parcel_yearsumm_r$perc_area_parking)
```
Looks like up to 8% of the total area remodeled in a given year was parking, in some cases! Let's visualize these patterns.

```{r, warning=F}
(yearsumm_parkingprop_plot <- ggplot(parcel_yearsumm_r) + 
    geom_smooth(aes(x = YR_REMOD, y = perc_area_parking),method="loess",span=0.2,se = F) + 
    scale_y_continuous('Proportion of total remodeled area in year') + 
       scale_x_continuous("Year remodeled",limits=c(1970,2018.5),breaks=seq(1900,2020,5)) + 
      theme_bw()
 )
```
So, overall very little parking square footage is remodeled in most years, with many years having no remodeled parking, but in a few years it jumps up notably -- for instance, in 2014.

```{r}
ggsave(plot=yearsumm_parkingprop_plot,"figures/prop_areaparking_year.pdf",height=7,width=10)
```

# Geographic dynamics
Now let's see where these parking parcels exist in Boston. To do this we're going to use the spatial files from BARI.
```{r}
library(leaflet)
library(RColorBrewer) # color scales
library(classInt) # color scales
library(scales)

parcels_shp <- st_read(dsn=paste0(BARIdrive_path,"Geographical Infrastructure/Data/Geographical Infrastructure 2018/Data/BostonParcels2018"),layer="Parcels2018")

```
Now that the data is combined with geographic shapefiles, let's make some simple visuals

```{r}
# add parking variable into shapefile:
parcels_shp$parking <- parcel_data$parking[match(parcels_shp$Ln_P_ID,parcel_data$Land_Parcel_ID)]


# pdf("figures/newcon_pp_ct_2017.pdf")
# par(mar=c(2,2,4,2))
plot(st_geometry(subset(parcels_shp,parking==1)),col="grey")
# title(main="Demographic change, pre-2015")
# dev.off()
```

Now lets make some better maps:

```{r message=FALSE, warning=FALSE, include=FALSE}
library(sp);library(maptools)
library(proj4)
library(rgdal)
library(ggmap)
library(classInt)


boston_bbox <- st_bbox(parcels_shp,f=0.1)
boston_bbox <- as.numeric(boston_bbox)
names(boston_bbox) <- c("left","bottom","right","top")
boston_bbox2 <- st_bbox(parcels_shp,f=0)
boston_bbox2 <- as.numeric(boston_bbox2)
names(boston_bbox2) <- c("left","bottom","right","top")

parcels_shp2 <- sf::as_Spatial(parcels_shp)
boston_bbox3 <- make_bbox(parcels_shp2@bbox[1,],parcels_shp2@bbox[2,],f=0.1)

boston_base <- get_stamenmap(bbox=boston_bbox3, zoom=12, color="color",
                       source = "stamen", maptype="toner", crop=T)

(b0 <- ggmap(boston_base, extent="device",legend="none",maprange=T))
```

```{r message=FALSE, warning=FALSE}
# b1 <- b0 + 
#  	geom_polygon(data=parcels_shp2[which(parcels_shp2$parking==1)], aes(x=lon,y=lat), colour = "red", fill = "grey") +
#  	geom_polygon(data=parcels_shp2[which(parcels_shp2$parking==0),], aes(x=lon,y=lat), col="blue",fill=NA) + 
#  	theme(legend.position = c(0.85,0.15),plot.margin = unit(c(0.1,0.1,0.1,0.1),"cm"))

(b1 <- b0 + 
  coord_sf(crs = st_crs(4326)) +
 	geom_sf(data=cts,col="darkgrey",lwd=0.1,inherit.aes=F) +
	geom_sf(data=subset(parcels_shp,parking==1),col=vturq,inherit.aes=F)
)
```
This map spatially demonstrates what the charts showed: off-street parking (plotted in turquoise on this map) is incredibly concentrated (relative to total area) in the downtown areas of Boston.


```{r message=FALSE, warning=FALSE}
ggsave(b1,filename = "figures/parking_parcels_gg.pdf",height=7,width=9)

boston_base_bg <- get_stamenmap(bbox=boston_bbox3, zoom=12, color="bw",
														source="stamen",maptype = "toner-lite",crop=T)

b0_bg <- ggmap(boston_base_bg, extent="device",legend="none",maprange=T)

(b1_bg <- b0_bg + 
  coord_sf(crs = st_crs(4326)) +
	geom_sf(data=subset(parcels_shp,parking==1),col=vturq,inherit.aes=F)
)
```

```{r}
ggsave(b1_bg,filename = "figures/parking_parcels_lite.pdf",height=7,width=9)
```


On a policy note, these summaries point out one way in which transportation resources currently exist in the City. As Boston moves towards a more climate-friendly resilience plan, leveraging the area dedicated to cars to instead prioritize non-driving forms of transportation will be important. Currently, dedicating so much of the downtown neighborhoods to parking only serves to *increase* the number of cars there, which seems counterintuitive for traffic and urban planning more broadly. If the City of Boston wanted to decrease traffic in these central neighborhoods and decrease the number of cars entering such dense neighborhoods, dedicating *less* of the area in these neighborhoods to cars would be a good first step.

More broadly, these summaries and various visuals together demonstrate the use of BARI's geographical infrastructure dataset more generally: easily being able to summarize information at various geographic levels across the City.

```{r}
# tm_shape(subset(parcels_shp,parking==1)) + 
#       tm_fill(col = vturq,title = "Parking parcels")

```