Herein:


Introduction

{previous} {toc} {next}

Housing code enforcement, a Montgomery County official told me, is a balancing act. What level of bad conditions do you accept to keep people from becoming homeless? The data I have seen do not yet clearly answer this question.

The County Code mandates that the Department of Housing and Community Affairs (DHCA) inspect rental properties yearly for compliance with regulations. Inspections are also performed in response to complaints submitted to the County’s 311 system and routed to DHCA. DHCA inspectors (currently 19) arrange for and carry out inspections, document infractions, serve notice to property owners, and, where owners fail to correct infractions, issue civil citations to appear in court. Additionally, they update databases that track infractions.

In 2016 the County Council passed Bill 19-15 that made significant changes to the code affecting inspections and enforcement of the regulations for rental housing (press release):

This project seeks to model the factors that influence the severity of housing code violations and possibly see how the inspection process could be optimized.

Data Sets

{previous} {toc} {next}

The Maryland Property Data - Parcel Points, filtered for Montgomery County

This describes all 344K properties in Montgomery County. In addition to property address, it contains key data including geographic location and census block groups that allow it to be joined to both County and U.S. Census data sets. It has interesting items such as property owner; number of dwelling units; and dates of last inspection, sale, and construction. Indicators of owner occupancy and residential property allow filtering of rental property.

Data Montgomery Housing Code Violations

This contains 470K housing code violations dating from 2013. It serves as the primary data source for this project. Although it catalogs only violations, the case number id provides an estimate of total cases including those without infractions. It describes the items in violation, the code citations, corrective actions required and the 311 service complaint. It provides dates of filing, assignment, inspection, correction and closure.

Data Montgomery Troubled Property Analysis

This apparently identifies the properties currently subject to more stringent inspections and corrective action plans. Its case number links back to the Housing Code Violation data set and may be useful to extract other cases with severe violations.

U.S. Census 2018 American Community Survey (ACS) extract for Montgomery County Block Groups

This data set provides estimates of the number of units in owner and renter occupied properties in each census block in Montgomery County. The estimates are based on data sampled between 2014 and 2018.

Other Data Montgomery data sets and text sources to be used in the project include:

Preliminary Figures and Questions

{previous} {toc} {next}

Rental Properties flagged as Residential in 2010, not owner occupied.

from the Maryland Parcel Points Data Set

totalProperties totalUnits minUnits q25 meanUnits q75 maxUnits
57818 175529 1 1 3.036 1 1121

Housing Code Inspections, 311 Service Requests, Violations Cited

from joining Housing Code Violations and 311 Service Requests

year 311referrals totalCases citedCases from311Request violationsCited
2013 8924 7392 4131 3249 31283
2014 10646 7532 4428 3913 24468
2015 10743 7251 3848 3960 28173
2016 11787 6887 3515 3916 40436
2017 10093 6771 3898 4141 52645
2018 11917 7575 4481 5151 75886
2019 11476 6899 3735 4147 216508

Questions

Methods: Cleaning, Analysis, Visualization

{previous} {toc}

Nearly all of the work on this project will be done in R in the Rstudio development and markdown environment. Given the variable download performance of the data sources, once obtained, I will upload the csv data sets to my publicly accessible GitHub site. This will be the source from which the R code will get its data for further cleaning and presentation.

Cleaning: join violations to parcel points, map the results to census block groups

I will clean the data sets primarily using libraries from tidyverse and related R packages. Additionally, I will use sf, the Simple Features geospatial library, to manipulate locations and place entries into census block groups.

Some aspects of cleaning will include:

  • Purging entries from Housing Code Violations and Parcel Points that do not involve rental properties,
  • Depending on volume, using either Google Maps or the U.S. Census TIGER LINE data set to place entries in their correct locations,
  • Matching entries in Housing Code Violations and Parcel Points, first by location, and then, using fuzzy matching by text address.


Analysis: what features explain how a rental property tips into or out of trouble?

Potential methods include:

  • Principal Component Analysis to group features into independent components that move with the severities of conditions of properties,
  • K-means or other clustering techniques to show how components move together,
  • Logistic regression to model how likely, based on weighted components, conditions of properties are to change.


Visualizations: ggplot2 and add-ons

  • Violin plots to show the occurrence and distribution of features,
  • Geospatial plots of census block group weaving properties, characteristics, severity of violations, with features obtained from the 2018 ACS.
  • K-means or other cluster plots.