Cincinnati Tax Abatement Analysis

Author

Logan

Introduction

I’m going to be performing an analysis on Tax Abatement data for the city of Cincinnati, provided by the City of Cincinnati data portal. This data shows information for various properties in the City of Cincinnati.

 library(tidyverse) 
── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ──
✔ dplyr     1.1.4     ✔ readr     2.1.5
✔ forcats   1.0.0     ✔ stringr   1.5.1
✔ ggplot2   3.5.1     ✔ tibble    3.2.1
✔ lubridate 1.9.4     ✔ tidyr     1.3.1
✔ purrr     1.0.2     
── Conflicts ────────────────────────────────────────── tidyverse_conflicts() ──
✖ dplyr::filter() masks stats::filter()
✖ dplyr::lag()    masks stats::lag()
ℹ Use the conflicted package (<http://conflicted.r-lib.org/>) to force all conflicts to become errors
library(skimr)    
taxdata <- read_csv("https://myxavier-my.sharepoint.com/:x:/g/personal/scalesl_xavier_edu/EcTCPv86iudIgDCGBViy5uQBcmdWYT3XZ_cznoUMsyEPJg?download=1")
Rows: 4744 Columns: 17
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (10): CRA_ID, ADDRESS, APPLICATION_DATE, CONSTRUCTION_TYPE, LEED_CLASS, ...
dbl  (6): HCA_ABATED_VALUE, ABATEMENT_QUALIFYING_TERM, INCENTIVE_AMOUNT, LON...
lgl  (1): DCED_NEIGHBORHOOD

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.

Question

Do better LEED ratings increase tax abatement? The LEED (Leadership in Energy and Environmental Design) rating is assigned to properties to asses their energy and environmental impact. The rating is given by either Silver, Gold, or Platinum depending on the different aspects of the property such as water, gas, electric, or HAVC system within the property. My hypothesis is that a better LEED rating would give the property owner a better Tax Abatement.

Method

Answering this question would involve the Tax Abatement data from the City of Cincinnati’s website. I would need to filter the (LEED_CLASS) variable to only show “Silver, Gold, and Platinum” instead of the other categories like “Not Certified and Certified”. Then, the filtered data would be piped into a ggplot with (LEED_CLASS) on the x-axis and (HCA_ABATED_VALUE) on the y-axis. The visualization used would be a boxplot or “geom_boxplot” to show the distribution of the different LEED ratings with the tax abatement they received. I will also use “scale_y_log10” because the data has outliers that pull the visual down and format to dollars.

Results

taxdata %>%
  filter(LEED_CLASS %in% c("SILVER", "GOLD", "PLATINUM")) %>%
  ggplot(aes(x = LEED_CLASS, y = HCA_ABATED_VALUE)) +
  geom_boxplot() +
  labs(title = "Abatement Values for Different LEED Ratings",
           x = "LEED Rating",
           y = "Abatement Value ($)")
Warning: Removed 14 rows containing non-finite outside the scale range
(`stat_boxplot()`).

Interpretation

The results provided evidence to support the idea that better LEED ratings do improve the abatement values given. The platinum rating provided the highest abatement values, with Gold being the lowest and Silver being the lowest.