Table Of Contents

  1. Section 1: Introduction

    Section 1.1: COVID-19, A Brief History

    Section 1.2: Toll on Hospitals

    Section 1.3: How can the United States Increase Hospital Preparedness?

    Section 1.4: Supplemental Information

    Section 1.5: Previous Work

  2. Section 2: Exploratory Data Analysis

  3. Section 3: Methods and Results

    Section 3.1: USA Case Study

    Section 3.2: New Jersey Case Study

    Section 3.3: New York Case Study

    Section 3.3.1: New York City Case Study

    Section 3.4: Virginia Case Study

  4. Section 4: Conclusions

  5. Section 5: Limitations and Future Work

  6. Section 6: References

Section 1: Introduction

Section 1.1: COVID-19, A Brief History

Novel Coronavirus, also known as SARS-CoV-2 or COVID-19, is a global pandemic that has spread at an exponential rate since originating in Wuhan, China in late 20191. As of May 9th, 2020, there have been over 4 million confirmed cases and over 250,000 associated deaths2. Symptoms of the virus include fever, chills, tremors, myalgias, headaches, dysphagia, and dyspnea3. The virus yields a 7% fatality rate due to these symptoms eventually progressing to significant alveolar damage and eventual respiratory failure4. As of today, no antiviral treatments or preventative vaccines specifically targeting and protecting against COVID-19 exist5. As a result, both the confirmed case incidence and death rate of the virus continue to increase.

In the United States alone, the number of confirmed cases has eclipsed 1.3 million, and governments across the country have been implementing quarantine and lock down measures due to the acute threat since late March 20206. However, as time elapses, protests, led by disgruntled and frustrated citizens with the intent to coerce governments into reopening, have been increasing in frequency7. Due to this trend, several questions have arisen: Is the country ready to open again? Is social distancing even working? How long will social distancing last?

To answer these questions, it is vital to consider the incidence and prevalence of COVID-19 cases around the country. Although experts are pointing to certain locales reaching the asymptotical region of the growth curve in terms of daily cases, incidences continue to skyrocket in other locales. As a result, epidemiologists and infectious disease professionals continue to recommend contingency measures and social distancing.

Section 1.2: The Toll on Hospitals

One thing both the proponents and opponents of social distancing can agree on is the fact that hospitals around the country are still being bombarded by this high incidence of confirmed COVID-19 cases. Nurses, doctors, and social workers, among others, are working tirelessly to treat patients in critical condition, risking their own healthy status. This constant exposure, combined with consecutively working several hours at a time without a break, has taken its toll on medical professionals. Lorna M. Breen, previously a doctor in Manhattan, where the pandemic has been ruthless in its reign, committed suicide on April 29th, 2020, citing devastating scenes of the toll COVID-19 took on patients as well as her hectic schedule as reasons for her self-inflicted injuries that ultimately proved fatal8.

Additionally, hospitals around the country have been facing severe shortages of necessary protective equipment such as masks, gloves, and eye guards9. Medical equipment necessary for the treatment of the disease itself is also running out in certain hospitals, and automobile companies have even started to manufacture items such as respirators to lessen the shortages10. Despite this, equipment scarcity is a constant threat to the country’s ability to survive the pandemic with minimal damage until an antiviral treatment, preventative vaccine, or herd immunity is achieved.

Section 1.3: How can the United States Increase Hospital Preparedness?

The lack of medical supplies, citizens adhering to social distancing/quarantine, and availability of a treatment or vaccine simply exponentiates the virus’s already severe threat: Hospitals in the United States haven’t faced such a steady influx of patients since arguably the Spanish Flu pandemic of 191811. This begs the question: What region(s) of the country currently or will soon require the most emergent hospital aid due to the pandemic?

This question is important for governments to consider for several reasons. If an area is deemed more vulnerable to the pandemic, acute funding, medical equipment, and even healthcare professionals can be sent to lessen the toll. Furthermore, reallocating resources to more emergent areas can help decrease the skyrocketing incidence of COVID-19 as a whole, as domino effects can occur in which one region falls to the pandemic and exponentially spreads the virus to two or more other regions. Finally, answering this question can simply help raise the awareness of American citizens to what is occurring in areas of the country that they are not privy to. This can aid in the countrywide acceptance of vital contingency measures that can serve to bring an end to the current situation.

Section 1.4: Supplemental Information

The following is an attempt to answer the bolded question above using data visualization in the R language and analyzing locales based on the current total COVID-19 cases per staffed hospital bed. RStudio was utilized as the IDE for this study. The following libraries were also utilized:

The United States was used as the country of interest. First, the United States was analyzed by state to determine the most vulnerable states in terms of confirmed cases per staffed hospital bed. Within the United States, the states of New Jersey, New York, and Virginia were then analyzed by county to determine which counties in each of these states are most vulnerable using the same metric. New York City was also analyzed as proof-of-concept that this analysis can be extended to the narrowest locality and even be utilized to identify vulnerable boroughs of a metropolis.

Section 1.5: Previous Work

Since the current COVID-19 pandemic has only been existent since late 2019, there is relatively limited previous work on this topic in terms of region vulnerability predictions. However, plotting geospatial data using the maps and leaflet packages has been extensively performed with many applications, including population density, political voting layouts, and other census data. The methodology outlined below can be seen as an extension of this work, as it uses a distribution of staffed hospital beds, analogous to population density, as well as total COVID-19 cases, to generate a metric of vulnerability for each subregion within a certain locale.

Furthermore, data visualization simply based on total COVID-19 cases has been performed extensively as the pandemic has progressed. Institutions such as the Centers for Disease Control and Prevention as well as Johns Hopkins University have been updating their websites almost instantaneously for real-time visualization of case rate and death rate during the current pandemic.

Previous works similar to the methodology that will be outlined can be found listed below. Some of these works were used as inspiration for the methodology of this report:

Making Maps with R| Using Leaflet| Geocomputation with R| Johns Hopkins COVID data| Center for Disease Control and Prevention COVID-19 Visualization| RStudio and COVID-19| UVA COVID-19 Surveillance Dashboard| COVID-19 Epidimiology with R


Section 2: Exploratory Data Analysis

The variables that will be emphasized in this report primarily include the accumulated number of confirmed total COVID-19 cases by subregion, as of May 8th, 2020, as well as the quantity of staffed hospital beds within the subregion.

At the country level, the vulnerability is determined by states as the subregion in the case of the United States. It is important to qualitatively assess the distribution of staffed hospital beds by state when deeming a state as vulnerable to the pandemic. However, it is not feasible to map all of the hospitals in the entire country out by their coordinates, where this may be possible at the state level. Therefore, it is easier to focus on the region of the staffed hospital beds rather than the location:

##          NAME     StaffedBeds   
##  Alabama   : 1   Min.   :  835  
##  Alaska    : 1   1st Qu.: 3640  
##  Arizona   : 1   Median :10195  
##  Arkansas  : 1   Mean   :14351  
##  California: 1   3rd Qu.:16956  
##  Colorado  : 1   Max.   :74624  
##  (Other)   :46
## [1] 746275

The output above depicts summary statistics of the distribution of staffed beds across all of the states (subregions) in the United States. This will be a useful statistic in determining state vulnerability to the pandemic, as it will be the divisor to generate a metric of total COVID-19 cases as a function of hospital distribution in each state. The average number of hospital beds in each state was determined to be 14,351 staffed beds. The total number of staffed beds in the country is 746,275 beds.

## # A tibble: 6 x 2
##   state        Cases
##   <chr>        <dbl>
## 1 Alabama     207425
## 2 Alaska       12405
## 3 Arizona     219857
## 4 Arkansas     89442
## 5 California 1452020
## 6 Colorado    438960

Another country level summary statistic that is important to consider is the number of confirmed COVID-19 cases in each state. This will be the numerator, or primary factor, in assessing vulnerability of a subregion.

At the state level, the vulnerability is determined by county as the subregion. It is important to qualitatively assess the hospital distribution layout within a state, as this can generate initial expectations as to which regions should be the most vulnerable.

##                             Hospital   County StaffedBeds      lng     lat
## 1           Palisades Medical Center   Hudson         202 -74.0129 40.8009
## 2 AtlantiCare Regional Atlantic City Atlantic         540 -74.5421 39.4761
## 3             Bayonne Medical Center   Hudson         163 -74.1124 40.6682
## 4            Bayshore Medical Center Monmouth         169 -74.1917 40.4055
## 5   Bergen New Bridge Medical Center   Bergen        1020 -74.0629 40.9570
## 6       Cape Regional Medical Center Cape May         149 -74.8169 39.0875

The output above shows a sample of a data frame for the hospitals in New Jersey based on name, county, staffed beds, longitude, and latitude. This is helpful for later plotting the hospitals and assessing the staffed beds in each county within a state, or even borough within a metropolis:

## # A tibble: 6 x 2
##   County     StaffedBeds
##   <fct>            <dbl>
## 1 ATLANTIC           739
## 2 BERGEN            2942
## 3 BURLINGTON         569
## 4 CAMDEN            1984
## 5 CAPE MAY           149
## 6 CUMBERLAND         325

The output above depicts how to analyze the counties of New Jersey in terms of total staffed beds given the data frame used previously. The number of staffed beds per county will be what the vulnerability metric is determined with respect to (divisor).

In summary, vulnerability in the methodology outlined below will be assessed based on confirmed COVID-19 cases per staffed hospital bed in the subregions of interest. This will be the metric in determining which region(s) of the United States require the most emergent healthcare aid due to the pandemic.


Section 3: Methods and Results

Limitations and Future Work

Although the methodology outlined in this paper is effective, several limitations exist. Primarily, the hospitals utilized may not have been representative of the entire population of hospitals within the United States. This can be clearly seen in the Virginia case study, in which Buckingham County was found to be the most vulnerable due to the locale not possessing any healthcare centers. Similarly, this methodology would fail when applied to other states in which there are an extensive quantity of subregions which may or may not contain the metric required (in this case, hospitals). That being stated, only major hospitals were included in this study, and the outcomes may have been very different in the case that urgent care centers or small, privately owned hospitals were taken into account. Future research should take these types of smaller care centers into account, in addition to utilizing other types of care centers such as nursing homes and hospice centers which can also provide critical care during a pandemic. Another change to address this limitation is including transportation networks (roads, highways, etc.) and combining some subregions into larger regions based on distance. Taking population density into account and generating an underlying heat map to the data might also help to understand which counties can be combined to avoid the possibility of a subregion not possessing the metric of interest.

Another limitation includes the instantaneous data utilized rather than a continuous source over time since the COVID-19 outbreak in the United States first occurred. Since the total accumulation of confirmed cases was filtered to only include data of May 8th, 2020, there was no time factor taken into account. This can render the methodology inconsequential in the late stages of the pandemic, as the growth rate may be slowing down but the total amount of confirmed cases will always accumulate. Future research on the methodology should take into account the incidence rates of a recent period (e.g. one week or past five days) in the locality and incorporate this into the vulnerability rating. This would help in giving the metric a sense of urgency, and could possibly be performed by plotting the choropleth maps over time using a slider in the plotly or shiny packages.

Finally, an additional limitation of the methodology outlined above is the limited sample size of case studies. The COVID-19 pandemic is globally ravaging several countries, not just the United States. Therefore, it is important to consider the scope as the world rather than a lone country. Although this may not be entirely necessary due to the United States currently leading the world in total confirmed cases, it can help to prevent future outbreaks of the virus.

Future avenues of research additionally include extending the methodology to other contagious diseases currently devastating other regions of the world such as malaria, HIV-AIDS, Ebola, and OrthoHantavirus. Furthermore, memory can be implemented into this method to prevent future outbreaks from ravaging the same locales and not learning from history. This analysis can be extended to other applications for preventing risk such as the migration of an invasive species or identifying what locales are most in need of a certain technology.

Overall, however, the analysis should take into account more variables such as population density, transportation routes, distance of hospital from each county, and metropolitan areas.



References

  1. https://www.ncbi.nlm.nih.gov/books/NBK554776/
  2. https://ourworldindata.org/grapher/total-deaths-covid-19
  3. https://www.cdc.gov/coronavirus/2019-ncov/symptoms-testing/symptoms.html
  4. https://jamanetwork.com/journals/jamainternalmedicine/article-abstract/2763184
  5. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7117787/
  6. https://www.healthaffairs.org/doi/10.1377/hlthaff.2020.00455
  7. https://www.washingtonpost.com/politics/inside-the-conservative-networks-backing-anti-quarantine-protests/2020/04/22/da75c81e-83fe-11ea-a3eb-e9fc93160703_story.html
  8. https://www.nytimes.com/2020/04/27/nyregion/new-york-city-doctor-suicide-coronavirus.html
  9. https://www.nbcnews.com/news/uts-news/government-watchdog-hospitals-face-severe-shortages-medical-gear-confusing-guidance-n1177256
  10. https://www.cnbc.com/2020/05/06/coronavirus-ford-and-3m-begin-shipping-respirators-to-frontline-health-workers.html
  11. https://www.cdc.gov/flu/pandemic-resources/1918-pandemic-h1n1.html