This work will build on the work published in Skinner et al., 2023, which revealed that human footprint index (HFI) has the potential to predict local occurrence of multiple vector-borne diseases, in line with ecological theory. Due to data availability constraints, Skinner et al., 2023 used a machine learning approach, limiting the ability to identify causal relationships. Recently published updates to HFI data have dramatically extended the temporal coverage from just 2013 and 2019 to annual estimates from 2000–2020. This newly published data allows for a panel data regression approach that has the potential to reveal causal relationships between HFI and disease occurrence. We propose a first pass investigating HFI relationships with only dengue occurrence, with the potential to expand the analysis to other diseases. We chose dengue as our first disease of interest due to (1) its consistent presence in Brazil since 2000, (2) it was found to have the strongest relationship with HFI in Skinner et al., 2023 (Fig. 3) out of six diseases investigated, (3) its large incidence/occurrence values across a substantial range of Brazil, making it the most likely disease to power a panel regression analysis, and (4) it’s transmission is tightly associated with specific environments (urban) because of the biology of its primary vector Aedes aegypti, which breeds in urban settings and predominantly feeds on humans. Ideally, we hope that this analysis can reveal HFI tipping points that allow dengue to establish in areas that are undergoing increasing development and pressure in Brazil and beyond.
Skinner et al., 2023 identified HFI = 8 as the threshold value at which dengue reaches 50% of its maximum occurrence probability (Fig 4a). All municipalities that span the value of HFI = 8 are marked in blue. They equate to 323 of the total 5569 Brazilian municipalities (5.8%). The proportion of municipalities interacting with the threshold is comparable across threshold values of 8, 9, and 10.
Same plot as above, but subset to municipalities that span HFI = 8.
Preliminary model set up includes a simple panel regression model regressing threshold values against dengue occurrence with nine years of lags, unit (municipality) fixed effects, and yearly fixed effects. Standard errors clustered at the unit level.
Summary of various model options I have tried:
The general pattern coming out of all model specifications shows HFI having little, no, or negative impact on dengue occurrence until lags 5-9 where impact becomes positive, sometimes significantly. The pattern comes out most strongly when subsetting to just the municipalities that span the HFI threshold value. Surprisingly, demanding HFI staying above or below the threshold for 3 years to change between 1 and 0 produced the most insignificant results. A few examples are included below to show range of possible results depending on modeling choices (changes from prior example bolded in description). Next steps listed below.
All municipalities, threshold dummy 1 or 0 depending on being above or below 8, occurrence outcome 1 when above 10, 0 when below (remains 1 for rest of sample).
Only municipalities that span HFI threshold, threshold dummy 1 or 0 depending on being above or below 8, occurrence outcome 1 when above 10, 0 when below (remains 1 for rest of sample).
Only municipalities that span HFI threshold, threshold dummy 1 or 0 depending on being above or below 8 for at least three years, occurrence outcome 1 when above 10, 0 when below (remains 1 for rest of sample).
Only municipalities that span HFI threshold, threshold dummy 1 or 0 depending on being above or below 8, occurrence outcome 1 when incidence above 300 per 100k, 0 when below (remains 1 for rest of sample).
Only municipalities that span HFI threshold, HFI median (continuous), occurrence outcome 1 when above 10, 0 when below (remains 1 for rest of sample).