Pork Barrel Politics of The Towns Fund Model Extension

Study Preregistration form: [https://rpubs.com/DDowd1/Preregistraion_Form]

Information about this replication project

YAML settings

output:
  html_document:
   code_download: true
    toc: true
    toc_depth: 2
    toc_float:
     collapsed: false
     smooth_scroll: true

Global settings of R chunks

# Global options
opts_chunk$set(echo=TRUE,
                 cache=TRUE,
               comment=NA,
               message=FALSE,
               warning=FALSE)

Libraries

library(rmarkdown)
library(knitr)
library(tidyverse)
library(lme4)
library(sjPlot)

Versions of used packages

$rmarkdown
[1] '2.21'

$knitr
[1] '1.42'

My enviroment

[1] "R version 4.2.1 (2022-06-23 ucrt)"

1. Introduction

This paper is an extension of Table 1, Model 4 found in the paper named “Pork Barrel Politics of the Town’s (Hanretty 2021). The model assesses data about a Towns Fund created by the Conservative Government in 2019. It found that the towns located in parliamentary seats which conservatives narrowly won or lost were more likely to receive funding. This led Hanretty to speculate that the conservatives aimed funding towards towns to gain electoral advantages in future electoral cycles, a process known as Pork Barrel Politics. These findings were important as spending towards local government takes up almost a quarter of public spending, so changes even small adjustments made to spending towards local authorities can have a large effect on what services they can provide to citizens (Ward, 1999). Regional spending like the towns fund often comes in the form of regional policies which address social and economic distress in areas were the government of the day has decided that inequality has reached too high (Hoare, 1985)

An issue with Hanretty’s original model is that it fails to account for the strong spatial effect associated with Pork Barrel Politics(Hoare, 1983). Pork Barrel politics is defined as the process of politicians creating geographically aimed local projects for their own constituencies (Lancaster 1986). This gives constituents incentive to re-elect their representatives as they are receiving direct rewards for their support. Since Pork is not a phenomena that can be seen across a whole population, spatial elements should be included in analysis to see how effects vary across geographic areas.Milligan (2005) found evidence of the relationship with voter margins and the odds of receiving funding regional grants with data from the Canadian government. Unlike Hanretty, Milligan performed analysis at the local level not across the population. The analysis for this extension will do the same, using the region variable found in the Hanretty data as a level 2 variable. It is expected that a model that measures effects at local levels will fit the data better then a single level model that does not account for Spatial differences.

There are 4 different methods in which one can take to check the reproducability of Scientific results (Freese and Peterson 2017). In the original replication check, which was performed for Hanretty’s model as part of this project, a verification method was used. The verification method involves using the same data and same methods as the original to see if the results are consistent. The replication was unsuccessful in repeating the results as the code for the Model was not made available by Hanretty for the replication. This paper will use a different method of reproducibility checking called Robustness (Freese and Peterson 2017). This is an alternative method of checking key analysis, which involves using an alternative method of analysis to the original model to see if results replicate. In his paper, Hanretty (2021) performed sensitivity analysis on the model created for the paper and claims that for his results to be incorrect there would have to be a lurking variable which has thirty times the association between the rank and success variables. This means that an extended version of the model will yield similar results to the original version of the model.

2. Data and methods

2.1. Data

The data supplied for the original model was supplied by Hanretty in the Harvard data verse replication package. The original sources of the data couldn’t be supplied as permission was not given for the redistribution of the resources by the supplier. The RDs file supplied by Hanretty is the only available source for this data.

dat <- readRDS("Data/selection_data.rds")

Since the data set supplied by Hanretty is not a raw version of the data very little data transformation was needed for this test. However, transformation was still performed for the creation of the model.

  1. A numerical dummy variable was made from a categorical in the data set. This is the dependent variable in the model measuring the success of towns in receiving funding from the government.
dat$outcome <- as.numeric(dat$Funded == "Yes")
  1. The categorical variable used in the model is then made. This is the independent variable of interest. It creates a 5 point scale with the ranging from the conservatives failing getting over -5% of the vote of the winning party to the conservatives winning with more than 10% of the vote.
dat2 <- subset(dat, !is.na(ConMaj.allm))

dat2<- dat %>% 
  #as_tibble() %>% 
  mutate(ConMaj.allm.categorical = 
           case_when(ConMaj.allm < -5 ~ "-10% to -5%",
                     ConMaj.allm > -5 & ConMaj.allm < 0 ~"-5 to 0",
                     ConMaj.allm > 0 & ConMaj.allm < 5 ~" 0 to 5",
                     ConMaj.allm > 5 & ConMaj.allm < 10 ~"5 to 10",
                     ConMaj.allm > 10 ~"greater than 10"))

2.2 Exploratory Analysis

Figure one shows the number of towns located in each Region of England. The colours of the bar indicate how many of those towns are located in Conservative held seats. The Chart indicates that the Conservatives win a much lower proportion of the seats in the North of England then they do in the South and Midlands.

#table showing number of towns located in each region
ggplot(data=dat2)+
  aes(x=Region,
      fill=ConWinner1.allm)+
  geom_bar()+  
  labs(title = "How many Towns are in each Region?",
    x="Region of England",
       y="Number of Towns",
       fill= "Is this town located in a conservative seat?")+
  theme(legend.position="top")+
  scale_fill_manual(values=c("red3",
                                   "blue4" ))

Figure 2 shows how the success of the Tory party in a towns parliamentary seat affects its likelihood of receiving funding in the towns fund. The relationship varies wildly by Region. In Northern states, a higher percentage of Tory votes increases the chance of receiving funding. In Southern regions, the effect is opposite with towns located in seats which voted strongly for the Tories

#Relationship between 
ggplot(data=dat2)+
  aes(x=ConMaj.allm,
      y=outcome,
      colour=Region)+
  geom_point()+
  geom_smooth(method="glm",
              level=0)+
   scale_colour_manual(values=c("East Midlands"="mediumpurple4",
                             "East of England"="mediumpurple2",
                             "North East"="firebrick4",
                             "North West"="red3",
                             "South East"="steelblue1",
                             "South West"="dodgerblue4",
                             "West Midlands"="blueviolet",
                             "Yorkshire and the Humber"="firebrick1"))+
  theme_bw()

2.3. Method for final analysis

Hanretty performed the original analysis using a single level regression model. A categorical variable was used to show how the gap between the Tories and their main rival for the seat was used to show that conservative marginal seats were targeted by the Tories in their Towns fund scheme to boost support in future electoral cycles. The issue with this model is that it fails to show how this effect can vary at the regional level. The theory of Pork Barrel Politics is inherently geographic as it is the relocation of Government money to specific areas in which the government believe that spending will benefit its political goals(Lancaster 1986). Therefore, it makes sense to perform analysis varying by region as spending in certain regions will yield greater political gains in some regions then it would in others. In this extension, a random slopes model will be created using region as the level two variable. The slopes will be varied by the size of the conservative vote compared to its biggest rival for the seat. The other control variables found in the original model will also be featured in the extension as level one controls. The results original recreational model and the extension model will then be compared to. The size and significance of effects will be compared to see if the change in design yields significant change in results. A BIC test will also be used to show which model fits the data set best.

3. Results

The random effects introduced in the extension model create no significant change in the effects of the original. The odds ratios of the categorical variables are slightly lower than the original model however, there is not enough of a statistically significant change when the random slopes are introduced. Both models find that seats that are in constituencies where conservatives win with 5 to 10% of the vote more than their closest rival. Being in marginal seat still yielded significant effects on the odds of receiving funding. Towns located in seats that had marginally lost in were 6.71 times more likely to receive funding and towns which the conservatives marginally won were 4.03 times more likely to receive funding from the towns fund. Being located in a seat which had a conservative majority greater then 10% did not have any significant effect on the data. The region variable did not explain the variance in the data. The random effect coefficient was only 0.06 meaning that there was no significant effect on the data from the Region variable when it was used as a level two independent variable. The random effect coefficients for the categorical variable were mixed. The 0 to 5% and greater than 10% coefficients explained much more of the variance in the model then the other levels of the categorical variable. The effects of the control variables did not have a statistically significant change from the original model apart from the Qualification and Income dependency variables. Introducing the level 2 region effects to the model made the result of the Qualification variable significant despite its small odds ratio of 0.27. The odds ratio of Income Dependency grew to 10.18 the second largest effect found in the model. This indicates that spending could have been targeted towards towns that with higher levels of income inequality, this is should be an expected outcome for the data as town fund schemes are a method of which the government of the day can address regional inequalities (Hoare, 1985).

#Original Model Replication
Original <- 
  glm(outcome ~ ConMaj.allm.categorical+
                 log(Pop) + 
                 Rank + 
                 ConWinner1.allm + 
                 Region+
                 Quals+ 
                 IncomeDep+
                 Brexit+
                 Productivity+
                 Shocks+
                 Investment+
                 Alignment,
               family = binomial,
               data = dat2)


#Extension Model
Extension <- glmer(data=dat2, outcome~ ConMaj.allm.categorical+
                       log(Pop)+ 
                       Rank+
                       ConWinner1.allm+ 
                       Quals+ 
                       IncomeDep+
                       Brexit+
                       Productivity+
                       Shocks+
                       Investment+
                       Alignment+( 1+ ConMaj.allm.categorical |Region),
                       family=binomial)

tab_model(Original,Extension)
  outcome outcome
Predictors Odds Ratios CI p Odds Ratios CI p
(Intercept) 0.00 0.00 – 0.03 <0.001 0.00 0.00 – 0.01 <0.001
ConMaj.allm.categorical-5 to 0 7.71 2.04 – 28.40 0.002 6.71 1.52 – 29.55 0.012
ConMaj allm categorical [
0 to 5]
5.88 1.58 – 22.67 0.009 4.03 1.04 – 15.60 0.044
ConMaj allm categorical
[5 to 10]
15.79 3.79 – 70.75 <0.001 11.73 2.75 – 50.09 0.001
ConMaj allm categorical
[greater than 10]
1.81 0.57 – 5.95 0.317 1.35 0.44 – 4.14 0.603
Pop [log] 2.96 2.01 – 4.51 <0.001 2.88 1.94 – 4.28 <0.001
Rank 0.98 0.95 – 1.00 0.026 0.98 0.96 – 1.00 0.032
ConWinner1 allmTRUE 1.60 0.59 – 4.33 0.352 1.86 0.71 – 4.87 0.207
Region [East of England] 0.82 0.18 – 3.69 0.799
Region [North East] 0.58 0.13 – 2.41 0.463
Region [North West] 0.75 0.22 – 2.56 0.648
Region [South East] 0.45 0.12 – 1.56 0.210
Region [South West] 0.44 0.13 – 1.53 0.202
Region [West Midlands] 0.81 0.27 – 2.45 0.713
Region [Yorkshire and the
Humber]
1.95 0.61 – 6.34 0.262
Quals 0.27 0.07 – 0.98 0.050 0.30 0.09 – 1.03 0.055
IncomeDep 7.03 1.33 – 38.81 0.023 10.18 2.23 – 46.44 0.003
Brexit 1.11 0.32 – 3.81 0.868 1.18 0.38 – 3.67 0.779
Productivity 1.62 0.43 – 6.16 0.475 1.91 0.64 – 5.76 0.248
Shocks 1.87 0.61 – 5.62 0.269 1.82 0.66 – 5.05 0.247
Investment 0.37 0.01 – 10.81 0.586 0.44 0.01 – 15.43 0.653
Alignment 6.90 0.24 – 428.59 0.291 7.24 0.21 – 248.58 0.272
Random Effects
σ2   3.29
τ00   0.06 Region
τ11   0.16 Region.ConMaj.allm.categorical-5 to 0
  0.31 Region.ConMaj.allm.categorical 0 to 5
  0.13 Region.ConMaj.allm.categorical5 to 10
  0.32 Region.ConMaj.allm.categoricalgreater than 10
ρ01   -1.00
  1.00
  -1.00
  -1.00
ICC   0.03
N   8 Region
Observations 539 539
R2 Tjur 0.354 0.494 / 0.512

The BIC test shows that the original replication model fits the data better then the model created for this extension; indicating that the original model created by Hanretty fits the data more accurately than the extension model created or this project. This means that it is unlikely that the regional effects used in the model have do not have a significant effect on whether or not towns received funding from the Government. Therefore, there will be no significant difference between the effects that the Conservative Majority categorical variable will have on the dependant variable across regions.

BIC(Original,Extension)
          df      BIC
Original  22 475.6313
Extension 30 532.0290

4. Conclusions

Chris Hanretty’s model did not replicate in the original verifiability check performed in this project. However, in this robustness check the results found were similar suggesting that the model does yield robust results. The BIC test comparing the two models found that the original single level model fit the data more than the extension version of the model which had random slopes. The null hypothesis stated in the pre-registration form stating that the model with regional random slopes would be superior can be rejected. This means that there is no significant regional variance in the relationship between conservative vote and the odds of receiving funding. The relationship discovered in the Pork Barrel Politics of the Towns Fund (Hanretty, 2021) paper is likely to be an effect that can be found across the data not just in certain geographic locations. The regional effects found by Milligan and Smart(2005) could not be replicated with this model .

References

Freese, J., & Peterson, D. (2017). Replication in social science. Annual Review of Sociology, 43, 147-165, doi: 10.1146.

Hanretty, C. (2021). “The pork barrel politics of the Towns Fund.” The Political Quarterly 92(1): 7-13.

Hoare, A.G., 1983. Pork-barrelling in Britain: a review. Environment and Planning C: Government and Policy, 1(4), pp.413-438.

Hoare, A. G. (1985). Dividing the pork barrel: Britain’s enterprise zone experience. Political Geography Quarterly, 4(1), 29-46.

Lancaster, T. D. (1986). “Electoral structures and pork barrel politics.” International Political Science Review 7(1): 67-81.

Milligan, K.S. and Smart, M., 2005. Regional grants as pork barrel politics. Available at SSRN 710903.

Appendix

Appendix 1. My enviroment (full information)

# Detailed information about my environment
sessionInfo()
R version 4.2.1 (2022-06-23 ucrt)
Platform: x86_64-w64-mingw32/x64 (64-bit)
Running under: Windows 10 x64 (build 19044)

Matrix products: default

locale:
[1] LC_COLLATE=English_United Kingdom.utf8 
[2] LC_CTYPE=English_United Kingdom.utf8   
[3] LC_MONETARY=English_United Kingdom.utf8
[4] LC_NUMERIC=C                           
[5] LC_TIME=English_United Kingdom.utf8    

attached base packages:
[1] stats     graphics  grDevices utils     datasets  methods   base     

other attached packages:
 [1] sjPlot_2.8.14   lme4_1.1-33     Matrix_1.4-1    lubridate_1.9.2
 [5] forcats_1.0.0   stringr_1.5.0   dplyr_1.1.2     purrr_1.0.1    
 [9] readr_2.1.4     tidyr_1.3.0     tibble_3.2.1    ggplot2_3.4.2  
[13] tidyverse_2.0.0 knitr_1.42      rmarkdown_2.21 

loaded via a namespace (and not attached):
 [1] Rcpp_1.0.10        mvtnorm_1.1-3      lattice_0.20-45    digest_0.6.31     
 [5] utf8_1.2.3         R6_2.5.1           backports_1.4.1    evaluate_0.20     
 [9] pillar_1.9.0       rlang_1.1.1        rstudioapi_0.14    minqa_1.2.5       
[13] performance_0.10.3 nloptr_2.0.3       jquerylib_0.1.4    effectsize_0.8.3  
[17] ggeffects_1.2.2    splines_4.2.1      munsell_0.5.0      broom_1.0.4       
[21] modelr_0.1.11      compiler_4.2.1     xfun_0.39          parameters_0.21.1 
[25] pkgconfig_2.0.3    htmltools_0.5.5    insight_0.19.2     tidyselect_1.2.0  
[29] codetools_0.2-18   fansi_1.0.4        tzdb_0.4.0         withr_2.5.0       
[33] MASS_7.3-57        sjmisc_2.8.9       grid_4.2.1         nlme_3.1-157      
[37] jsonlite_1.8.4     gtable_0.3.3       lifecycle_1.0.3    magrittr_2.0.3    
[41] bayestestR_0.13.1  scales_1.2.1       datawizard_0.7.1   estimability_1.4.1
[45] cli_3.6.1          stringi_1.7.12     cachem_1.0.8       bslib_0.4.2       
[49] generics_0.1.3     vctrs_0.6.2        boot_1.3-28        sjlabelled_1.2.0  
[53] tools_4.2.1        glue_1.6.2         sjstats_0.18.2     hms_1.1.3         
[57] emmeans_1.8.6      fastmap_1.1.1      yaml_2.3.7         timechange_0.2.0  
[61] colorspace_2.1-0   sass_0.4.6        

Appendix 2. Entire R code used in the project

# Opening key libraries first
library(rmarkdown)
library(knitr)
# Global options
opts_chunk$set(echo=TRUE,
                 cache=TRUE,
               comment=NA,
               message=FALSE,
               warning=FALSE)
library(rmarkdown)
library(knitr)
library(tidyverse)
library(lme4)
library(sjPlot)
# Versions of used packages
packages <- c("rmarkdown", "knitr")
names(packages) <- packages
lapply(packages, packageVersion)
# What is my R version?
version[['version.string']]
dat <- readRDS("Data/selection_data.rds")
dat$outcome <- as.numeric(dat$Funded == "Yes")
dat2 <- subset(dat, !is.na(ConMaj.allm))

dat2<- dat %>% 
  #as_tibble() %>% 
  mutate(ConMaj.allm.categorical = 
           case_when(ConMaj.allm < -5 ~ "-10% to -5%",
                     ConMaj.allm > -5 & ConMaj.allm < 0 ~"-5 to 0",
                     ConMaj.allm > 0 & ConMaj.allm < 5 ~" 0 to 5",
                     ConMaj.allm > 5 & ConMaj.allm < 10 ~"5 to 10",
                     ConMaj.allm > 10 ~"greater than 10"))
#table showing number of towns located in each region
ggplot(data=dat2)+
  aes(x=Region,
      fill=ConWinner1.allm)+
  geom_bar()+  
  labs(title = "How many Towns are in each Region?",
    x="Region of England",
       y="Number of Towns",
       fill= "Is this town located in a conservative seat?")+
  theme(legend.position="top")+
  scale_fill_manual(values=c("red3",
                                   "blue4" ))
#Relationship between 
ggplot(data=dat2)+
  aes(x=ConMaj.allm,
      y=outcome,
      colour=Region)+
  geom_point()+
  geom_smooth(method="glm",
              level=0)+
   scale_colour_manual(values=c("East Midlands"="mediumpurple4",
                             "East of England"="mediumpurple2",
                             "North East"="firebrick4",
                             "North West"="red3",
                             "South East"="steelblue1",
                             "South West"="dodgerblue4",
                             "West Midlands"="blueviolet",
                             "Yorkshire and the Humber"="firebrick1"))+
  theme_bw()
#Original Model Replication
Original <- 
  glm(outcome ~ ConMaj.allm.categorical+
                 log(Pop) + 
                 Rank + 
                 ConWinner1.allm + 
                 Region+
                 Quals+ 
                 IncomeDep+
                 Brexit+
                 Productivity+
                 Shocks+
                 Investment+
                 Alignment,
               family = binomial,
               data = dat2)


#Extension Model
Extension <- glmer(data=dat2, outcome~ ConMaj.allm.categorical+
                       log(Pop)+ 
                       Rank+
                       ConWinner1.allm+ 
                       Quals+ 
                       IncomeDep+
                       Brexit+
                       Productivity+
                       Shocks+
                       Investment+
                       Alignment+( 1+ ConMaj.allm.categorical |Region),
                       family=binomial)

tab_model(Original,Extension)
BIC(Original,Extension)
# Detailed information about my environment
sessionInfo()
###Data Transformation Code----

#Creating binary dependant variable
dat$outcome <- as.numeric(dat$Funded == "Yes")

dat2 <- subset(dat, !is.na(ConMaj.allm))

#Creating Conservative majority Categorical Variable
dat2<- dat %>% 
  #as_tibble() %>% 
  mutate(ConMaj.allm.categorical = 
           case_when(ConMaj.allm < -5 ~ "-10% to -5%",
                     ConMaj.allm > -5 & ConMaj.allm < 0 ~"-5 to 0",
                     ConMaj.allm > 0 & ConMaj.allm < 5 ~" 0 to 5",
                     ConMaj.allm > 5 & ConMaj.allm < 10 ~"5 to 10",
                     ConMaj.allm > 10 ~"greater than 10"))

#Exploratory Analysis Code---

#table showing number of towns located in each region
ggplot(data=dat2)+
  aes(x=Region,
      fill=ConWinner1.allm)+
  geom_bar()+  
  labs(title = "How many Towns are in each Region?",
    x="Region of England",
       y="Number of Towns",
       fill= "Is this town located in a conservative seat?")+
  theme(legend.position="top")+
  scale_fill_manual(values=c("red3",
                                   "blue4" ))
                                   
#Relationship between Conservative margins and odds of success
ggplot(data=dat2)+
  aes(x=ConMaj.allm,
      y=outcome,
      colour=Region)+
  geom_point()+
  geom_smooth(method="glm",
              level=0)+
   scale_colour_manual(values=c("East Midlands"="mediumpurple4",
                             "East of England"="mediumpurple2",
                             "North East"="firebrick4",
                             "North West"="red3",
                             "South East"="steelblue1",
                             "South West"="dodgerblue4",
                             "West Midlands"="blueviolet",
                             "Yorkshire and the Humber"="firebrick1"))+
  theme_bw()

#Code for Models----

#original model
Original <- 
  glm(outcome ~ ConMaj.allm.categorical+
                 log(Pop) + 
                 Rank + 
                 ConWinner1.allm + 
                 Region+
                 Quals+ 
                 IncomeDep+
                 Brexit+
                 Productivity+
                 Shocks+
                 Investment+
                 Alignment,
               family = binomial,
               data = dat2)

#Extension Model
Extension <- glmer(data=dat2, outcome~ ConMaj.allm.categorical+
                       log(Pop)+ 
                       Rank+
                       ConWinner1.allm+ 
                       Quals+ 
                       IncomeDep+
                       Brexit+
                       Productivity+
                       Shocks+
                       Investment+
                       Alignment+( 1+ ConMaj.allm.categorical |Region),
                       family=binomial)

tab_model(Original,Extension)
