This project investigates the data surrounding California’s
Proposition 61 which appeared on the 2016 Ballot. Proposition 61 would
prohibit the state from buying prescription drugs above the lowest price
offered for the Department of Veteran Affairs. This particular
proposition is interesting to investigate because pharmaceutical
companies spent a whopping $109 million dollars on advertising against
the measure. The Proposition was not passed, by a 54% vote for No. Did
the advertising from Big Pharma target veteran and rural populations, or
are there more subtle factors that influenced the outcome of this
proposition? To begin, we will briefly compare and contrast differences
in census statistics across counties that voted yes and no.
To create these scatter plots, specific census data was
pulled and filtered by county. For the Veteran Proportion, the number of
veterans was devided by county population. Below is an example code
variables <- c(
veterans = "B21001_002E",
total_pop = "B01001_001E"
)
county_data <- get_acs(
geography = "county",
variables = variables,
state = "CA",
year = 2016,
survey = "acs5",
output = "wide"
)
filtered_data <- county_data[county_data$NAME %in% paste(counties, "County, California"), ]
filtered_data$Veteran_Proportion <- filtered_data$veterans / filtered_data$total_pop
ggplot(filtered_data, aes(x = reorder(NAME, -Veteran_Proportion), y = Veteran_Proportion)) +
geom_bar(stat = "identity", fill = "darkorange") +
coord_flip() +
labs(
title = "Veteran Population by County that voted No for Prop 61",
x = "County",
y = "Proportion of Veterans"
) +
scale_y_continuous(labels = scales::percent) +
theme_minimal() +
theme(
axis.text.x = element_text(angle = 45, hjust = 1),
plot.title = element_text(hjust = 0.5, size = 16)
Proposition 61 was significant to veteran populations and was a
large focus point pharmaceutical companies advertised. Theoretically, if
Medi-Cal was getting the same low prices as Veteran Affairs, nothing
would stop pharmaceutical companies from raising the prices for
everyone. Therefore, it was relevant to analyze veteran status with a
vote for Proposition 61. These graphs indicate a difference. Counties
that voted no, on average, had a much greater veteran proportion of the
county population. However, some counties had a similar veteran
population. The following graphs will compare counties with similar
veteran populations by differentiating factors.
These graphs show similar average ages. Age is an important
factor to consider because older individuals will qualify for Medi-Cal,
and the proposition would then be advantageous. However, in counties
with similar veteran populations, there was not a significant difference
in median age.
This data shows a sharper contrast. Counties that voted Yes had
a significantly higher average income. This is counter intuitive, as you
would expect counties with lower come, with more people who qualify for
Medi-Cal, to support Proposition 61. However, high income most likely
indicates that these individuals are more likely to be privately
insured. This disconnect could have allowed voters to be less persuaded
by the advertisements surrounding Proposition 61.
Counties that voted Yes on Prop 61 had smaller households. This
could be correlated with a number of factors. Larger households using a
wider variety of healthcare services, differences in ages requiring a
greater range of coverage, and eligibility.
We will examine Alameda County to preform precinct level
analysis. Alameda County voted yes and has a lower veteran population.
Therefore, will the most obvious factor for a no vote excluded, we can
examine more closely how income, age, and household size correlate with
a Yes or No vote.
To approach this code, I first overlayed the precinct with a
Yes Vote percentage with data from the statewide data base. I mereged
these colums by their precinct identifier.
join <- left_join(shapefile_data, vote61, join_by(PRECINCT == svprec))
## Reading layer `mprec_001_g16_v01' from data source
## `/private/var/folders/c6/wd3l8r_d0pv0df44zx1m3fq40000gn/T/RtmpWlfVRw/mprec_001_g16_v01.shp'
## using driver `ESRI Shapefile'
## Simple feature collection with 1664 features and 1 field
## Geometry type: MULTIPOLYGON
## Dimension: XY
## Bounding box: xmin: -122.3738 ymin: 37.45419 xmax: -121.4692 ymax: 37.90669
## CRS: NA
This is the precincts overlapped with their percentage of a Yes
vote. Green indicates a precinct that voted over 50% Yes and red
indicates a precinct that voted over 50% No. This is our baseline for
comparison
I then preformed a spatial merge between the
precinct shapefile and a cnesus block shapefile. I encountered
challanged here, attempting to make the block fit the shapefile because
the boundaries are different. I overcame this by using st_transform.
tracts_with_data <- tracts %>%
left_join(census_data, by = "GEOID")
st_crs(shapefile_data) <- invisible(st_crs(tracts))
shapefile_data <- invisible(st_transform(shapefile_data, crs = st_crs(tracts)))
invisible(st_is_valid(shapefile_data))
invisible(st_is_valid(tracts))
precincts_with_data <- suppressMessages(st_join(shapefile_data, tracts_with_data, left = TRUE))
precinct_summary <- precincts_with_data %>%
group_by(PRECINCT) %>%
summarize(
avg_income = mean(estimate, na.rm = TRUE)
)
ggplot(data = precinct_summary) +
geom_sf(aes(fill = avg_income)) +
scale_fill_gradient(
low = "red",
high = "green",
) +
theme_minimal() +
labs(
title = "Average Household Income by Precinct in 2016 Alameda County",
fill = "Income"
)
Here we see a positive correlation. Low income is associated
with a No vote. This suggests that Advertising was effective in
spreading the message that this Proposition could impact drug prices
overall, which would harm low income precincts much more than high
income precincts. However, this differentiates when we reach the analyze
the precincts with the lowest income, which is correlated with a Yes
vote. This is visible in the upper left region of the map. Could this
region be differentiated by other census factors?
If you look at the dark red cluster in the upper left region of
the map, this region was correlated with a Yes vote. The area had a much
higher average age, indicating a significant amount of individuals on
Medi-Cal. However, the other, larger, dark red region of the map is
correlated with a mixed vote. Younger individuals along the line of
small precincts also voted Yes on average. These results confirm what
the county level analysis that age is not a strong correlating
factor
With this analysis, we see more clarity surrounding the upper
left larger precincts. This area has a smaller household size correlated
with a yes vote
Significant Precincts Upper Left Region (Larger Precincts): This area is correlated with a Yes vote. The larger precincts on the most left side consist of predominately low-income, middle-aged, individuals with small households. Left Cluster (Significant Elderly Population): The area is correlated with a Yes Vote: The precincts consist of high-income, older individuals with a small average household size. Left Region (Smaller Precincts Resembling a Diagonal Line):This area is more mixed in regards to a yes or no vote. These precincts consist of predominately low-income, younger individuals with a range of household sizes. Right Cluster of Smaller Precincts: This region is correlated with a No vote.These precincts consist of low income, middle-aged, individuals in smaller households.
This project explored the voting patterns of California’s 2016 Proposition 61, which aimed to regulate prescription drug pricing by aligning state purchases with the lowest prices offered to the Department of Veteran Affairs. Using county-level and precinct-level data, the analysis examined census data, like veteran status, income, age, and household size to determine their relationship with a Vote for Proposition 61.
At the county level, a clear correlation emerged between veteran populations and voting preferences. Counties with a higher proportion of veterans were more likely to vote “No,” possibly due to concerns that lowering drug prices for Medi-Cal would affect veterans’ access to medications. Conversely, counties with higher average incomes tended to vote Yes, suggesting that higher-income voters, possibly with private insurance, were less influenced by pharmaceutical advertisements.
Further analysis at the precinct level in Alameda County provided more nuanced insights. While low-income areas generally showed a No vote, there was an exception in some of the poorest precincts, which voted Yes. This was evident in the larger precincts in the upper left region of the map, which were predominantly low-income, middle-aged, and had small households. This region’s support for Proposition 61 may have been influenced by the significant number of older individuals, presumably on Medi-Cal, who stood to benefit from lower drug prices.
The map visualizations also revealed patterns in household size, with smaller households correlating with a Yes vote in certain precincts. Additionally, precincts with a higher average age, possibly indicating more Medi-Cal recipients, also leaned towards a Yes vote. These findings suggest that household size and age, in conjunction with income levels, played a significant role in determining support for Proposition 61.
In conclusion, the data analysis and visualizations illustrated the complex relationship between demographic factors and voting behavior. While income and household size were key predictors, age and veteran status also contributed to the variability in support for the proposition. The mixed results across precincts underscore the challenges pharmaceutical companies faced in swaying public opinion through advertising.
Future studies could focus on low-income areas where misinformation about the proposition may have had a larger influence. Additionally, a more granular examination of precincts with mixed voting patterns could help identify other demographic or social factors that were not accounted for in this analysis. Expanding this analysis to include a broader range of propositions related to healthcare policy could provide further insights into the intersection of healthcare, income inequality, and voting behavior.