Introduction

We just finished another presidential election year, full of surprises and upsets, or maybe upsets and surprises. Either way, the voting part is over, the counting part is ongoing, and unlike 2020, the vanquished had a sense of dignity and sanity. All sorts of experts have crawled out from under their rocks to either claim credit or assign blame. I’m not interested in that. I’m interested in the story data can tell.

I asked myself what Presidential elections looked like from a “pure” numbers perspective, so I collected some data. The voting-eligible population (VEP) came from the University of Florida Election Lab. The vote counts came from either the Wikipedia pages for each election, 1980, 1984, 1988, 1992, 1996, 2000, 2004, 2008, 2012, 2016, and 2020 or the Election Atlas state pages for 2024. The 2024 numbers are all preliminary and should be interpreted with caution. I also collected the number of Electoral College votes each state had in each election to use as weights for modeling. This was looked up from several sources, including Wikipedia.

The data consisted of election year, state names, and counts of VEP and votes for the Democrat, Republican, and Other candidates. I subtracted the sum of the votes from the VEP to get Nonvoting counts. This was all done in Excel, where I also formatted the data into “long” format. I loaded the clean data. You can get it, yourself in this Google sheet.

Before I go any further, I will be using R for this work, so I need to load a bunch of R packages.

library(googlesheets4)
Warning: package ‘googlesheets4’ was built under R version 4.4.2
library(glmmTMB)
library(performance)
library(car)
Loading required package: carData
library(performance)
library(MuMIn)
library(emmeans)
Welcome to emmeans.
Caution: You lose important information if you filter this package's results.
See '? untidy'
library(effectsize)
library(tidyverse)
── Attaching core tidyverse packages ──────────────────────────────────────────────────────── tidyverse 2.0.0 ──
✔ dplyr     1.1.4     ✔ readr     2.1.5
✔ forcats   1.0.0     ✔ stringr   1.5.1
✔ ggplot2   3.5.1     ✔ tibble    3.2.1
✔ lubridate 1.9.3     ✔ tidyr     1.3.1
✔ purrr     1.0.2     
── Conflicts ────────────────────────────────────────────────────────────────────────── tidyverse_conflicts() ──
✖ dplyr::filter() masks stats::filter()
✖ dplyr::lag()    masks stats::lag()
✖ dplyr::recode() masks car::recode()
✖ purrr::some()   masks car::some()
ℹ Use the ]8;;http://conflicted.r-lib.org/conflicted package]8;; to force all conflicts to become errors
library(reshape2)
Warning: package ‘reshape2’ was built under R version 4.4.2

Attaching package: ‘reshape2’

The following object is masked from ‘package:tidyr’:

    smiths
Vote.dat <- read_sheet("https://docs.google.com/spreadsheets/d/1oySzPkmaVMBHHwAib4kLuca_7XQrXg5x67hQmLKUpTA/edit")
The googlesheets4 package is requesting access to your Google account.
Enter '1' to start a new auth process or select a pre-authorized account.
1: Send me to the browser for a new auth process.
2: maloneywriting18@gmail.com
1
Waiting for authentication in browser...
Press Esc/Ctrl + C to abort
Authentication complete.
✔ Reading from Vote.sheet.
✔ Range Sheet1.
Vote.dat$State <- factor(Vote.dat$State)
Vote.dat$Selection <- factor(Vote.dat$Selection)

Feel free to use the above link to do a manual import if you’d prefer that.

So, let’s take a look at the raw data.

messy.plot <- ggplot(Vote.dat, aes(x = jitter(Year), y = Votes/VEP, group = Selection, color = Selection)) + 
  scale_color_manual(values = c("#4477AA", "#228833", "#CCBB44", "#EE6677"),
                     limits = c("Democrat", "Other", "Nonvoting", "Republican")) +
  geom_point(size = 3) +
  theme(panel.background = element_rect(fill = "white"),
        axis.line = element_line(size = 1, color = "black")) +
  scale_y_continuous(labels = scales::percent) +
  scale_x_continuous(breaks = c(1980, 1984, 1988, 1992, 1996, 2000, 2004, 2008, 2012, 2016, 2020, 2024)) +
  ylab("Frequency (per VEP)") + xlab("Year")

messy.plot

Summarizing

That’s a real mess. Let’s try generating a summary.

# Some background functions
weighted.average <- function(x, w){
## Sum of the weights
sum.w <- sum(w, na.rm = T)
## Sum of the weighted $x_i$
xw <- sum(w*x, na.rm = T)
## Return the weighted average
return(xw/sum.w)
}

weighted.se.mean <- function(x, w, na.rm = T){
## Remove NAs
if (na.rm) {
i <- !is.na(x)
w <- w[i]
x <- x[i]
}
## Calculate effective N and correction factor
n_eff <- (sum(w))^2/(sum(w^2))
correction = n_eff/(n_eff-1)
## Get weighted variance
numerator = sum(w*(x-weighted.average(x,w))^2)
denominator = sum(w)
## get weighted standard error of the mean
se_x = sqrt((correction * (numerator/denominator))/n_eff)
return(se_x)
}

# Making the weighted summary
Vote.sum <- Vote.dat %>%
  mutate(pasteup = paste(Year, Selection)) %>%
  mutate(ratio = Votes/VEP) %>%
  group_by(pasteup) %>%
  summarize(weighted_means = weighted.average(ratio, Electoral.Votes)) %>%
  cbind(Vote.dat %>%
          mutate(pasteup = paste(Year, Selection)) %>%
          mutate(ratio = Votes/VEP) %>%
          group_by(pasteup) %>%
          summarize(weighted_CL = weighted.se.mean(ratio, Electoral.Votes) * qt(0.975, 50)))
Vote.sum <- Vote.sum[, c(1, 2, 4)]
Vote.sum <- cbind(colsplit(Vote.sum$pasteup, " ", c("Year", "Selection")), Vote.sum[,2:3])
Vote.sum$Selection <- factor(Vote.sum$Selection)
Vote.sum

What I did was take weighted means by year of all the states for votes cast for the Democratic or Republican candidate or any other candidate, along with the remainder (non-voting VEP), divided by the VEP. The means were weighted by the Electoral College votes that state had in that election year. I then calculated the weighted standard errors of the mean, using a function written by Alex Stephenson. I converted these to 95% confidence limits by multiplying by the appropriate quantile of t at 0.975 and 50 degrees of freedom. Why did I weight the popular votes? It’s a concession to reality. People might hand-wring and whine all they like about how horrid the Electoral College might be, but it’s not going away any time soon. Understanding US presidential politics requires taking it into account.

So, I have a table, I don’t find tables useful. What are useful are charts.

summary.plot <- ggplot(Vote.sum, aes(x = Year, y = weighted_means, group = Selection, color = Selection)) + 
  geom_line(linewidth = 1) +
  scale_color_manual(values = c("#4477AA", "#228833", "#CCBB44", "#EE6677"),
                     limits = c("Democrat", "Other", "Nonvoting", "Republican")) +
  geom_point(size = 3) +
  geom_errorbar((aes(ymin=weighted_means - weighted_CL, ymax = weighted_means + weighted_CL))) +
  theme(panel.background = element_rect(fill = "white"),
        axis.line = element_line(size = 1, color = "black"),
        legend.position = c(0.8, 0.3)) +
  scale_y_continuous(labels = scales::percent) +
  scale_x_continuous(breaks = c(1980, 1984, 1988, 1992, 1996, 2000, 2004, 2008, 2012, 2016, 2020, 2024)) +
  ylab("Frequency (per VEP)")

summary.plot

I didn’t just pull the colors out of my cute patootie. They’re from Paul Tol’s site that describes many wonderful (and color-blind safe) color schemes, arranged with the now-traditional coloring of blue for Democrats and red for Republicans.

A little non-statistical analysis

We can analyze the above chart without even considering statistics. I’m going to be mostly just descriptive. The first thing to note is that the most popular “candidate”, by far, for the last 44 years has been nobody. Since at least 1980, there has never been a president elected by the majority of Americans eligible to vote. Let us wring hands. Hand wringing time is over. While it’s true that “get out of my face” has usually been the most popular selection, it’s equally true that this option has been chosen less and less as time has gone on. Whatever is wrong with America, more Americans have been getting out to vote. True enough, 2024 was another election of apathy, but it’s hard to call the future based on a single-point deflection. The overall trend has been for non-voting to drop by a little less than 1 percentage point every election. The Democrats and Republicans are basically in a dead heat in dividing this gain up between themselves. The overall Democrat gain since 1980 has been greater than the overall Republican gain, but currently they are indistinguishable. On the other hand, the slow increase in voting doesn’t seem to have been distributed to “Other” options.

Another thing to note is that the Republicans and Democrats seem to have switched performances from 1980 to the present. From 1980 to 2004, Republican performance fluctuated wildly from year to year. The Democrats, on the other hand, were experiencing very slow but very regular gains until 2000. From these inflection points to the present day, this has switched. The Democrats now have wild swings while the Republicans’ oscillations have calmed down (although with more variation than the early-period Democrats showed). How to explain this or what it means for the future I leave to more knowledgeable people. What I can say is that it looks like both parties experienced a significant sea change roughly 2000-2004, and it is having long-lasting effects on each of them. What these changes were, I can’t even speculate. Feel free.

Calling the Horse Races

Let’s dive deeper and look at each election in turn. Reagan’s tenure was marked by moving from strong to stronger. Unusually for incumbents, he gained percentage in his re-election bid. However, this was not at the expense of Mondale, who also got a (very tiny) gain vs. Carter’s performance. Instead, Reagan may have attracted some non-voters and a large portion of people who had voted for Anderson (who?). Poppy Bush’s election was an exercise in coat-tail riding. The nonvoters bounced back up, and Dukakis got another (very tiny) gain over the Democrats’ share of the previous election,

Indeed, from 1980 to 2000 “a very tiny gain” could accurately describe the Democratic party’s entire presidential performance, even in years they took the White House. In every election, it was a very small improvement in attracting a proportion of the voting-eligible population, while the Republicans see-sawed up and down in popularity. For example, Clinton took office in a tidal wave of spoilage. Ross Perot’s leap seems to have been fueled by some Republican-leaning voters and bigger chunk of people who otherwise wouldn’t have bothered to vote at all. But Perot was a gadfly candidate. Bush was the anointed leader of the Republican party, so there was a squabble on that side of the political spectrum. Clinton rode in on this disunity.

Perot’s huge-for-a-third-dude showing cannot be explained only by attracting nonvoters. In 1998, there was a giant leap in apathy, which probably hurt Dole more than it hurt Clinton. However, Clinton didn’t experience a spike in popularity. He managed to do a bit better than last time. This set the stage for 2000.

This was an extremely close election. What is interesting is that there was very little apparent motivation of the nonvoters. It ended up in a mess. Bush’s re-election was more definite and it was accompanied by the Democratic party starting to improve more quickly its slug-like crawl upwards since 1980. However, Bush was able to attract more of the nonvoters (and possibly Other voters) than did Kerry. What is interesting is that this election saw a big drop in nonvoters, which goes against the general trend of an increase in apathy or at very best a small decrease during a re-election attempt. However, these voters didn’t turn out for Kerry.

In 2008, Obama ended up continuing a straight-line trend started by the prior two failed Democrat attempts. There was a small tend downwards in nonvoters, but the Republicans lost share as well, indicating some possible aisle-crossing by voters. Obama’s second turn was a typical successful re-election, with more people choosing to sit it out, but not enough to force a loss. The Republicans were still experiencing the Bush 2-inspired decline.

2016 was another close election. It’s well-documented that Trump did not achieve the total popular majority of votes cast. The weighted means still reflect this. This is not anomalous. It is because weighted means can “distribute” proportional Electoral outcomes within each state as well as fractions. What we do see is that Gary Johnson had a bit of a spike, which may have been acquired from a mixture of nonvoters and both Democrat and Republican leaning voters.

The Biden election is illustrative of what might be an important trend. Once again, the number of nonvoters fell (to its lowest level), and the primary beneficiary seems to have been Biden. It was the first time in the period I looked at that an actual candidate managed to get more popular votes than “this election isn’t worth voting in”!

Finally, Iit should be noted that the 2024 results are based in part on projections and estimates and ought not be over-interpreted. However, the biggest winner is “Nobody is worth my vote.” Apathy and disengagement won again, with one of its biggest spikes since 1996. However, this time, the Republicans weren’t riding a massive bottom-out trend. If the projections are to be believed, Trump also lost share from his previous attempt. He merely lost less than did Harris vs. Biden’s performance. Trump managed to get less of the voting-eligible public than he did in 2020. Unfortunately, Harris managed to do even worse. I can’t explain why. I won’t try to explain why. I can only show the what. The why requires more in-depth knowledge than I have.

Modeling

We could stop right there, but I looked at the chart and wondered if there were any possibly verifiable trends in what I think I saw. That’s when I pulled out the formal modeling tools.

The data was repeated measurements (by state and District of Columbia) of 12 presidential elections. When I see repeated measures, I like to use a mixed-level model. If you want to know more, you can start here. The data is count data. Yes, I presented it as frequencies, but a frequency is a count divided by a reference count. The preferred way to analyze count data is with a Poisson distribtion. However, the Poisson distribution presumes that the mean and the variance of the data are the same, or dispersion = 1, to use jargon. A dispersion over 1 is overdispersed, and under 1 is underdispersed. Since I did not yet know if the data were over or under-dispersed, I chose R’s glmmTMB package, since it has options that can handle either eventuality. I started with a first pass model that used the Poisson family.

options(contrasts = rep("contr.Sum",2))
start.mod <- glmmTMB(Votes ~ Selection * Year + (1 | State) + offset(log(VEP)), weights = Electoral.Votes, family = poisson, data = Vote.dat)
Warning in (function (start, objective, gradient = NULL, hessian = NULL,  :
  NA/NaN function evaluation

I got a warning. It’s often due to differences in variable scale. The offending variable is probably Year, which can be fixed.

Vote.dat$Year.s <- scale(Vote.dat$Year)
start.mod <- glmmTMB(Votes ~ Selection * Year.s + (1 | State) + offset(log(VEP)), weights = Electoral.Votes, family = poisson, data = Vote.dat)

And I had to check the dispersion.

check_overdispersion(start.mod)
# Overdispersion test

       dispersion ratio =    1112634.483
  Pearson's Chi-Squared = 2713715504.977
                p-value =        < 0.001
Overdispersion detected.

Yes, definitely overdispersed. There are fancy ways to handle this, like the Conway-Maxwell-Poisson family, but the current implementation of this family can be slower than a geologic age to run. Fortunately, the reliable old negative binomial family will handle overdispersion. Since the sample had over 2000 entries, I felt comfortable exploring different random effects vis the states. I compared models that included random intercepts and random slopes for both Selection and Year, with and without correlated intercepts.

nb.mod1 <- glmmTMB(Votes ~ Selection * Year.s + (1 | State) + offset(log(VEP)), weights = Electoral.Votes, family = nbinom2, data = Vote.dat)
nb.mod2 <- glmmTMB(Votes ~ Selection * Year.s + (Selection | State) + offset(log(VEP)), weights = Electoral.Votes, family = nbinom2, data = Vote.dat)
Warning in finalizeTMB(TMBStruc, obj, fit, h, data.tmb.old) :
  Model convergence problem; singular convergence (7). See vignette('troubleshooting'), help('diagnose')
nb.mod3 <- glmmTMB(Votes ~ Selection * Year.s + (Year.s | State) + offset(log(VEP)), weights = Electoral.Votes, family = nbinom2, data = Vote.dat)
nb.mod4 <- glmmTMB(Votes ~ Selection * Year.s + (Selection + Year.s | State) + offset(log(VEP)), weights = Electoral.Votes, family = nbinom2, data = Vote.dat)
Warning in finalizeTMB(TMBStruc, obj, fit, h, data.tmb.old) :
  Model convergence problem; non-positive-definite Hessian matrix. See vignette('troubleshooting')
Warning in finalizeTMB(TMBStruc, obj, fit, h, data.tmb.old) :
  Model convergence problem; singular convergence (7). See vignette('troubleshooting'), help('diagnose')
nb.mod5 <- glmmTMB(Votes ~ Selection * Year.s + (Selection * Year.s | State) + offset(log(VEP)), weights = Electoral.Votes, family = nbinom2, data = Vote.dat)
Warning in finalizeTMB(TMBStruc, obj, fit, h, data.tmb.old) :
  Model convergence problem; non-positive-definite Hessian matrix. See vignette('troubleshooting')
Warning in finalizeTMB(TMBStruc, obj, fit, h, data.tmb.old) :
  Model convergence problem; singular convergence (7). See vignette('troubleshooting'), help('diagnose')
nb.mod6 <- glmmTMB(Votes ~ Selection * Year.s + (Selection || State) + offset(log(VEP)), weights = Electoral.Votes, family = nbinom2, data = Vote.dat)
nb.mod7 <- glmmTMB(Votes ~ Selection * Year.s + (Year.s || State) + offset(log(VEP)), weights = Electoral.Votes, family = nbinom2, data = Vote.dat)
nb.mod8 <- glmmTMB(Votes ~ Selection * Year.s + (Selection + Year.s || State) + offset(log(VEP)), weights = Electoral.Votes, family = nbinom2, data = Vote.dat)
Warning in finalizeTMB(TMBStruc, obj, fit, h, data.tmb.old) :
  Model convergence problem; false convergence (8). See vignette('troubleshooting'), help('diagnose')
nb.mod9 <- glmmTMB(Votes ~ Selection * Year.s + (Selection * Year.s || State) + offset(log(VEP)), weights = Electoral.Votes, family = nbinom2, data = Vote.dat)

I compared models that included random intercepts and random slopes for both Selection and Year, with and without correlated intercepts. Different models were compared by second-order Akaike information criterion . I admit, doing this is not rigorous. There’s controversy over whether AICc can be calculated at all for a mixed model, but I’m not being paid for this, so I wasn’t going to take the time to do more in-depth model comparisons.

Alist <- c(AICc(nb.mod1),
           AICc(nb.mod2),
           AICc(nb.mod3),
           AICc(nb.mod4),
           AICc(nb.mod5),
           AICc(nb.mod6),
           AICc(nb.mod7),
           AICc(nb.mod8),
           AICc(nb.mod9))
Alist
[1] 742668.5 739833.8 742668.2       NA       NA 740009.8 742666.3 739999.4 739731.6

The model with the lowest AICc is the 9th model, so that’s what I adopted. You may have noticed a few lines back that I set the contrasts to “contr.Sum”. These are simple sum-to-zero contrasts, aka effect coding. R’s default is dummy coding or treatment contrasts, which are fine if you’re not worrying about analyzing interactions. We have an interaction, so I’m using effect coding.

Vote.model <- nb.mod9
summary(Vote.model)
 Family: nbinom2  ( log )
Formula:          Votes ~ Selection * Year.s + (Selection * Year.s || State) +      offset(log(VEP))
Data: Vote.dat
Weights: Electoral.Votes

      AIC       BIC    logLik  deviance  df.resid 
 739731.4  739830.0 -369848.7  739697.4      2431 

Random effects:

Conditional model:
 Groups Name                          Variance  Std.Dev. Corr                               
 State  (Intercept)                   0.0095931 0.09794                                     
        Selection[S.Democrat]         0.0457551 0.21390  0.00                               
        Selection[S.Nonvoting]        0.0474525 0.21784  0.00 0.00                          
        Selection[S.Other]            0.0721619 0.26863  0.00 0.00 0.00                     
        Year.s                        0.0005149 0.02269  0.00 0.00 0.00 0.00                
        Selection[S.Democrat]:Year.s  0.0059620 0.07721  0.00 0.00 0.00 0.00 0.00           
        Selection[S.Nonvoting]:Year.s 0.0036982 0.06081  0.00 0.00 0.00 0.00 0.00 0.00      
        Selection[S.Other]:Year.s     0.0063977 0.07999  0.00 0.00 0.00 0.00 0.00 0.00 0.00 
Number of obs: 2448, groups:  State, 51

Dispersion parameter for nbinom2 family (): 3.74 

Conditional model:
                               Estimate Std. Error z value Pr(>|z|)    
(Intercept)                   -1.782538   0.014308 -124.58   <2e-16 ***
Selection[S.Democrat]          0.450019   0.030783   14.62   <2e-16 ***
Selection[S.Nonvoting]         0.872299   0.031316   27.85   <2e-16 ***
Selection[S.Other]            -1.838467   0.038281  -48.03   <2e-16 ***
Year.s                        -0.079030   0.004922  -16.06   <2e-16 ***
Selection[S.Democrat]:Year.s   0.184270   0.012791   14.41   <2e-16 ***
Selection[S.Nonvoting]:Year.s -0.001572   0.010808   -0.15    0.884    
Selection[S.Other]:Year.s     -0.288923   0.013224  -21.85   <2e-16 ***
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
r2(Vote.model)
# R2 for Mixed Models

  Conditional R2: 0.453
     Marginal R2: 0.417
Anova(Vote.model, type = "III")
Analysis of Deviance Table (Type III Wald chisquare tests)

Response: Votes
                    Chisq Df Pr(>Chisq)    
(Intercept)      15520.80  1  < 2.2e-16 ***
Selection         3253.50  3  < 2.2e-16 ***
Year.s             257.86  1  < 2.2e-16 ***
Selection:Year.s   630.28  3  < 2.2e-16 ***
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Don’t spend much time trying to interpret coefficients. I’m going to estimate marginal means and trends, which are much easier to interpret. Marginal means (trends for slopes) estimate based on the model and take into account the effects of all parts of the model on a response. They summares what the model is saying in a way that is more intuitive than attempting to puzzle out multiple coefficients. If you want to really go down a rabbit hole, there’s a very complete description you can read elsewhere.

If you look at the R2 (Nakagawa’s R2), this model isn’t too bad a fit. Marginal R2 estimates the total contribution of the fixed effects, conditional includes the random effects (individual state effects).

I’m not interested in comparing outcomes for a single given election. That’s not very informative from this high up in the clouds. Comparing the trends, on the other hand, interests me.

Vote.emt <- emtrends(Vote.model, ~Selection:Year.s, "Year.s")
Vote.emt
 Selection  Year.s Year.s.trend     SE  df asymp.LCL asymp.UCL
 Democrat        0       0.1052 0.0137 Inf    0.0784    0.1321
 Nonvoting       0      -0.0806 0.0119 Inf   -0.1038   -0.0574
 Other           0      -0.3680 0.0142 Inf   -0.3958   -0.3401
 Republican      0       0.0272 0.0196 Inf   -0.0113    0.0656

Confidence level used: 0.95 

Visualizing

What does this tell us? We can see that this trend is increasing over the 1980-2024 period for Democrats, decreasing for non-voting, and votes for “Other” candidates, and weakly increasing over time for Republicans. What is interesting is that the confidence limits for the Republican trend are negative and positive, suggesting that the overall trend may not significantly differ from zero for this period. But words say so much less than a picture. First, we have to generate some points from the model.

Vote.emm <- emmeans(Vote.model, ~Selection:Year.s, type = "response", offset = log(1), at = 
                      list(Year.s = as.numeric(levels(factor(Vote.dat$Year.s)))))
Vote.preds <- data.frame(Vote.emm) %>%
  mutate(Year = (Year.s * attr(Vote.dat$Year.s, "scaled:scale") + attr(Vote.dat$Year.s, "scaled:center")))
Vote.preds
model.plot <- ggplot(Vote.preds, aes(x = Year, y = response, group = Selection, color = Selection)) + 
  geom_line(linewidth = 1) +
  scale_color_manual(values = c("#4477AA", "#228833", "#CCBB44", "#EE6677"),
                     limits = c("Democrat", "Other", "Nonvoting", "Republican")) +
  geom_ribbon(aes(ymin = asymp.LCL, ymax = asymp.UCL), alpha = 0.05) +
  theme(panel.background = element_rect(fill = "white"),
        axis.line = element_line(size = 1, color = "black"),
        legend.position = c(0.8, 0.3)) +
  scale_y_continuous(labels = scales::percent) +
  scale_x_continuous(breaks = c(1980, 1984, 1988, 1992, 1996, 2000, 2004, 2008, 2012, 2016, 2020, 2024)) +
  ylab("Frequency (per VEP)")

model.plot

The ribbons around each line are the 95% confidence intervals. The trends look like they differ from each other, but is that real?

pairs(Vote.emt)
 contrast                               estimate     SE  df z.ratio p.value
 Democrat Year.s0 - Nonvoting Year.s0      0.186 0.0176 Inf  10.573  <.0001
 Democrat Year.s0 - Other Year.s0          0.473 0.0193 Inf  24.541  <.0001
 Democrat Year.s0 - Republican Year.s0     0.078 0.0281 Inf   2.774  0.0283
 Nonvoting Year.s0 - Other Year.s0         0.287 0.0180 Inf  15.997  <.0001
 Nonvoting Year.s0 - Republican Year.s0   -0.108 0.0256 Inf  -4.217  0.0001
 Other Year.s0 - Republican Year.s0       -0.395 0.0287 Inf -13.775  <.0001

P value adjustment: tukey method for comparing a family of 4 estimates 

If we look at the adjusted p values for every pairwise comparison of trends, yes. Even adjusting for all multiple comparisons, every trend differs from every other trend with a p of < 0.05. How can this be interpreted? Typically, people say “significant” or “not significant” and call it a day. There’s another interpretation of the p value, and that’s the probability of the observed difference occuring by random chance if the difference did not actually exist. For example, if there really were no difference between Republican and Democrat performance, according to this model, we would expect to still see the difference we estimated about 2.8% of the time.

So, we can generally infer that the differences we think we see are probably real. However, are they meaningful? The p value can’t tell us how large a difference is. It can only tell us how likely it is. Fortunately, standardized effect sizes exist to explore this question.

Vote.eff <- eff_size(Vote.emt, sigma = sigma(Vote.model), edf = df.residual(Vote.model))
Vote.eff
 contrast                               effect.size      SE  df asymp.LCL asymp.UCL
 Democrat Year.s0 - Nonvoting Year.s0        0.0497 0.00470 Inf   0.04049    0.0589
 Democrat Year.s0 - Other Year.s0            0.1266 0.00525 Inf   0.11625    0.1368
 Democrat Year.s0 - Republican Year.s0       0.0209 0.00753 Inf   0.00611    0.0356
 Nonvoting Year.s0 - Other Year.s0           0.0768 0.00493 Inf   0.06719    0.0865
 Nonvoting Year.s0 - Republican Year.s0     -0.0288 0.00684 Inf  -0.04223   -0.0154
 Other Year.s0 - Republican Year.s0         -0.1057 0.00778 Inf  -0.12093   -0.0904

sigma used for effect sizes: 3.739 
Confidence level used: 0.95 

These effect sizes are Cohen’s d, which boils down to how far away the trends are from each other, adjusted by how uncertain the estimates of the trends are. The numbers aren’t very informative by themselves. Fortunately has been extensive work to define meaningful categories of effect sizes for political science. I think that the categories proposed by Erik Ghneer Larsen are reasonable. However, he presented them in relation to Pearson’s r. Fortunately, it is possible to convert between the two. Unfortunately, the standard conversion of d to r is biased to produce low outcomes, because it converts to the point-biserial r and presumes that d was estimated with a binomial predictor. Year.s is a continuous predictor. Therefore, I converted d to the probability of superiority common language effect size and then converted this to r using the formula found in Dunlap, 1994.

Vote.d_to_r <- data.frame(summary(Vote.eff)$contrast, summary(Vote.eff)$effect.size,
                          d_to_p_superiority(summary(Vote.eff)$effect.size),
                          sin(((d_to_p_superiority(summary(Vote.eff)$effect.size)) -.5) * pi))
colnames(Vote.d_to_r) <- c("Contrast", "d", "CLES", "r")


Vote.d_to_r <- Vote.d_to_r %>% mutate(Larsen = case_when(
  abs(r) < 0.05 ~ "Less than Small Effect",
  abs(r) >= 0.05 & abs(r) < 0.15 ~ "Small Effect",
  abs(r) >= 0.15 & abs(r) < 0.30 ~ "Medium Effect",
  abs(r) >= 0.30 ~ "Large Effect"
))

Vote.d_to_r

CLES is the “Common Language Effect Size” or “probability of superiority”. This is an estimate of the probability (between 0 and 1) that, for a given trait, a member of one group will be larger than a member of the other group. A CLES of 0.5 means the groups are identical. As you can see, none of the differences are large.

What do we take home?

If you’ve made it this far, you really like chopping through the weeds, but what have we learned. In addition to the race-by-race non-statistical analysis, we can say with some certainty that, since 1980, the Democratic party has been steadily gaining among the voting-eligible population, while Republican gains have been slower. This difference is likely to be real, but it’s not at all large. What is also interesting is that the proportion of people just sitting it out has been slowly decreasing since 1980. Americans have been becoming more engaged.

What about the “Other”? I’ll freely admit that this category fluctuated too wildly to make any deep conclusions. Ross Perot was a major influence, but the Perot spike has never been repeated.

The Future

Short-term, there is no way to predict 2028, certainly not right now. Farther into the future, if the trends I mapped continue, although there is no way to say they must, the Republican party’s gains would be outstripped by the Democrats as more Americans choose to participate in presidential elections. I’m goint to speculate that I’m not the first person to do an analysis like this. I’ve no doubt that professional strategists have already been looking at these trends. This could explain the willingness to adopt severe measures that seems to have driven the Republican party in recent years. If the current paradigm is not destroyed, the Republicans may become irrelevant.

---
title: "American Presidential Voting Habits, Spots and Trends"
output: html_notebook
author: "Bryan J. Maloney"
date: "13 November 2024"
---

## Introduction
We just finished another presidential election year, full of surprises and upsets, or maybe upsets and surprises. Either way, the voting part is over, the counting part is ongoing, and unlike 2020, the vanquished had a sense of dignity and sanity. All sorts of experts have crawled out from under their rocks to either claim credit or assign blame. I'm not interested in that. I'm interested in the story data can tell.

I asked myself what Presidential elections looked like from a "pure" numbers perspective, so I collected some data. The voting-eligible population (VEP) came from the [University of Florida Election Lab](https://election.lab.ufl.edu/voter-turnout/). The vote counts came from either the Wikipedia pages for each election, [1980](https://en.wikipedia.org/wiki/1980_United_States_presidential_election#Results_by_state), [1984](https://en.wikipedia.org/wiki/1984_United_States_presidential_election#Results_by_state), [1988](https://en.wikipedia.org/wiki/1988_United_States_presidential_election#Results_by_state), [1992](https://en.wikipedia.org/wiki/1992_United_States_presidential_election#Results_by_state), [1996](https://en.wikipedia.org/wiki/1996_United_States_presidential_election#Results_by_state), [2000](https://en.wikipedia.org/wiki/2000_United_States_presidential_election#Results_by_state), [2004](https://en.wikipedia.org/wiki/2004_United_States_presidential_election#Results_by_state), [2008](https://en.wikipedia.org/wiki/2008_United_States_presidential_election#Results_by_state), [2012](https://en.wikipedia.org/wiki/2012_United_States_presidential_election#Results_by_state), [2016](https://en.wikipedia.org/wiki/2016_United_States_presidential_election#Results_by_state), and [2020](https://en.wikipedia.org/wiki/2020_United_States_presidential_election#Results_by_state) or the [Election Atlas state pages for 2024](https://uselectionatlas.org/RESULTS/). The 2024 numbers are all preliminary and should be interpreted with caution. I also collected the number of Electoral College votes each state had in each election to use as weights for modeling. This was looked up from several sources, [including Wikipedia](https://en.wikipedia.org/wiki/United_States_Electoral_College#Chronological_table).

The data consisted of election year, state names, and counts of VEP and votes for the Democrat, Republican, and Other candidates. I subtracted the sum of the votes from the VEP to get Nonvoting counts. This was all done in Excel, where I also formatted the data into “long” format. I loaded the clean data. You can get it, yourself [in this Google sheet](https://docs.google.com/spreadsheets/d/1oySzPkmaVMBHHwAib4kLuca_7XQrXg5x67hQmLKUpTA/edit?usp=sharing").

Before I go any further, I will be using R for this work, so I need to load a bunch of R packages.

```{r}
library(googlesheets4, quietly = TRUE)
library(glmmTMB, quietly = TRUE)
library(performance, quietly = TRUE)
library(car, quietly = TRUE)
library(performance, quietly = TRUE)
library(MuMIn, quietly = TRUE)
library(emmeans, quietly = TRUE)
library(effectsize, quietly = TRUE)
library(tidyverse, quietly = TRUE)
library(reshape2, quietly = TRUE)
```
```{r}
Vote.dat <- read_sheet("https://docs.google.com/spreadsheets/d/1oySzPkmaVMBHHwAib4kLuca_7XQrXg5x67hQmLKUpTA/edit")
Vote.dat$State <- factor(Vote.dat$State)
Vote.dat$Selection <- factor(Vote.dat$Selection)
```

Feel free to use the above link to do a manual import if you'd prefer that.

So, let's take a look at the raw data.

```{r}
messy.plot <- ggplot(Vote.dat, aes(x = jitter(Year), y = Votes/VEP, group = Selection, color = Selection)) + 
  scale_color_manual(values = c("#4477AA", "#228833", "#CCBB44", "#EE6677"),
                     limits = c("Democrat", "Other", "Nonvoting", "Republican")) +
  geom_point(size = 3) +
  theme(panel.background = element_rect(fill = "white"),
        axis.line = element_line(size = 1, color = "black")) +
  scale_y_continuous(labels = scales::percent) +
  scale_x_continuous(breaks = c(1980, 1984, 1988, 1992, 1996, 2000, 2004, 2008, 2012, 2016, 2020, 2024)) +
  ylab("Frequency (per VEP)") + xlab("Year")

messy.plot
```

## Summarizing

That's a real mess. Let's try generating a summary.
```{r}
# Some background functions
weighted.average <- function(x, w){
## Sum of the weights
sum.w <- sum(w, na.rm = T)
## Sum of the weighted $x_i$
xw <- sum(w*x, na.rm = T)
## Return the weighted average
return(xw/sum.w)
}

weighted.se.mean <- function(x, w, na.rm = T){
## Remove NAs
if (na.rm) {
i <- !is.na(x)
w <- w[i]
x <- x[i]
}
## Calculate effective N and correction factor
n_eff <- (sum(w))^2/(sum(w^2))
correction = n_eff/(n_eff-1)
## Get weighted variance
numerator = sum(w*(x-weighted.average(x,w))^2)
denominator = sum(w)
## get weighted standard error of the mean
se_x = sqrt((correction * (numerator/denominator))/n_eff)
return(se_x)
}

# Making the weighted summary
Vote.sum <- Vote.dat %>%
  mutate(pasteup = paste(Year, Selection)) %>%
  mutate(ratio = Votes/VEP) %>%
  group_by(pasteup) %>%
  summarize(weighted_means = weighted.average(ratio, Electoral.Votes)) %>%
  cbind(Vote.dat %>%
          mutate(pasteup = paste(Year, Selection)) %>%
          mutate(ratio = Votes/VEP) %>%
          group_by(pasteup) %>%
          summarize(weighted_CL = weighted.se.mean(ratio, Electoral.Votes) * qt(0.975, 50)))
Vote.sum <- Vote.sum[, c(1, 2, 4)]
Vote.sum <- cbind(colsplit(Vote.sum$pasteup, " ", c("Year", "Selection")), Vote.sum[,2:3])
Vote.sum$Selection <- factor(Vote.sum$Selection)
Vote.sum
```

What I did was take weighted means by year of all the states for votes cast for the Democratic or Republican candidate or any other candidate, along with the remainder (non-voting VEP), divided by the VEP. The means were weighted by the Electoral College votes that state had in that election year. I then calculated the weighted standard errors of the mean, using a function written by [Alex Stephenson](https://www.alexstephenson.me/post/2022-04-02-weighted-variance-in-r/). I converted these to 95% confidence limits by multiplying by the appropriate quantile of t at 0.975 and 50 degrees of freedom. Why did I weight the popular votes? It's a concession to reality. People might hand-wring and whine all they like about how horrid the Electoral College might be, but it's not going away any time soon. Understanding US presidential politics requires taking it into account.

So, I have a table, I don't find tables useful. What are useful are charts.
```{r}
summary.plot <- ggplot(Vote.sum, aes(x = Year, y = weighted_means, group = Selection, color = Selection)) + 
  geom_line(linewidth = 1) +
  scale_color_manual(values = c("#4477AA", "#228833", "#CCBB44", "#EE6677"),
                     limits = c("Democrat", "Other", "Nonvoting", "Republican")) +
  geom_point(size = 3) +
  geom_errorbar((aes(ymin=weighted_means - weighted_CL, ymax = weighted_means + weighted_CL))) +
  theme(panel.background = element_rect(fill = "white"),
        axis.line = element_line(size = 1, color = "black"),
        legend.position = c(0.8, 0.3)) +
  scale_y_continuous(labels = scales::percent) +
  scale_x_continuous(breaks = c(1980, 1984, 1988, 1992, 1996, 2000, 2004, 2008, 2012, 2016, 2020, 2024)) +
  ylab("Frequency (per VEP)")

summary.plot
```

I didn't just pull the colors out of my cute patootie. They're from Paul Tol's site that describes many wonderful (and color-blind safe) [color schemes](https://personal.sron.nl/~pault/), arranged with the now-traditional coloring of blue for Democrats and red for Republicans.

## A little non-statistical analysis
We can analyze the above chart without even considering statistics. I'm going to be mostly just descriptive. The first thing to note is that the most popular “candidate”, by far, for the last 44 years has been nobody. Since at least 1980, there has never been a president elected by the majority of Americans eligible to vote. Let us wring hands. Hand wringing time is over. While it’s true that “get out of my face” has usually been the most popular selection, it’s equally true that this option has been chosen less and less as time has gone on. Whatever is wrong with America, more Americans have been getting out to vote. True enough, 2024 was another election of apathy, but it’s hard to call the future based on a single-point deflection. The overall trend has been for non-voting to drop by a little less than 1 percentage point every election. The Democrats and Republicans are basically in a dead heat in dividing this gain up between themselves. The overall Democrat gain since 1980 has been greater than the overall Republican gain, but currently they are indistinguishable. On the other hand, the slow increase in voting doesn’t seem to have been distributed to “Other” options.

Another thing to note is that the Republicans and Democrats seem to have switched performances from 1980 to the present. From 1980 to 2004, Republican performance fluctuated wildly from year to year. The Democrats, on the other hand, were experiencing very slow but very regular gains until 2000. From these inflection points to the present day, this has switched. The Democrats now have wild swings while the Republicans' oscillations have calmed down (although with more variation than the early-period Democrats showed). How to explain this or what it means for the future I leave to more knowledgeable people. What I can say is that it looks like both parties experienced a significant sea change roughly 2000-2004, and it is having long-lasting effects on each of them. What these changes were, I can’t even speculate. Feel free.

### Calling the Horse Races
Let's dive deeper and look at each election in turn. Reagan’s tenure was marked by moving from strong to stronger. Unusually for  incumbents, he gained percentage in his re-election bid. However, this was not at the expense of Mondale, who also got a (very tiny) gain vs. Carter's performance. Instead, Reagan may have attracted some non-voters and a large portion of people who had voted for Anderson (who?). Poppy Bush’s election was an exercise in coat-tail riding. The nonvoters bounced back up, and Dukakis got another (very tiny) gain over the Democrats’ share of the previous election,

Indeed, from 1980 to 2000 “a very tiny gain” could accurately describe the Democratic party’s entire presidential performance, even in years they took the White House. In every election, it was a very small improvement in attracting a proportion of the voting-eligible population, while the Republicans see-sawed up and down in popularity. For example, Clinton took office in a tidal wave of spoilage. Ross Perot’s leap seems to have been fueled by some Republican-leaning voters and bigger chunk of people who otherwise wouldn’t have bothered to vote at all. But Perot was a gadfly candidate. Bush was the anointed leader of the Republican party, so there was a squabble on that side of the political spectrum. Clinton rode in on this disunity.

Perot’s huge-for-a-third-dude showing cannot be explained only by attracting nonvoters. In 1998, there was a giant leap in apathy, which probably hurt Dole more than it hurt Clinton. However, Clinton didn’t experience a spike in popularity. He managed to do a bit better than last time. This set the stage for 2000.

This was an extremely close election. What is interesting is that there was very little apparent motivation of the nonvoters. It ended up in a mess. Bush’s re-election was more definite and it was accompanied by the Democratic party starting to improve more quickly its slug-like crawl upwards since 1980. However, Bush was able to attract more of the nonvoters (and possibly Other voters) than did Kerry. What is interesting is that this election saw a big drop in nonvoters, which goes against the general trend of an increase in apathy or at very best a small decrease during a re-election attempt. However, these voters didn’t turn out for Kerry.

In 2008, Obama ended up continuing a straight-line trend started by the prior two failed Democrat attempts. There was a small tend downwards in nonvoters, but the Republicans lost share as well, indicating some possible aisle-crossing by voters. Obama’s second turn was a typical successful re-election, with more people choosing to sit it out, but not enough to force a loss. The Republicans were still experiencing the Bush 2-inspired decline.

2016 was another close election. It’s well-documented that Trump did not achieve the total popular majority of votes cast. The weighted means still reflect this. This is not anomalous. It is because weighted means can "distribute" proportional Electoral outcomes within each state as well as fractions. What we do see is that Gary Johnson had a bit of a spike, which may have been acquired from a mixture of nonvoters and both Democrat and Republican leaning voters.

The Biden election is illustrative of what might be an important trend. Once again, the number of nonvoters fell (*to its lowest level*), and the primary beneficiary seems to have been Biden. It was the first time in the period I looked at that an actual candidate managed to get more popular votes than “this election isn’t worth voting in”!

Finally, Iit should be noted that the 2024 results are based in part on projections and estimates and ought not be over-interpreted. However, the biggest winner is “Nobody is worth my vote.” Apathy and disengagement won again, with one of its biggest spikes since 1996. However, this time, the Republicans weren’t riding a massive bottom-out trend. If the projections are to be believed, Trump also lost share from his previous attempt. He merely lost less than did Harris vs. Biden's performance. Trump managed to get less of the voting-eligible public than he did in 2020. Unfortunately, Harris managed to do even worse. I can’t explain why. I won’t try to explain why. I can only show the what. The why requires more in-depth knowledge than I have.


## Modeling

We could stop right there, but I looked at the chart and wondered if there were any possibly verifiable trends in what I think I saw. That's when I pulled out the formal modeling tools.

The data was repeated measurements (by state and District of Columbia) of 12 presidential elections. When I see repeated measures, I like to use a mixed-level model. If you want to know more, [you can start here](https://vsni.co.uk/blogs/mixed-models-for-repeated-measures-and-longitudinal-data/). The data is count data. Yes, I presented it as frequencies, but a frequency is a count divided by a reference count. The preferred way to analyze count data is with a Poisson distribtion. However, the Poisson distribution presumes that the mean and the variance of the data are the same, or dispersion = 1, to use jargon. A dispersion over 1 is overdispersed, and under 1 is underdispersed. Since I did not yet know if the data were over or under-dispersed, I chose R's glmmTMB package, since it has options that can handle either eventuality. I started with a first pass model that used the Poisson family.

```{r}
options(contrasts = rep("contr.Sum",2))
start.mod <- glmmTMB(Votes ~ Selection * Year + (1 | State) + offset(log(VEP)), weights = Electoral.Votes, family = poisson, data = Vote.dat)
```
I got a warning. It's often due to differences in variable scale. The offending variable is probably Year, which can be fixed.
```{r}
Vote.dat$Year.s <- scale(Vote.dat$Year)
start.mod <- glmmTMB(Votes ~ Selection * Year.s + (1 | State) + offset(log(VEP)), weights = Electoral.Votes, family = poisson, data = Vote.dat)
```

And I had to check the dispersion.

```{r}
check_overdispersion(start.mod)
```

Yes, definitely overdispersed. There are fancy ways to handle this, like the Conway-Maxwell-Poisson family, but the current implementation of this family can be slower than a geologic age to run. Fortunately, the reliable old negative binomial family will handle overdispersion. Since the sample had over 2000 entries, I felt comfortable exploring different random effects *vis* the states. I compared models that included random intercepts and random slopes for both Selection and Year, with and without correlated intercepts.

```{r}
nb.mod1 <- glmmTMB(Votes ~ Selection * Year.s + (1 | State) + offset(log(VEP)), weights = Electoral.Votes, family = nbinom2, data = Vote.dat)
nb.mod2 <- glmmTMB(Votes ~ Selection * Year.s + (Selection | State) + offset(log(VEP)), weights = Electoral.Votes, family = nbinom2, data = Vote.dat)
nb.mod3 <- glmmTMB(Votes ~ Selection * Year.s + (Year.s | State) + offset(log(VEP)), weights = Electoral.Votes, family = nbinom2, data = Vote.dat)
nb.mod4 <- glmmTMB(Votes ~ Selection * Year.s + (Selection + Year.s | State) + offset(log(VEP)), weights = Electoral.Votes, family = nbinom2, data = Vote.dat)
nb.mod5 <- glmmTMB(Votes ~ Selection * Year.s + (Selection * Year.s | State) + offset(log(VEP)), weights = Electoral.Votes, family = nbinom2, data = Vote.dat)
nb.mod6 <- glmmTMB(Votes ~ Selection * Year.s + (Selection || State) + offset(log(VEP)), weights = Electoral.Votes, family = nbinom2, data = Vote.dat)
nb.mod7 <- glmmTMB(Votes ~ Selection * Year.s + (Year.s || State) + offset(log(VEP)), weights = Electoral.Votes, family = nbinom2, data = Vote.dat)
nb.mod8 <- glmmTMB(Votes ~ Selection * Year.s + (Selection + Year.s || State) + offset(log(VEP)), weights = Electoral.Votes, family = nbinom2, data = Vote.dat)
nb.mod9 <- glmmTMB(Votes ~ Selection * Year.s + (Selection * Year.s || State) + offset(log(VEP)), weights = Electoral.Votes, family = nbinom2, data = Vote.dat)
```

I compared models that included random intercepts and random slopes for both Selection and Year, with and without correlated intercepts. Different models were compared by second-order Akaike information criterion . I admit, doing this is not rigorous. There's controversy over whether AICc can be calculated at all for a mixed model, but I’m not being paid for this, so I wasn’t going to take the time to do more in-depth model comparisons.

```{r}
Alist <- c(AICc(nb.mod1),
           AICc(nb.mod2),
           AICc(nb.mod3),
           AICc(nb.mod4),
           AICc(nb.mod5),
           AICc(nb.mod6),
           AICc(nb.mod7),
           AICc(nb.mod8),
           AICc(nb.mod9))
Alist
```

The model with the lowest AICc is the 9th model, so that's what I adopted. You may have noticed a few lines back that I set the contrasts to "contr.Sum". These are simple sum-to-zero contrasts, aka effect coding. R's default is dummy coding or treatment contrasts, which are fine if you're not worrying about analyzing interactions. We have an interaction, so I'm using effect coding.

```{r}
Vote.model <- nb.mod9
summary(Vote.model)
r2(Vote.model)
Anova(Vote.model, type = "III")
```

Don't spend much time trying to interpret coefficients. I'm going to  estimate marginal means and trends, which are much easier to interpret. Marginal means (trends for slopes) estimate based on the model and  take into account the  effects of all parts of the model on a response. They summares what the model is saying in a way that is more intuitive than attempting to puzzle out multiple coefficients. If you want to really go down a rabbit hole, there's a [very complete description](https://www.andrewheiss.com/blog/2022/05/20/marginalia/) you can read elsewhere.

If you look at the *R*^2^ (Nakagawa's *R*^2^), this model isn't too bad a fit. Marginal *R*^2^ estimates the total contribution of the fixed effects, conditional includes the random effects (individual state effects).

I'm not interested in comparing outcomes for a single given election. That's not very informative from this high up in the clouds. Comparing the *trends*, on the other hand, interests me.

```{r}
Vote.emt <- emtrends(Vote.model, ~Selection:Year.s, "Year.s")
Vote.emt
```
### Visualizing
What does this tell us? We can see that this trend is increasing over the 1980-2024 period for Democrats, decreasing for non-voting, and votes for "Other" candidates, and weakly increasing over time for Republicans. What is interesting is that the confidence limits for the Republican trend are negative and positive, suggesting that the overall trend may not significantly differ from zero for this period. But words say so much less than a picture. First, we have to generate some points from the model.

```{r}
Vote.emm <- emmeans(Vote.model, ~Selection:Year.s, type = "response", offset = log(1), at = 
                      list(Year.s = as.numeric(levels(factor(Vote.dat$Year.s)))))
Vote.preds <- data.frame(Vote.emm) %>%
  mutate(Year = (Year.s * attr(Vote.dat$Year.s, "scaled:scale") + attr(Vote.dat$Year.s, "scaled:center")))
Vote.preds
```
```{r}
model.plot <- ggplot(Vote.preds, aes(x = Year, y = response, group = Selection, color = Selection)) + 
  geom_line(linewidth = 1) +
  scale_color_manual(values = c("#4477AA", "#228833", "#CCBB44", "#EE6677"),
                     limits = c("Democrat", "Other", "Nonvoting", "Republican")) +
  geom_ribbon(aes(ymin = asymp.LCL, ymax = asymp.UCL), alpha = 0.05) +
  theme(panel.background = element_rect(fill = "white"),
        axis.line = element_line(size = 1, color = "black"),
        legend.position = c(0.8, 0.3)) +
  scale_y_continuous(labels = scales::percent) +
  scale_x_continuous(breaks = c(1980, 1984, 1988, 1992, 1996, 2000, 2004, 2008, 2012, 2016, 2020, 2024)) +
  ylab("Frequency (per VEP)")

model.plot
```

The ribbons around each line are the 95% confidence intervals. The trends look like they differ from each other, but is that real?

```{r}
pairs(Vote.emt)
```
If we look at the adjusted p values for every pairwise comparison of trends, yes. Even adjusting for all multiple comparisons, every trend differs from every other trend with a p of < 0.05. How can this be interpreted? Typically, people say "significant" or "not significant" and call it a day. There's another interpretation of the p value, and that's the probability of the observed difference occuring by random chance *if the difference did not actually exist*. For example, if there really were no difference between Republican and Democrat performance, according to this model, we would expect to still see the difference we estimated about 2.8% of the time.

So, we can generally infer that the differences we think we see are probably real. However, are they meaningful? The p value can't tell us how *large* a difference is. It can only tell us how *likely* it is. Fortunately, standardized effect sizes exist to explore this question.

```{r}
Vote.eff <- eff_size(Vote.emt, sigma = sigma(Vote.model), edf = df.residual(Vote.model))
Vote.eff
```

These effect sizes are Cohen's *d*, which boils down to how far away the trends are from each other, adjusted by how uncertain the estimates of the trends are. The numbers aren't very informative by themselves. Fortunately has been extensive work to define meaningful categories of effect sizes for political science. I think that the categories proposed by [Erik Ghneer Larsen](https://erikgahner.dk/2022/effect-sizes-in-political-science/) are reasonable. However, he presented them in relation to Pearson's r. Fortunately, it is possible to convert between the two. Unfortunately, the standard conversion of d to r is biased to produce low outcomes, because it converts to the point-biserial r and presumes that d was estimated with a binomial predictor. Year.s is a continuous predictor. Therefore, I converted d to the probability of superiority common language effect size and then converted this to r using the formula found in Dunlap, 1994.

```{r}
Vote.d_to_r <- data.frame(summary(Vote.eff)$contrast, summary(Vote.eff)$effect.size,
                          d_to_p_superiority(summary(Vote.eff)$effect.size),
                          sin(((d_to_p_superiority(summary(Vote.eff)$effect.size)) -.5) * pi))
colnames(Vote.d_to_r) <- c("Contrast", "d", "CLES", "r")


Vote.d_to_r <- Vote.d_to_r %>% mutate(Larsen = case_when(
  abs(r) < 0.05 ~ "Less than Small Effect",
  abs(r) >= 0.05 & abs(r) < 0.15 ~ "Small Effect",
  abs(r) >= 0.15 & abs(r) < 0.30 ~ "Medium Effect",
  abs(r) >= 0.30 ~ "Large Effect"
))

Vote.d_to_r
```
CLES is the "Common Language Effect Size" or "probability of superiority". This is an estimate of the probability (between 0 and 1) that, for a given trait, a member of one group will be larger than a member of the other group. A CLES of 0.5 means the groups are identical. As you can see, none of the differences are large.

## What do we take home?
If you've made it this far, you really like chopping through the weeds, but what have we learned. In addition to the race-by-race non-statistical analysis, we can say with some certainty that, since 1980, the Democratic party has been steadily gaining among the voting-eligible population, while Republican gains have been slower. This difference is likely to be real, but it's not at all large. What is also interesting is that the proportion of people just sitting it out has been slowly decreasing since 1980. Americans have been becoming more engaged.

What about the "Other"? I'll freely admit that this category fluctuated too wildly to make any deep conclusions. Ross Perot was a major influence, but the Perot spike has never been repeated.

## The Future
Short-term, there is no way to predict 2028, certainly not right now. Farther into the future, if the trends I mapped continue, although there is no way to say they must, the Republican party's gains would be outstripped by the Democrats as more Americans choose to participate in presidential elections. I'm goint to speculate that I'm not the first person to do an analysis like this. I've no doubt that professional strategists have already been looking at these trends. This could explain the willingness to adopt severe measures that seems to have driven the Republican party in recent years. If the current paradigm is not destroyed, the Republicans may become irrelevant.
