Motivation and Background

The election landscape has shifted rapidly over the past few election cycles. One element of this is the increasing role that the Hispanic population plays in American politics. According to the Pew Research center, Hispanics have become “the largest racial or ethnic minority group in the electorate,” for the 2020 presidential election, putting them ahead of African and Asian Americans (Cilluffo and Fry, 2019). This has large implications for the future of the US political climate, with Hispanics yielding a substantial amount of power over the election cycle (Bell, 2016). It is therefore important to know how Hispanics tend to vote, and whether or not the voting behavior expressed by one generation of Hispanic voters is the same as their children and grandchildren. For this project, I would like to analyze the voter registration files of Miami-Dade county, Florida. I have chosen Miami-Dade county as it has a large Hispanic population (65%), the largest subset of which is Cubans (34.3%), (Census Bureau, 2010). Cubans are an interesting case, as unlike other Hispanic ethnic groups, they tend to register as Republican. First generation Cuban immigrants who came to the US in the 50s and 60s may have been fleeing Fidel Castro’s rise to power and the subsequent communist takeover in their county, which has influenced their right-wing voter choices (Bishin and Klofstad, 2011). As Cubans make up such a large portion of the Miami-Dade voter file, their turnout can have an impact on the outcome of election cycles. What would be interesting then to analyze is how strong the transfer of partisanship is between different generations of Cuban Americans. Young people as a whole are registering in increasing numbers an independents or democrats. If this trend holds up in the Cuban community as well, this can lead to major implications in political atmosphere of Miami-Dade county

Objectives

  1. To determine if there is a relationship between country of origin and voting behavior for Cuban Americans, and if so, is this partisanship transferable to newer generations who have grown up in the United States
  2. Determine if this transfer of partisanship is stronger for Cubans than other Hispanic Ethnic groups (for the purposes of this project, I will be comparing them to Puerto Ricans)
  3. Determine if this transfer of partisanship is stronger for Cuban households than the rest of Miami-Dade County
  4. Determine if one political party has a higher transfer of partisanship than others, comparing these rates with country of origin

Hypothesis

  1. Cuban households will have an overall stronger rate of partisan transfer than the rest of Miami-Dade (higher homogeneity)
  2. The Latino households I will be studying, Cuban, Puerto Rican (PR), and Cuban-Puerto Rican (PRCuba), will have a stronger transfer of partisanship than the rest of Miami Dade (“Other”)
  3. For Cuban households, the rate of transfer for the Repubican Party may be lower than expected due to younger voters moving towards Non-Party Affliation and the Democratic Party

Data Source

Methodology

Test 1 - Homogeneous vs. Heterogeneous Households

To begin, open Tidyverse and read in my files from excel (there are 4 total)

library(tidyverse)
## -- Attaching packages -------------------------------------------------------------------------------- tidyverse 1.2.1 --
## v ggplot2 3.2.1     v purrr   0.3.2
## v tibble  2.1.3     v dplyr   0.8.3
## v tidyr   0.8.3     v stringr 1.4.0
## v readr   1.3.1     v forcats 0.4.0
## -- Conflicts ----------------------------------------------------------------------------------- tidyverse_conflicts() --
## x dplyr::filter() masks stats::filter()
## x dplyr::lag()    masks stats::lag()
library(readxl)
DAD_Cuba <- read_excel("DAD_Other.xlsx")
## Warning in read_fun(path = enc2native(normalizePath(path)), sheet_i =
## sheet, : Expecting numeric in R19485 / R19485C18: got 'k'
library(readxl)
DAD_PR <- read_excel("DAD_PR.xlsx")
library(readxl)
DAD_PRCuba <- read_excel("DAD_PRCuba.xlsx")
library(readxl)
DAD_Other <- read_excel("DAD_Other.xlsx")
## Warning in read_fun(path = enc2native(normalizePath(path)), sheet_i =
## sheet, : Expecting numeric in R19485 / R19485C18: got 'k'

Next, I used the “count” to get a count of the total 0’s and 1’s I’ve assigned to each household for each country of origin. * 0’s are homogeneous households, meaning everyone in the household is registered to the same party * 1’s are households in which at least one member is not registered to the same party * This test is used to determine the level of homogeneity among the households of each country of origin for comparison purposes

count(DAD_Cuba, Cat1)
## # A tibble: 3 x 2
##    Cat1     n
##   <dbl> <int>
## 1     0 11692
## 2     1 13946
## 3    NA 57566
count(DAD_Other, Cat1)
## # A tibble: 3 x 2
##    Cat1     n
##   <dbl> <int>
## 1     0 11692
## 2     1 13946
## 3    NA 57566
count(DAD_PR, Cat1)
## # A tibble: 3 x 2
##    Cat1     n
##   <dbl> <int>
## 1     0   296
## 2     1   481
## 3    NA  1743
count(DAD_PRCuba, Cat1)
## # A tibble: 3 x 2
##    Cat1     n
##   <dbl> <int>
## 1     0    77
## 2     1    95
## 3    NA   384

Stray Note: I assigned these numbers in excel. Since I only assigned one number per household, the NA’s come from the other two or more members that didn’t have a number assigned to them. I’m going by household, not by person. So, for a three person homogeneous household, they would have a 0, followed by two NA’s.

Continuing on - I calculated the proportions of 0’s and 1’s and wrote them in excel before reading the file into R.

library(readxl)
DAD_Cat1 <- read_excel("DAD_Cat1.xlsx")

Here’s the table:

*

Graph:

ggplot(data=DAD_Cat1, aes(x = Country_of_Origin, y = Proportion, fill = Cat1)) + geom_col(position = "dodge")

Takeaways

  • All households (Other, Cuban, Puerto Rican, Cuban-Puerto Rican) have a higher proportion of heterogeneous households than homogeneous
  • Puerto Rican households have the highest difference in proportions – 23.8% more heterogeneous households than homogeneous
  • The others have the lowest difference in proportions – 8.8%
  • Cuban and Cuban-Puerto Rican proportions are very similar – 11.4% and 10.4%, respectively
  • Conclusion: Cuban households are more homogeneous than Puerto Rican households, but less so for Cuban-Puerto Rican households and others

Chi-Squared Test

cat1cs <- matrix(c(11691,13947, 3935,4948, 296,481, 77,95), byrow=F, ncol=4)
colnames(cat1cs) <- c("Other","Cuba","PR","PRCuba")
rownames(cat1cs) <- c("Homogeneous", "Heterogeneous")
cat1cs
##               Other Cuba  PR PRCuba
## Homogeneous   11691 3935 296     77
## Heterogeneous 13947 4948 481     95
chisq.test(cat1cs)
## 
##  Pearson's Chi-squared test
## 
## data:  cat1cs
## X-squared = 20.303, df = 3, p-value = 0.0001469

The P-value suggests that there is an extremely low probability that the relationship between country of origin and level of homogeneity is by chance

Test 2 - Full vs. Partial vs. No Partisan Transfer

Table:

Graph

Takeaways:

  • All households (Other, Cuban, Puerto Rican, Cuban-Puerto Rican) have a higher proportion of full transfer than partial or no transfer
  • Cuban-Puerto Rican households have the highest proportion of no transfers, at 37.6%
  • The others have the highest proportion of full transfers at 55.2%, followed by Cuba at 51.8%
  • The others have the highest discrepancy between categories –there are 32.7% more full transfers than partial transfers, and 32.9% more full transfers than no transfers

Chi-Squared Test

cat3cs <- matrix(c(11691,4765,4723,3934,1488,2172,296,157,187,78,25,62), byrow=F, ncol=4)
colnames(cat3cs) <- c("Other","Cuba","PR","PRCuba")
rownames(cat3cs) <- c("Full_Transfer","Partial_Transfer","No_Transfer")
cat3cs
##                  Other Cuba  PR PRCuba
## Full_Transfer    11691 3934 296     78
## Partial_Transfer  4765 1488 157     25
## No_Transfer       4723 2172 187     62
chisq.test(cat3cs)
## 
##  Pearson's Chi-squared test
## 
## data:  cat3cs
## X-squared = 159.53, df = 6, p-value < 2.2e-16

The P-value suggests that there is an extremely low probability that the relationship between country of origin and level of partisan transfer is by chance

Test 3 - Partisan Transfer by Party

To start, I looked at Miami-Dade county by country of origin and by age to get an overview of the demographic breakdown of the parties (Note: these graphs aren’t representative of the entirety of Miami-Dade, only the dataset I am working with - meaning this is Miami-Dade when only looking at 3-7 person households consisting of people with the same last name).

Graph: Miami-Dade by Country of Origin

Graph: Miami-Dade by Age

I also broke down country of origin by age:

Graph: Cuba by Age

Graph: Puerto Rico by Age

Graph: Cuba-Puerto Rico by Age

Graph: Others by Age

Takeaways:

  • Unsurprisingly, Cuban households have a large amount of Republicans, though it seems to be much more prominent among older voters. Cuban-Puerto Ricans are showing similar results. Puerto Ricans seem to be more spread apart. Others lean heavily democratic. For all graphs, NPA is much more prominent among younger voters.
  • Note: There was a 5th age category which covered people under 18 and over 105. I took that out as there were very few people in this category. Also, I took out everyone who identified with a party that was not DEM, REP, or NPA. I categorized “IND” the same as NPA.

Household- Party Combinations

  • All Disclaimers I have stated in test two above apply here. This is only considering three person households.
  • Code Key:
  1. All REP household
  2. All DEM household
  3. All NPA household
  4. REP parents, at least one DEM child
  5. REP parents, at least one NPA child
  6. REP parents, children are DEM and NPA
  7. DEM parents, at least one REP child
  8. DEM parents, at least one NPA child
  9. DEM parents, children are REP and NPA
  10. NPA parents, at least one REP child
  11. NPA parents, at least one DEM child
  12. NPA parents, children are DEM and REP
  13. Parents differ, all children are the party of at least one of their parents
  14. Parents differ, at least one child is of a different party
  15. Un-categorized

Graph: Cuban Household Combinations

Graph: Puerto Rican Household Combinations

Graph: Cuban-Puerto Rican Household Combinations

Graph: Others Household Combinations

Stray Notes:

  • I adjusted the y-axis of the Puerto Rican Households to match the other three.
  • Cuban Households most common combinations: 1 and 5
  • Puerto Rican Households most common combinations: 2 and 10
  • Cuban- Puerto Rican Households most common combinations: 1 and 5
  • Others Households most common combinations: 1 and 2

Proportion of Party Transfer

Table:

Take Cuban Republicans as an example. Of all of the Cuban Households where the parents are identified as Repubican, roughly 61.8% of these households have children that also identify as Republican.

Graph: Proportion of Party Transfer

Stray Notes:

  • This test only accounts for Full Transfer households.
  • Cubans have the highest rate of Republican transfer. Others have the highest rate of democratic transfer. Puerto Ricans have the highest rate of NPA transfer.
  • Interestingly, DEM, REP, and NPA transfer in Cuban households seems to be pretty close in proportion, indicating that REP households aren’t as dominant as one would think.

Conclusion

Works Cited

Bell, Aaron. “The Role of the Latino Vote in the 2016 Elections.” SSRN Electronic Journal, vol. 13, May 2016.

Bishin, Benjamin G., and Casey A. Klofstad. “The Political Incorporation of Cuban Americans.” Political Research Quarterly, vol. 65, no. 3, 2011, pp. 586–599.

Cilluffo, Anthony, and Richard Fry. “An Early Look at the 2020 Electorate.” Pew Research Center’s Social & Demographic Trends Project, 30 Jan. 2019, https://www.pewsocialtrends.org/essay/an-early-look-at-the-2020-electorate/.

Data Access and Dissemination Systems (DADS). “American FactFinder - Results.” United States Census Bureau, 5 Oct. 2010, https://factfinder.census.gov/faces/tableservices/jsf/pages/productview.xhtml?src=CF.

Escaleras, Monica, et al. “You Are Who You Think You Are: Linked Fate and Vote Choices among Latino Voters.” Wiley Online Library, John Wiley & Sons, Ltd (10.1111), 30 Aug. 2019, https://onlinelibrary.wiley.com/doi/abs/10.1111/polp.12329?af=R.

Data Collected by Daniel A. Smith