Overview

Recent news reports dailykos and forbes are using polling data from April’s Emerson poll to claim that Sanders supporters are choosing Donald trump over other Democratic candidates. In particular, these articles are focusing on Sanders voters voting for Trump over Elizabeth Warren (26% of Sanders voters prefer Trump to Warren)

The poll identifies who you support in the Primary(first choice), it then asks who you would vote for in a Trump versus every other candidate scenario. 26% of Sanders voters voting against Warren, is a very strange polling outcome. Warren and Sanders are likely the two closest to each other in terms of ideologyadd link. Such data could be indicative of sexism, or some sort of Bernie bro phenomenon. However I felt as though this polling data needed some exploration before making such claims. Emerson has conducted several polls asking similar questions, and I have read in the polls for Feb,March and April. My plan is to find out how the distributions of these Trump versus other candidates vary across these surveys and across different first choices.

Is the data trustworthy

  • Here’s my first issue with the polling data. Below I print a df of all 356 people identified in the April survey as democratic primary voters. All 356 gave answers to every single question on the survey.
Candidate Vs biden Vs beto Vs bernie Vs kamala Vs Warren
3 Beto O’Rourke 29 29 29 29 29
4 Pete Buttigieg 32 32 32 32 32
5 Bernie Sanders 103 103 103 103 103
6 Elizabeth Warren 24 24 24 24 24
7 Joe Biden 85 85 85 85 85
8 Kamala Harris 27 27 27 27 27
9 Amy Klobuchar 4 4 4 4 4
10 Mike Gravel 3 3 3 3 3
11 Kirsten Gillibrand 0 0 0 0 0
12 Tulsi Gabbard 3 3 3 3 3
13 Cory Booker 6 6 6 6 6
14 Andrew Yang 10 10 10 10 10
15 John Delaney 2 2 2 2 2
16 Jay Inslee 1 1 1 1 1
17 John Hickenlooper 1 1 1 1 1
18 Marianne Williamson NA NA NA NA NA
19 Julian Castro 12 12 12 12 12
20 Tim Ryan 2 2 2 2 2
21 Wayne Messam 1 1 1 1 1
22 Eric Swalwell 4 4 4 4 4
23 Someone else 7 7 7 7 7
24 Total 356 356 356 356 356
  • This seems very strange to me.
    • The goal of the survey is clearly to get nation polling averages. I would think that the company would not discard incomplete survey data which at least showed national polling average choices, but it would appear they have discarded all incomplete observations. I have not looked through their methodology as of yet
    • Assuming no data misentry or manipulation, accepting only surveys in which individuals answer every question, throws some bias into the survey data. Most average voters, will not complete an entire survey on the election like this. Voters giving this much attention to all of these questions are not representative of average voters. However, Let’s proceed with the analysis assuming the data is clean

Get distributions of votes for Trump against democratic Opponents

  • Below is bread and butter of what we are looking for from the April survey data
candidate Trump % vs biden Trump % vs beto Trump % vs Bernie Trump % vs Harris Trump % vs Warren Trump % vs Butt
5 Joe Biden 10.3 10.4 13.8 12.6 15.5 13.8
3 Bernie Sanders 15.1 18.3 4.9 16.6 26.1 21.4
1 Beto O’Rourke 2.1 9.3 9.3 14.4 11.6 8.3
2 Pete Buttigieg 4.0 2.6 0.0 8.1 9.3 1.2
6 Kamala Harris 2.2 2.2 2.2 2.2 2.2 2.2
4 Elizabeth Warren 11.9 5.5 5.5 3.7 11.9 3.7

Weird behavior in data

Democratic party voters are choosing Trump over their preferred democratic candidate

  • 10.3% of Biden voters would vote for Trump against Biden
  • 5% of Bernie voters would vote for Trump against Bernie
  • 9.3% of Beto voters would vote Trump versus Beto
  • 2.2% of Kamala voters would vote for Trump versus Kamala
  • 11.9% of Warren voters would vote for Trump against Warren

This is really weird. Biden warren and Beto voters who identify as having those candidates as their first choice, choose Trump over those candidates. Having not looked at other polls, this behavior could be within norms, but none the less its strange and should be highlighted

  • Overall 273 respondents said they would vote for Trump in one on one mathcups, out of a total of 1780, which comes out to 15.3%

  • Overall 84 Bernie supporters said they would vote for Trump in one on one mathcups, out of a total of 520 matchups, which comes out to 16.1%
  • Overall 54 Biden supporters said they would vote for Trump in one on one mathcups, out of a total of 425 matchups, which comes out to 12.7%
  • Overall 9 Warren supporters said they would vote for Trump in one on one mathcups, out of a total of 120 matchups, which comes out to 7.5%, which is odd given 12% warren voters said they would vote against Warren versus Trump.
  • Overall 14 Beto supporters said they would vote for Trump in one on one mathcups, out of a total of 145 matchups, which comes out to 9.6%
  • Overall 5 Kamela supporters said they would vote for Trump in one on one mathcups, out of a total of 140 matchups, which comes out to 3.7 %

What does this Mean

Looking at this data, it appears that overall, Bernie supporters are only slightly more likely to Vote for Trump, than the average of the dataset. However, Bernie supporters do make up twenty seven percent of the dataset. Emerson has released two prior polls which March and FEB. This data need to be cleaned and explored. I am hopeful that doing so, will give us a bigger picture on how much sample size and other factors played into these distributions and outcomes. Part 2 will explore these 2 datasets, and part 3 will attempt to bridge the gap and explore what we can learn from combinging our results form both parts together.