Overview
In part 1 of my analysis on the April Emerson data here I attempted to look into why on average 16% of Bernie Sanders supporters seemed to be unwilling to support other democratic party candidates, in particular 26 % voted for Trump over Elizabeth Warren. I concluded that blog post ackowledging that more data would be needed to investigate this trend. At the start of this blog post we are going to analyze the May poll data that Emerson has just released here. After we examine these trends and compare them to this months data, I will start to look at interesting demographic conclusions from the May polling data
As a quick note. In my last blog post, I was weery over a lack of disclosure over polling methodology. This month Emerson seems to have addressed these concerns with an addendum addressing their methodology here. This sets up an oppurtunity to test for differences in results between landline and online polling percantages. The sample size may turn out to be too small, but perhaps someone wants to take that project on.
Trump Vote given matchups
- Rows represent how often someone who picks the candidate on the left, will vote for trump in other matchups
Joe Biden |
2.7 |
5.4 |
4.0 |
7.0 |
6.8 |
8.7 |
Bernie Sanders |
9.1 |
13.6 |
4.3 |
10.2 |
11.0 |
11.3 |
Kamala Harris |
11.4 |
11.4 |
11.4 |
17.2 |
17.2 |
11.4 |
Pete Buttigieg |
7.9 |
4.6 |
8.6 |
4.6 |
6.0 |
2.2 |
Beto O’Rourke |
9.4 |
9.4 |
9.4 |
23.8 |
23.8 |
23.8 |
Elizabeth Warren |
5.2 |
6.8 |
6.8 |
5.2 |
6.8 |
6.8 |
Average Trump Vote From democrats by first choice candidate:
Kamala Harris |
13.78 |
Pete Buttigieg |
6.25 |
Bernie Sanders |
9.91 |
Elizabeth Warren |
6.19 |
Joe Biden |
5.29 |
Beto O’Rourke |
13.85 |
Michael Bennet |
100.00 |
Andrew Yang |
56.00 |
Mike Gravel |
25.00 |
Cory Booker |
33.33 |
- Overall 246 respondents said they would vote for Trump in one on one mathcups, out of a total of 1780, which comes out to 13.8%
Comparing to April
Joe Biden |
Joe Biden |
10.3 |
10.4 |
13.8 |
12.6 |
15.5 |
13.8 |
Bernie Sanders |
Bernie Sanders |
15.1 |
18.3 |
4.9 |
16.6 |
26.1 |
21.4 |
Beto O’Rourke |
Beto O’Rourke |
2.1 |
9.3 |
9.3 |
14.4 |
11.6 |
8.3 |
Pete Buttigieg |
Pete Buttigieg |
4.0 |
2.6 |
0.0 |
8.1 |
9.3 |
1.2 |
Kamala Harris |
Kamala Harris |
2.2 |
2.2 |
2.2 |
2.2 |
2.2 |
2.2 |
Elizabeth Warren |
Elizabeth Warren |
11.9 |
5.5 |
5.5 |
3.7 |
11.9 |
3.7 |
dealing with full dataset for future reproducibility
Analyzing 2016 Primary voters 2020 preferences
- Grab 2016 primary voters in our Survey Data
- 49% voted for hilary
- 37% voted for bernie
- 39.8% of 2016 primary Bernie voters, choose Bernie as their primary choice in 2020
- Joe biden(21%), Elizabeth Warren( 11% ), Buttigieg(7.1%), Kamala(4.8%)
- 39.7% of Clinton voters are voting for Joe Biden
- Kamala Harris(14.9%),Bernie Sanders (14.9%), Warren (9.1%), Buttigieg(7.8%)
I find it fascniating that nearly the same percantage of voters from 2016 have split off towards Biden and Bernie. Is this an indication of how many voters voted for Bernie out of lack of options? Does this reflect the level of democratic voters which are ideologically grounded into their own camps (I.E establishment candidate versus anti-establishment). Was this an intentional demographic move by the survey analysts? This seems like a big hit to Bernie, and a huge boost to Biden. But this really depends on the causality, which is nearly impossible to discern. I will make sure to focus on this category in future analysis on Emerson Polls.
Clinton |
14.9 |
7.8 |
14.9 |
9.1 |
39.7 |
3.5 |
Sanders |
4.8 |
7.1 |
39.8 |
11.3 |
21.3 |
2.5 |
Total |
10.4 |
7.5 |
24.9 |
9.7 |
32.6 |
2.9 |
Clinton |
0.0 |
0.7 |
1.3 |
1.0 |
0.7 |
2.6 |
Sanders |
1.9 |
1.3 |
0.0 |
0.0 |
0.8 |
0.0 |
Total |
0.7 |
1.1 |
0.9 |
0.6 |
0.6 |
1.8 |
## [1] 73
## [1] 101
Clinton |
6.82 |
Sanders |
12.78 |
Interesting demographics of 2016 primary voters
- Of each candidates voters, incomes over 100k make up 18.4% of bernie voters and 17.1 % of hilary voters
- considering the populist economic message that seems strange
- I will look into how income effects 2020 preferences soon
- Strangely a higher percentage of 2016 Hilary voters would vote for Bernie over Trump(94.9%) than Bernie 2016 voters would vote for Bernie over Trump (91.2%)
- of the 27 voters who voted for someone else in democratic primaries in 2016, 14 are voting for Biden in 2020. Only 1 said they would vote for Bernie(a sign of dislike of Hilary?)
- of the 31 that did not vote in 2016 11/31 said they will vote for bernie in 2020(is this a function of age?)
Analyzing age demographics
- age demogrphics show a battle of Biden versus bernie
- As seen below Bernie(41.2%) has a huge lead in 18-29 with Biden(11.1%) being second
- 30-49 Bernie(29.1%) retains a small lead over Biden(26.4%)
- 50-64 Biden takes a big lead( Biden-42.4%, Bernie 19%)
- 65+ Bernie(10.3%) is no longer 2nd favorite Kamala(12.1%) is with Biden (52.4%) maintaining a commanding lead
18-29 years |
9.2 |
9.0 |
41.2 |
8.7 |
11.1 |
2.8 |
30-49 years |
6.0 |
9.0 |
29.1 |
9.1 |
26.4 |
5.6 |
50-64 years |
14.3 |
4.3 |
19.0 |
11.7 |
42.4 |
1.4 |
65 or more years |
12.1 |
10.3 |
6.6 |
7.3 |
52.4 |
1.4 |
Total |
10.4 |
7.5 |
24.9 |
9.7 |
32.6 |
2.9 |
These demographics seem to make plenty of sense. Bernie’s rise is tied to energizing young voters over a party platform that would produce tremendous change in american politics. Older generations wouldn’t stand to benefit from the massive investments. The implication is obvious. Polls like this have been generated demographic assumptions by age. Below I print out the demographics from this survey
18-29 years |
84 |
21.43 |
30-49 years |
118 |
30.10 |
50-64 years |
129 |
32.91 |
65 or more years |
61 |
15.56 |
Total |
392 |
100.00 |
I beleive this is a fiar representation of the electorate by age. Turnout by age groups will clearly be the key to determining who wins the nomination
Breakdown by Poltiical ideology
Very Liberal |
4.9 |
11.2 |
43.7 |
14.5 |
19.2 |
0.9 |
Somewhat Liberal |
7.4 |
7.3 |
22.2 |
11.0 |
34.8 |
5.0 |
Moderate |
17.3 |
4.6 |
19.4 |
6.2 |
37.8 |
1.7 |
Somewhat Conservative |
23.1 |
5.4 |
5.1 |
3.8 |
36.6 |
0.0 |
Very Conservative |
14.1 |
9.1 |
0.0 |
0.0 |
47.6 |
6.1 |
- Clear ideological divide but strangely enough Joe Biden still captures very liberal(19.2%) and leads in all other ideological categories.
## [1] 87
## [1] 154
## [1] 525
## [1] 892
## [1] 699
Very Liberal |
13.28 |
Somewhat Liberal |
13.22 |
Moderate |
41.02 |
Somewhat Conservative |
80.72 |
Very Conservative |
87.92 |
- we can see that nearly 40% of moderate democratic voters, vote for Trump
- this has some large implications for the general election.
- People will constantly highlight how democrats need to reach out to the moderates. What this shows is that 40% of moderate democratic voters will end up voting for Trump.
- nearly 87% of both somewhat and very liberal voters will vote for a democratic candidate.
- Winning over conservative voters seems rather pointless. total volume of voters matters here
Very Liberal |
98 |
25.19 |
Somewhat Liberal |
144 |
37.02 |
Moderate |
98 |
25.19 |
Somewhat Conservative |
25 |
6.43 |
Very Conservative |
24 |
6.17 |
- So given this dataset, over 60% of voters are in the camp which are able to be won over. Perhaps focus should be on turn out in these swaths of the electorate
landline versus survey data
- The demographics for the landline versus Online data, are vastly different.
- I am unsure about this, but I believe Emerson used the online data to fill in their overall demographics to meet a standard that matches demographics from the 2016 election.
- If this is in fact the case, it shows the inherent problem with landline polling. Below you can see the age demographics for landline and online portions of the data.
- Nearly 80% of the 400 online respondents were under the age of 50.
- Nearly 75% of the landline respondents were over the age of 50
- These samples are terribly biased towards each demographic. While the survey may in fact reach a reasonable representation of the population through combining both datasets, this highlights issues on the landline side.
- How do young voters that are landline polled differ from the young voter population? Why did Emerson have to conduct online polling with such a skewed demographic? Are they possibly admitting that we can’t comfortably obtain landline polling for young voters?
- While Emerson has done a better job at talking about their assumptions, perhaps they need to go even further in the next polling set.
18-29 years |
30.10 |
6.46 |
30-49 years |
51.00 |
16.89 |
50-64 years |
16.92 |
47.68 |
65 or more years |
2.24 |
28.97 |
Total |
100.00 |
100.00 |
Conclusion
As usual ther are plenty of other interesting things to lok into. I still want to look into breakdowns by telephone versus by internet polling conducted by the third party source in this poll. I would like to look more into income and how it effects voting behavior. This survey contained Issue importance ratings, cross tabulating issues with different demographics would be very interesting.