WordClouds

Trump’s Twitter Word Cloud

Debate Word Cloud

Biden’s Twitter Word Cloud

Intro

Trump’s Support Rate

Comparisons between different approval-ratings polls aren’t always apples to apples. Some polls are surveying adults regardless of their voting status; others are polling registered voters. So three pairs of graphs were created with respect to “All polls”, “Adults” and Voters.

As we can see the general trends of the three categories of people are similar. 2017 is the “jumping down year” for Trump. Maybe many people who selected him got disappointed. But the approving rate went back (though not as high as when he got elected) and didn’t fluctuate a lot during the years (but never surpassed 50%). In the beginning of 2020 it even reached a little peak, but soon got beaten down (Maybe by Covid?). However, even with Covid, his approving rate didn’t fall as below as 2017. Is possible that some people who were first disappointed by him later realized that he was “the best among the worst”?

Also note that Trump won 46.9% of total votes in 2020. This rate is a bit higher than what is estimated in here. This implies a slight bias of the surveys. Maybe some Trump supporters would really lie. As another Democratic candidate Andrew Yang said, “We are not gonna beat Trump. We are gonna beat the problems that got Trump elected.” As the data predict, Trump’s supporting rate has a tendency of going up in the future. There are many problems for Biden to solve.

Trump’s supporting rate over time

Map Visualization

2020 Poll Map


We got poll data from https://www.electoral-vote.com/evp2020/Info/data.html. It has 1025 poll results for each state by different date. In map visualization, we used average poll result for each state, i.e., used average supporting rate for Trump or Biden in each state.We made average supporting rate for Trump or Biden in each state. For this data, “Demo” is supporting rate for Democratic party and “GOP” means supporting rate for Republican party. “Demo-GOP” means \(\frac{Demo-GOP}{Demo+GOP}\). We use “Demo-GOP” for map visualization.We modified our data with US map information in order to make map visualization.In the map, size of each state represents its population. So the map is different from true map of the USA. For example, California, New York, Texas, and Florida are much bigger in size than their real size in our map. The color represents strength of supporting rate for two parties. The deeper the blue color, the Democratic party is overwhelming. Similarly, the deeper the red color, the Republican party is overwhelming. Colors near white represent that both parties have similar supporting rate, which means states with nearly white colors are swing-states. The advantage of this map is that one can see the strength of each parties supporting rate and population size in each state, figuring out which states are important in the election. For example, Texas, Florida, Georgia, North Carolina, and Ohio are nearly white, which means they are swing-states. Also, their sizes are bigger than the other states, which means they have lots of population. So one can conclude that these four states are very important states in the 2020 election. Also, one can see that supporting rates for Democratic party are mostly higher than that of Republican party. Next, we will see the difference between election result and poll result.

2020 Election Results


  1. 2020 Election We imported 2020 election data. Now we do the same job as we did for the poll data. The election result is quite different from the poll result. Although Biden won the election, overall, Republican party got more vote than expected by the poll result. Especially, In Florida, the vote result is entirely differnt from the poll result : Trump won in the Florida. We now compare poll result and election result for 2020 in one map.

Election vs Poll


  1. 2020 : Poll vs Election We calculated the difference between election result and mean poll result. Now we will make a map with it. In the data, one can see that Biden got less vote than expected in the polls. Above map is the comparing poll result and election result in 2020. Overall, although Trump lost the election, he got much more vote in almost all states than polls expected. This means that many people are not willing to be honest in the poll. Now we will compare the election result for 2016 and 2020.

2016 election results


  1. 2016 Election We imported 2016 election result. Now we will make map from it as we did before. Note that Dark color in West Virginia is due to its extraordinary high supporting rate for Trump. One can see that Trump lost his support from many states in 2020 comparing to 2016. Now we compare election result for 2020 and 2016 in one map.

2016 vs 2020


  1. Election : 2020 vs 2016 We calculated the difference between election result in 2020 and that in 2016. One can see that Democratic party gianed more vote in 2020 than 2016. One can see that in most states, Democratic party gained more vote in 2020 than 2016.

Correspondence analysis for Poll by Demographics

Race & Gender vs. Canidate


Now we use correspondance analysis for poll by demographics. Correspondence analysis is an effective multivariate analysis method to visualize \(I \times J\) contingency table. We got our poll by demographics data from https://www.pewresearch.org/politics/2020/10/09/the-trump-biden-presidential-contest/. One can see that there are three categories : Trump or Lean Trump, Biden or Lean Biden, and Others. We will do correspondence analysis with this data. In correspondence analysis, Dimension 1 is the dominant dimension. If Dimension 1 is much more dominant than Dimension 2, which is our data, one can say that when \(i\)th row is \(j\)th column is close in Dimension 1, they occur together very much. This correspondence analysis is for race & gender vs supporting candidate. One can see that White people are more likely to support Trump but the other races are more likely to support Biden. Also, Womens are more likely to support Biden than men but it is less important than race factor.

Party vs. Canidate


Supporting party vs supporting candidate: One can see that republicans are more likely to support Trump and Democratics are more likely to support Biden. Also, moderate people are less supportive for candidate in their party.

Marriage & gender vs. Canidate


Marriage & gender vs supporting candidate: One can see that married people are more liklely to support Trump than Biden. Also, Women are more likely to support Biden than Trump. Marriage is more dominant than gender for choosing supporting candidate.

Age vs. Canidate


Age vs supporting candidate: As people gets older, they are more likely to support Trump. But young people (18-29) have little bit more tendency to choose no candidates comparing the other age groups.

Region & gender vs. Canidate


Region & gender vs supporting candidate: One can see that people living in more urban area are more likely to support Biden. Also, Women are more likely to support Biden. Region is more dominant than geneder for choosing supporting candidnate.

Education level vs. Canidate


Education level vs supporting candidate: One can see that as education level gets higher, people are more likely to support Biden. Especially, people with postgrad education are much more likely to support Biden.

Income vs. Canidate


Income vs supporting candidate:

Religion vs. Canidate


Religion vs supporting candidate: