Github link: https://github.com/Zackwin73/SURVMETH-727-Final-Project
HTML link (HTML format is how original paper was formatted and includes the interactive plots.) :https://rpubs.com/ZWin1234/1379033
Trust in politicians has been declining steadily throughout the 2000s, with the United States experiencing record-low levels of public confidence in elected officials to act in citizens’ best interest. Despite this decline, the 2016, 2020, and 2024 general elections saw unusually high voter turnout in the state of Florida. This raises a central question: if overall trust in politicians is consistently falling, why are citizens still turning out to vote at high rates?
This period of time is especially important to examine because it spans several major national events things includes the years immediately before the COVID-19 pandemic, the pandemic itself, and the subsequent recovery. One might expect that declining trust during such crises would lead individuals to disengage from politics rather than participate. Yet turnout increased. This paper investigates what factors may motivate citizens to vote even when trust in political leaders is low and explores what might explain the persistence of electoral participation in the face of widespread distrust.
When you turn on the TV, radio, or social media in the fall, what do you expect to see. Some people watch the return of football, some want to see the best ways to prepare for the incoming cold weather, and others listen to what is happening in the news. In between what you want to be watching, you will see advertisements for politicians trying to win their elections. He or she may grandstands about how they will fight for you and your community’s voice in the nation or state capital and proclaim that they will be your voice. The question that naturally comes next is how much people actually trust their politicians to have them in mind when they finally do get elected.
PEW Research explored this question in their work on voters trust in politicians. Their findings saw a decline in how much confidence the public has in their elected officials. They found that the average level of trust in politicians has lowered over time. Specifically, from 2000 to 2024, their study shows a drop from an average of about 40 percent trust in the early 2000s to around 20 percent on average from 2010 to today. This research combines results from different public polls that ask about trust in politicians and aggregates the ratings. The graph below shows what the PEW data found.
This research would suggest that the average voter has little trust in a politician to have their self-interest in mind. The act of voting has been described in research by political scientists such as V. O. Key as a cost–benefit analysis, requiring people to weigh their reasons for voting against the small gain they receive from actually doing it. There is a process to registering, casting their ballot either as an absentee ballot(mail), voting early, or making time on Election Day to vote, especially when there are no universal laws guaranteeing those access points or paid time off to vote. With that reasoning, it would be reasonable to assume that voting would likely decrease if voters have no trust and the cost of voting is high enough that they would need a strong reason to go out of their way to cast a ballot.
Recent years have shown this is not the case in Florida. When visiting the Florida Board of Elections website, what is shown is that the total vote percentage in Florida has increased in each presidential general election. Voter turnout went from 74.5 to 77.2 to 78.9 percent. I then went further and examined whether these high vote percentages were stable across all counties or if they were concentrated in certain areas. After scraping the Florida Board of Elections website to gather the percent of votes cast in each county, I used that county-level data to examine how this movement was distributed. What is shown below is how that scraped county-level data portrays this change.
(To add context, I include an interactive plot where each point represents a county to show exactly where all 67 counties in Florida fall on the plot. You can double click a county name in the legend to isolate a single county and see the vote percentage it had in each of the three presidential general election years.)
What this graph of all the counties movement shows is that voting percent seems to have a positive trend throughout most counties in Florida. When isolating the movement of each county’s vote percentage, what is found is that fifty-nine counties saw a consistent increase from 2016 to 2024, while seven counties showed a decrease and two remained the same. This suggests that the overall increase in the total number of registered voters casting a vote has pushed statewide turnout into the high seventies over these general elections. This then brings up the question that if voters have low trust in politicians to legislate on their behalf, what is motivating them to go out and vote.
This paper seeks to explore some of the reasons why a voter will go out of their way to cast a vote. I address this by examining three different data sets. First, I explore census data. At the county level, I examine which aspects, such as population size, household income, or other demographic measures, can help explain voting. Second, I look at the number of COVID cases in each county to see whether the largest public health and financial turmoil in recent history had any effect on the rate of change in voting within each county. Finally, I explore media coverage, specifically the type of news the New York Times released about Florida politicians. I examine the frequency of words used and the sentiment behind the articles to see how the media could influence voters perspective on politicians.
First, when examining what could be explanatory of voting, it is important to look at county-level demographics and statistics that could help explain voting patterns. This is because county-level data allows us to better examine trends, since the demographics and lifestyles of voters can differ widely from one county to another. To accomplish this, I used data from the Census to gather county-level information in Florida. Using the Census API, I collected the 5-year ACS data to compare demographic measures and merged them with the data I scraped from the Florida election website. The reason for choosing the 5-year ACS data is that the 2020 and 2024 1-year ACS datasets are not suitable for representative county-level analysis. The 2024 1-year ACS is not accessible, and the 2020 1-year ACS is considered an experimental survey that is not comparable to other 1-year estimates. To address this, I relied on the 5-year ACS and averaged the vote percentages from the Florida election data for the years that fall within that ACS time frame, which are 2016 and 2020. This allows me to examine which demographic factors could influence the increasing vote percentages across counties. This data set focuses on what might motivate a person to vote even when the public, on average, shows low trust in politicians, helping explain why distrust does not necessarily prevent individuals from voting.
To examine what census-level information might help explain why votes are being cast, I ran two multiple regression models to look at what observational effects these demographics can have on voting levels for each county. The reason for running two regression models is to examine two different levels of voting. First, I look at an average vote count level, which uses average turnout as the response, to examine what causes a single vote increase to occur. Next, I look at what would change the overall vote percentage by using average vote percent as the response variable. This examines what factors would increase the total share of registered voters in a county who actually turn out to vote.
These two regressions are used to examine what explains voting at a individual count level and at a percent change. This allows a perceptive at what causes a single vote increase and what causes the total percent increase to go up by one.
| Avg Turnout | Avg Vote Percent | |
|---|---|---|
| (Intercept) | 37764.196247+ | 72.129873*** |
| (18995.935202) | (8.383755) | |
| pop | 1.163236* | -0.000192 |
| (0.549203) | (0.000242) | |
| male | -2.096016** | 0.000063 |
| (0.675225) | (0.000298) | |
| median_age | 255.824689 | 0.281172** |
| (198.326978) | (0.087531) | |
| white | 0.539308+ | 0.000151 |
| (0.278044) | (0.000123) | |
| black | 0.453645+ | 0.000143 |
| (0.269170) | (0.000119) | |
| asian | 0.573134 | 0.000343 |
| (0.583757) | (0.000258) | |
| hispanic | -0.084610+ | 0.000029 |
| (0.049222) | (0.000022) | |
| hh_income | -0.054398 | 0.000251** |
| (0.198230) | (0.000087) | |
| income_pc | -0.552530 | -0.000406+ |
| (0.462000) | (0.000204) | |
| poverty_count | -0.648775*** | -0.000015 |
| (0.161297) | (0.000071) | |
| housing_units | -0.078769 | 0.000037 |
| (0.080567) | (0.000036) | |
| median_home_value | 0.044600 | 0.000052** |
| (0.034737) | (0.000015) | |
| median_rent | -5.513564 | -0.015738** |
| (10.601729) | (0.004679) | |
| edu | -804.739734** | -0.128675 |
| (265.892235) | (0.117350) | |
| Num.Obs. | 66 | 66 |
| R2 | 0.999 | 0.619 |
| R2 Adj. | 0.999 | 0.515 |
| AIC | 1360.4 | 340.6 |
| BIC | 1395.4 | 375.6 |
| Log.Lik. | -664.181 | -154.286 |
| F | 5552.765 | 5.926 |
| RMSE | 5678.36 | 2.51 |
| + p < 0.1, * p < 0.05, ** p < 0.01, *** p < 0.001 | ||
What these regression models suggest is that there are certain demographics that help explain changes in voter turnout both by individual increases and by overall percentage. What is shown is that for the average vote turnout, factors that showed significance in the model were the total population of the county, the number of males, the poverty count, and the amount of education people had in the county. For percent vote, factors such as median home value, median rent cost, household income, and age were all significant predictors of the change in vote percent.
What is shown from these regressions is that the demographics that are significant predictors of voter turnout and the percent change in voter turnout come from different areas. For the individual turnout amount, the predictors are more based on demographics that are harder to control. Gender, total population, poverty, and education levels are very individual level factors. For the predictors of vote percent change, the results show that factors such as how much the average rent costs in a county or the household income are more predictive of how the percent of voter turnout will rise or fall.
What the main takeaway here is that the factors that are predictive of the individual turnout amount and the factors that predict the shift in the percent of all registered voters are different. To see higher increases in voting by a unit of one, it can depend on demographics such as education, poverty, population, and gender. However, to see shifts in vote percent based on the entire pool of registered voters, the factors that explain this are more about the economics of the county. If rent is cheaper, if houses are more expensive, or what the average income is, those factors help explain how shifts in voting by percent will occur. In conclusion, what we find is that the motivation for a single increase in voting can be different from what causes the overall percent of voters in a county to show a substantive shift toward more registered voters actually voting. This helps suggest that voting is more complicated depending on how you measure what is predictive. Here, the levels are individual voter count versus vote turnout percent.
Next, what I take a look at is how COVID affected the amount of voters between 2016, 2020, and 2024 for the presidential general elections. The reason to explore this is connected to the central ideas of why government and public service exist. COVID was one of the largest and most devastating public health crises in recent history. It is important to examine how the percent of voting changed across these three elections. This can give insight into whether voters felt that during a moment of crisis they would be more committed or motivated to vote in order to get politicians into office who could help with the crisis.
I examine this by using the COVID case data available from the University of South Florida website and merging it with the voter turnout data. Specifically, I created the percent change that occurs over the three elections in question. To properly examine this change, I include an interactive plot that shows the total COVID cases a county had in 2020 and the change in its voter turnout percent across those years. In this interactive plot, you can single out a county by using the legend and double clicking the county you want to explore.
What is found in this plot is that more counties had a positive increase in voter percent. For COVID cases, there does not appear to be a trend where larger numbers of COVID cases in a county lead to the largest percent increase in voting. The counties with the highest change in turnout were actually small counties in Florida that had fewer than three thousand COVID cases. These smaller, more rural counties could have been affected harder by COVID because of limited access to hospitals, which could explain some of the effects we see. Another thing that stands out is that Miami Dade had two hundred fifty thousand COVID cases, yet it showed no turnout change.
What this data suggests is that there is not a systematic connection between increasing voter turnout and the effect of COVID cases. This suggests that the increase in voting in counties in Florida was not directly motivated by the amount of COVID cases. It also suggests that there is another motivating factor that leads people to get out and vote. This means that trust in politicians to have the interests of voters may not be fully explanatory because we don’t see a trend of increased voting connected with large amounts of COVID cases in the height on the pandemic. Even during the worst public health crisis seen in recent history, there does not seem to be a systematic relationship between COVID cases and the increasing vote percent in each county. This can suggest that something else was motivating Florida counties to see a sustained increase in the number of people voting.
Next, what I look into is how the media may affect voters. There is not a direct way to examine how this media might influence their likelihood to vote, but what can be examined is how the framing of news from a large news company may shape the information that voters consume. To explore this, I used the New York Times API to grab and then examine news articles that included a Florida politician in the title. To do this, I included all well known Florida politicians from the 2016 and 2020 cycles in those election years in my API search. These election years were chosen because the study of voter motivation is most meaningful when the focus is on the period leading into an election. Voters tend to pay attention to what is right in front of them, so to measure that short lead up into an election cycle, I examined the year of news beginning in January and continuing through the end of the year when the elections are held. The years 2016 and 2020 were the only ones available because the 2024 cycle in the New York Times API has not yet been posted.
First, I examine what words show up the most in the articles. This is done by taking the article headlines and introductory statements and measuring the frequency of the words that appear. This is shown in the graph below.
What these frequencies show is that in three hundred plus articles names like Marco Rubio, who was a senator from Florida at the time, Donald Trump, who was the president in 2016 and the Republican candidate in 2020, and Rick Scott all appear hundreds of times in the titles and descriptions of the New York Times articles. Other key words that show up hundreds of times are words like campaign, republican, and senator. What this tells us is that news coverage that talks about Florida politicians is focused mostly on prominent figures such as senators or the presidential candidate who lives in the state. Another point is that because Republicans controlled many of the major political positions in the state during these years, much of the information in a national newspaper puts Republicans at the center of the coverage. This can also be explained by the fact that the best known Florida politicians during this period were from that party.
Next, I look at the change in words from 2016 to 2020 to see what is shifting in the information being portrayed. What is seen is that news related to the pandemic, such as death and dies, saw an increase in articles about politicians, while the focus on politicians who were not in the election cycle, like Marco Rubio, Rick Scott, or Ted Cruz, decreased. What this shows is that news media changes heavily from year to year, and the stories are dependent on what people are most interested in at that time.
Finally I examine the sentiment each headline and introductory statement tells us. What is analyzed is how strong the language in these articles are After examining the frequency of the words, we see that there is a focus on Republican politicians and on the pandemic and what is happening. What I examine next is the meaning behind the words that appear in the articles The sentiment is measured on a scale from negative one to one to show how strong the words being used are. This analysis helps show how powerful the language in these headlines is. The goal is to see whether the information being presented is something that tries to get people to stop and pay attention to the news. This could affect voters by creating strong emotional responses or strong sentimental reactions that might motivate them if that level of sentiment is present in the articles.
What is shown in this plot is that the highest density of articles contains words that score at the strongest ends of the sentiment analysis scale. There are very few articles that have extremely strong sentiment scores. Another thing that stands out is that the peaks for both 2020 and 2016 are at both ends of the scale. This suggests that the highest density of articles uses strong language in their headlines.
In conclusion for the New York Times data, what is shown is that what is portrayed tends to be what is popular, not necessarily balanced or two sided stories about politics. This is shown by the fact that Republicans are the only group with high word frequency in the New York Times headlines. Second, what I found is that the sentiment behind the articles is shaped by more powerful language and themes rather than mild or strictly informative headlines. This suggests that even though the New York Times is a fairly traditional and respected news source, voters are still receiving information that is emotionally loaded, with strong words and strong themes used to get them to read the articles. This suggests that trust might not be the only explanation for voting. Emotions and feelings beyond what a politician can do for someone politically, including cultural or emotional reactions, may motivate voters who feel strongly enough to act. This data does not measure direct effects, but the way the New York Times frames its headlines and chooses what to talk about can be explanatory for how people consume media. If the New York Times is portraying articles with this level of emotional weight, it is reasonable to assume that other forms of media, such as independent creators, streaming platforms like Twitch that discuss political stories, and other popular media sources today, may also play a role in riling up voters.
What this paper set out to explore is what can explain the increase in voters even when public trust in politicians is at a low. The data explored in this paper showed that this question is more complicated than it first appears. What is shown is that public trust in politicians to act in the public interest does not fully capture the complexity of voting and why people choose to participate.
What was found is that the demographics of a county can explain two different levels of voting. The individual statistics affect how many individual votes will be cast, while the economics of a county, such as overall income, the housing market, and the cost of rent, are predictive of changes in the overall voting percentage. This shows that voting can be predicted by different layers. One layer is based on who lives in an area. People who are poor or who have less education are going to vote less when looking at individual level turnout. Another layer is the economic well being of the county as a whole, which will influence the percentage point shifts in total voters. This suggests that even if individuals do not fully trust politicians to act in their self interest, they may still be motivated to vote if their economic goals align with what they believe politicians can deliver. For example, increases in housing values, property markets, local business growth, or job creation may motivate voters even if their trust in politicians remains low.
Second, it is found that COVID cases themselves are not directly connected to voting behavior in a simple way. This again shows that the idea that people vote only because they trust politicians to act in their interest is too simple. The data shows that places with the largest number of cases did not see major shifts in turnout, while some rural counties with smaller numbers of cases did see changes. Access to hospitals and economic disruption should be explored further to understand whether smaller counties faced more difficulty than larger cities. However, in the case of counties with more than five thousand COVID cases, the distribution of increased voting appears more even, with higher voting happening regardless of case counts.
Finally, the exploration of media data found that the portrayal of politics in the news is emotionally charged and focused on major figures rather than evenly covering different sides. What was found in the national media using the New York Times articles is that these outlets focus on well known politicians, which can result in coverage that is lopsided and centered on the political party that is in control of the state. These stories focus on who is important and they use emotionally charged language. The stories were found to have a high density of negative and positive sentiment, with fewer stories showing moderate tone. This tells us that news media such as the New York Times presents charged stories about people that readers are likely to know or recognize. This can create a connection or leave an impression that could motivate voting behavior. For people who feel connected to a political party or a cause, seeing emotionally charged stories may motivate them to vote even if their trust in politicians is low.
To wrap this up, this paper found that trust in politicians may not show the full story of why people vote. The vote returns in Florida point to another explanation, which is that there are many reasons that motivate a person to vote beyond what they receive directly from a politician. The environment around them and the media they consume may explain more about how to predict voting than trust alone.