Network & Viewer Analysis (Chi-Square Test)

Rationale

Agenda-setting theory says that the media influences what issues audiences think are most important. When a news network emphasizes certain topics, viewers are more likely to consider those topics as top priorities, even if their views or beliefs are not directly changed. Different networks prioritize different issues, which can lead to differences in what their audiences view as important.

Viewers of a particular news network may be more likely to consider specific issues (such as immigration) as a top concern if that network consistently emphasizes it. Therefore, the network someone watches could be associated with whether they view immigration as a top national issue.

Hypothesis

The proportion of participants who identify immigration as a top issue will differ depending on whether they viewed CNN or Fox News.

Variables & method

A total of 600 participants were assigned to watch their network and state whether they believed immigration was a “top issue” or “not a top issue.”

The dependent variable was a categorical measure of whether participants judged immigration to be a top issue. The independent variable was a categorical measure indicating whether participants had watched CNN or Fox News.

A chi-square test was conducted to examine whether the association between news network and perceptions of immigration as a top issue was relevant.

Results & discussion

The graph and crosstabulation table below summarize the relationship between the dependent and independent variables. The chi-square results are also shown.

	CNN	Fox
Crosstabulation of DV by IV
Counts and (Column Percentages)
1 Top issue	35 (11.7%)	115 (38.3%)
2 Not top issue	265 (88.3%)	185 (61.7%)

Test	Chi-squared Statistic	Degrees of Freedom	p-value
Chi-squared Test Results
Test of Independence between DV and IV
Chi-squared Test of Independence	55.476	1	0.000

The results supported the hypothesis. More Fox News viewers considered immigration a top issue (38.3%) than CNN viewers (11.7%). A larger proportion of CNN viewers (88.3%) said immigration was not a top issue in comparison to Fox News viewers (61.7%). The chi-square test found the association to be significant.

Code:

# ------------------------------
# Setup: Install and load packages
# ------------------------------
if (!require("tidyverse")) install.packages("tidyverse")   # Data wrangling & plotting
if (!require("gmodels")) install.packages("gmodels")       # Crosstabs
if (!require("gt")) install.packages("gt")                 # Table formatting

library(tidyverse)
library(gmodels)
library(gt)

# ------------------------------
# Load the data
# ------------------------------
# Replace "YOURFILENAME.csv" with your dataset name
mydata <- read.csv("TopIssue.csv") #Edit

# ------------------------------
# Define Dependent (DV) and Independent (IV) variables
# ------------------------------
# Replace YOURDVNAME and YOURIVNAME with actual column names in your data
mydata$DV <- mydata$Immigration #Edit
mydata$IV <- mydata$PreferredNetwork #Edit

# ------------------------------
# Visualization: Stacked bar chart of IV by DV
# ------------------------------
graph <- ggplot(mydata, aes(x = IV, fill = DV)) +
  geom_bar(colour = "black") +
  scale_fill_brewer(palette = "Paired") +
  labs(
    title = "Distribution of DV by IV",
    x = "Independent Variable",
    y = "Count",
    fill = "Dependent Variable"
  )

#Show the graph
graph

# ------------------------------
# Crosstabulation of DV by IV (DV = rows, IV = columns)
# ------------------------------

crosstab <- mydata %>%
  count(DV, IV) %>%
  group_by(IV) %>%
  mutate(RowPct = 100 * n / sum(n)) %>%
  ungroup() %>%
  mutate(Cell = paste0(n, "\n(", round(RowPct, 1), "%)")) %>%
  select(DV, IV, Cell) %>%
  pivot_wider(names_from = IV, values_from = Cell)

# Format into gt table
crosstab_table <- crosstab %>%
  gt(rowname_col = "DV") %>%
  tab_header(
    title = "Crosstabulation of DV by IV",
    subtitle = "Counts and (Column Percentages)"
  ) %>%
  cols_label(
    DV = "Dependent Variable"
  )

# Show the polished crosstab table
crosstab_table

# ------------------------------
# Chi-squared test of independence
# ------------------------------
options(scipen = 999)  # Prevents scientific notation
chitestresults <- chisq.test(mydata$DV, mydata$IV)

# ------------------------------
# Format Chi-squared test results into a table
# ------------------------------
chitest_summary <- tibble(
  Test   = "Chi-squared Test of Independence",
  Chi_sq = chitestresults$statistic,
  df     = chitestresults$parameter,
  p      = chitestresults$p.value
)

chitest_table <- chitest_summary %>%
  gt() %>%
  # Round χ² and p-value to 3 decimals, df to integer
  fmt_number(columns = c(Chi_sq, p), decimals = 3) %>%
  fmt_number(columns = df, decimals = 0) %>%
  tab_header(
    title = "Chi-squared Test Results",
    subtitle = "Test of Independence between DV and IV"
  ) %>%
  cols_label(
    Test   = "Test",
    Chi_sq = "Chi-squared Statistic",
    df     = "Degrees of Freedom",
    p      = "p-value"
  )

# Show the formatted results table
chitest_table