This is the file to calculate the frecuencies of content types
Just the first time, we run this code to add the validated categories to the RTE_Events file:
Calculating the type of content most frequently retweeted.
## # A tibble: 6 × 4
## category Frequency Percentage Median_Retweets
## <fct> <int> <dbl> <dbl>
## 1 Interesting 199 52.5 14
## 2 Resonance 77 20.3 16
## 3 Emotional Impact 61 16.1 22
## 4 Novelty 35 9.2 17
## 5 Call to action 6 1.6 37
## 6 Humor 1 0.3 47
Null Hypothesis: The null hypothesis states that there is no significant difference in frequencies among the categories.
##
## Chi-squared test for given probabilities
##
## data: category_summary$Frequency
## X-squared = 420.68, df = 5, p-value < 2.2e-16
## The result indicates that we reject the null hypothesis
## The overall variables are independent and signitificative different
##
## Chi-squared test for given probabilities
##
## data: specific_counts
## X-squared = 1.8551, df = 1, p-value = 0.1732
## The result indicates that we cannot reject the null hypothesis.
## Resonance and Emotional Impact are not signitificative different