Objective

This file identify the number of RTs in our sample. We found that 95% of our sample had less than 8RTs and consequently, we had 390 RTEs.

Loading data

First we load and combine two dataframes with the counting of retweets for both countries:

  • da_dataframe (Argentina)

  • dc2_dataframe (Canada)

Sampling

We follow Helms 2017 and 2016. We found that 95% of our RTE have 50 retweets or fewer retweets.

## [1] "The threshold retweet count at 95% is: 8"
## [1] "The number of cases at this threshold is: 390"
## # A tibble: 92 × 5
##    retweet_count     n cumulative_cases total_cases cumulative_percentage
##            <dbl> <dbl>            <dbl>       <dbl>                 <dbl>
##  1             8    33             6888        7245                  95.1
##  2             9    30             6918        7245                  95.5
##  3            10    25             6943        7245                  95.8
##  4            11    28             6971        7245                  96.2
##  5            12    19             6990        7245                  96.5
##  6            13    20             7010        7245                  96.8
##  7            14    13             7023        7245                  96.9
##  8            15    19             7042        7245                  97.2
##  9            16    12             7054        7245                  97.4
## 10            17     9             7063        7245                  97.5
## # ℹ 82 more rows