Reading in the data - r/AnarchoCapitalism

Following code reads in the data:

## Loading required package: MASS
## Loading required package: survival
## Warning: Missing column names filled in: 'X1' [1]
## 
## ── Column specification ────────────────────────────────────────────────────────
## cols(
##   X1 = col_double(),
##   `News Outlet` = col_character(),
##   `Jan-Mar 2015` = col_double(),
##   `Jan-Mar 2016` = col_double(),
##   `Jan-Mar 2017` = col_double(),
##   `Jan-Mar 2018` = col_double(),
##   `Jan-Mar 2019` = col_double(),
##   V7 = col_character(),
##   `2015 Slant` = col_double(),
##   `2016 Slant` = col_double(),
##   `2017 Slant` = col_double(),
##   `2018 Slant` = col_double(),
##   `2019 Slant` = col_double()
## )

The data read is of the r/Anarcho_Capitalism subreddit for years 2015-2019, Jan - March. Each year’s computed slant score is fitted with a distribution (given the Cullen & Gray Graph). For this subreddit, following is the simple plot of the 2015 slant score data along with the Cullen & Gray graph:

## summary statistics
## ------
## min:  2.594774e-05   max:  0.9999741 
## median:  0.5250006 
## mean:  0.52505 
## estimated sd:  0.07185334 
## estimated skewness:  -0.2277766 
## estimated kurtosis:  30.02687

2015 Slant Scores:

Clearly, the candidates for distribution fitting are: beta, gamma & lognormal. We use: Weibull, beta & gamma. Note that since MLE is used to fit the data, we scale it such that all values are positive. First, the 2015 slant scores is fitted as follows:

##       Min.    1st Qu.     Median       Mean    3rd Qu.       Max. 
## -20.232000   0.000000   0.000000   0.001903   0.000000  18.305000
##     25%     50%     75%     90%     99% 
## 0.00000 0.00000 0.00000 0.27350 9.80175

2016 Slant scores:

Here are the 2016 slant scores plotted:

## summary statistics
## ------
## min:  2.371185e-05   max:  0.9999763 
## median:  0.338629 
## mean:  0.3448791 
## estimated sd:  0.1064356 
## estimated skewness:  2.710351 
## estimated kurtosis:  17.64149
##     Min.  1st Qu.   Median     Mean  3rd Qu.     Max. 
## -14.2800   0.0000   0.0000   0.2636   0.0000  27.8910
##      25%      50%      75%      90%      99% 
##  0.00000  0.00000  0.00000  1.23400 18.79535

2017 Slant scores:

Here are the 2017 slant scores plotted:

## summary statistics
## ------
## min:  2.614242e-05   max:  0.9999739 
## median:  0.4555317 
## mean:  0.4655701 
## estimated sd:  0.09595385 
## estimated skewness:  1.708692 
## estimated kurtosis:  15.86868
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
## -17.424   0.000   0.000   0.384   0.000  20.826
##     25%     50%     75%     90%     99% 
##  0.0000  0.0000  0.0000  0.4530 15.8871

2018 Slant scores:

Here are the 2018 slant scores plotted:

## summary statistics
## ------
## min:  5.053568e-05   max:  0.9999495 
## median:  0.3567819 
## mean:  0.3622465 
## estimated sd:  0.0712898 
## estimated skewness:  4.657313 
## estimated kurtosis:  49.07384
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
## -7.0590  0.0000  0.0000  0.1081  0.0000 12.7270
##     25%     50%     75%     90%     99% 
## 0.00000 0.00000 0.00000 0.00000 5.75425

2019 Slant scores:

Here are the 2019 slant scores plotted:

##      Min.   1st Qu.    Median      Mean   3rd Qu.      Max. 
## -11.40300   0.00000   0.00000  -0.08756   0.00000   3.26200
##    25%    50%    75%    90%    99% 
## 0.0000 0.0000 0.0000 0.0000 0.1668