Following code reads in the data:
## Loading required package: MASS
## Loading required package: survival
## Warning: Missing column names filled in: 'X1' [1]
##
## ── Column specification ────────────────────────────────────────────────────────
## cols(
## X1 = col_double(),
## `News Outlet` = col_character(),
## `Jan-Mar 2015` = col_double(),
## `Jan-Mar 2016` = col_double(),
## `Jan-Mar 2017` = col_double(),
## `Jan-Mar 2018` = col_double(),
## `Jan-Mar 2019` = col_double(),
## V7 = col_character(),
## `2015 Slant` = col_double(),
## `2016 Slant` = col_double(),
## `2017 Slant` = col_double(),
## `2018 Slant` = col_double(),
## `2019 Slant` = col_double()
## )
The data read is of the r/Anarcho_Capitalism subreddit for years 2015-2019, Jan - March. Each year’s computed slant score is fitted with a distribution (given the Cullen & Gray Graph). For this subreddit, following is the simple plot of the 2015 slant score data along with the Cullen & Gray graph:
## summary statistics
## ------
## min: 2.594774e-05 max: 0.9999741
## median: 0.5250006
## mean: 0.52505
## estimated sd: 0.07185334
## estimated skewness: -0.2277766
## estimated kurtosis: 30.02687
Clearly, the candidates for distribution fitting are: beta, gamma & lognormal. We use: Weibull, beta & gamma. Note that since MLE is used to fit the data, we scale it such that all values are positive. First, the 2015 slant scores is fitted as follows:
## Min. 1st Qu. Median Mean 3rd Qu. Max.
## -20.232000 0.000000 0.000000 0.001903 0.000000 18.305000
## 25% 50% 75% 90% 99%
## 0.00000 0.00000 0.00000 0.27350 9.80175
Here are the 2016 slant scores plotted:
## summary statistics
## ------
## min: 2.371185e-05 max: 0.9999763
## median: 0.338629
## mean: 0.3448791
## estimated sd: 0.1064356
## estimated skewness: 2.710351
## estimated kurtosis: 17.64149
## Min. 1st Qu. Median Mean 3rd Qu. Max.
## -14.2800 0.0000 0.0000 0.2636 0.0000 27.8910
## 25% 50% 75% 90% 99%
## 0.00000 0.00000 0.00000 1.23400 18.79535
Here are the 2017 slant scores plotted:
## summary statistics
## ------
## min: 2.614242e-05 max: 0.9999739
## median: 0.4555317
## mean: 0.4655701
## estimated sd: 0.09595385
## estimated skewness: 1.708692
## estimated kurtosis: 15.86868
## Min. 1st Qu. Median Mean 3rd Qu. Max.
## -17.424 0.000 0.000 0.384 0.000 20.826
## 25% 50% 75% 90% 99%
## 0.0000 0.0000 0.0000 0.4530 15.8871
Here are the 2018 slant scores plotted:
## summary statistics
## ------
## min: 5.053568e-05 max: 0.9999495
## median: 0.3567819
## mean: 0.3622465
## estimated sd: 0.0712898
## estimated skewness: 4.657313
## estimated kurtosis: 49.07384
## Min. 1st Qu. Median Mean 3rd Qu. Max.
## -7.0590 0.0000 0.0000 0.1081 0.0000 12.7270
## 25% 50% 75% 90% 99%
## 0.00000 0.00000 0.00000 0.00000 5.75425
Here are the 2019 slant scores plotted:
## Min. 1st Qu. Median Mean 3rd Qu. Max.
## -11.40300 0.00000 0.00000 -0.08756 0.00000 3.26200
## 25% 50% 75% 90% 99%
## 0.0000 0.0000 0.0000 0.0000 0.1668