library(rjags)
## Loading required package: coda
## Linked to JAGS 4.1.0
## Loaded modules: basemod,bugs
library(mcmc)
library(stringr)
library(BEST)
The Tsardom of Russia, also known as the Tsardom of Muscovy, was the name of the centralized Russian state from Ivan IV’s assumption of the title of Tsar in 1547 until Peter the Great’s foundation of the Russian Empire in 1721. The House of Romanov was the second dynasty (race), after the Rurik dynasty, to rule over Russia, and reigned from 1613 until the abdication of Czar Nicholas II on March 15, 1917, as a result of the February Revolution See: https://en.wikipedia.org/wiki/Tsardom_of_Russia, https://en.wikipedia.org/wiki/House_of_Romanov and http://www.spsl.nsc.ru/history/descr/main_p.htm.
We want to estimate the parameters of ruling duration for both races, using both classical (frequentist’s) and Bayesian approach (http://www.indiana.edu/~kruschke/BEST/). Let’s do it!
Tsardom<-c(1462-1505, 1505-1533, 1533-1584, 1584-1598, 1598-1605, 1605-1605, 1605-1606, 1606-1610, 1610-1613, 1613-1645, 1645-1676, 1676-1682, 1682-1696)
Tsardom<-abs(Tsardom)
Tsardom
## [1] 43 28 51 14 7 0 1 4 3 32 31 6 14
hist(Tsardom)
Romanov<-c(1696-1725,1725-1727,1727-1730,1730-1740,1740-1741,1741-1761,1761-1762,1762-1796,1796-1801,1801-1825,1825-1855,1855-1881,1881-1894,1894-1917)
Romanov<-abs(Romanov)
Romanov
## [1] 29 2 3 10 1 20 1 34 5 24 30 26 13 23
hist(Romanov)
Tsars<-c(Tsardom,Romanov)
Tsars
## [1] 43 28 51 14 7 0 1 4 3 32 31 6 14 29 2 3 10 1 20 1 34 5 24
## [24] 30 26 13 23
hist(Tsars)
summary(Tsardom)
## Min. 1st Qu. Median Mean 3rd Qu. Max.
## 0 4 14 18 31 51
summary(Romanov)
## Min. 1st Qu. Median Mean 3rd Qu. Max.
## 1.00 3.50 16.50 15.79 25.50 34.00
summary(Tsars)
## Min. 1st Qu. Median Mean 3rd Qu. Max.
## 0.00 3.50 14.00 16.85 28.50 51.00
qc<-c(0.1,0.25,0.5,0.6, 0.75,0.9,0.95)
quantile(Tsardom,qc)
## 10% 25% 50% 60% 75% 90% 95%
## 1.4 4.0 14.0 16.8 31.0 40.8 46.2
quantile(Romanov,qc)
## 10% 25% 50% 60% 75% 90% 95%
## 1.3 3.5 16.5 22.4 25.5 29.7 31.4
quantile(Tsars,qc)
## 10% 25% 50% 60% 75% 90% 95%
## 1.0 3.5 14.0 21.8 28.5 32.8 40.3
For the last Romanov we see \(T=23\) that is Nicholas II ruled over Russia until the abdication being in the 60% percentile of the sample while for the Tsardom this percentile is \(16.8\).
We postulate our null hypothesis of equality of two samples (Romanov, Tsardom) means as \(H_0: T_{Romanov} = T_{Tsardom}\). Consequently our alternative hypothesis will be \(H_1: {T_{Romanov}} \neq {T_{Tsardom}}\).
#Student t-test for two means
t.test(x = Tsardom,y = Romanov)
##
## Welch Two Sample t-test
##
## data: Tsardom and Romanov
## t = 0.38586, df = 21.45, p-value = 0.7034
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
## -9.70439 14.13296
## sample estimates:
## mean of x mean of y
## 18.00000 15.78571
#Test for variances equality
var.test(x = Tsardom, y = Romanov)
##
## F test to compare two variances
##
## data: Tsardom and Romanov
## F = 2.003, num df = 12, denom df = 13, p-value = 0.2285
## alternative hypothesis: true ratio of variances is not equal to 1
## 95 percent confidence interval:
## 0.6352478 6.4883991
## sample estimates:
## ratio of variances
## 2.003048
#Test for normality
shapiro.test(Tsardom)
##
## Shapiro-Wilk normality test
##
## data: Tsardom
## W = 0.8838, p-value = 0.0803
shapiro.test(Romanov)
##
## Shapiro-Wilk normality test
##
## data: Romanov
## W = 0.89445, p-value = 0.09371
#wilcoxon test for medians
wilcox.test(x = Tsardom,y = Romanov,exact = F)
##
## Wilcoxon rank sum test with continuity correction
##
## data: Tsardom and Romanov
## W = 97.5, p-value = 0.7707
## alternative hypothesis: true location shift is not equal to 0
All of the tests prove null hypothesis \(H_0: T_{Romanov} = T_{Tsardom}\).
\[ P(\Theta|D) = P(D|\Theta)P(\Theta)/P(D)\]
Here we produce by MCMC (Markov chain Monte Carlo) procedure the posterior distribution of the duration values parameters \(\Theta\) (means, difference of means and the effect size) given the data \(D\). Then we verify our results using diagnostics.
#Random number generator seed
set.seed(12345)
#Bayesian alternative to the t-test
BESTout <- BESTmcmc(Tsardom, Romanov)
##
## Processing function input.......
##
## Done.
##
## Compiling model graph
## Resolving undeclared variables
## Allocating nodes
## Graph information:
## Observed stochastic nodes: 27
## Unobserved stochastic nodes: 5
## Total graph size: 77
##
## Initializing model
##
## Adaptive phase, 500 iterations x 3 chains
## If no progress bar appears JAGS has decided not to adapt
##
##
## Burn-in phase, 1000 iterations x 3 chains
##
##
## Sampling from joint posterior, 33334 iterations x 3 chains
##
##
## MCMC took 1.107 minutes.
summary(BESTout)
## mean median mode HDI% HDIlo HDIup compVal %>compVal
## mu1 17.556 17.53 17.241 95 6.568 28.517
## mu2 15.800 15.80 15.800 95 8.350 23.145
## muDiff 1.755 1.75 1.763 95 -11.663 14.704 0 60.8
## sigma1 18.748 17.98 16.403 95 11.081 28.042
## sigma2 13.266 12.77 12.166 95 8.190 19.388
## sigmaDiff 5.482 5.11 4.500 95 -5.106 16.925 0 86.3
## nu 38.799 30.10 13.950 95 2.094 99.679
## log10nu 1.460 1.48 1.530 95 0.776 2.114
## effSz 0.108 0.11 0.132 95 -0.680 0.902 0 60.8
For the last Romanov Nicholas II we see \(T_{Romanov} = 23.2\) as HIDup for the Bayesian estimate of the mean i.e. 95% probability to fit expected value.