Smith is in jail and has 1 dollar; he can get out on bail if he has 8 dollars. A guard agrees to make a series of bets with him. If Smith bets A dollars, he wins A dollars with probability .4 and loses A dollars with probability .6. Find the probability that he wins 8 dollars before losing all of his money if
Answer
The Markov chain (Xn, n = 0, 1, . . .) representing the evolution of Smith’s money
Let \(\varphi(i)\) be the probability that the chain reaches state 8 before reaching state 0, starting from state i. In other words, if \(S_j\) is the first \(n \geq 0\) such that \(X_n = j\),
\[\varphi(i) = P_i(S_8 < S_0) = P(S_8 < S_0|X_0 = i)\]
Using first-step analysis (viz. the Markov property at time n = 1), we have
\[\varphi(i) = 0.4\varphi(i + 1) + 0.6\varphi(i - 1), i = 1, 2, 3, 4, 5, 6, 7\]
\[\varphi(0) = 0\]
\[\varphi(8) = 1\]
Various States
Markov Chains
The timid strategy can be modeled as an absorbing Markov chain. From any given dollars amount n where n is 1 to 8, the probability of moving to n???1 is 0.6 while the probability of moving to n+1 is 0.4. The absorbing states are 0 (no money left to bet) and 8 (the game has been won).
Timid Strategy
Therefore,
\(\frac{q}{p} = \frac {0.6}{0.4}=1.5\)
\(P=\frac {1-(\frac{q}{p})^s}{1-(\frac{q}{p})^M}\)
\(P=\frac{1-1.5^1}{1-1.5^8}=0.0203\)
q <- 0.6
p <- 0.4
m <- 8
qp = q/p
for (s in 1:8) {
P = (1 - qp^s)/(1-qp^m)
print(paste0("s: " ,s," P: ", P))
print(P)
}
## [1] "s: 1 P: 0.0203013481363997"
## [1] 0.02030135
## [1] "s: 2 P: 0.0507533703409993"
## [1] 0.05075337
## [1] "s: 3 P: 0.0964314036478986"
## [1] 0.0964314
## [1] "s: 4 P: 0.164948453608248"
## [1] 0.1649485
## [1] "s: 5 P: 0.267724028548771"
## [1] 0.267724
## [1] "s: 6 P: 0.421887390959556"
## [1] 0.4218874
## [1] "s: 7 P: 0.653132434575734"
## [1] 0.6531324
## [1] "s: 8 P: 1"
## [1] 1
Probability is 0.0203
Answer
Since the prisoner bets A dollars and will lose or gain A dollars, and he bets his entire money each time until $8. he must win each time or lose. If he wins sequence is: $1,$2,$4,$8. He starts with $1 and must win 3 consecutive bets in a row at probability p=0.4
Bold Strategy
# Transition matrix in canonical form
s <- matrix(c(1, 0, 0,0,0,
0.6,0,0.4,0,0,
0.6,0,0,0.4,0,
0.6,0,0,0,0.4,
0, 0, 0,0,1), nrow = 5, byrow=TRUE)
s
## [,1] [,2] [,3] [,4] [,5]
## [1,] 1.0 0 0.0 0.0 0.0
## [2,] 0.6 0 0.4 0.0 0.0
## [3,] 0.6 0 0.0 0.4 0.0
## [4,] 0.6 0 0.0 0.0 0.4
## [5,] 0.0 0 0.0 0.0 1.0
s1 <- matrix(c(0,1,0,0,0),nrow=1)
s1
## [,1] [,2] [,3] [,4] [,5]
## [1,] 0 1 0 0 0
s2 <- s1 %*% s
s2
## [,1] [,2] [,3] [,4] [,5]
## [1,] 0.6 0 0.4 0 0
s3 <- s2 %*% s
s3
## [,1] [,2] [,3] [,4] [,5]
## [1,] 0.84 0 0 0.16 0
s4 <- s3 %*% s
s4
## [,1] [,2] [,3] [,4] [,5]
## [1,] 0.936 0 0 0 0.064
Therefore, from the Binomial Distribution, we have that:
(P <- 0.4^3)
## [1] 0.064
Probability is 0.064
Answer
To summarize, this game has a negative expected value. That is, on any given bet, the prisoner is more likely to lose than win. In such cases, the best strategy is to play as few times as possible. The more prisoner plays the closer the result will match the expected outcome (due to the law of large numbers). The expected outcome is that the prisoner will lose all his money.
The bold strategy will play at most 3 times while the timid strategy must play, at a minimum, 8 times to win the game.
Bold strategy gives Smith a better chance to get out of jail because it gives him more than triple his winning chances with this strategy.