定义状态:
- S0:出现过 0 次“5”,且尚未失败
- S1:出现过 1 次“5”,且尚未失败
- L:失败状态(已经看到“4”或“6”)
- W:获胜状态(已经看到第 2 次“5”)
转移概率为:
{S0→S1:1/7,S1→W:1/7,S0→L:2/7,S1→L:2/7,S0→S0:4/7S1→S1:4/7
设 p0 为从 S0 出发最终获胜的概率,p1 为从 S1 出发最终获胜的概率,则
p0=71p1+74p0,p1=71+74p1
先解 p1:
p1−74p1=71⟹73p1=71⟹p1=31.
再解 p0:
p0−74p0=71p1⟹73p0=71⋅31⟹p0=91.
91
Original Explanation
Define states:
- S0: 0 occurrences of "5", game not lost
- S1: 1 occurrence of "5", game not lost
- L: Losing state (seen "4" or "6")
- W: Winning state (seen 2nd "5")
Transition probabilities:
{S0→S1:1/7,S1→W:1/7,S0→L:2/7,S1→L:2/7,S0→S0:4/7S1→S1:4/7
Let p0 = probability of winning from S0, p1 = probability of winning from S1. Then
p0=71p1+74p0,p1=71+74p1
Solve p1:
p1−74p1=71⟹73p1=71⟹p1=31.
Then
p0−74p0=71p1⟹73p0=71⋅31⟹p0=91.
91