返回题库

令牌之争

Token Tussle

专题
Probability / 概率
难度
L7

题目详情

两名玩家每人一开始有 12 个代币。他们掷三个骰子,直到总点数为 11 或 14。如果总点数为 14,则玩家 AA 向玩家 BB 提供一个令牌;如果点数为 14,则玩家 AA 向玩家 BB 提供一个令牌;如果点数为 14,则玩家 AA 向玩家 BB 提供一个令牌。如果总和为 11,则玩家 BB 向玩家 AA 给予令牌。他们重复这个过程,直到一个玩家(获胜者)拥有所有代币。玩家 AA 获胜的概率是多少?

Two players each start with 12 tokens. They roll three dice until the sum is either 11 or 14. If the sum is 14, player AA gives a token to player BB; if the sum is 11, player BB gives a token to player AA. They repeat this process until one player, the winner, has all the tokens. What is the probability that player AA wins?

解析

让我们稍微概括一下,假设每个玩家一开始都有 aa 代币,而玩家 AA 一回合获得代币的概率是 pp

xxAA 的代币数量减去 aa。因此游戏从 x=0x=0 开始,玩家 AA 需要在达到 x=ax = -a 之前达到 x=ax = a 才能获胜。

我们可以将游戏视为有偏差的、吸引人的随机游走:xx从零开始,在每一步,xx增加1(概率为pp)或减少1(概率为1p1-p),直到x=ax = ax=ax = -a

AA的代币数量减去aaxx时,令RxR_xAA的获胜概率。然后我们知道: Ra=1 and Ra=0R_a = 1 \ \text{and} \ R_{-a} = 0Rn=pRn+1+(1p)Rn1R_n = pR_{n+1} + (1-p)R_n - 1 对于a<n<a-a < n < a。上面的方程是线性递推关系的一个例子,我们可以按如下方式求解 RnR_npRn+1=Rn(1p)Rn1pR_{n+1} = R_n - (1-p)R_{n-1} 这有特征方程 px2x+1p=0px^2 - x + 1 - p = 0 我们假设 p>12p > \frac{1}{2}。那么这个方程的两个根就是a1=1a_1 = 1a2=1ppa_2 = \frac{1-p}{p}。那么存在常量 c1c_1c2c_2 使得: Rn=c1a1n+c2a2nR_n = c_1a_1^n + c2a^n_20=Ra0 = R_{-a}1=Ra1 = R_a。求解,我们找到c1=a2aa22aa12ac_1 = \frac{a_2^a}{a^{2a}_{2} - a^{2a}_{1}}c2=a1aa22aa12ac_2 = \frac{-a_1^a}{a^{2a}_2 - a^{2a}_{1}}。我们对 R0R_0 感兴趣,我们发现: R0=c1+c2=1a1a+a2aR_0 = c_1 + c_2 = \frac{1}{a_1^a + a_2^a} 对于我们的特定问题,a=12a=12p=914p = \frac{9}{14}(因为用三个骰子掷出总和为 11 和 14 的概率分别为 18\frac{1}{8}572\frac{5}{72}),因此 a2=59a_2 = \frac{5}{9}AA 玩家获胜的概率为: R0=11+(59)120.99913631R_0 = \frac{1}{1 + (\frac{5}{9})^{12}} \approx 0.99913631


Original Explanation

Let's generalize slightly and suppose that each player starts with aa tokens and the probability of player AA gaining a token on one turn is pp.

Let xx be the number of AA's tokens minus aa. So the game begins with x=0x=0, and player AA needs to reach x=ax = a before reaching x=ax = -a in order to win.

We can view the game as a biased, absorbing random walk: xx begins at zero, and at each step, xx increases by one (with probability pp) or decreases by 1 (with probability 1p1-p) until x=ax = a or x=ax = -a.

Let RxR_x be AA's probability of winning when the number of AA's tokens minus aa is xx. Then we know:

Ra=1 and Ra=0R_a = 1 \ \text{and} \ R_{-a} = 0

and

Rn=pRn+1+(1p)Rn1R_n = pR_{n+1} + (1-p)R_n - 1

for a<n<a-a < n < a. The equation above is an example of a linear recurrence relation, and we can solve for RnR_n as follows:

pRn+1=Rn(1p)Rn1pR_{n+1} = R_n - (1-p)R_{n-1}

and this has characteristic equation

px2x+1p=0px^2 - x + 1 - p = 0

Let's assume p>12p > \frac{1}{2}. Then the two roots of this equation are a1=1a_1 = 1 and a2=1ppa_2 = \frac{1-p}{p}. Then there exist constants c1c_1 and c2c_2 such that:

Rn=c1a1n+c2a2nR_n = c_1a_1^n + c2a^n_2

with 0=Ra0 = R_{-a} and 1=Ra1 = R_a. Solving, we find c1=a2aa22aa12ac_1 = \frac{a_2^a}{a^{2a}_{2} - a^{2a}_{1}} and c2=a1aa22aa12ac_2 = \frac{-a_1^a}{a^{2a}_2 - a^{2a}_{1}}. We are interested in R0R_0 and we find:

R0=c1+c2=1a1a+a2aR_0 = c_1 + c_2 = \frac{1}{a_1^a + a_2^a}

For our particular problem, a=12a=12 and p=914p = \frac{9}{14} (since the probabilities of throwing a sum of 11 and 14 with three dice are 18\frac{1}{8} and 572\frac{5}{72}, respectively), so a2=59a_2 = \frac{5}{9} and so the probability of player AA winning is:

R0=11+(59)120.99913631R_0 = \frac{1}{1 + (\frac{5}{9})^{12}} \approx 0.99913631