返回题库

选对罐子

Pick Your Urn Wisely

专题
Probability / 概率
难度
L4

题目详情

有两只外观相同的罐子:

  • 罐 A:4 枚 1 元筹码,6 枚 10 元筹码。
  • 罐 B:3 枚 1 元筹码,7 枚 10 元筹码。

你先随机选一只罐子并抽出一枚筹码,结果抽到 1 元。

接着你可以选择:

  • 继续从同一只罐子再随机抽一枚;或
  • 改从另一只罐子随机抽一枚。

你的收益等于第二枚筹码的面值。问:最优策略下期望收益是多少?

You have 22 indistinguishable urns in front of you. The first urn has 44 ($1(\$1 chips) and $6$ ($10(\$10 chips), while the second urn has $3$ ($1(\$1 chips) and $7$ ($10(\$10 chips). You reach into one urn at random and select a ($1(\$1 chip). Then you have the opportunity to pick another chip at random either from same urn you picked the first chip from or at random from the other urn. Your payout is the value of the second chip you select. Under optimal gameplay, what is your expected payout?

解析

先算后验:

P(A抽到 1)=1241012410+12310=47,P(B抽到 1)=37.P(A\mid\text{抽到 1})=\frac{\frac12\cdot\frac{4}{10}}{\frac12\cdot\frac{4}{10}+\frac12\cdot\frac{3}{10}}=\frac{4}{7}, \quad P(B\mid\text{抽到 1})=\frac{3}{7}.

若继续抽同一只:

  • 若在 A:剩余为 3 个 1 与 6 个 10(共 9 个),期望 31+6109=7\frac{3\cdot 1+6\cdot 10}{9}=7
  • 若在 B:剩余为 2 个 1 与 7 个 10(共 9 个),期望 2+709=8\frac{2+70}{9}=8

综合期望:

747+837=5277.4286.7\cdot\frac{4}{7}+8\cdot\frac{3}{7}=\frac{52}{7}\approx 7.4286.

若改抽另一只则期望更低,因此最优是“不换”,期望收益为 52/752/7


Original Explanation

Evaluate the expected payout if you switch urns versus if you stay with your current urn. This can be split into 22 cases depending on which urn you select the first chip from. We can assign Urn AA to be the urn that started with 44 ($1(\$1 chips) and Urn $B$ be the one that started with $3$ ($1(\$1 chips). Define the random variable XX as the urn that you selected your first chip from. We can say that P[X=A]=43P[X=B]\mathbb{P}[X = A] = \dfrac{4}{3}\mathbb{P}[X = B] as the Urn AA started with $4$ ($1(\$1 chips) while Urn BB has 33. Therefore, as those 2 values must sum to 11, P[X=A]=47\mathbb{P}[X = A] = \dfrac{4}{7} and P[X=B]=37\mathbb{P}[X = B] = \dfrac{3}{7}.

Now we can consider 22 more cases being if we switch urns or stay with the same urn. Denote P0P_0 be the payout if we stay with the same urn. If we don't switch, then:

E[P0]=E[P0X=A]P[X=A]+E[P0X=B]P[X=B]\mathbb{E}[P_0] = \mathbb{E}[P_0 \mid X = A]\mathbb{P}[X = A] + \mathbb{E}[P_0 \mid X = B]\mathbb{P}[X = B]

If X=AX = A, then after drawing one ($1(\$1 chip), there are 66 ($10(\$10 chips) and 33 ($1(\$1 chips) left, so E[P0X=A]=639=7\mathbb{E}[P_0 \mid X = A] = \dfrac{63}{9} = 7. Similarly, there would be 77 ($10(\$10 chips) and 22 ($1(\$1 chips) after the first draw if X=BX = B, so E[P0X=B]=729=8\mathbb{E}[P_0 \mid X = B] = \dfrac{72}{9} = 8. Plugging these values in yields: E[P0]=747+837=527\mathbb{E}[P_0] = 7 \cdot \dfrac{4}{7} + 8 \cdot \dfrac{3}{7} = \dfrac{52}{7}

Let P1P_1 be your profit if you switch. Similarly, we have: E[P1]=E[P1X=A]P[X=A]+E[P1X=B]P[X=B]\mathbb{E}[P_1] = \mathbb{E}[P_1 \mid X = A]\mathbb{P}[X = A] + \mathbb{E}[P_1 \mid X = B]\mathbb{P}[X = B]

If X=AX = A, then by switching, we pick from an untouched urn BB, so E[P1X=A]=7310=7.3\mathbb{E}[P_1 \mid X = A] = \dfrac{73}{10} = 7.3. Similarly, if X=BX = B, then we pick from an untouched urn AA, so E[P1X=B]=6410=6.4\mathbb{E}[P_1 \mid X = B] = \dfrac{64}{10} = 6.4.

Therefore, E[P1]=6.437+7.347=24235<24535=7\mathbb{E}[P_1] = 6.4 \cdot \dfrac{3}{7} + 7.3 \cdot \dfrac{4}{7} = \dfrac{242}{35} < \dfrac{245}{35} = 7

Therefore, you should not switch, and your expected payout is 527\dfrac{52}{7}.