返回题库

最多掷三次骰子的最优停策略

Dice Game

专题
Probability / 概率
难度
L4

题目详情

你最多可以掷一枚公平六面骰 3 次。

第一次或第二次掷出点数 xx 后,你可以选择:

  • 立刻停下并拿到 xx 美元;
  • 或继续掷(放弃当前 xx)。

若到了第三次则必须接受第三次点数。问:这个游戏的价值(期望收益)是多少?最优策略是什么?

You can roll a fair 6-sided die up to 3 times. After the first or the second roll, suppose you see a number xx. You can either take xx dollars and stop, or continue rolling (and forgo xx). If you reach the third roll, you must take whatever value you get. What is the value of this game, and what is the strategy?

解析

第三次若到达,期望为

1+2+3+4+5+66=3.5.\frac{1+2+3+4+5+6}{6}=3.5.

第二次看到 xx:若继续的期望是 3.5,则当 x>3.5x>3.5(即 4,5,64,5,6)应停,否则继续。于是第二次后的期望为

363.5+16(4+5+6)=4.25.\frac{3}{6}\cdot 3.5 + \frac{1}{6}(4+5+6)=4.25.

第一次看到 xx:与 4.25 比较,若 x>4.25x>4.25(即 5,65,6)应停,否则继续。于是从开始的期望为

264.25+16(5+6)=1434.67.\frac{2}{6}\cdot 4.25 + \frac{1}{6}(5+6)=\frac{14}{3}\approx 4.67.

最优策略:

  • 第一次掷到 5 或 6 就停;
  • 第二次掷到 4、5、6 就停;
  • 否则掷到第三次并接受结果。

Original Explanation

  • Final stage (3rd roll): If you get to the 3rd roll, the expected value of that single roll is 1+2+3+4+5+66=3.5.\frac{1+2+3+4+5+6}{6} = 3.5.

  • Second roll: If you see a value xx on the second roll, compare xx to the expected value if you continue (which is 3.5 for the third roll).

    • If x>3.5x > 3.5 (i.e., x=4,5,6x = 4,5,6), stop and take xx.
    • If x3.5x \le 3.5 (i.e., x=1,2,3x = 1,2,3), continue.
    • The expected value after the second roll is thus 36×3.5+16×(4+5+6)=4.25.\frac{3}{6}\times 3.5 + \frac{1}{6}\times (4+5+6) = 4.25.
  • First roll: Compare the first-roll xx to 4.25.

    • If x>4.25x > 4.25 (i.e., x=5x=5 or 66), stop and take xx.
    • If x4.25x \le 4.25 (i.e., x=1,2,3,4x=1,2,3,4), continue.
    • The expected value from the first roll is then 26×4.25+16×(5+6)=1434.67.\frac{2}{6}\times 4.25 + \frac{1}{6}\times (5+6) = \frac{14}{3} \approx 4.67.

Thus, the game is worth 1434.67\tfrac{14}{3}\approx 4.67.
Optimal strategy:

  • Stop if the roll is 5 or 6 on the first roll,
  • Stop if the roll is 4, 5, or 6 on the second roll,
  • Take whatever the third roll is if you reach that point.