返回题库

最多掷 3 次骰:最优停策略

roll a single die no more than three times

专题
Probability / 概率
难度
L4

题目详情

I will roll a single die no more than three times. You can stop me immediately after the first roll, or immediately after the second, or you can wait for the third. I will pay you the same number of dollars as there are dots on the single upturned face on my last roll (roll number three unless you stop me sooner). What is your playing strategy? 18

解析

用倒推。

第 3 次是最后一次,不可再停。

看到第 2 次点数 xx 时:若继续则期望为 E[第3次]=3.5\mathbb{E}[\text{第3次}]=3.5

因此第 2 次最优:

  • x4x\ge 4,停;
  • x3x\le 3,继续。

于是“从第 1 次选择继续”对应的期望收益为

16(33.5+4+5+6)=4.25.\frac{1}{6}(3\cdot 3.5+4+5+6)=4.25.

所以第 1 次最优:

  • 若第 1 次为 5 或 6,停;
  • 若为 1–4,继续。

该策略下期望收益为

16(44.25+5+6)=1434.667.\frac{1}{6}(4\cdot 4.25+5+6)=\boxed{\frac{14}{3}}\approx 4.667.