返回题库

疾病发生率

Disease Incidence

专题
Statistics / 统计
难度
L3

第 1 小问

题目详情

某种疾病的发病率为2%。如果假阴性率为10%,假阳性率为1%,那么检测呈阳性的人实际上患有这种疾病的概率是多少?

A certain disease has an incidence rate of 2%. If the false negative rate is 10% and the false positive rate is 1%, what is the probability that a person who tests positive actually has the disease?

解析

为了解决这个问题,我们首先要了解我们正在寻找什么以及我们已经获得了哪些信息。

我们将 AA 定义为某人患有该疾病的事件,并将 BB 定义为该人对该疾病检测呈阳性的事件。我们的目标是找到:

P(Person has the diseasePerson tests positive)=P(AB)P(\textrm{Person has the disease} \mid \textrm{Person tests positive}) = P(A \mid B)

我们已经知道以下内容:

根据发病率: -P(Having the disease)=P(A)=0.02P(\textrm{Having the disease}) = P(A) = 0.02 -P(Not having the disease)=¬P(A)=10.02=0.98P(\textrm{Not having the disease}) = \neg P(A) = 1 - 0.02 = 0.98

基于假阴性率:

  • P(Testing negativeHave the disease)=P(¬BA)=0.1P(\textrm{Testing negative} \mid \textrm{Have the disease}) = P(\neg B \mid A ) = 0.1

基于误报率: -P(Testing positiveNot having the disease)=P(B¬A)=0.01P(\textrm{Testing positive} \mid \textrm{Not having the disease}) = P(B \mid \neg A) = 0.01

鉴于我们已经有了这些先验知识,我们可以应用贝叶斯规则来找到这个问题的解决方案。贝叶斯规则定义为:

P(AB)=P(BA)×P(A)P(B)P(A \mid B) = \frac{P(B \mid A) \times P(A)}{P(B)}

代入我们所知道的结果:

P(AB)=P(BA)×P(A)P(B)=(1P(¬BA))×P(A)(1P(¬BA))×P(A)+P(B¬A)×P(¬A))=(10.1)×0.02(10.1)×0.02+(0.01)×0.98P(A \mid B) = \frac{P(B \mid A) \times P(A)}{P(B)} = \frac{(1 - P(\neg B|A)) \times P(A)}{(1 - P(\neg B|A)) \times P(A) + P(B \mid \neg A) \times P(\neg A))} = \frac{(1 - 0.1) \times 0.02}{(1 - 0.1) \times 0.02 + (0.01) \times 0.98} P(AB)0.647P(A \mid B) \approx 0.647

Original Explanation

To solve this question, let's first understand what we are looking to find and what information we are already given.

We'll define AA to be the event that a person has the disease and we'll define BB to be the event that the person tests positive for the disease. Our goal is to find:

P(Person has the diseasePerson tests positive)=P(AB)P(\textrm{Person has the disease} \mid \textrm{Person tests positive}) = P(A \mid B)

We already know the following:

Based on the incidence rate:

  • P(Having the disease)=P(A)=0.02P(\textrm{Having the disease}) = P(A) = 0.02
  • P(Not having the disease)=¬P(A)=10.02=0.98P(\textrm{Not having the disease}) = \neg P(A) = 1 - 0.02 = 0.98

Based on the false negative rate:

  • P(Testing negativeHave the disease)=P(¬BA)=0.1P(\textrm{Testing negative} \mid \textrm{Have the disease}) = P(\neg B \mid A ) = 0.1

Based on the false positive rate:

  • P(Testing positiveNot having the disease)=P(B¬A)=0.01P(\textrm{Testing positive} \mid \textrm{Not having the disease}) = P(B \mid \neg A) = 0.01

Given that we already have this prior knowledge, we can apply Bayes rule to find the solution to this problem. Bayes rule is defined as:

P(AB)=P(BA)×P(A)P(B)P(A \mid B) = \frac{P(B \mid A) \times P(A)}{P(B)}

Plugging in what we know we get:

P(AB)=P(BA)×P(A)P(B)=(1P(¬BA))×P(A)(1P(¬BA))×P(A)+P(B¬A)×P(¬A))=(10.1)×0.02(10.1)×0.02+(0.01)×0.98P(A \mid B) = \frac{P(B \mid A) \times P(A)}{P(B)} = \frac{(1 - P(\neg B|A)) \times P(A)}{(1 - P(\neg B|A)) \times P(A) + P(B \mid \neg A) \times P(\neg A))} = \frac{(1 - 0.1) \times 0.02}{(1 - 0.1) \times 0.02 + (0.01) \times 0.98} P(AB)0.647P(A \mid B) \approx 0.647

第 2 小问

题目详情

一家医学研究实验室研制出一种新药,声称可以治愈 80% 的新型疾病患者。在检查该药物后,FDA 认为他们关于该药物功效的说法被夸大了。为了验证他们的假设,FDA 对 20 名患有这种新疾病的人使用该药物,并观察 XX,即通过该药物治愈的人数。更正式地说,FDA 正在进行以下测试:

H0:p=0.8H_0: p = 0.8Ha:p<0.8H_a: p < 0.8

假设使用 x12x \leq 12 的排斥区域,本研究的显着性水平 α\alpha 是多少?

A medical research lab has concocted a new drug that they claim will cure 80% of people suffering from a novel disease. After examining the drug, the FDA believes that their claims regarding the efficacy of the drug are inflated. To validate their hypothesis, the FDA administers the drug to 20 people with the novel disease and observes, XX, the number of individuals cured by the drug. More formally, the FDA is conducting the following test:

H0:p=0.8H_0: p = 0.8 and Ha:p<0.8H_a: p < 0.8.

Assuming a rejection region of x12x \leq 12 is used, what is the significance level, α\alpha, for this study?

解析

回想一下,显着性水平是指当原假设实际上为真时我们拒绝原假设的概率。因此,我们可以将 α\alpha 求解为:

α=P(X12p=0.8)\alpha = P( X \leq 12 \mid p = 0.8)

我们首先考虑治愈率为 80% 且恰好 X=12X = 12 人被治愈的概率。这可以表述为

P(X=12p=0.8)=(2012) 0.812×0.28P(X = 12 \mid p=0.8) = {20 \choose 12} \ 0.8^{12} \times 0.2^{8}

现在我们可以将之前的发现推广到 X12X \leq 12

i=012(20i) 0.8i0.220i\sum_{i=0}^{12} {20\choose i} \ 0.8^i * 0.2^{20-i}

Original Explanation

Recall, that the significance level refers to the probability that we reject the null hypothesis when it is in fact true. Therefore, we can solve for α\alpha as:

α=P(X12p=0.8)\alpha = P( X \leq 12 \mid p = 0.8)

Let's first consider the probability that there is an 80% cure rate and exactly X=12X = 12 people were cured. This can be formulated as

P(X=12p=0.8)=(2012) 0.812×0.28P(X = 12 \mid p=0.8) = {20 \choose 12} \ 0.8^{12} \times 0.2^{8}

Now we can generalize this previous finding for X12X \leq 12

i=012(20i) 0.8i0.220i\sum_{i=0}^{12} {20\choose i} \ 0.8^i * 0.2^{20-i}

第 3 小问

题目详情

一家医学研究实验室研制出一种新药,声称可以治愈 80% 的新型疾病患者。在检查该药物后,FDA 认为他们关于该药物功效的说法被夸大了。为了验证他们的假设,FDA 对 20 名患有这种新疾病的人使用该药物,并观察 XX,即通过该药物治愈的人数。更正式地说,FDA 正在进行以下测试:

假设X12X \leq 12的拒绝区域,当p=0.6p = 0.6时测试的功效是多少?

A medical research lab has concocted a new drug that they claim will cure 80% of people suffering from a novel disease. After examining the drug, the FDA believes that their claims regarding the efficacy of the drug are inflated. To validate their hypothesis, the FDA administers the drug to 20 people with the novel disease and observes, XX, the number of individuals cured by the drug. More formally, the FDA is conducting the following test:

Assuming a rejection region of X12X \leq 12, what is the power of the test when p=0.6p = 0.6?

解析

功效为 1β1-\beta,其中 β\beta 是当 HaH_a 为真时检验统计量不在拒绝区域内的概率。为了计算功率,我们必须求解 β\beta

β=P(We fail to reject H0Ha is true)=P(x>12p=0.6)\beta = P(\textrm{We fail to reject } H_0 \mid H_a \textrm{ is true}) = P(x > 12 \mid p=0.6)

=i=1320(20i)×0.6i×0.420i0.416=\sum_{i=13} ^{20} {20 \choose i} \times 0.6^{i} \times 0.4^{20-i} \approx 0.416

因此,本次测试的功效为:1β0.5841-\beta \approx 0.584


Original Explanation

Power is 1β1-\beta, where β\beta is the probability that the test statistic is not in the rejection region when HaH_a is true. In order to calculate power, we must solve for β\beta.

β=P(We fail to reject H0Ha is true)=P(x>12p=0.6)\beta = P(\textrm{We fail to reject } H_0 \mid H_a \textrm{ is true}) = P(x > 12 \mid p=0.6)

=i=1320(20i)×0.6i×0.420i0.416=\sum_{i=13} ^{20} {20 \choose i} \times 0.6^{i} \times 0.4^{20-i} \approx 0.416.

Thus, the power of this test is: 1β0.5841-\beta \approx 0.584