R² 的范围
R Squared Range
题目详情
使用 OLS,把 对 回归后得到模型的 为 0.15。再把 对 回归,这次模型的 为 0.2。设把 同时对 回归所得模型的 下界和上界分别为 。请把答案写成 。
Using OLS, we regress onto and find that the model has an of 0.15. We also regress onto but this time the model has an of 0.2. Let denote the lower and upper-bound of the of a model which regresses onto . Express your answer as .
解析
设 是把 仅对 回归得到的 , 是把 仅对 回归得到的 。
下界: 在 OLS 中,加入解释变量不会降低 。因此,把 对 回归得到的 至少与单变量回归中较大的那个一样大:
上界: 没有任何限制阻止 和 一起把 完全解释掉,即使它们各自单独对 的解释力都不强。比如, 可能落在 的张成空间中,但与每个变量单独对齐的程度都较低。因此,联合回归的 可以高达 1:
所以,
Original Explanation
Let be the from regressing on alone, and be the from regressing on alone.
Lower bound: Adding regressors in OLS cannot reduce . Therefore, the from regressing y on must be at least as large as the larger of the two individual values:
Upper bound: There is no restriction that prevents and together from perfectly explaining , even if each variable alone explains little. For example, could lie in the span of while being poorly aligned with each variable individually. Hence, the of the joint regression can be as high as 1:
Therefore,