返回题库

回归系数

Regression Coefficients

专题
Machine Learning / 机器学习
难度
L3

题目详情

设有两个数据集 XXYY,满足 Var(X)=10Var(X) = 10Var(Y)=20Var(Y) = 20。我们做如下线性回归:

yαx+βxxy \sim \alpha_x + \beta_xx

并得到 βx=1\beta_x = 1。现在再做回归:

xαy+βyyx \sim \alpha_y + \beta_yy

那么 βy\beta_y 的值是多少?

Suppose that we have two datasets XX and YY with Var(X)=10Var(X) = 10 and Var(Y)=20Var(Y) = 20. We perform the linear regression:

yαx+βxxy \sim \alpha_x + \beta_xx

and obtain βx=1\beta_x = 1. Suppose now perform the regression:

xαy+βyyx \sim \alpha_y + \beta_yy

What is the value of βy\beta_y?

解析

rrXXYY 的皮尔逊相关系数。那么

βx=σyσx\beta_x = \frac{\sigma_y}{\sigma_x} βy=rσxσy\beta_y = r \frac{\sigma_x}{\sigma_y}

因此:

βy=σx2βxσy2βy=βxσx2σy2\beta_y = \frac{\sigma^2_x}{\beta_x\sigma^2_y} \to \beta_y = \beta_x\frac{\sigma^2_x}{\sigma^2_y}

把题目给定的常数代入:

βy=1×1020=12\beta_y = 1 \times \frac{10}{20} = \frac{1}{2}

Original Explanation

Let rr be the Pearson Correlation Coefficient of XX and YY. Then

βx=σyσx\beta_x = \frac{\sigma_y}{\sigma_x} βy=rσxσy\beta_y = r \frac{\sigma_x}{\sigma_y}

Therefore:

βy=σx2βxσy2βy=βxσx2σy2\beta_y = \frac{\sigma^2_x}{\beta_x\sigma^2_y} \to \beta_y = \beta_x\frac{\sigma^2_x}{\sigma^2_y}

Plugging in the constants we get:

βy=1×1020=12\beta_y = 1 \times \frac{10}{20} = \frac{1}{2}