矩阵分解 / Matrix Decomposition

本页结构

核心概念

特征值、特征向量与对角化 Eigenvalues, eigenvectors and diagonalization
SVD 在压缩和因子结构中的作用 SVD as a basis for compression and factor structure
QR、LU、Cholesky 与投影矩阵 QR, LU, Cholesky and projection matrices

学习顺序

根据问题选择分解方法，而不是机械背诵。 Choose a decomposition based on the problem, not by memorized order.
把 SVD 和主方向、降维联系起来。 Relate SVD to principal directions and dimensionality reduction.
知道 Cholesky 分解何时要求正定性。 Know when Cholesky requires positive definiteness.

$A$ is diagonalizable if and only if it has linearly independent eigenvectors, or equivalently, if the geometric multiplicity and the algebraic multiplicity of all the eigenvalues agree. A special case of this is if $A$ has $n$ distinct eigenvalues. Suppose we have eigenvalues $\lambda_1, \dots, \lambda_n$ and corresponding eigenvectors $v_1, \dots, v_n$ . Then

Intuitively, this says that we can find a basis consisting of the eigenvectors of $A$ . Useful for computing large powers of $A$ , as $A^n = XD^n X^{-1}$ . An important example is $A$ being real and symmetric implies $A$ is diagonalizable.

直观地说，我们可以找到一个由 $A$ 的特征向量组成的基。对于计算 $A$ 的大幂很有用，如 $A^n = XD^n X^{-1}$ 。一个重要的例子是 $A$ 是实数且对称意味着 $A$ 是可对角化的。

SVD is powerful in low-rank approximations of matrices. Unlike eigenvalue decomposition, SVD uses two unique bases (left/right singular vectors). For orthogonal matrices $U (m \times m), V (n \times n)$ and diagonal matrix $\Sigma (m \times n)$ with nonnegative diagonal entries in nonincreasing order, we can write any $m \times n$ matrix $A$ as:

SVD 在矩阵的低阶近似方面非常强大。与特征值分解不同，SVD 使用两个唯一的基（左/右奇异向量）。对于正交矩阵 $U (m \times m), V (n \times n)$ 和对角矩阵 $\Sigma (m \times n)$ 具有非递增顺序的非负对角线条目，我们可以将任何 $m \times n$ 矩阵 $A$ 写为：

Intuitively, this says that we can express $A$ as a diagonal matrix with suitable choices of (orthogonal) bases.

直观上，这表明我们可以将 $A$ 表示为对角矩阵，并选择合适的（正交）基数。

For nonsingular $A$ , we can write $A = QR$ , where $Q$ is orthogonal and $R$ is an upper triangular matrix with positive diagonal elements. QR decomposition assists in increasing the efficiency of solving $Ax = b$ for nonsingular $A$ :

对于非奇异 $A$ ，我们可以写成 $A = QR$ ，其中 $Q$ 是正交矩阵， $R$ 是具有正对角线元素的上三角矩阵。 QR 分解有助于提高求解非奇异 $A$ 的 $Ax = b$ 的效率：

QR decomposition is very useful in efficiently solving large numerical systems and inversion of matrices. Furthermore, it is also used in least-squares when our data is not full rank.

QR 分解对于有效求解大型数值系统和矩阵求逆非常有用。此外，当我们的数据不是满秩时，它也可用于最小二乘法。

For nonsingular $A$ , we can write $A = LU$ , where $L$ is a lower and $U$ is an upper triangular matrix. This decomposition assists in solving $Ax = b$ as well as computing the determinant:

对于非奇异 $A$ ，我们可以写成 $A = LU$ ，其中 $L$ 是下三角矩阵， $U$ 是上三角矩阵。此分解有助于求解 $Ax = b$ 以及计算行列式：

If $A$ is symmetric positive definite, then $A$ can be expressed as $A = R^\intercal R$ via Cholesky decomposition, where $R$ is an upper triangular matrix with positive diagonal entries. Cholesky decomposition is essentially LU decomposition with $L = U^\intercal$ . These decompositions are both useful for solving large linear systems.

如果 $A$ 是对称正定的，则 $A$ 可以通过 Cholesky 分解表示为 $A = R^\intercal R$ ，其中 $R$ 是对角线项为正的上三角矩阵。 Cholesky 分解本质上是采用 $L = U^\intercal$ 的 LU 分解。这些分解对于求解大型线性系统都很有用。

Fix a vector $v \in \mathbb{R}^n$ . The projection of $x \in \mathbb{R}^n$ onto $v$ is given by

修复向量 $v \in \mathbb{R}^n$ 。 $x \in \mathbb{R}^n$ 到 $v$ 的投影由下式给出

More generally, if $S = \text{Span}\{v_1, \dots, v_k\} \subseteq \mathbb{R}^n$ has orthogonal basis $\{v_1, \dots, v_k\}$ , then the projection of $x \in \mathbb{R}^n$ onto $S$ is given by

The main property is that $\text{proj}_S(x) \in S$ and $x - \text{proj}_S(x)$ is orthogonal to any $s \in S$ . Linear Regression can be viewed as a projection of our observed data onto the subspace formed by the span of the collected data.

主要属性是 $\text{proj}_S(x) \in S$ 和 $x - \text{proj}_S(x)$ 与任何 $s \in S$ 正交。线性回归可以看作是我们观察到的数据到由收集数据的跨度形成的子空间的投影。

补充讲解

特征向量解释方向

Eigenvectors explain directions

特征分解找出在线性变换下只被缩放的方向。在协方差矩阵中，这些方向对应主要风险轴。

Eigen-decomposition identifies directions that are only scaled by the transformation. In covariance matrices, these directions are principal risk axes.

SVD 是稳健默认工具

SVD is the robust default

SVD 可处理矩形矩阵和秩亏矩阵，因此常用于 PCA、最小二乘诊断和降维。

SVD works for rectangular and rank-deficient matrices, making it the preferred tool for PCA, least squares diagnostics, and dimensionality reduction.

因子模型也是分解

Factor models are decompositions

很多量化因子模型把收益拆成系统性部分和残差。矩阵分解让这种拆分的几何结构更清楚。

Many quant factor models decompose returns into systematic components and residuals. Matrix decompositions make the geometry of that split explicit.

矩阵分解

本页结构

核心概念

学习顺序

可对角矩阵

奇异值分解

二维码分解

LU 和 Cholesky 分解

预测

补充讲解

特征向量解释方向

SVD 是稳健默认工具

因子模型也是分解