Orthogonalization method
The basic authors of the description: Daria Inzhelevskaya(text), Alexey Frolov(editing)
The Gram--Schmidt orthogonalization is a method that constructs a set of orthogonal vectors [math]{\displaystyle \mathbf {b}_{1},\;\ldots ,\;\mathbf {b} _{N}} [/math] or a set of orthonormal vectors [math]{\displaystyle \mathbf {e} _{1},\;\ldots ,\;\mathbf {e}_{N}} [/math] from a given set of linearly independent vectors [math]{\displaystyle \mathbf {a} _{1},\;\ldots ,\;\mathbf {a} _{N}}[/math]. This is done in such a way that each vector [math]{\displaystyle \mathbf {b} _{j}} [/math] or [math]{\displaystyle \mathbf {e} _{j}}[/math] is a linear combination of the vectors [math]{\displaystyle \mathbf {a} _{1},\;\ldots ,\; \mathbf {a} _{j}}[/math]. The process may be used for obtaining the QR decomposition, where the system of original vectors is the columns of a given matrix, while the columns of Q are the result of orthogonalization. Thus, unlike the Givens (rotation) and Householder (reflection) methods, which are based on the left unitary/orthogonal reduction to triangular form, the orthogonalization method reduces the original matrix by right non-orthogonal (triangular) transformations to a unitary/orthogonal matrix.
Mathematical foundations of the method
The classical orthogonalization method for the QR decomposition of a square matrix (real version) is fairly simple. However, due to its instability, which manifests itself in the non-orthogonality of resulting systems, the method is very rarely used in practice.
Let [math]\mathbf{a}_1,\;\ldots,\;\mathbf{a}_N[/math] be linearly independent vectors. Define the projection of a vector [math]\mathbf{a}[/math] on (the direction of) a vector [math]\mathbf{b}[/math] by the formula [math]\mathbf{proj}_{\mathbf{b}}\,\mathbf{a} = {\langle \mathbf{a}, \mathbf{b} \rangle \over \langle \mathbf{b}, \mathbf{b}\rangle} \mathbf{b} ,[/math]
where [math]\langle \mathbf{a}, \mathbf{b} \rangle[/math] is the scalar product of the vectors [math]\mathbf{a}[/math] and [math]\mathbf{b}[/math].
In the k-dimensional real space, the scalar product of the vectors [math]\mathbf{ a= [a_1, a_2, ...,a_k]}[/math] and [math]\mathbf{ b= [b_1, b_2, ..., b_k]}[/math] is defined as
- [math]\langle \mathbf{a}, \mathbf{b} \rangle=\sum_{i=1}^k a_ib_i=a_1b_1+a_2b_2+\cdots+ a_kb_k[/math].
The above operator projects the vector [math]\mathbf{a}[/math] on the direction of the vector [math]\mathbf{b}[/math].
The orthogonality of the first two vectors in the orthogonalization process below is attained at Step (2).
The classical Gram--Schmidt process is performed as follows:
- [math] {\begin{array}{lclr} {\mathbf {b}}_{1}&=&{\mathbf {a}}_{1}&(1)\\ {\mathbf {b}}_{2}&=&{\mathbf {a}}_{2}-{\mathbf {proj}}_{{{\mathbf {b}}_{1}}}\,{\mathbf {a}}_{2}&(2)\\ {\mathbf {b}}_{3}&=&{\mathbf {a}}_{3}-{\mathbf {proj}}_{{{\mathbf {b}}_{1}}}\,{\mathbf {a}}_{3}-{\mathbf {proj}}_{{{\mathbf {b}}_{2}}}\,{\mathbf {a}}_{3}&(3)\\ {\mathbf {b}}_{4}&=&{\mathbf {a}}_{4}-{\mathbf {proj}}_{{{\mathbf {b}}_{1}}}\,{\mathbf {a}}_{4}-{\mathbf {proj}}_{{{\mathbf {b}}_{2}}}\,{\mathbf {a}}_{4}-{\mathbf {proj}}_{{{\mathbf {b}}_{3}}}\,{\mathbf {a}}_{4}&(4)\\ &\vdots &&\\{\mathbf {b}}_{N}&=&{\mathbf {a}}_{N}-\displaystyle \sum _{{j=1}}^{{N-1}}{\mathbf {proj}}_{{{\mathbf {b}}_{j}}}\,{\mathbf {a}}_{N}&(N) \end{array}} [/math]
From each vector [math]\mathbf{b}_j \;(j = 1 \ldots N)[/math], one can obtain a normalized vector: [math]\mathbf{e}_j = {\mathbf{b}_j\over \| \mathbf{b}_j \|}[/math] (the normalized vector has the same direction as the original one, while its norm is 1 ). Here, the norm is consistent with the scalar product: [math]\| x \| = \sqrt{\langle x, x \rangle}[/math]
The output of the Gram--Schmidt process:
The system [math]\mathbf{b}_1,\;\ldots,\;\mathbf{b}_N[/math] of orthogonal vectors or the system [math]\mathbf{e}_1,\;\ldots,\;\mathbf{e}_N[/math] of orthonormal vectors.
In practice, the most often used form of the method is the orthogonalization method with re-orthogonalization.