Method level

Difference between revisions of "Orthogonalization method"

From Algowiki
Jump to navigation Jump to search
[unchecked revision][unchecked revision]
Line 1: Line 1:
 
{{level-m}}
 
{{level-m}}
  
The basic authors of the description: [[Участник:DVIN|Инжелевская Дарья Валерьевна]](text), [[Участник:Frolov|А.В.Фролов]](editing)  
+
The basic authors of the description: [[:ru:Участник:DVIN|Daria Inzhelevskaya]](text), [[:ru:Участник:Frolov|Alexey Frolov]](editing)  
  
 
The '''Gram--Schmidt orthogonalization''' is a method that constructs a set of orthogonal vectors <math>{\displaystyle \mathbf {b}_{1},\;\ldots ,\;\mathbf {b} _{N}} </math> or a set of orthonormal vectors <math>{\displaystyle \mathbf {e} _{1},\;\ldots ,\;\mathbf {e}_{N}} </math> from a given set of linearly independent vectors <math>{\displaystyle \mathbf {a} _{1},\;\ldots ,\;\mathbf {a} _{N}}</math>. This is done in such a way that each vector  
 
The '''Gram--Schmidt orthogonalization''' is a method that constructs a set of orthogonal vectors <math>{\displaystyle \mathbf {b}_{1},\;\ldots ,\;\mathbf {b} _{N}} </math> or a set of orthonormal vectors <math>{\displaystyle \mathbf {e} _{1},\;\ldots ,\;\mathbf {e}_{N}} </math> from a given set of linearly independent vectors <math>{\displaystyle \mathbf {a} _{1},\;\ldots ,\;\mathbf {a} _{N}}</math>. This is done in such a way that each vector  

Revision as of 16:56, 16 March 2018


The basic authors of the description: Daria Inzhelevskaya(text), Alexey Frolov(editing)

The Gram--Schmidt orthogonalization is a method that constructs a set of orthogonal vectors [math]{\displaystyle \mathbf {b}_{1},\;\ldots ,\;\mathbf {b} _{N}} [/math] or a set of orthonormal vectors [math]{\displaystyle \mathbf {e} _{1},\;\ldots ,\;\mathbf {e}_{N}} [/math] from a given set of linearly independent vectors [math]{\displaystyle \mathbf {a} _{1},\;\ldots ,\;\mathbf {a} _{N}}[/math]. This is done in such a way that each vector [math]{\displaystyle \mathbf {b} _{j}} [/math] or [math]{\displaystyle \mathbf {e} _{j}}[/math] is a linear combination of the vectors [math]{\displaystyle \mathbf {a} _{1},\;\ldots ,\; \mathbf {a} _{j}}[/math]. The process may be used for obtaining the QR decomposition, where the system of original vectors is the columns of a given matrix, while the columns of Q are the result of orthogonalization. Thus, unlike the Givens (rotation) and Householder (reflection) methods, which are based on the left unitary/orthogonal reduction to triangular form, the orthogonalization method reduces the original matrix by right non-orthogonal (triangular) transformations to a unitary/orthogonal matrix.

Mathematical foundations of the method

The classical orthogonalization method for the QR decomposition of a square matrix (real version) is fairly simple. However, due to its instability, which manifests itself in the non-orthogonality of resulting systems, the method is very rarely used in practice.

Let [math]\mathbf{a}_1,\;\ldots,\;\mathbf{a}_N[/math] be linearly independent vectors. Define the projection of a vector [math]\mathbf{a}[/math] on (the direction of) a vector [math]\mathbf{b}[/math] by the formula [math]\mathbf{proj}_{\mathbf{b}}\,\mathbf{a} = {\langle \mathbf{a}, \mathbf{b} \rangle \over \langle \mathbf{b}, \mathbf{b}\rangle} \mathbf{b} ,[/math]

where [math]\langle \mathbf{a}, \mathbf{b} \rangle[/math] is the scalar product of the vectors [math]\mathbf{a}[/math] and [math]\mathbf{b}[/math].

In the k-dimensional real space, the scalar product of the vectors [math]\mathbf{ a= [a_1, a_2, ...,a_k]}[/math] and [math]\mathbf{ b= [b_1, b_2, ..., b_k]}[/math] is defined as

[math]\langle \mathbf{a}, \mathbf{b} \rangle=\sum_{i=1}^k a_ib_i=a_1b_1+a_2b_2+\cdots+ a_kb_k[/math].

The above operator projects the vector [math]\mathbf{a}[/math] on the direction of the vector [math]\mathbf{b}[/math].

The orthogonality of the first two vectors in the orthogonalization process below is attained at Step (2).

The classical Gram--Schmidt process is performed as follows:

[math] {\begin{array}{lclr} {\mathbf {b}}_{1}&=&{\mathbf {a}}_{1}&(1)\\ {\mathbf {b}}_{2}&=&{\mathbf {a}}_{2}-{\mathbf {proj}}_{{{\mathbf {b}}_{1}}}\,{\mathbf {a}}_{2}&(2)\\ {\mathbf {b}}_{3}&=&{\mathbf {a}}_{3}-{\mathbf {proj}}_{{{\mathbf {b}}_{1}}}\,{\mathbf {a}}_{3}-{\mathbf {proj}}_{{{\mathbf {b}}_{2}}}\,{\mathbf {a}}_{3}&(3)\\ {\mathbf {b}}_{4}&=&{\mathbf {a}}_{4}-{\mathbf {proj}}_{{{\mathbf {b}}_{1}}}\,{\mathbf {a}}_{4}-{\mathbf {proj}}_{{{\mathbf {b}}_{2}}}\,{\mathbf {a}}_{4}-{\mathbf {proj}}_{{{\mathbf {b}}_{3}}}\,{\mathbf {a}}_{4}&(4)\\ &\vdots &&\\{\mathbf {b}}_{N}&=&{\mathbf {a}}_{N}-\displaystyle \sum _{{j=1}}^{{N-1}}{\mathbf {proj}}_{{{\mathbf {b}}_{j}}}\,{\mathbf {a}}_{N}&(N) \end{array}} [/math]


From each vector [math]\mathbf{b}_j \;(j = 1 \ldots N)[/math], one can obtain a normalized vector: [math]\mathbf{e}_j = {\mathbf{b}_j\over \| \mathbf{b}_j \|}[/math] (the normalized vector has the same direction as the original one, while its norm is 1 ). Here, the norm is consistent with the scalar product: [math]\| x \| = \sqrt{\langle x, x \rangle}[/math]

The output of the Gram--Schmidt process:

The system [math]\mathbf{b}_1,\;\ldots,\;\mathbf{b}_N[/math] of orthogonal vectors or the system [math]\mathbf{e}_1,\;\ldots,\;\mathbf{e}_N[/math] of orthonormal vectors.

In practice, the most often used form of the method is the orthogonalization method with re-orthogonalization.