Skip to main content

Section 5.3 Inner product space

Subsection 5.3.1 Abstract Vector Space

  1. Let \(\mathbf{P}_n(x)=\{a_0+a_1x+\ldots+a_nx^n|a_i\in \mathbb{R}\}\) is a vector space with two operations
    \begin{align*} (a_0+a_1x \amp +\ldots+a_nx^n) +(b_0+b_1x+\ldots+b_nx^n) \\ \amp = (a_0+b_0)+(a_1+b_1)x+\ldots+(a_n+b_n)x^n \end{align*}
    \begin{equation*} k(a_0+a_1x+\ldots+a_nx^n)=(k a_0)+(k a_1)x+\ldots+(k a_n)x^n \end{equation*}
  2. Define \(C[a, b]\) be the set of all real-valued continuous functions defined on the interval \([a,b]\text{.}\) \(C[a,b]\) is a vector space with operations
    \begin{equation*} (f+g)(x)=f(x)+g(x) \text{ and } (c f)(x)=c[f(x)]\text{.} \end{equation*}

Subsection 5.3.2 Inner Product Space

Let \(\mathbf{u}, \mathbf{v}\text{,}\) and \(\mathbf{w}\) be vectors in a vector space \(V\text{,}\) and let \(c\) be any scalar. An inner product on \(V\) is a function that associates a real number \(\langle\mathbf{u}, \mathbf{v}\rangle\) with each pair of vectors \(\mathbf{u}\) and \(\mathbf{v}\) and satisfies the axioms listed below.
  1. \(\displaystyle \langle\mathbf{u}, \mathbf{v}\rangle=\langle\mathbf{v}, \mathbf{u}\rangle\)
  2. \(\displaystyle \langle\mathbf{u}, \mathbf{v}+\mathbf{w}\rangle=\langle\mathbf{u}, \mathbf{v}\rangle+\langle\mathbf{u}, \mathbf{w}\rangle\)
  3. \(\displaystyle c\langle\mathbf{u}, \mathbf{v}\rangle=\langle c \mathbf{u}, \mathbf{v}\rangle\)
  4. \(\langle\mathbf{v}, \mathbf{v}\rangle \geq 0\text{,}\) and \(\langle\mathbf{v}, \mathbf{v}\rangle=0\) if and only if \(\mathbf{v}=\mathbf{0}\text{.}\)

Definition 5.3.1. Inner Product Space.

A vector space \(V\) with an inner product is called an inner product space.

Subsection 5.3.3 Gram-Schmidt Process in an Inner Product Space

Example: In \(C[0,2]\text{,}\) find an orthonormal basis of the space \(W=\operatorname{span}\{1,x,x^{2}\}\text{.}\)

Subsection 5.3.4 QR Decomposition

The Gram-Schmidt process has a matrix interpretation called QR decomposition. For any \(m \times n\) matrix \(A\) with linearly independent columns (full column rank), we can factor \(A\) as the product of an orthogonal matrix and an upper triangular matrix.

Definition 5.3.5. QR Decomposition.

Let \(A\) be an \(m \times n\) matrix with linearly independent columns. Then \(A\) can be factored as
\begin{equation*} A = QR \end{equation*}
where \(Q\) is an \(m \times n\) matrix with orthonormal columns, and \(R\) is an \(n \times n\) upper triangular matrix with positive diagonal entries.
Connection to Gram-Schmidt: If we apply the Gram-Schmidt process to the columns of \(A = [\mathbf{a}_1 | \mathbf{a}_2 | \cdots | \mathbf{a}_n]\text{,}\) we obtain orthonormal vectors \(\mathbf{q}_1, \mathbf{q}_2, \ldots, \mathbf{q}_n\text{.}\) These form the columns of \(Q = [\mathbf{q}_1 | \mathbf{q}_2 | \cdots | \mathbf{q}_n]\text{.}\)
The matrix \(R\) encodes the coefficients from the Gram-Schmidt process. Specifically, when we write
\begin{align*} \mathbf{a}_1 &= r_{11}\mathbf{q}_1\\ \mathbf{a}_2 &= r_{12}\mathbf{q}_1 + r_{22}\mathbf{q}_2\\ \mathbf{a}_3 &= r_{13}\mathbf{q}_1 + r_{23}\mathbf{q}_2 + r_{33}\mathbf{q}_3\\ &\vdots \end{align*}
these coefficients \(r_{ij}\) become the entries of the upper triangular matrix \(R\text{.}\)
Example: Compute the QR decomposition of a \(3 \times 3\) matrix
\begin{equation*} A = \begin{pmatrix} 1 \amp 1 \amp 0\\ 1 \amp 0 \amp 1\\ 0 \amp 1 \amp 1 \end{pmatrix}\text{.} \end{equation*}
Geometric Interpretation: The QR decomposition gives us two complementary views of the column space:
  • \(Q\) provides an orthonormal basis for \(\text{Col}(A)\)
  • \(R\) describes how to express the original columns in terms of this orthonormal basis
  • The transformation from \(A\) to \(Q\) preserves all geometric properties (lengths, angles, etc.) within the column space
Connection to Previous Subsection: QR decomposition is precisely the matrix formulation of the Gram-Schmidt process. While Gram-Schmidt gives us the step-by-step procedure for orthogonalization, QR decomposition packages this into a compact matrix factorization. Both approaches:
  • Start with linearly independent vectors
  • Produce an orthonormal basis for the same subspace
  • Preserve the spanning relationships between vectors
  • Enable efficient computation of projections and least squares solutions