Linear Transformations of Vector Spaces

Section 4.4 Linear Transformations of Vector Spaces

Recall that a transformation \(T:\mathbb{R}^n\rightarrow \mathbb{R}^m\) is called a linear transformation if the following are true for all vectors \(\mathbf{u}\) and \(\mathbf{v}\) in \(\mathbb{R}^n\text{,}\) and scalars \(k\text{.}\)

\begin{equation*} T(k\mathbf{u})= kT(\mathbf{u}), \end{equation*}

\begin{equation*} T(\mathbf{u}+\mathbf{v})= T(\mathbf{u})+T(\mathbf{v}). \end{equation*}

In Section 2.5 and Section 2.6 we looked the properties of these transformations in detail. The main result was that every such linear transformation was a matrix transformation, meaning that there was a \(n \times m\) matrix \(A\) so that \(T(\mathbf{x})=A\mathbf{x}\) for all \(\mathbf{x}\) in \(\R^n\text{.}\) The matrix \(A\) is called the standard matrix of \(T\text{;}\) see Definition 2.6.18. Moreover, Theorem 2.6.16 showed exactly how to find the standard matrix of \(T\text{:}\)

\begin{equation*} A=\begin{bmatrix} | \amp |\amp \amp |\\ T(\mathbf{e}_1) \amp T(\mathbf{e}_2)\amp \dots \amp T(\mathbf{e}_n)\\ |\amp | \amp \amp | \end{bmatrix} \end{equation*}

where \(\mathbf{e}_1, \mathbf{e}_2, \ldots, \mathbf{e}_n\) is the standard basis of \(\R^n\text{.}\)

Subsection 4.4.1 Defining Linear Transformations Again

In this section we extend these ideas to linear transformations between general vector spaces. To start, the definition of linear transformation extends essentially without change.

Definition 4.4.1.

Let \(V\) and \(W\) be vector spaces. A transformation \(T:V\rightarrow W\) is called a linear transformation if the following are true for all vectors \(u\) and \(v\) in \(V\text{,}\) and scalars \(k\text{.}\)

\begin{equation*} T(k u) = kT(u), \end{equation*}

\begin{equation*} T(u+v) = T(u)+T(v). \end{equation*}

This generalization allows for more interesting examples to be studied, such as transformations on polynomials. Here are a few examples:

Example 4.4.2.

Recall that \(\mathbb{M}_{n,n}\) is the set of all \(n\times n\) matrices. In Example 4.1.8, we demonstrated that \(\mathbb{M}_{n,n}\) together with operations of matrix addition and scalar multiplication is a vector space. Fix an \(n\times n\) matrix, \(Q\text{.}\) We can then define a transformation \(T_Q:\mathbb{M}_{n,n}\rightarrow \mathbb{M}_{n,n}\) by

\begin{equation*} T_Q(A)=QA, \end{equation*}

Show that \(T_Q\) is a linear transformation.

Answer.

We verify the linearity properties using properties of matrix-matrix and matrix-scalar multiplication (see Theorem 3.1.14). For \(A\) and \(B\) in \(\mathbb{M}_{n,n}\) and a scalar \(k\) we have:

\begin{equation*} T_Q(kA)=Q(kA)=k(QA)=kT_Q(A) \end{equation*}

together with

\begin{equation*} T_Q(A+B)=Q(A+B)=QA+QB=T_Q(A)+T_Q(B). \end{equation*}

Example 4.4.3.

Recall that \(\mathbb{P}^3\) is the set of polynomials of degree \(3\) or less than \(3\text{.}\) In Example 4.1.14, we showed that \(\mathbb{P}^3\) together with operations of polynomial addition and scalar multiplication is a vector space. Suppose \(T:\R^3\rightarrow\mathbb{P}^3\) is a linear transformation such that

\begin{equation*} T(\mathbf{i})=1+x-2x^2+x^3, \end{equation*}

\begin{equation*} T(\mathbf{j})=x+2x^3, \end{equation*}

\begin{equation*} T(\mathbf{k})=3+x^3. \end{equation*}

Find the image of \(\begin{bmatrix}1\\-2\\1\end{bmatrix}\) under \(T\text{.}\)

Answer.

\begin{align*} T\left(\begin{bmatrix}1\\-2\\1\end{bmatrix}\right)\amp =T(\mathbf{i}-2\mathbf{j}+\mathbf{k})=T(\mathbf{i})-2T(\mathbf{j})+T(\mathbf{k}) \\ \amp =(1+x-2x^2+x^3)-2(x+2x^3)+(3+x^3) \\ \amp =4-x-2x^2-2x^3. \end{align*}

The key step, the one that depends on \(T\) being linear, is the second equality, where we write the image of \(\mathbf{i}-2\mathbf{j}+\mathbf{k}\) under \(T\) in terms of \(T(\mathbf{i})\text{,}\) \(T(\mathbf{j})\text{,}\) and \(T(\mathbf{k})\text{.}\)

Example 4.4.4.

Let \(R:\mathbb{M}_{3,3}\rightarrow \R\) be a transformation such that \(R(A)=\mbox{rank}(A)\text{.}\) Show that \(R\) is not linear.

Answer.

The linearity definition requires that for any two inputs \(u\) and \(v\text{,}\) \(T(u+v)=T(u)+T(v)\text{.}\) To show this is not true, it is enough to show there is one choice of \(u\) and \(v\) where the equality fails. All we need is one! So, to show that this transformation \(R\) is not linear, it suffices to find two matrices \(A\) and \(B\) such that \(R(A+B)\neq R(A)+R(B)\text{.}\)

Observe that if we pick \(A\) and \(B\) so that each has rank \(3\) we would have \(R(A)+R(B)=\mbox{rank}(A)+\mbox{rank}(B)=6\) while \(R(A+B)=\mbox{rank}(A+B)\leq 3\text{.}\) Clearly \(R(A+B)\neq R(A)+R(B)\text{.}\)

This argument is sufficient, as it shows there must be a choice that works, but if we want a specific choice, we can find one.

Let

\begin{equation*} A=\begin{bmatrix}1\amp 0\amp 0\\0\amp 1\amp 0\\0\amp 0\amp 1\end{bmatrix} \quad\text{and}\quad B=\begin{bmatrix}-1\amp 0\amp 0\\0\amp 1\amp 0\\0\amp 0\amp -1\end{bmatrix}. \end{equation*}

Then

\begin{equation*} T(A)=3\quad\text{and}\quad T(B)=3 \end{equation*}

and

\begin{equation*} T(A+B)=T\left(\begin{bmatrix}0\amp 0\amp 0\\0\amp 2\amp 0\\0\amp 0\amp 0\end{bmatrix}\right)=1. \end{equation*}

Thus, \(1=T(A+B)\neq T(A)+T(B)=6\text{.}\)

For any vector space \(V\text{,}\) we have the following two simple, but very important, linear transformations.

Definition 4.4.5.

The identity transformation on \(V\text{,}\) denoted by \(\id_V\text{,}\) is a transformation that maps each element of \(V\) to itself. In other words,

\begin{equation*} \id_V:V\rightarrow V \end{equation*}

is a transformation such that

\begin{equation*} \id_V(\mathbf{v})=\mathbf{v}\quad\text{for all}\quad \mathbf{v} \in V \end{equation*}

Definition 4.4.6.

The zero transformation, \(Z\text{,}\) maps every element of the domain to the zero vector. In other words,

\begin{equation*} Z:V\rightarrow W \end{equation*}

is a transformation such that

\begin{equation*} Z(\mathbf{v})=\mathbf{0}\quad\text{for all}\quad \mathbf{v} \in V \end{equation*}

Theorem 4.4.7.

The identity transformation is linear.

Proof.

Left to the reader. (SeeExercise 4.4.5.15)

Theorem 4.4.8.

The zero transformation is linear.

Proof.

Left to the reader. (See Exercise 4.4.5.16)

Subsection 4.4.2 Linear Transformations and Bases

Exploration 4.4.1.

Suppose we want to define a linear transformation \(T:\R^2\rightarrow \R^2\) by

\begin{equation*} T(\mathbf{i})=\begin{bmatrix}3\\-2\end{bmatrix}\quad\text{and}\quad T(\mathbf{j})=\begin{bmatrix}-1\\2\end{bmatrix}. \end{equation*}

Is this information sufficient to define \(T\text{?}\) To answer this question we will try to determine what \(T\) does to an arbitrary vector of \(\R^2\text{.}\) If \(\mathbf{v}\) is a vector in \(\R^2\text{,}\) then \(\mathbf{v}\) can be uniquely expressed as a linear combination of \(\mathbf{i}\) and \(\mathbf{j}\)

\begin{equation*} \mathbf{v}=a\mathbf{i}+b\mathbf{j}. \end{equation*}

By linearity of \(T\) we have

\begin{equation*} T(\mathbf{v})=T(a\mathbf{i}+b\mathbf{j})=aT(\mathbf{i})+bT(\mathbf{j})=a\begin{bmatrix}3\\-2\end{bmatrix}+b\begin{bmatrix}-1\\2\end{bmatrix}. \end{equation*}

This shows that the image of every vector of \(\R^2\) under \(T\) is completely determined by the action of \(T\) on the standard unit vectors \(\mathbf{i}\) and \(\mathbf{j}\text{.}\)

Vectors \(\mathbf{i}\) and \(\mathbf{j}\) form the standard basis of \(\R^2\text{.}\) What if we want to use a different basis? Let

\begin{equation*} \mathcal{B}=\left \lbrace \begin{bmatrix}1\\1\end{bmatrix},\begin{bmatrix}2\\-1\end{bmatrix}\right \rbrace \end{equation*}

be our basis of choice for \(\R^2\text{.}\) (How would you verify that \(\mathcal{B}\) is a basis of \(\R^2\text{?}\)) And suppose we want to define a linear transformation \(S:\R^2\rightarrow \R^2\) by

\begin{equation*} S\left(\begin{bmatrix}1\\1\end{bmatrix}\right)=\begin{bmatrix}0\\-1\end{bmatrix}\quad\text{and}\quad S\left(\begin{bmatrix}2\\-1\end{bmatrix}\right)=\begin{bmatrix}2\\0\end{bmatrix}. \end{equation*}

Is this enough information to define \(S\text{?}\) Because \([1,1],[2,-1]\) form a basis of \(\R^2\text{,}\) every element \(\mathbf{v}\) of \(\R^2\) can be written as a unique linear combination

\begin{equation*} \mathbf{v}=a\begin{bmatrix}1\\1\end{bmatrix}+b\begin{bmatrix}2\\-1\end{bmatrix}. \end{equation*}

We can find \(S(\mathbf{v})\) as follows:

\begin{equation*} S(\mathbf{v})=S\left(a\begin{bmatrix}1\\1\end{bmatrix}+b\begin{bmatrix}2\\-1\end{bmatrix}\right)=a\begin{bmatrix}0\\-1\end{bmatrix}+b\begin{bmatrix}2\\0\end{bmatrix}. \end{equation*}

Again, we see how a linear transformation is completely determined by its action on a basis.

These examples only work because Theorem 4.2.7 tells us that given a basis, every vector has a unique representation as a linear combination of the basis vectors.

Let \(\mathcal{B}=\{\mathbf{v}_1,\ldots,\mathbf{v}_p\}\) be a basis of a vector space \(V\text{.}\) To define a linear transformation \(T:V\rightarrow W\text{,}\) it is sufficient to state the image of each basis vector under \(T\text{,}\) that is, we specify \(T(\mathbf{v}_1), T(\mathbf{v}_2), \ldots, T(\mathbf{v}_p)\text{.}\) Once the images of the basis vectors are established, we can determine the images of all vectors of \(V\) as follows: given any vector \(\mathbf{v}\) of \(V\text{,}\) write \(\mathbf{v}\) as a linear combination of the elements of \(\mathcal{B}\)

\begin{equation*} \mathbf{v}=a_1\mathbf{v}_1+\ldots+a_p\mathbf{v}_p. \end{equation*}

Then

\begin{equation*} T(\mathbf{v})=T(a_1\mathbf{v}_1+\ldots+a_p\mathbf{v}_p)=a_1T(\mathbf{v}_1)+\ldots+a_pT(\mathbf{v}_p). \end{equation*}

Subsection 4.4.3 Coordinate Vectors

Transformations that map vectors to their coordinate vectors will prove to be of great importance. In this section we will prove that such transformations are linear and give several examples. If \(V\) is a vector space, and \(\mathcal{B}=\{\mathbf{v}_1, \ldots ,\mathbf{v}_n\}\) is an ordered basis for \(V\) then any vector \(\mathbf{v}\) of \(V\) can be uniquely expressed as \(\mathbf{v}=a_1\mathbf{v}_1+\ldots +a_n\mathbf{v}_n\) for some scalars \(a_1, \ldots ,a_n\text{.}\) The vector in \(\R^n\) given by

\begin{equation*} [\mathbf{v}]_{\mathcal{B}}=\begin{bmatrix}a_1\\a_2\\\vdots\\a_n\end{bmatrix} \end{equation*}

is said to be the coordinate vector for \(\mathbf{v}\) with respect to the ordered basis \(\mathcal{B}\) (see Definition 4.2.28). As before, we denote the coordinate vector of \(\mathbf{v}\) by \([\mathbf{v}]_{\mathcal{B}}\text{.}\) While \(\mathbf{v}\) is in the abstract vector space \(V\text{,}\) the coordinate vector \([\mathbf{v}]_{\mathcal{B}}\) is in \(\R^n\text{,}\) not \(V\text{.}\) Because the coordinates are unique for each vector \(\mathbf{v}\) in \(V\text{,}\) we can define a transformation \(T:V\rightarrow \R^n\) by

\begin{equation*} T(\mathbf{v})=[\mathbf{v}]_{\mathcal{B}}. \end{equation*}

It turns out that \(T\) is linear. Before we prove this, consider some examples.

Example 4.4.9.

Consider \(\mathbb{M}_{2,2}\text{.}\) Notice that

\begin{equation*} \mathcal{B}=\left\{\begin{bmatrix}1\amp 0\\0\amp 0\end{bmatrix}, \begin{bmatrix}0\amp 1\\0\amp 0\end{bmatrix}, \begin{bmatrix}0\amp 0\\1\amp 0\end{bmatrix}, \begin{bmatrix}0\amp 0\\0\amp 1\end{bmatrix}\right\} \end{equation*}

is an ordered basis for \(\mathbb{M}_{2,2}\) (You should do a quick mental check that \(\mathcal{B}\) is a legitimate basis). Define \(T:\mathbb{M}_{2,2}\rightarrow \R^4\) by \(T(A)=[A]_{\mathcal{B}}\text{.}\) Find

\begin{equation*} T\left(\begin{bmatrix}-2\amp 3\\1\amp -5\end{bmatrix}\right). \end{equation*}

Answer.

We need to find the coordinate vector for \(\begin{bmatrix}-2\amp 3\\1\amp -5\end{bmatrix}\) with respect to \(\mathcal{B}\text{.}\) Firstly,

\begin{equation*} \begin{bmatrix}-2\amp 3\\1\amp -5\end{bmatrix}=-2\begin{bmatrix}1\amp 0\\0\amp 0\end{bmatrix}+ 3\begin{bmatrix}0\amp 1\\0\amp 0\end{bmatrix}+ \begin{bmatrix}0\amp 0\\1\amp 0\end{bmatrix}+ (-5)\begin{bmatrix}0\amp 0\\0\amp 1\end{bmatrix}. \end{equation*}

This gives us:

\begin{equation*} T\left(\begin{bmatrix}-2\amp 3\\1\amp -5\end{bmatrix}\right)=\left[\begin{bmatrix}-2\amp 3\\1\amp -5\end{bmatrix}\right]_{\mathcal{B}}=\begin{bmatrix}-2\\3\\1\\-5\end{bmatrix}. \end{equation*}

Example 4.4.10.

Recall that \(\mathbb{P}^2\) is the set of polynomials of degree \(2\) or less than \(2\text{.}\) In Example 4.1.13, we showed that \(\mathbb{P}^2\) is a vector space.

Notice that \(\mathcal{B}_1=\{1, x, x^{2}\}\) is an ordered basis for \(\mathbb{P}^2\text{.}\) (It is easy to verify that \(\mathcal{B}_1\) is a basis.) If \(T:\mathbb{P}^2\rightarrow \R^3\) is given by \(T(p)=[p]_{\mathcal{B}_1}\text{,}\) find

\begin{equation*} T(2x^2-3x). \end{equation*}
Notice that \(\mathcal{B}_2=\{1 + x, 1 - x, x + x^{2}\}\) is an ordered basis for \(\mathbb{P}^2\)- Exercise 4.2.7.13 demonstrates that \(\mathcal{B}_2\) is a basis. If \(T:\mathbb{P}^2\rightarrow \R^3\) is given by \(T(p)=[p]_{\mathcal{B}_2}\text{,}\) find

\begin{equation*} T(2x^2-3x). \end{equation*}

Answer.

Item 1 We express \(2x^2-3x\) as a linear combination of elements of \(\mathcal{B}_1\text{.}\)

\begin{equation*} 2x^2-3x=0\cdot 1+ (-3)x+2x^2. \end{equation*}

Therefore

\begin{equation*} [2x^2-3x]_{\mathcal{B}_1}=\begin{bmatrix}0\\-3\\2\end{bmatrix}. \end{equation*}

Note that it is important to keep the basis elements in the same order in which they are listed, as the order of components of the coordinate vector depends on the order of the basis elements. We conclude that

\begin{equation*} T(2x^2-3x)=\begin{bmatrix}0\\-3\\2\end{bmatrix}. \end{equation*}

For Item 2: Our goal is to express \(2x^2-3x\) as a linear combination of the elements of \(\mathcal{B}_2\text{.}\) Thus, we need to find coefficients \(a\text{,}\) \(b\) and \(c\) such that

\begin{align*} 2x^2-3x \amp =a(1+x)+b(1-x)+c(x+x^2) \\ \amp =(a+b)+(a-b+c)x+cx^2. \end{align*}

This gives us a system of linear equations:

\begin{equation*} \begin{array}{ccccccc} a \amp +\amp b\amp \amp \amp = \amp 0 \\ a\amp -\amp b\amp +\amp c\amp =\amp -3\\ \amp \amp \amp \amp c\amp =\amp 2 \end{array} \end{equation*}

Solving the system yields \(a=-\frac{5}{2}\text{,}\) \(b=\frac{5}{2}\) and \(c=2\text{.}\) Thus

\begin{equation*} T(2x^2-3x)=[2x^2-3x]_{\mathcal{B}_2}=\begin{bmatrix}-5/2\\5/2\\2\end{bmatrix}. \end{equation*}

Theorem 4.4.11.

Let \(V\) be an \(n\)-dimensional vector space, and let \(\mathcal{B}\) be an ordered basis for \(V\text{.}\) Then \(T:V\rightarrow \R^n\) given by \(T(\mathbf{v})=[\mathbf{v}]_{\mathcal{B}}\) is a linear transformation.

Proof.

Theorem 4.2.7 shows that there is only one way to represent each element of \(V\) as a linear combination of elements of \(\mathcal{B}\text{.}\) Thus each element of \(V\) maps to exactly one element of \(\R^n\text{.}\) This proves that \(T\) is a function, or a transformation.

We will now prove that \(T\) is linear. Let \(\mathbf{v}\) be an element of \(V\text{.}\) We will first show that \(T(k\mathbf{v})=kT(\mathbf{v})\text{.}\) Suppose \(\mathcal{B}=\{\mathbf{v}_1, \ldots ,\mathbf{v}_n\}\text{,}\) then \(\mathbf{v}\) can be written as a unique linear combination:

\begin{equation*} \mathbf{v}=a_1\mathbf{v}_1+ \ldots +a_n\mathbf{v}_n \end{equation*}

We have:

\begin{align*} T(k\mathbf{v})\amp =T(k(a_1\mathbf{v}_1+ \ldots +a_n\mathbf{v}_n)) \\ \amp =T((ka_1)\mathbf{v}_1+ \ldots +(ka_n)\mathbf{v}_n) \\ \amp =\begin{bmatrix}ka_1\\\vdots\\ka_n\end{bmatrix}=k\begin{bmatrix}a_1\\\vdots\\a_n\end{bmatrix}=kT(\mathbf{v}). \end{align*}

We leave it to the reader to verify that \(T(\mathbf{v}+\mathbf{w})=T(\mathbf{v})+T(\mathbf{w})\) (see Exercise 4.4.5.17).

In our final example, we will consider \(T\) in the context of a basis of the codomain, as well as a basis of the domain. This will later help us tackle the question of the matrix of \(T\) associated with bases other than the standard one.

Example 4.4.12.

Let

\begin{equation*} \mathbf{v}_1=\begin{bmatrix}1\\2\\0\end{bmatrix}\quad\text{and}\quad\mathbf{v}_2=\begin{bmatrix}0\\1\\1\end{bmatrix}, \end{equation*}

\begin{equation*} \mathbf{w}_1=\begin{bmatrix}1\\0\\1\end{bmatrix}\quad\text{and}\quad\mathbf{w}_2=\begin{bmatrix}1\\0\\0\end{bmatrix}, \end{equation*}

and

\begin{equation*} V=\text{span}(\mathbf{v}_1, \mathbf{v}_2)\quad\text{and}\quad W=\text{span}(\mathbf{w}_1, \mathbf{w}_2). \end{equation*}

Because each of \(\{\mathbf{v}_1, \mathbf{v}_2\}\) and \(\{\mathbf{w}_1, \mathbf{w}_2\}\) is linearly independent, let

\begin{equation*} \mathcal{B}=\{\mathbf{v}_1, \mathbf{v}_2\}\quad\text{and}\quad\mathcal{C}=\{\mathbf{w}_1, \mathbf{w}_2\} \end{equation*}

be ordered bases of \(V\) and \(W\text{,}\) respectively. Define a linear transformation \(T:V\rightarrow W\) by

\begin{equation*} T(\mathbf{v}_1)=2\mathbf{w}_1-3\mathbf{w}_2\quad\text{and} \quad T(\mathbf{v}_2)=-\mathbf{w}_1+4\mathbf{w}_2. \end{equation*}

Verify that \(\mathbf{v}=[2,5,1]\) is in \(V\) and find the coordinate vector \([\mathbf{v}]_{\mathcal{B}}\text{.}\)
Find \(T(\mathbf{v})\) and the coordinate vector \([T(\mathbf{v})]_{\mathcal{C}}\text{.}\)

Answer.

For Item 1, we need to express \(\mathbf{v}\) as a linear combination of \(\mathbf{v}_1\) and \(\mathbf{v}_2\text{.}\) This can be done by observation or by solving the equation

\begin{equation*} \begin{bmatrix}1\amp 0\\2\amp 1\\0\amp 1\end{bmatrix}\begin{bmatrix}a\\b\end{bmatrix}=\begin{bmatrix}2\\5\\1\end{bmatrix}. \end{equation*}

We find that \(a=2\) and \(b=1\text{,}\) so \(\mathbf{v}=2\mathbf{v}_1+\mathbf{v}_2\text{.}\) Thus \(\mathbf{v}\) is in \(V\text{.}\) The coordinate vector for \(\mathbf{v}\) with respect to the ordered basis \(\mathcal{B}\) is

\begin{equation*} [\mathbf{v}]_{\mathcal{B}}=\begin{bmatrix}2\\1\end{bmatrix}. \end{equation*}

For Item 2, by linearity of \(T\) we have

\begin{align*} T(\mathbf{v})=T(2\mathbf{v}_1+\mathbf{v}_2)\amp =2T(\mathbf{v}_1)+T(\mathbf{v}_2) \\ \amp =2(2\mathbf{w}_1-3\mathbf{w}_2)+(-\mathbf{w}_1+4\mathbf{w}_2) \\ \amp =3\mathbf{w}_1-2\mathbf{w}_2=\begin{bmatrix}1\\0\\3\end{bmatrix}. \end{align*}

The coordinate vector for \(T(\mathbf{v})\) with respect to the ordered basis \(\mathcal{C}\) is

\begin{equation*} [T(\mathbf{v})]_{\mathcal{C}}=\begin{bmatrix}3\\-2\end{bmatrix}. \end{equation*}

Subsection 4.4.4 Matrix Representation of a Linear Transformation

In this section we will define and learn how to compute the matrix representation of a linear transformation between vector spaces. For a linear transformations \(T:\R^n\to\R^m\) we know that \(T(\mathbf{x})=A\mathbf{x}\text{,}\) where

\begin{equation*} A = \begin{bmatrix}T(\mathbf{e}_1) \amp T(\mathbf{e}_2) \amp \cdots \amp T(\mathbf{e}_n)\end{bmatrix}. \end{equation*}

We call \(A\) the standard matrix of the transformation \(T:\R^n\to\R^m\text{.}\) As usual, \(\mathbf{e}_1, \ldots, \mathbf{e}_n\) are the standard basis vectors of \(\R^n\text{.}\)

To create a matrix representation for a linear transformation \(T : V \to W\) between vector spaces \(V\) and \(W\text{,}\) we have to pick bases for each of \(V\) and \(W\text{.}\)

Definition 4.4.13.

Let \(V\) and \(W\) be vector spaces with ordered bases \(\mathcal{B}=\{\mathbf{b}_1, \ldots, \mathbf{b}_n\}\) and \(\mathcal{C}=\{\mathbf{c}_1, \ldots, \mathbf{c}_m\}\text{,}\) respectively.

The matrix representation of a linear transformation \(T:V\to W\) with respect to the bases \(\mathcal{B}\) and \(\mathcal{C}\) is defined as the matrix whose columns are the coordinate vectors of the images of the basis vectors of \(V\) under \(T\text{.}\) In other words, the matrix representation of \(T\) is given by

\begin{equation*} A_{\mathcal{C}\leftarrow\mathcal{B}} = \begin{bmatrix} [T(\mathbf{b}_1)]_{\mathcal{C}} \amp [T(\mathbf{b}_2)]_{\mathcal{C}} \amp \cdots \amp [T(\mathbf{b}_n)]_{\mathcal{C}} \end{bmatrix}. \end{equation*}

Here, \([T(\mathbf{b}_i)]_{\mathcal{C}}\) is the coordinate vector of \(T(\mathbf{b}_i)\) with respect to the basis \(\mathcal{C}=\{\mathbf{c}_1, \ldots, \mathbf{c}_m\}\text{.}\)

There is a lot to unpack in this definition and we will look at a few examples in a moment. First, the matrix representation really is a matrix, just as coordinate vectors are in \(\R^n\text{.}\)

Second, the matrix representation is an generalization of the standard matrix from Section 2.6. To see the connection, let \(\mathcal{E}_n=\{\mathbf{e}_1, \ldots, \mathbf{e}_n\}\text{,}\) the standard basis of \(\R^n\text{.}\) Then, for \(T : \R^n \to \R^m\text{,}\) \(T(\mathbf{e_i})\) is its own coordinate vector with respect to \(\mathcal{E}_m\text{.}\) So our new definition of matrix representation is the same as the standard matrix of \(T\text{,}\) that is,

\begin{equation*} \begin{bmatrix} [T(\mathbf{e}_1)]_{\mathcal{E}_m} \amp [T(\mathbf{e}_2)]_{\mathcal{E}+m} \amp \cdots \amp [T(\mathbf{e}_n)]_{\mathcal{E}_m} \end{bmatrix} =\begin{bmatrix}T(\mathbf{e}_1) \amp T(\mathbf{e}_2) \amp \cdots \amp T(\mathbf{e}_n)\end{bmatrix}. \end{equation*}

The matrix representation of a linear transformation with respect to bases lets turn linear transformations of general vector spaces into matrices. Using this, we can show that vector spaces with the same (finite) dimension are essentially the same. So it provides a bridge between the abstract concept of linear transformations and the concrete computations involving matrices. Let’s practice finding matrix representations to get comfortable with this new tool.

Example 4.4.14.

Consider the derivative map \(D : \mathbb{P}^3 \to \mathbb{P}^2\) that sends a polynomial of degree at most 3 to its derivative. Letting \(\mathcal{B}=\{1,x,x^2,x^3\}\) and \(\mathcal{C}=\{1,x,x^2\}\text{,}\) what is the matrix of \(D\) with respect to these bases?

Answer.

From the formula in Definition 4.4.13, we first need to find the image of each element of \(\mathcal{B}\) under \(D\) and then find the coordinate vector of each image in terms of the basis \(\mathcal{C}\text{.}\) These coordinate vectors will be the columns of the matrix representation.

For \(1\text{,}\) we know \(D(1)=0\) and the coordinate vector of the zero vector is \([0,0,0]\text{.}\) For \(x\text{,}\) we know \(D(x)=1\) and \([1]_\mathcal{C}\) is \([1,0,0]\text{.}\) Similarly, \(D(x^2)=2x\) and \([2x]_\mathcal{C} = [0,2,0]\) while \([ D(x^3) ]_\mathcal{C} = [ 3x^2 ]_\mathcal{C}= [0,0,3]\text{.}\) Putting this together, we have

\begin{equation*} D_{\mathcal{C}\leftarrow\mathcal{B}} = \begin{bmatrix} 0 \amp 1 \amp 0 \amp 0 \\ 0 \amp 0 \amp 2 \amp 0 \\ 0 \amp 0 \amp 0 \amp 3 \end{bmatrix}. \end{equation*}

Example 4.4.15.

Consider the transformation \(T:\mathbb{P}^2\to\mathbb{M}_{2,2}\) defined by

\begin{equation*} T(1) = \begin{bmatrix}1 \amp 0 \\ 0 \amp 2\end{bmatrix}, \quad T(x) = \begin{bmatrix}1 \amp -1 \\ 5 \amp 0\end{bmatrix}, \quad T(x^2) = \begin{bmatrix}1 \amp 1 \\ 1 \amp 1\end{bmatrix}. \end{equation*}

Find the matrix representation of \(T\) with respect to the bases \(\mathcal{B} = \{1, x, x^2\}\) and \(\mathcal{C} = \left\{\begin{bmatrix}1 \amp 0 \\ 0 \amp 0\end{bmatrix}, \begin{bmatrix}0 \amp 1 \\ 0 \amp 0\end{bmatrix}, \begin{bmatrix}0 \amp 0 \\ 1 \amp 0\end{bmatrix}, \begin{bmatrix}0 \amp 0 \\ 0 \amp 1\end{bmatrix}\right\}\text{.}\)

Answer.

The matrix representation of \(T\) with respect to the bases \(\mathcal{B}\) and \(\mathcal{C}\) is given by

\begin{equation*} A_{\mathcal{C}\leftarrow\mathcal{B}} = \begin{bmatrix} [T(1)]_{\mathcal{C}} \amp [T(x)]_{\mathcal{C}} \amp [T(x^2)]_{\mathcal{C}} \end{bmatrix}. \end{equation*}

Note that we need to compute the coordinate vectors of \(T(1)\text{,}\) \(T(x)\text{,}\) and \(T(x^2)\) with respect to the basis \(\mathcal{C}\text{.}\) Observe:

\begin{align*} T(1) \amp = 1\begin{bmatrix}1\amp 0\\ 0\amp 0\end{bmatrix} + 0\begin{bmatrix}0\amp 1\\ 0\amp 0\end{bmatrix} + 0\begin{bmatrix}0\amp 0\\ 1\amp 0\end{bmatrix} + 2\begin{bmatrix}0\amp 0\\ 0\amp 1\end{bmatrix} \\ T(x) \amp = 1\begin{bmatrix}1\amp 0\\ 0\amp 0\end{bmatrix} + -1\begin{bmatrix}0\amp 1\\ 0\amp 0\end{bmatrix} + 5\begin{bmatrix}0\amp 0\\ 1\amp 0\end{bmatrix} + 0\begin{bmatrix}0\amp 0\\ 0\amp 1\end{bmatrix} \\ T(x^2) \amp = 1\begin{bmatrix}1\amp 0\\ 0\amp 0\end{bmatrix} + 1\begin{bmatrix}0\amp 1\\ 0\amp 0\end{bmatrix} + 1\begin{bmatrix}0\amp 0\\ 1\amp 0\end{bmatrix} + 1\begin{bmatrix}0\amp 0\\ 0\amp 1\end{bmatrix} \end{align*}

So each coordinate vector is

\begin{align*} [T(1)]_{\mathcal{C}} \amp = \begin{bmatrix}1 \\ 0 \\ 0 \\ 2\end{bmatrix}\\ [T(x)]_{\mathcal{C}} \amp = \begin{bmatrix}1 \\ -1 \\ 5 \\ 0\end{bmatrix}\\ [T(x^2)]_{\mathcal{C}} \amp = \begin{bmatrix}1 \\ 1 \\ 1 \\ 1\end{bmatrix} \end{align*}

Therefore, the matrix representation is

\begin{equation*} A_{\mathcal{C}\leftarrow\mathcal{B}} = \begin{bmatrix} 1 \amp 1 \amp 1 \\ 0 \amp -1 \amp 1 \\ 0 \amp 5 \amp 1 \\ 2 \amp 0 \amp 1 \end{bmatrix}. \end{equation*}

Exercises 4.4.5 Exercises

1.

Suppose \(T:\mathbb{P}^2\rightarrow\mathbb{M}_{2,2}\) is a linear transformation such that

\begin{equation*} T(1)=\begin{bmatrix}1\amp 0\\0\amp 1\end{bmatrix},\quad T(x)=\begin{bmatrix}1\amp 1\\0\amp 1\end{bmatrix},\quad T(x^2)=\begin{bmatrix}1\amp 1\\1\amp 1\end{bmatrix} \end{equation*}

Find

\begin{equation*} T(4-x+3x^2)\text{.} \end{equation*}

Answer.

\begin{equation*} T(4-x+3x^2)=\begin{bmatrix}6\amp 2\\3\amp 6\end{bmatrix}. \end{equation*}

2.

Define \(T:\mathbb{M}_{3,3}\rightarrow \R\) by \(T(A)=\mbox{tr}(A)\text{.}\) (Recall that \(\mbox{tr}(A)\) denotes the trace of \(A\text{,}\) which is the sum of the main diagonal entries of \(A\text{.}\)) Find

\begin{equation*} T\left(\begin{bmatrix}1\amp 2\amp 3\\4\amp 5\amp 6\\7\amp 8\amp 9\end{bmatrix}\right). \end{equation*}

Answer.

\begin{equation*} T\left(\begin{bmatrix}1\amp 2\amp 3\\4\amp 5\amp 6\\7\amp 8\amp 9\end{bmatrix}\right)=15. \end{equation*}

3.

Is \(T\) a linear transformation? If so, prove it. If not, give a counterexample.

4.

Define \(T:\R^2\rightarrow\mathbb{M}_{2,2}\) by

\begin{equation*} T\left(\begin{bmatrix}a\\b\end{bmatrix}\right)=\begin{bmatrix}a\amp 1\\1\amp b\end{bmatrix}. \end{equation*}

Find

\begin{equation*} T\left(\begin{bmatrix}2\\-1\end{bmatrix}\right). \end{equation*}

Answer.

\begin{equation*} T\left(\begin{bmatrix}2\\-1\end{bmatrix}\right)=\begin{bmatrix}2\amp 1\\1\amp -1\end{bmatrix}. \end{equation*}

5.

Is \(T\) a linear transformation? If so, prove it. If not, give a counterexample.

6.

This problem requires the knowledge of how to compute a \(3\times 3\) determinant (for a quick reminder, chapter \(1\)). Define \(T:\mathbb{M}_{3,3}\rightarrow \R\) by \(T(A)=\det(A)\text{.}\) Find

\begin{equation*} T\left(\begin{bmatrix}1\amp 2\amp 3\\4\amp 5\amp 6\\7\amp 8\amp 9\end{bmatrix}\right). \end{equation*}

Answer.

\begin{equation*} T\left(\begin{bmatrix}1\amp 2\amp 3\\4\amp 5\amp 6\\7\amp 8\amp 9\end{bmatrix}\right)=0. \end{equation*}

7.

Is \(T\) a linear transformation? If so, prove it. If not, give a counterexample.

8.

Define \(T:\mathbb{P}^3\rightarrow\mathbb{P}^2\) by \(T(p(x))=p'(x)\) (in other words, \(T\) maps a polynomial to its derivative). Find

\begin{equation*} T(4x^3-2x^2+x+6). \end{equation*}

Answer.

\begin{equation*} T(4x^3-2x^2+x+6)=12x^2-4x+1. \end{equation*}

9.

Is \(T\) a linear transformation? If so, prove it. If not, give a counterexample.

10.

Recall that the set \(V\) of all symmetric \(2\times 2\) matrices is a subspace of \(\mathbb{M}_{2,2}\text{.}\) In Example 4.2.23, we demonstrated that

\begin{equation*} \mathcal{B} = \left\{ \begin{bmatrix} 1 \amp 0 \\ 0 \amp 0 \end{bmatrix}, \begin{bmatrix} 0 \amp 0 \\ 0 \amp 1 \end{bmatrix}, \begin{bmatrix} 0 \amp 1 \\ 1 \amp 0 \end{bmatrix} \right\} \end{equation*}

is a basis for \(V\text{.}\) Define \(T:V\rightarrow \R^3\) by \(T(A)=[A]_{\mathcal{B}}\text{.}\) Find

\begin{equation*} T(I_2) \quad \text{and} \quad T\left(\begin{bmatrix}2\amp -3\\-3\amp 1\end{bmatrix}\right). \end{equation*}

Answer.

\begin{equation*} T(I_2)=\begin{bmatrix}1\\1\\0\end{bmatrix}, \end{equation*}

\begin{equation*} T\left(\begin{bmatrix}2\amp -3\\-3\amp 1\end{bmatrix}\right)=\begin{bmatrix}2\\1\\-3\end{bmatrix}. \end{equation*}

11.

Let \(V\) be a subspace of \(\R^3\) with a basis \(\mathcal{B}=\left\{\begin{bmatrix}2\\1\\-1\end{bmatrix}, \begin{bmatrix}0\\3\\2\end{bmatrix}\right\}\text{.}\) Find the coordinate vector, \([\mathbf{v}]_{\mathcal{B}}\text{,}\) for \(\mathbf{v}=[4,-1,-4]\text{.}\)

Answer.

\begin{equation*} [\mathbf{v}]_{\mathcal{B}}=\begin{bmatrix}2\\-1\end{bmatrix}. \end{equation*}

12.

If the order of the basis elements in Exercise 4.4.5.11 was switched to form a new basis

\begin{equation*} \mathcal{B}'=\left\{\begin{bmatrix}0\\3\\2\end{bmatrix}, \begin{bmatrix}2\\1\\-1\end{bmatrix} \right\}. \end{equation*}

How would this affect the coordinate vector?

Answer.

\begin{equation*} [\mathbf{v}]_{\mathcal{B}'}=\begin{bmatrix}-1\\2\end{bmatrix} \end{equation*}

13.

In Exercise 4.2.7.16, you demonstrated that

\begin{equation*} \mathcal{B}=\{x^{2}, x + 1, 1 - x - x^{2}\} \end{equation*}

is a basis for \(\mathbb{P}^2\text{.}\) Define \(T:\mathbb{P}^2\rightarrow \R^3\) by \(T(p(x))=[p(x)]_{\mathcal{B}}\text{.}\) Find

\begin{equation*} T(0), \quad T(x+1) \quad \text{and} \quad T(x^2-3x+1). \end{equation*}

Answer.

\begin{equation*} T(0)=\begin{bmatrix}0\\0\\0\end{bmatrix}, \end{equation*}

\begin{equation*} T(x+1)=\begin{bmatrix}0\\1\\0\end{bmatrix}, \end{equation*}

\begin{equation*} T(x^2-3x+1)=\begin{bmatrix}3\\-1\\2\end{bmatrix}. \end{equation*}

14.

Let \(V\) and \(W\) be vector spaces, and let \(\mathcal{B}_V=\{\mathbf{v}_1, \mathbf{v}_2, \mathbf{v}_3, \mathbf{v}_4\}\) and \(\mathcal{B}_W=\{\mathbf{w}_1,\mathbf{w}_2, \mathbf{w}_3\}\) be ordered bases of \(V\) and \(W\text{,}\) respectively. Suppose \(T:V\rightarrow W\) is a linear transformation such that:

\begin{equation*} T(\mathbf{v}_1)=\mathbf{w}_2, \end{equation*}

\begin{equation*} T(\mathbf{v}_2)=2\mathbf{w}_1-3\mathbf{w}_2, \end{equation*}

\begin{equation*} T(\mathbf{v}_3)=\mathbf{w}_2+\mathbf{w}_3, \end{equation*}

\begin{equation*} T(\mathbf{v}_4)=-\mathbf{w}_1. \end{equation*}

If \(\mathbf{v}=-2\mathbf{v}_1+3\mathbf{v}_2-\mathbf{v}_4\text{,}\) express \(T(\mathbf{v})\) as a linear combination of vectors of \(\mathcal{B}_W\text{.}\) Now,

\begin{equation*} T(\mathbf{v})=7\mathbf{w}_1-11\mathbf{w}_2+0\mathbf{w}_3. \end{equation*}

Find \([\mathbf{v}]_{\mathcal{B}_V}\) and \([T(\mathbf{v})]_{\mathcal{B}_{W}}\text{.}\)

Answer.

\begin{equation*} [\mathbf{v}]_{\mathcal{B}_V}=\begin{bmatrix}-2\\3\\0\\-1\end{bmatrix},\quad [T(\mathbf{v})]_{\mathcal{B}_{W}}=\begin{bmatrix}7\\-11\\0\end{bmatrix}. \end{equation*}

15.

Prove Theorem 4.4.7

16.

Prove Theorem 4.4.8

17.

Complete the proof of Theorem 4.4.11.

Prev Top Next