Lecture 23 - Polynomial Operator

Let's just start with an example. Say:

p (x) = x^{2} - 2 x + 2

and $D$ is the derivative map. Then:

P (D) = D^{2} - 2 D + 2 I

notice we are just "plugging in" $x = D$ here. So then:

P (D) (D^{2} - 2 D + 2 I) (1 + x + x^{2}) = D^{2} (1 + x + x^{2}) - 2 D (1 + x + x^{2}) + 2 (1 + x + x^{2})

which simplifies to:

(2) - 2 (2 x + 1) + 2 (1 + x + x^{2}) = . . .

the rest really doesn't matter. What's important is that $P (D)$ is a polynomial operator, derived from $D$ . Notice that it's an operator, so we operated it on something that $D$ can operate on.

As another example, let $p (x) = x - 2$ and $q (x) = 2 x + 1$ , and $P (T) = T - 2 I$ and $Q (T) = 2 T + I$ , where $T \in L (V)$ . So then:

P (T) Q (T) v = P (T) (2 T v + v) = (T - 2 I) (2 T v + v) = 2 T^{2} v - 4 T v + T v - 2 v = 2 T^{2} v - 3 T v - 2 v

which also equals:

(2 T^{2} - 3 T - 2 I) \vec{v}

it shouldn't surprise you that the multiplication of polynomials give the polynomial operator where $x = T$ :

p (x) q (x) = 2 x^{2} - 3 x - 2

Therefore, it follows that:

(p q) (T) = p (T) q (T)

as an interesting property. Is it always true? But of course, and it's because that you can foil out polynomial operators just like normal polynomials, so the properties of polynomials carry over to these new objects.

As an example, since $(p q) (x) = (q p) (x)$ then by our theorem above then:

p (T) q (T) = (p q) (T) = (q p) (T) = q (T) p (T)

THIS IS BIGGGGGG!!!! Not always $f \circ g \neq g \circ f$ , but it does here. The thing here is that because we are only dealing with one operator $T$ and that:

T \circ T = T \circ T

And that we have distributivity and all the other good properties of $T \in L (V)$ which is a vector space, and therefore has those properties.

Theorem

Every operator on a finite-dimensional complex vector space $V$ has an eigenvalue, and associated eigenvector.

Note that this isn't true on real vector spaces. This is why. Think of the rotation transformation. Here in a real vector space you only have complex-valued eigenvalues, so then it won't work for real vector spaces.

Proof
Suppose that $V$ is a finite-dimensional complex vector space $V$ with $dim (V) = n \geq 1$ . Further, let's suppose that $T \in L (V)$ . We'll need to show that $T$ has an eigenvalue.

Choose $\vec{v} \neq \vec{0} \in V$ . Make a list:

\vec{v}, T \vec{v}, T^{2} \vec{v}, \dots, T^{n} \vec{v}

eventually, we'll have a LD list, since there's $n + 1$ vectors, so this list is definite LD. Hence, there's $α_{i} \in C$ (not all 0) where:

α_{0} I \vec{v} + α_{1} T \vec{v} + \dots + α_{n} T^{n} \vec{v} = \vec{0}

But we can convert our operators into a polynomial operator!:

(α_{0} I + α_{1} T + \dots + α_{n} T^{n}) \vec{v}

(α_{0} + α_{1} z + \dots + α_{n} z^{n}) (T) \vec{v} = \vec{0}

And we can factor our polynomial into a product of linear terms. But $α_{n}$ may be zero, and so on. But we have at least 1 that's non-zero, so the degree of our polynomial is at least degree 1 (really between 1 and $n$ ), so call it degree $m$ . Thus:

c (z - λ_{1}) \dots (z - λ_{m}) (T) \vec{v} = \vec{0}

Thus, we can write this as a product of operators:

c (T - λ_{1} I) \dots (T - λ_{m} I) \vec{v} = \vec{0}

since $\vec{v} \neq \vec{0}$ . Recall that if $(T - λ I) \vec{v} = \vec{0}$ then $\vec{v} \in nullspace (T - λ I)$ so then $\vec{v}$ is an eigenvector and $λ$ is it's eigen value. Go in the order of the polynomials. If $(T - λ_{m} I) \vec{v} = \vec{0}$ then by the prior theorem then $\vec{v}$ is an eigenvector with eigenvalue $λ_{m}$ . If $(T - λ_{m - 1} I) (T - λ_{m} I) \vec{v} = \vec{0}$ then we know that $(T - λ_{m} I) \vec{v}$ is an eigenvector with eigenvalue $λ_{m - 1}$ . Repeat forever. We know that we must have one of these has to be it since if we get to the end:

c (T - λ_{1} I) \dots (T - λ_{m} I) \vec{v} = \vec{0}

where since $c \neq 0$ (we're dealing with a degree $m$ polynomial) so then if we get to that point then $\dots (T - λ_{m} I) \vec{v}$ is an eigenvector with eigenvalue $λ_{1}$ .

Boom!
☐
We can now always use an eigenvalue to simplify certain processes! An application is now detailed below:

Computing Higher Powers of Matrices

When we have 0's in matrices, we normally can diagonalize it, and thus compute higher powers of matrices. Let $A_{n \times n}$ . We know that the diagonal is on the $a_{i, i}$ entries for all $1 \leq i \leq n$ .

[\begin{matrix} a_{11} & a_{12} & \dots & a_{1 n} \\ a_{21} & a_{22} & \dots & a_{2 n} \\ ⋮ & ⋱ & ⋮ \\ a_{n 1} & a_{n 2} & \dots & a_{n n} \end{matrix}]

Upper Triangular

A matrix was upper triangular if there's 0's in the lower triangle (ie: there's data only in the diagonal and in the upper part):

[\begin{matrix} a_{11} & a_{12} & \dots & a_{1 n} \\ 0 & a_{22} & \dots & a_{2 n} \\ ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & a_{n n} \end{matrix}]

for example, suppose that $V$ is a vector space of dimension 3, and $v_{1}, v_{2}, v_{3}$ is its corresponding basis $β$ . Let $T \in L (V)$ where:

M (T, β) = [\begin{matrix} a_{11} & a_{12} & a_{13} \\ 0 & a_{22} & a_{23} \\ 0 & 0 & a_{33} \end{matrix}]

Therefore:

\begin{aligned} T (v_{1}) & = a_{11} v_{1} \\ T (v_{2}) & = a_{12} v_{1} + a_{22} v_{2} \\ T (v_{3}) & = a_{13} v_{1} + a_{23} v_{2} + a_{33} v_{3} \end{aligned}

but notice this! If I look at $T (v_{i})$ we can always write this as a linear combination of all $v_{j}$ 's where $j < i$ . Namely:

T (v_{i}) \in span (v_{1}, . . ., v_{i})

That's big! If I were to look at:

T (span (v_{i})) \subseteq span (v_{1}, . . ., v_{i})

So then $span (v_{1}, . . ., v_{i})$ is a $T$ invariant subspace for all $i$ .

We tie everything together into one neat theorem:

Upper Triangular Transformation Properties

The following are equivalent:

$T$ has an upper triangular matrix representation w.r.t. some basis $β = {v_{1}, . . ., v_{n}}$ for $V$ .
$T (v_{i}) \in span (v_{1}, . . ., v_{i})$ for all $1 \leq i \leq n$
$span (v_{1}, . . ., v_{i})$ is $T$ -invariant for all $i$ .

Finishing Up

For the next lecture, we'll talk about the following theorem:

Theorem

Every operator $T \in L (V)$ , where $V$ is a finite-dimensional complex vector space has a matrix representation which is upper-triangular.

We'll talk about this next time.