MHB Notation for vector coordinates in a given basis

Evgeny.Makarov · Aug 29, 2014

Sorry for a long post. I am looking for a clear and concise way to explain how to compute coordinates when changes of basis or linear operators are involved. I would like to avoid the summation notation as much as possible and use the definition of matrix multiplication only in the beginning, when it is indeed necessary. I would like to be able to explain things like the following.

Why is it that when a change of basis occurs, we express the "old" coordinates through the "new" ones, but when a linear operator is applied, we express the "new" coordinates through the "old" ones?
How to find the matrix of a linear operator in a different basis?
Suppose a linear operator $\varphi$ on $\Bbb R^n$ maps a sequence of vectors $\mathcal{A}=(a_1,\dots,a_n)$ to $\mathcal{B}=(b_1,\dots,b_n)$ and $\mathcal{A}$ is linear independent. How to find the matrix of $\varphi$ in basis $\mathcal{E}$ given coordinates of $\mathcal{A}$ and $\mathcal{B}$ in $\mathcal{E}$?

While it is possible to explain the change of basis referring to "new" and "old" coordinates of a single vector in two bases, applying an operator $\varphi$ to a vector $v$ involves four sets of coordinates:

coordinates of $v$ in the initial basis $\mathcal{E}$,
coordinates of $\varphi v$ in the initial basis $\mathcal{E}$,
coordinates of $v$ in the new basis $\varphi\mathcal{E}$ and
coordinates of $\varphi v$ in the new basis $\varphi\mathcal{E}$.

It's no longer enough to use $x$ and $x'$ for coordinates. It really helps if we can say precisely which vector in which basis has which coordinates.

A popular idea is to write $[v]_{\mathcal{E}}$ to refer to coordinates of vector $v$ in basis $\mathcal{E}$. Similarly, $[\varphi]_{\mathcal{E}}$ denotes the matrix of $\varphi$ in $\mathcal{E}$ and if $\mathcal{E}'=(e_1',\dots,e_n')$, then $[\mathcal{E}']_{\mathcal{E}}$ is the matrix with columns $[e_1']_{\mathcal{E}},\dots,[e_n']_{\mathcal{E}}$, i.e., the transition matrix from $\mathcal{E}$ to $\mathcal{E}'$. By definition,
\[
[\varphi]_{\mathcal{E}}=[\varphi\mathcal{E}]_{\mathcal{E}}.\tag{1}
\]
Then we can state and prove the following properties.
\begin{align}
&[\mathcal{E}']_{\mathcal{E}}[v]_{\mathcal{E}'}=[v]_{\mathcal{E}}\tag{2}\\
&[v]_{\mathcal{E}}=[\varphi v]_{\mathcal{\varphi E}}\tag{3}
\end{align}

Using this, we can prove that
\[
[\varphi v]_{\mathcal{E}}=[\varphi]_{\mathcal{E}}[v]_{\mathcal{E}}.\tag{4}
\]
Indeed,
\[
[\varphi v]_{\mathcal{E}}\overset{(2)}{=}[\varphi\mathcal{E}]_{\mathcal{E}}[\varphi v]_{\mathcal{\varphi E}}
\overset{(1)}{=}[\varphi]_{\mathcal{E}}[\varphi v]_{\varphi\mathcal{E}}\overset{(3)}{=}[\varphi]_{\mathcal{E}}[v]_{\mathcal{E}}.\tag{5}
\]

For another example, here is the summary of Deveno's explanation that $[\varphi]_{\mathcal{E}'}=[\mathcal{E}]_{\mathcal{E}'}[\varphi]_{\mathcal{E}}[\mathcal{E}']_{\mathcal{E}}$ https://driven2services.com/staging/mh/index.php?posts/55983/. For any $v$,
\[
[\mathcal{E}]_{\mathcal{E}'}[\varphi]_{\mathcal{E}}[\mathcal{E}']_{\mathcal{E}}[v]_{\mathcal{E}'}
\overset{(2)}{=}
[\mathcal{E}]_{\mathcal{E}'}[\varphi]_{\mathcal{E}}[v]_{\mathcal{E}}
\overset{(4)}{=}
[\mathcal{E}]_{\mathcal{E}'}[\varphi v]_{\mathcal{E}}
\overset{(2)}{=}
[\varphi v]_{\mathcal{E}'}.
\]

This notation seems short and expressive, but unfortunately $[v]_{\mathcal{E}}$ does not make sense if $\mathcal{E}$ is not a basis. So if $\varphi$ is not an isomorphism, then the proof (5) does not quite work.

It is possible to define the inverse operation: if $x$ is a column of numbers, then $(x)_{\mathcal{E}}\overset{\text{def}}{=}\mathcal{E}x$ is the linear combination of vectors from $\mathcal{E}$ with coefficients $x$. This operation is well-defined even if $\mathcal{E}$ are linearly dependent. I have not yet finished rewriting (1)-(4) using this notation, but even if this is possible, I am wondering if the proofs would not be too obscure and giving little insight.

How do authors and lecturers usually deal with this? Also, I am wondering if there is a generalization of the operation of taking coordinates. Perhaps coordinates can be thought of as a morphism in category theory from $V\times\dots\times V$ to $V$ taking a basis into a vector. Maybe such a generalization can give a hint for a suitable notation.

Thank you.

I like Serena · Aug 29, 2014

ThePerfectHacker said:

Who needs change-of-basis when one has commutative diagrams?

Suppose $\alpha: \mathcal A \to \mathcal E$ and $\beta: \mathcal B \to \mathcal E$ are the canonical transformations. And suppose $M_\phi$ is the requested matrix.

View attachment 3078

Then:
$$M_\phi = \beta \circ \varphi \circ \alpha^{-1}$$

MHB Notation for vector coordinates in a given basis

Attachments

Thread 'When are Markov Matrices also Martingales?'

Thread 'Derivation of equations of stress tensor transformation'

Similar threads

Hot Threads

I How to show ##p(x)=g(x)x\pm 1\in\Bbb{Q}[x]## is irreducible in ##\Bbb{Q}_{\Bbb{Z}}[x]##?

I Showing ##k[x_1,\ldots,x_n]/\mathfrak{a}## is finite dimensional

I How do we distinguish two different notations for cokernel and coimage?

I Localising a non integral domain at a prime

I Proof by induction of block diagonal decomposition of a matrix

Recent Insights

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight

Insights Relativator (Circular Slide-Rule): Simulated with Desmos - Insight

Insights Fixing Things Which Can Go Wrong With Complex Numbers