Newtonian force as a covariant or contravariant quantity

In summary, the conversation discusses the concept of force and its relationship to vectors and dual vectors in the context of Newtonian mechanics. This idea is presented by Burke in his book called Div, Grad, and Curl are Dead. It is mentioned that force is a 1-form and dual to vectors, which is supported by the statement that energy is a scalar and displacement is a contravariant vector. However, there is some disagreement about whether energy is truly a scalar or a scalar density. The conversation also touches on the issue of symmetry between vectors and their duals and the role of a metric in writing Newton's Second Law. Finally, it is mentioned that while the dual space of \mathbb{R}^{3} is isomorphic
  • #106
bcrowell said:
When I say there's no metric, I mean that there's no metric on the two-dimensional space of [itex](\phi,\theta)[/itex].

I see. And regarding the definition of parallel - is it defined by a connection?
 
Physics news on Phys.org
  • #107
bcrowell said:
Here's another thing I have to try to wrap my head around: I think Killing vectors are naturally lower-index vectors, and even though you do have a metric, it doesn't make sense to try to raise their indices. The reason is that the Killing vector is a field, not just a vector defined at a point, and the metric is varying...Does that make sense!?
I'm not sure I understand this. A killing vector field is defined as a vector field such that the lie derivative of the metric tensor along this vector field is zero. We can talk of killing vector fields in terms of their flows and so on. The lie derivative measures the rate of change of a tensor field along the flow of a vector field so it makes sense to take a killing vector field as, at each point, naturally an element of the tangent space. Of course you can use the metric to raise and lower indices as usual but the way a killing field is defined it uses the notion of a vector field as per the lie derivative and the notion of a vector field as a derivation. I don't see immediately why they would instead be naturally co -vector fields a priori. Note that we raise and lower indices of vector fields all the time as it is a point wise operation done at each point in space - time. This isn't an issue.
 
  • #108
WannabeNewton said:
I'm not sure I understand this. A killing vector field is defined as a vector field such that the lie derivative of the metric tensor along this vector field is zero.

In the Wikipedia article on Killing vectors, it is said that [itex]X^\mu[/itex] is a Killing vector field if for all vectors [itex]Y[/itex] and [itex]Z[/itex],

[itex]g(\nabla_Y (X),Z) + g(Y , \nabla_Z (X)) = 0[/itex]

which is an equation on vector fields [itex]X^\mu[/itex]. However, it also says that in "local coordinates", this is equivalent to

[itex]\nabla_\mu X_\nu + \nabla_\nu X_\mu = 0[/itex]

which is an equation on covector fields [itex]X_\mu[/itex]. I don't quite understand this, because it seems that the latter equation doesn't even involve the metric (except indirectly, through the covariant derivative). It would seem to me that the latter could be defined even for a manifold without a metric, with just a connection, as a definition of a Killing co-vector field [itex]X_\mu[/itex].
 
  • #109
stevendaryl said:
In the Wikipedia article on Killing vectors, it is said that [itex]X^\mu[/itex] is a Killing vector field if for all vectors [itex]Y[/itex] and [itex]Z[/itex],

[itex]g(\nabla_Y (X),Z) + g(Y , \nabla_Z (X)) = 0[/itex]

which is an equation on vector fields [itex]X^\mu[/itex]. However, it also says that in "local coordinates", this is equivalent to

[itex]\nabla_\mu X_\nu + \nabla_\nu X_\mu = 0[/itex]

which is an equation on covector fields [itex]X_\mu[/itex]. I don't quite understand this, because it seems that the latter equation doesn't even involve the metric (except indirectly, through the covariant derivative). It would seem to me that the latter could be defined even for a manifold without a metric, with just a connection, as a definition of a Killing co-vector field [itex]X_\mu[/itex].
Huh?, Killing (co)vectors are those that preserve the metric tensor, how can you define them if there is no metric?
 
  • #110
stevendaryl said:
In the Wikipedia article on Killing vectors, it is said that [itex]X^\mu[/itex] is a Killing vector field if for all vectors [itex]Y[/itex] and [itex]Z[/itex],

[itex]g(\nabla_Y (X),Z) + g(Y , \nabla_Z (X)) = 0[/itex]

which is an equation on vector fields [itex]X^\mu[/itex]. However, it also says that in "local coordinates", this is equivalent to

[itex]\nabla_\mu X_\nu + \nabla_\nu X_\mu = 0[/itex]

which is an equation on covector fields [itex]X_\mu[/itex].
Yeah you can express them that way in local coordinates as you prolly already know because when you compute the first expression in coordinates the metric tensor ends up lowering the indices so I don't know if that would count as making the co - vector expression any more natural.
 
  • #111
bcrowell said:
Here's another thing I have to try to wrap my head around: I think Killing vectors are naturally lower-index vectors, and even though you do have a metric, it doesn't make sense to try to raise their indices. The reason is that the Killing vector is a field, not just a vector defined at a point, and the metric is varying...Does that make sense!?

I don't understand this. What varying metric are you referring to? why doesn't make sense to raise or lower indices for fields if there is a metric?, fields assign a vector or covector to each point in the manifold.
Why the insistence on what is natural for a certain field when examples keep popping up that contradict any abstract "naturalness" in the presence of a metric of the covariance or contravariance of (co)vector fields?
 
  • #112
TrickyDicky said:
Huh?, Killing (co)vectors are those that preserve the metric tensor, how can you define them if there is no metric?

I'm just saying that the equation for a Killing co-vector doesn't mention the metric tensor, and so the equation would make sense even in the absence of a metric, it would seem.
 
  • #113
Trying to clear up my foggy thinking re Killing vectors...

The Killing equation [itex]\nabla_a \xi_b+\nabla_b \xi_a=0[/itex] does use the metric, because the covariant derivative is defined in terms of the metric. Likewise if you define Killing vectors in terms of preserving distances, you're still appealing to a metric. On a bare manifold without a connection, there is no way to define a Killing vector. For instance, if I take the spacetime in and around the planet earth, it has Killing vectors such as rotation and time translation. If I take away the metrical information, then all I have is a topological space that's topologically isomorphic to R^3, and there's no way to say what its Killing vectors are.

You can raise the indices like this [itex]\nabla^a \xi^b+\nabla^b \xi^a=0[/itex]. Note how the structure of the equation forces you to raise the indices on the derivatives as well as the Killing vector. If we're thinking of derivatives as being "naturally" lower-index quantities, then this does provide some motivation for saying that Killing vectos are naturally lower-index. But the fact that this is a *covariant* derivative means that you must have a metric defined, and therefore there's nothing to stop you from raising its index. This is different from the case of a plain old partial derivative.

People normally express Killing vectors using partial derivatives as a basis, e.g., [itex]\partial_t[/itex] for a metric that doesn't change over time. So if there is a "natural" way to write them, it would definitely have to be lower-index.

On the other hand, it would also seem extremely natural to me to express the same symmetry as a translation of the time coordinate, [itex]x^t \rightarrow x^t+dt[/itex].

When you have a Killing vector [itex]\xi_a[/itex], test particles follow trajectories that conserve [itex]\xi_a v^a[/itex]. This seems odd to me because we normally think of momentum as being what's conserved, not velocity. We can multiply by m to get a momentum, but then it would be an upper-index momentum, which is not the natural way to express momentum.

I would like to connect this to DrGreg's #103, which wasn't explicitly written in terms of a discussion of Killing vectors.
 
  • #114
bcrowell said:
Trying to clear up my foggy thinking re Killing vectors...

The Killing equation [itex]\nabla_a \xi_b+\nabla_b \xi_a=0[/itex] does use the metric, because the covariant derivative is defined in terms of the metric.

But covariant derivative only requires a connection, not a metric. A metric is sufficient for a connection, but not necessary.
 
  • #115
stevendaryl said:
But covariant derivative only requires a connection, not a metric. A metric is sufficient for a connection, but not necessary.
The covariant dervative used with Killing fields is not a general connection but the Levi-Civita unique metric connection.
 
  • #116
stevendaryl said:
I'm just saying that the equation for a Killing co-vector doesn't mention the metric tensor, and so the equation would make sense even in the absence of a metric, it would seem.

The equation doesn't make sense without a metric, the metric is implicit in the Christoffel coefficients of the covariant derivatives of the equation, you need the metric to obtain them.
 
  • #117
TrickyDicky said:
The equation doesn't make sense without a metric, the metric is implicit in the Christoffel coefficients of the covariant derivatives of the equation, you need the metric to obtain them.

No, connections are more basic than metrics. It's possible to have a connection even when you don't have a metric.
 
  • #118
stevendaryl said:
But covariant derivative only requires a connection, not a metric. A metric is sufficient for a connection, but not necessary.

OK, but the examples we've come up with so far are all either ones with metrics or ones with no connection at all. Can you come up with a physically well motivated example where there's a connection but no metric, and tie it into this topic?
 
  • #119
bcrowell said:
OK, but the examples we've come up with so far are all either ones with metrics or ones with no connection at all. Can you come up with a physically well motivated example where there's a connection but no metric, and tie it into this topic?

Newton-Cartan theory of gravity has a connection, but no metric (or I should say, it doesn't have a full metric that allows the raising and lowering of indices).
 
  • #120
stevendaryl, the useful property of Killing fields is that their flows are local isometries. You can make any definition you'd like, but what would it be good for?

The wikipedia article has a comment at the end of a generalization to a manifold with no metric, but a group action to substitute for the lack of isometries.
 
  • #121
In GR you deal with levi civita connections but the right way to think of killing vector fields is as vector fields that result in vanishing lie derivative of the metric tensor along their flow i.e their flows are local isometries on the riemanniam manifold. The killing equation with the convectors is just a computational consequence of that.
 
  • #122
WannabeNewton said:
In GR you deal with levi civita connections but the right way to think of killing vector fields is as vector fields that result in vanishing lie derivative of the metric tensor along their flow i.e their flows are local isometries on the riemanniam manifold. The killing equation with the convectors is just a computational consequence of that.

The easiest way for me to picture a Killing vector field is to have a coordinate system in which the components of the metric tensor are independent of one or more coordinate. In that case, the vector field is just the basis vector for that coordinate. What I'm not sure about is whether this is perfectly general---that is, if there is a Killing field, can we always cook up a coordinate system (at least for a region) in which the metric is independent of one of the coordinates?
 
  • #123
stevendaryl said:
The easiest way for me to picture a Killing vector field is to have a coordinate system in which the components of the metric tensor are independent of one or more coordinate. In that case, the vector field is just the basis vector for that coordinate. What I'm not sure about is whether this is perfectly general---that is, if there is a Killing field, can we always cook up a coordinate system (at least for a region) in which the metric is independent of one of the coordinates?

Just take the integral curves of the Killing field for one of the coordinates.
 
  • #124
If you have access, the appendix on killing fields and flows in Carroll's GR text gives a very accessible and excellent treatment of these things.
 
  • #126
  • #127
bcrowell said:
When I say there's no metric, I mean that there's no metric on the two-dimensional space of [itex](\phi,\theta)[/itex].

atyy said:
I see. And regarding the definition of parallel - is it defined by a connection?

Browsing Wiki, there's a comment on an induced metric http://en.wikipedia.org/wiki/Newtonian_dynamics#Embedding_and_the_induced_Riemannian_metric : "Geometrically, the vector-function (7) implements an embedding of the comfiguration space M of the constrained Newtonian dynamical system into the 3N-dimensional flat configuration space of the unconstrained Newtonian dynamical system (3). Due to this embedding the Euclidean structure of the ambient space induces the Riemannian metric onto the manifold M." Aren't [itex](\phi,\theta)[/itex] the [itex]q[/itex] of Wiki?

Google also came up with http://www.mast.queensu.ca/~andrew/teaching/math439/pdf/439notes.pdf which seems useful. Force is defined as a covector on p100: "Thus a force ... is a covector in the cotangent space ... Just why a force should naturally be regarded as a covector rather than a tangent vector is a question best answered by saying, 'Because that's the way it works out.'". Example 2.6.1 (p109) which is continued as Example 2.6.13 (p118) gives an example of a constrained system. It does mention a Riemannian metric.

Maybe one difference is that in general the Riemannian metric is curved. In a flat space, the Riemannian metric can be used to talk about "position vectors" being perpendicular after we have assigned an origin and cartesian coordinates. However, in general, a Riemannian metric acts on velocity vectors, and there is no such thing as a "position vector", just position coordinates.
 
Last edited by a moderator:
  • #128
bcrowell said:
Trying to clear up my foggy thinking re Killing vectors...


If we're thinking of derivatives as being "naturally" lower-index quantities, then this does provide some motivation for saying that Killing vectos are naturally lower-index.
There are higher-index contravariant derivatives too, what you call natural is just pure convention.
bcrowell said:
When you have a Killing vector [itex]\xi_a[/itex], test particles follow trajectories that conserve [itex]\xi_a v^a[/itex]. This seems odd to me because we normally think of momentum as being what's conserved, not velocity. We can multiply by m to get a momentum, but then it would be an upper-index momentum, which is not the natural way to express momentum.
So much for what looks natural.
 
  • #129
stevendaryl said:
No, connections are more basic than metrics. It's possible to have a connection even when you don't have a metric.
I never said otherwise, I was clearly referring to the Killing field vectors in the context of Riemannian manifolds, (like in the Wikipedia page BTW) as was clear from my #115.
 
  • #130
atyy said:
Browsing Wiki, there's a comment on an induced metric http://en.wikipedia.org/wiki/Newtonian_dynamics#Embedding_and_the_induced_Riemannian_metric : "Geometrically, the vector-function (7) implements an embedding of the comfiguration space M of the constrained Newtonian dynamical system into the 3N-dimensional flat configuration space of the unconstrained Newtonian dynamical system (3). Due to this embedding the Euclidean structure of the ambient space induces the Riemannian metric onto the manifold M." Aren't [itex](\phi,\theta)[/itex] the [itex]q[/itex] of Wiki?

I think what they're saying is that you can associate a metric with it of the form [itex]ds^2=d\theta^2+d\phi^2[/itex]. But that isn't a metric that has any physical meaning in the example I made up in #100 of the arm holding a weight. E.g., it doesn't tell you how far the hand moved, or any other physically interesting piece of information. An equally plausible metric would be [itex]ds^2=3d\theta^2+d\phi^2[/itex], or virtually anything else you cared to make up.
 
  • #131
bcrowell said:
People normally express Killing vectors using partial derivatives as a basis, e.g., [itex]\partial_t[/itex] for a metric that doesn't change over time. So if there is a "natural" way to write them, it would definitely have to be lower-index.
The convention we have of using the symbol [itex]\partial_t[/itex] in this context is, I think, rather confusing. Here the symbol is being used, by convention, to represent a vector, not a co-vector, nor a component of a co-vector, viz.
[tex]
\partial_t = \begin{bmatrix} 1 \\ 0 \\ 0 \\ 0 \end{bmatrix}
[/tex](expressed in a [itex](t, x^1, x^2, x^3)[/itex] coordinate system). It's unfortunate the symbol has a "downstairs" suffix, which suggests covariance instead of contravariance. To avoid further confusion, I will denote this vector by [itex]\textbf{e}[/itex], and its components by [itex]e^\alpha[/itex], for the remainder of this post.

(The symbol [itex]\partial_t[/itex] also has a different meaning as the t-component of [itex]\partial_\alpha[/itex].)

bcrowell said:
I would like to connect this to DrGreg's #103, which wasn't explicitly written in terms of a discussion of Killing vectors.
In the notation above combined with post #103, the "covariant energy" is [itex]P_\alpha e^\alpha[/itex], so if [itex]\textbf{e} ( = \partial_t)[/itex] is a Killing vector...
 
  • #132
bcrowell said:
I think what they're saying is that you can associate a metric with it of the form [itex]ds^2=d\theta^2+d\phi^2[/itex]. But that isn't a metric that has any physical meaning in the example I made up in #100 of the arm holding a weight. E.g., it doesn't tell you how far the hand moved, or any other physically interesting piece of information. An equally plausible metric would be [itex]ds^2=3d\theta^2+d\phi^2[/itex], or virtually anything else you cared to make up.

The notes from Lewis I linked to in #127 seem to define "simple" Lagrangian dynamics with a metric. I think it might be better to say that in a general curved space, a metric acts on velocity vectors. In flat space the metric can be carried over to "position vectors", but in a curved space there are only "position coordinates". The "simple" includes constrained systems, Example 2.6.13 (p118).
 
  • #133
Since there seems to be some confusion on Killing vectors...

A Killing vector is a vector that preserves the metric under Lie dragging, i.e.

[tex]\mathcal{L}_K \, g = 0[/tex]
In this case, K is a vector, specifically, and not a covector. This equation is actually defined even in the absence of a connection; Lie dragging does not depend on it.

Now, obviously we have a metric, so we can use it to "lower the index" of K to make it a covector. In this case, the above equation boils down to

[tex]\nabla_\mu K_\nu + \nabla_\nu K_\mu = 0[/tex]
where [itex]\nabla[/itex] is the Levi-Civita connection (this is important). Because this equation specifically depends on choosing the Levi-Civita connection, it is really an equation that involves the metric and its derivatives, just as the first equation does.

If you choose some other connection [itex]\hat{\nabla}[/itex] (or if you have no metric, so there is no natural choice of connection), then you can still write the equation

[tex]\hat{\nabla}_\mu K_\nu + \hat{\nabla}_\nu K_\mu = 0[/tex]
but it is no longer Killing's equation! It is just some equation satisfied by some covector K.

Killing's equation is specifically "The Lie derivative along K of the metric is zero", in whatever form you choose to write it.

P.S. To see how

[tex]g(\nabla_X K, Y) + g(X, \nabla_Y K) = 0[/tex]
gives the usual form of Killing's equation, simply put in [itex]X = \partial_\mu[/itex] and [itex]Y = \partial_\nu[/itex].
 
  • #134
DrGreg said:
The convention we have of using the symbol [itex]\partial_t[/itex] in this context is, I think, rather confusing. Here the symbol is being used, by convention, to represent a vector, not a co-vector, nor a component of a co-vector, viz.
[tex]
\partial_t = \begin{bmatrix} 1 \\ 0 \\ 0 \\ 0 \end{bmatrix}
[/tex](expressed in a [itex](t, x^1, x^2, x^3)[/itex] coordinate system). It's unfortunate the symbol has a "downstairs" suffix, which suggests covariance instead of contravariance. To avoid further confusion, I will denote this vector by [itex]\textbf{e}[/itex], and its components by [itex]e^\alpha[/itex], for the remainder of this post.

(The symbol [itex]\partial_t[/itex] also has a different meaning as the t-component of [itex]\partial_\alpha[/itex].)

No, I'm pretty sure you're wrong about this. The operator [itex]\partial_a=\partial/\partial x^a[/itex] is written with a lower index precisely because it's a covector and transforms like one. The notation isn't misleading. The notation shows exactly how the thing transforms. For instance, you can convince yourself of this by writing down a rescaling of the coordinate and applying the tensor transformation law.
 
  • #135
bcrowell said:
No, I'm pretty sure you're wrong about this. The operator [itex]\partial_a=\partial/\partial x^a[/itex] is written with a lower index precisely because it's a covector and transforms like one. The notation isn't misleading. The notation shows exactly how the thing transforms. For instance, you can convince yourself of this by writing down a rescaling of the coordinate and applying the tensor transformation law.

The notation [itex]\frac{\partial}{\partial x^i}\vert_p[/itex] represents a tangent vector,and not a 1-form. A 1-form would look like [itex]dx^i\vert_p[/itex]. See "Introduction to Smooth Manifolds" by Lee.
 
  • #136
bcrowell said:
No, I'm pretty sure you're wrong about this. The operator [itex]\partial_a=\partial/\partial x^a[/itex] is written with a lower index precisely because it's a covector and transforms like one. The notation isn't misleading. The notation shows exactly how the thing transforms. For instance, you can convince yourself of this by writing down a rescaling of the coordinate and applying the tensor transformation law.
Hello again misturr :). While they do transform in that way, at a given point [itex]p\in M[/itex] and a coordinate chart [itex](U,(x^{i}))[/itex] the n derivations [itex]\left \{ \partial _{1}|_{p},...,\partial _{n}|_{p} \right \}[/itex] actually form a basis for [itex]T_{p}(M)[/itex] so they actually lie in the tangent space at that point.
 
  • #137
Ben Niehoff said:
A Killing vector is a vector that preserves the metric under Lie dragging, i.e.

[tex]\mathcal{L}_K \, g = 0[/tex]
In this case, K is a vector, specifically, and not a covector. This equation is actually defined even in the absence of a connection; Lie dragging does not depend on it.

I wanted to expand on this, because it might seem mysterious, given that the equation usually called "Killing's equation" has nablas in it.

Remember what Killing's equation actually means. The solution K is a vector field that generates an isometry of the manifold. So for example, on a sphere, the vector K points along lines of constant lattitude, and describes the velocity at each point on the sphere if the sphere as a whole were to be rigidly rotated about its axis. Notice that the flow of K along itself does NOT follow parallel transport (especially near the poles).

Of course, in order for K to solve a meaningful differential equation, there must be some way to relate nearby tangent spaces to each other. It is the rigid motion itself that links tangent spaces via the pullback. The connection is never needed.

The point is that ANY vector field generates some 1-parameter flow, called Lie dragging. The flow lines are the integral curves of that vector field. So, using pullbacks along this 1-parameter flow, you can relate different tangent spaces and compare vectors at different points. If that 1-parameter flow is a rigid motion, then you have an isometry (which essentially means, the manifold can be rigidly moved into itself). The conditions for a flow being a rigid motion are precisely Killing's equation*.

* Note: "Rigidity" is really the wrong concept here, as rigidity is an extrinsic property. Consider the rigidity of a triangular lattice in R^2, and now put the same triangular lattice into R^3; it becomes quite floppy. The correct concept is "metric-preserving", which is the intrinsic property that most closely corresponds to rigidity.
 
  • #138
WannabeNewton said:
Hello again misturr :). While they do transform in that way, at a given point [itex]p\in M[/itex] and a coordinate chart [itex](U,(x^{i}))[/itex] the n derivations [itex]\left \{ \partial _{1}|_{p},...,\partial _{n}|_{p} \right \}[/itex] actually form a basis for [itex]T_{p}(M)[/itex] so they actually lie in the tangent space at that point.

It's important to remember that the indices on [itex]\partial_a, \partial_b,[/itex] etc. are indices that indicate "which vector", not "which component". So to indicate contravariant vectors, the indices must be down in order for [itex]X = X^a \partial_a[/itex] to make sense.
 
  • #139
Edit: in the time it took me to write this, I see several others have already made the same point, but I might as well post this anyway.

bcrowell said:
No, I'm pretty sure you're wrong about this. The operator [itex]\partial_a=\partial/\partial x^a[/itex] is written with a lower index precisely because it's a covector and transforms like one. The notation isn't misleading. The notation shows exactly how the thing transforms. For instance, you can convince yourself of this by writing down a rescaling of the coordinate and applying the tensor transformation law.

What you say is true of the second meaning I listed. I agree that
[tex]
\begin{bmatrix}
\frac{\partial}{\partial t} &&
\frac{\partial}{\partial x} &&
\frac{\partial}{\partial y} &&
\frac{\partial}{\partial z}
\end{bmatrix}
[/tex]transforms as a co-vector, where the symbol [itex]\partial / \partial t[/itex] denote one component of a covector.

But the symbol is not being used in that sense. It's being used (in an abuse of notation denoting an isomorphism to another space) to denote a vector, not a component.

It's like writing[tex]
\textbf{V} = V^\alpha \textbf{e}_\alpha = V^0 \textbf{e}_0 + V^1 \textbf{e}_1 + V^2 \textbf{e}_2 + V^3 \textbf{e}_3
[/tex]where [itex]\textbf{e}_0[/itex], [itex]\textbf{e}_1[/itex], ... are four different vectors, not covectors, nor 4 components of one covector.
 
Last edited:
  • #140
DrGreg said:
Edit: in the time it took me to write this, I see several others have already made the same point, but I might as well post this anyway.
Actually, I think yours was the only post that actually cleared up the issue, so I'm glad you posted it anyway!

DrGreg said:
What you say is true of the second meaning I listed. I agree that
[tex]
\begin{bmatrix}
\frac{\partial}{\partial t} &&
\frac{\partial}{\partial x} &&
\frac{\partial}{\partial y} &&
\frac{\partial}{\partial z}
\end{bmatrix}
[/tex]transforms as a co-vector, where the symbol [itex]\partial / \partial t[/itex] denote one component of a covector.

But the symbol is not being used in that sense. It's being used (in an abuse of notation denoting an isomorphism to another space) to denote a vector, not a component.

It's like writing[tex]
\textbf{V} = V^\alpha \textbf{e}_\alpha = V^0 \textbf{e}_0 + V^1 \textbf{e}_1 + V^2 \textbf{e}_2 + V^3 \textbf{e}_3
[/tex]where [itex]\textbf{e}_0[/itex], [itex]\textbf{e}_1[/itex], ... are four different vectors, not covectors, nor 4 components of one covector.

IMO this hits the nail on the head and clears up something that I'd found confusing and that IMO #135 and #136 failed to deal with by resorting to appeals to authority. I think the key words here are "abuse of notation."

Let me see if I can lay out my present understanding and see if others agree with me.

We want a bunch of different things:

(1) We want to be able to use upper and lower indices to notate the real physical differences between two different types of vectors, and as physicists we define these types by their transformation properties.
(2) We want the transformation properties of an object to be apparent from the way we write its indices.
(3) We want to have grammatical rules that make it apparent when we're writing nonsense, e.g., we want to be able to recognize that there's something wrong in an equation like [itex]u_a=v_a w_{abc}[/itex].
(4) We want a notation that's compact and expressive, so we'd like to use Einstein summation notation and avoid writing sigmas.
(5) We want a notation that is widely accepted and understood by other physicists.

Appeals to authority are useful only with respect to #5. Since we don't all have the same printed books on our bookshelves, it's useful to have an online reference that's accessible to all of us, so let's use the online version of Carroll: http://ned.ipac.caltech.edu/level5/March01/Carroll3/Carroll2.html

Carroll said:
an arbitrary one-form is expanded into components as [itex]\omega=\omega_\mu dx^\mu[/itex] [...] We will usually write the components [itex] \omega_{\mu}[/itex] when we speak about a one-form [itex]\omega[/itex].

This violates #2, because the l.h.s. is written without indices, so we identify it as a scalar, and yet it's not a scalar, it's a covector. It satisfies #4 by being written in compact Einstein summation notation. It obeys #2 by writing the covector as [itex] \omega_{\mu}[/itex], but violates it by writing the basis covector as [itex]dx^\mu[/itex].

Carroll (and by proxy physicists in general) is trying to simultaneously satisfy incompatible desires 1-5. A clear way of seeing this is that he wants to write [itex] \omega_{\mu}[/itex] as a synonym for [itex]\omega[/itex]. Now suppose that (in old-fashioned non-abstract index notation, indicated by the Greek indices), we have [itex]\omega_0=1[/itex] for [itex]\mu=1[/itex] and all the other components zero. Then we have [itex]\omega=dx^\mu[/itex]. But [itex] \omega_{\mu}[/itex] is a synonym for [itex]\omega[/itex], so can substitute in and make it [itex]\omega_{\mu}=dx^\mu[/itex]. Oops, this violates #3.

I think what's really happening here is that in a misguided attempt to satisfy 3 and 4, we write expressions like [itex]\omega_\mu dx^\mu[/itex]. This reads like the scalar product of a vector and a covector, which isn't what it is. Maybe a better notation would be something like [itex]\omega_{(.)}=\sum_\mu\omega(\mu) \partial_\mu[/itex]. It violates 4 and 5, but it obeys 1-3.
 
Last edited:
Back
Top