Is Covariant Derivative Notation Misleading in Vector Calculus?

In summary, the notation for covariant derivatives is confusing because it is not actually a tensor. The notation is clunky and weird because first you take a covariant derivative of the vector, and the result is another vector. With my notation, the expression is just the partial derivative of the original vector.
  • #106
stevendaryl said:
My point is that it is ambiguous: Are you operating on a vector ##V## and then taking component ##\nu## of the result, or are you operating on the component ##V^\nu##?

With my personal convention for such things, ##\nabla_\mu## is just shorthand for the operator ##\nabla_{e_\mu}##. With this convention, plus linearity and the Leibniz rule for derivatives, we can write:

##\nabla_\mu V = \nabla_\mu (V^\sigma e_\sigma) = (\nabla_\mu V^\sigma) e_\sigma + V^\sigma (\nabla_\mu e_\sigma)##

Taking components of both sides (by operating with ##e^\nu##) gives:

##(\nabla_\mu V)^\nu = \nabla_\mu V^\nu + \Gamma^\nu_{\mu \sigma} V^\sigma##

So rather than saying ##(\nabla V)^\nu_\mu = \nabla_\mu V^\nu##, I would say it's equal to ##\nabla_\mu V^\nu + \Gamma^\nu_{\mu \sigma} V^\sigma##

I guess @PeterDonis would say that this ambiguity is resolved by denying that ##\nabla_\mu## is an operator; it only appears in the context of the expression ##\nabla_\mu V^\nu## where the meaning is ##(\nabla V)^\mu_\nu##.
 
Physics news on Phys.org
  • #107
stevendaryl said:
My point is that it is ambiguous: Are you operating on a vector V and then taking component ν of the result, or are you operating on the component Vν?
My interpretation of the notation is that I'm operating on tensor components. In any case what's of utmost importance is to also take care of the horizontal positioning of the indices. I can accept a notation like ##{(\nabla V)_{\mu}}^{\nu}##, though I think it's utmost inconvenient. I don't like books, where they don't care about the horizontal index position. It's already a night mare if it's not obeyed for Lorentz-transformation matrices in SRT!

Of course at the end ##\nabla V## (which is also sloppy notation for ##\nabla \otimes V##!) is always the same tensor, which is independent of the choice of bases and coordinates. In my notation its
$$\nabla V=\mathrm{d} x^{\mu} \nabla_{\mu} V^{\nu} \partial_{\nu}$$
in the usual notation ##\partial_{\mu}## and ##\mathrm{d} x^{\mu}## for the holonomous basis and its dual given some coordinates.

As I said before, I think there's not much to argue about. It's all just convention and one must make sure to understand the notation of each book/paper right.
 
  • Like
Likes etotheipi
  • #108
stevendaryl said:
I guess @PeterDonis would say that this ambiguity is resolved by denying that ##\nabla_\mu## is an operator

Not necessarily, no. With your convention, where ##\nabla_\mu## means ##\nabla_{e_\mu}##, the directional derivative operator along ##e_\mu##, it is obviously an operator. With the convention I'm used to, ##\nabla_\mu##, used in isolation, is just a way of referring to the covariant derivative operator ##\nabla## itself--Wald would write it as ##\nabla_a##. But in either case it's an operator. The ambiguity in the convention I'm used to is, as you say, that sometimes (usually in expressions where it's combined with other things), ##\nabla_\mu## can mean the ##\mu## component of some tensor obtained by applying the ##\nabla## operator to something; that is indeed not an operator. (As we have seen, it ends up being the same as the directional derivative in the ##e_\mu## direction of the thing the ##\nabla## is operating on. But that still doesn't resolve all ambiguities; see below.)

The only real way to resolve ambiguity is to, well, resolve ambiguity, by adding more notation until the expression is unambiguous.

For example, in Wald's abstract index notation, the various objects you have given would look like this:

Directional derivative of ##V##:

$$
\nabla_\mu V = \left[ \left( e_\mu \right)^a \nabla_a \right] V^b
$$

(Note the brackets enclosing the contraction that denotes the directional derivative, to make it unambiguous that it denotes a single operator.)

Extracting the ##\nu## component:

$$
\left( \nabla_\mu V \right)^\nu = \left[ \left( e_\mu \right)^a \nabla_a V^b \right] \left( e^\nu \right)_b
$$

(Here I don't have a third bracket type to use, so I'm relying on the first expression above to make it clear what the directional derivative operator is, and using the brackets to make clear that the operator is only operating on ##V##; the operator produces a vector, and we contract that vector with ##e^\nu## to extract the component.)

Directional derivative of the ##\nu## component of ##V##:

$$
\nabla_\mu V^\nu = \left[ \left( e_\mu \right)^a \nabla_a \right] \left[ V^b \left( e^\nu \right)_b \right]
$$

Expanding out the above (since now the directional derivative is operating on both ##V## and ##e_\nu##, as the brackets in the above expression make clear):

$$
\nabla_\mu V^\nu = \left[ \left( e_\mu \right)^a \nabla_a V^b \right] \left( e^\nu \right)_b + V^b \left[ \left( e_\mu \right)^a \nabla_a \left( e^\nu \right)_b \right]
$$

Notice that this does not give the same result as above.

Covariant derivative of ##V##:

$$
\nabla V = \nabla_a V^b
$$

Extracting the ##\mu##, ##\nu## component:

$$
\left( \nabla V \right)_\mu{}^\nu = \left( \nabla_a V^b \right) \left( e_\mu \right)^a \left( e^\nu \right)_b
$$

As we have seen in previous posts, since covariant differentiation commutes with contraction, this is equal to the ##\nu## component of the directional derivative of ##V## in the ##e_\mu## direction, but it is not equal to the directional derivative in the ##\mu## direction of the ##\nu## component of ##V##.
 
Last edited:
  • Like
Likes vanhees71 and etotheipi
  • #109
PeterDonis said:
The components aren't numerically the same, since they are components taken with respect to two different choices of basis. The index numbers are the same, but that's just because we number them by dimensions without taking into account anything about the particular coordinate choice. But if we were to designate indexes by coordinate instead of by index number, we would have, for example, ##V^t##, ##V^r##, ##V^\theta##, and ##V^\phi## for a coordinate basis as compared with ##V^T##, ##V^X##, ##V^Y##, and ##V^Z## for an orthonormal basis.
Even in the case when just numbering the indices, this can be addressed by using ##0123## for an unprimed coordinate system and ##0' 1' 2' 3'## for a primed coordinate system.
 
  • #110
Sorry to resume this thread. I've a doubt about the notation used by Wald for torsion-free condition of a covariant derivative operator, namely ##\nabla_a \nabla_b f = \nabla_b \nabla_a f## (Wald employes extensively abstract index notation).

It should be actually ##\nabla_a (\nabla_b f) = \nabla_b (\nabla_a f)##, I think: the result of the brackets () is a (0,1) tensor - a covector - and then the second instance of covariant derivative operator acts on it.

Is that correct ? Thanks
 
Last edited:
  • #111
Yes, this is correct. But that bracketing is considered superfluous, so it's omitted.
 
  • #112
dextercioby said:
Yes, this is correct. But that bracketing is considered superfluous, so it's omitted.
ok good. I believe the two sides (LHS and RHS) with named abstract indexes ##a## and ##b## reversed make sense only if we think of (or imagine) each of them acting on (or contracting with) a given 'fixed' (2,0) tensor field (e.g. ##u^av^b##).

Otherwise what sense would it make ? The two sides are actually the same - the same (0,2) tensor object having just the 2 ordered slots with reversed names.
 
Last edited:
  • #113
The covariant derivatives applied to a scalar indeed commute. In usual Ricci calculus that's very easy to see
$$\nabla_{\mu} \Phi=\partial_{\mu} \Phi$$
and then
$$\nabla_{\nu} \nabla_{\mu} \Phi = \partial_{\nu} \partial_{\mu} \Phi -{\Gamma^{\rho}}_{\mu \nu} \partial_{\rho} \Phi.$$
Since the partial derivatives commute (under the usual assumptions about the smoothness of ##\Phi##) and the Christoffel symbols are symmetric in a pseudo-Riemann manifold, you indeed have
$$\nabla_{\nu} \nabla_{\mu} \Phi = \nabla_{\mu} \nabla_{\nu} \Phi.$$
 
  • #114
vanhees71 said:
$$\nabla_{\nu} \nabla_{\mu} \Phi = \partial_{\nu} \partial_{\mu} \Phi -{\Gamma^{\rho}}_{\mu \nu} \partial_{\rho} \Phi.$$ Since the partial derivatives commute (under the usual assumptions about the smoothness of ##\Phi##) and the Christoffel symbols are symmetric in a pseudo-Riemann manifold, you indeed have
$$\nabla_{\nu} \nabla_{\mu} \Phi = \nabla_{\mu} \nabla_{\nu} \Phi.$$
I could be wrong but using Leibniz rule it should be (basically the partial derivatives order in the first term on RHS is reversed) $$\nabla_{\nu} (\nabla_{\mu} \Phi) = \partial_{\mu} \partial_{\nu} \Phi -{\Gamma^{\rho}}_{\mu \nu} \partial_{\rho} \Phi$$

Then, as you pointed out, since Christoffel symbols are symmetric and using partial derivative commutativity we get the result.
 
Last edited:
  • #115
Well, yes, my order of the lower Christoffel-symbol indices should be switched in the first expression, but of course they are symmetric, which is why I usually don't care. You are right, in more general cases of spaces with torsion, one has to keep an eye on the order of those indices. I hope, here I get it formally right with the index ordering:
$$\nabla_{\nu} \nabla_{\mu} \phi=\nabla_{\nu} \partial_{\mu} \Phi = \partial_{\nu} \partial_{\mu} \Phi -{\Gamma^{\rho}}_{\nu \mu} \partial_{\rho} \Phi=\partial_{\mu} \partial_{\nu} \Phi -{\Gamma^{\rho}}_{\mu \nu} \partial_{\rho} \Phi=\nabla_{\mu} \partial_{\nu} \Phi=\nabla_{\mu} \nabla_{\nu} \Phi.$$
 
  • #116
vanhees71 said:
Well, yes, my order of the lower Christoffel-symbol indices should be switched in the first expression, but of course they are symmetric, which is why I usually don't care. You are right, in more general cases of spaces with torsion, one has to keep an eye on the order of those indices. I hope, here I get it formally right with the index ordering:
$$\nabla_{\nu} \nabla_{\mu} \phi=\nabla_{\nu} \partial_{\mu} \Phi = \partial_{\nu} \partial_{\mu} \Phi -{\Gamma^{\rho}}_{\nu \mu} \partial_{\rho} \Phi=\partial_{\mu} \partial_{\nu} \Phi -{\Gamma^{\rho}}_{\mu \nu} \partial_{\rho} \Phi=\nabla_{\mu} \partial_{\nu} \Phi=\nabla_{\mu} \nabla_{\nu} \Phi.$$
My point was actually about partial derivatives order in the first term on RHS.
 
  • Like
Likes vanhees71
  • #117
I see, yes, but then we first apply ##\partial_{\mu}## and then ##\partial_{\nu}##, i.e., we have ##\partial_{\nu} \partial_{\mu}## in the first step. At the end it's right anyway, because the operators in question commute ;-)).
 
  • Like
Likes cianfa72
  • #118
Sorry, I believe the sign of Christoffel-symbol should be '+'. Btw I believe we're mixing again the meaning of Greek indices ##\mu## and ##\nu## (tensor component indexes in a basis vs "which vector in the basis"). In a coordinate basis (holonomic) it should be fine, however.
$$\nabla_{\nu} \nabla_{\mu} \phi=\nabla_{\nu} \partial_{\mu} \Phi = \partial_{\mu} \partial_{\nu} \Phi + (\nabla_{\nu} \partial_{\mu}) \Phi = \partial_{\mu} \partial_{\nu} \Phi +{\Gamma^{\rho}}_{\mu \nu} \partial_{\rho} \Phi=\partial_{\nu} \partial_{\mu} \Phi +{\Gamma^{\rho}}_{\nu \mu} \partial_{\rho} \Phi=\nabla_{\mu} \partial_{\nu} \Phi=\nabla_{\mu} \nabla_{\nu} \Phi.$$ Note in fact that ##\nabla_{\nu}\partial_{\mu}## is actually the covariant derivative in the direction ##\nu## (i.e. in the direction ##\partial_{\nu}##) of the vector ##\partial_{\mu}##.
 
Last edited:
  • #119
cianfa72 said:
Sorry, I believe the sign of Christoffel-symbol should be '+'. Btw I believe we're mixing again the meaning of Greek indices ##\mu## and ##\nu## (tensor component indexes in a basis vs "which vector in the basis"). In a coordinate basis (holonomic) it should be fine, however.
$$\nabla_{\nu} \nabla_{\mu} \phi=\nabla_{\nu} \partial_{\mu} \Phi = \partial_{\mu} \partial_{\nu} \Phi + (\nabla_{\nu} \partial_{\mu}) \Phi = \partial_{\mu} \partial_{\nu} \Phi +{\Gamma^{\rho}}_{\mu \nu} \partial_{\rho} \Phi=\partial_{\nu} \partial_{\mu} \Phi +{\Gamma^{\rho}}_{\nu \mu} \partial_{\rho} \Phi=\nabla_{\mu} \partial_{\nu} \Phi=\nabla_{\mu} \nabla_{\nu} \Phi.$$ Note in fact that ##\nabla_{\nu}\partial_{\mu}## is actually the covariant derivative in the direction ##\nu## (i.e. in the direction ##\partial_{\nu}##) of the vector ##\partial_{\mu}##.
That’s not how covariant differentiation works.
 
  • #120
Orodruin said:
That’s not how covariant differentiation works.
Yes, it was wrong. I tried to do the complete job: the goal is work out the component ##\mu##, ##\nu## of the tensor ##\nabla(\nabla \Phi##) in a coordinate basis.

As tensor ##\nabla \Phi = (\partial_{\alpha} \Phi) dx^{\alpha}## then
$$\nabla (\nabla \Phi)= [\partial_{\beta} \partial_{\alpha} \Phi - \partial_{\rho} \Phi {\Gamma^{\rho}}_{\alpha \beta}] dx^{\alpha} \otimes dx^{\beta}$$ Contract it with ##\partial_{\mu}## and ##\partial_{\nu}## to get the ##\mu##, ##\nu## component:
$$(\nabla (\nabla \Phi))_{\mu \nu}= [\partial_{\beta} \partial_{\alpha} \Phi - \partial_{\rho} \Phi {\Gamma^{\rho}}_{\alpha \beta}] dx^{\alpha}(\partial_{\mu}) dx^{\beta}(\partial_{\nu})$$$$ \nabla_{\mu} \nabla_{\nu} \Phi = (\nabla \nabla \Phi)_{\mu \nu}= [\partial_{\nu} \partial_{\mu} \Phi - \partial_{\rho} \Phi {\Gamma^{\rho}}_{\mu \nu}]$$
Is that correct now ? Thanks
 
Last edited:
  • Like
Likes Orodruin
  • #121
cianfa72 said:
Is that correct now ?
Looks good.
 
  • #122
Orodruin said:
Looks good.
And what was wrong with my derivation? I don't see any difference.
 
  • #123
vanhees71 said:
And what was wrong with my derivation? I don't see any difference.
I never said anything was wrong with it. I complained about #118.
 
  • Like
Likes cianfa72 and vanhees71
  • #124
Orodruin said:
I never said anything was wrong with it. I complained about #118.
Yep, my fault sorry.
 
  • #125
dextercioby said:
I cannot believe there are 100 posts here about a simple pure ... issue
This, unfortunately, has become a characteristic feature in here.
dextercioby said:
In mathematics ##\nabla_{\mu}V^{\nu}## is ill defined
No, it is not. In mathematics we define things. So, on a generic tensor (density) [itex]T_{A} \equiv T^{\rho_{1}\cdots \rho_{r}}_{{}\tau_{1}\cdots \tau_{s}}[/itex], I define the operator [itex]\nabla_{\mu}[/itex] by the rule [tex]\nabla_{\mu}T_{A} \equiv \partial_{\mu}T_{A} + \Gamma^{\lambda}_{\mu\nu}[T_{A}]^{\nu}{}_{\lambda} ,[/tex] where [tex][T^{\rho_{1} \cdots \rho_{r}}_{{}\tau_{1}\cdots \tau_{s}}]^{\nu}{}_{\lambda} \equiv \sum_{p = 1}^{r} \delta^{\rho_{p}}_{\lambda}T^{\rho_{1}\cdots \rho_{p-1}\nu \rho_{p+1}\cdots \rho_{r}}_{{}{}{}{}\tau_{1} \cdots \tau_{s}} - \sum_{q = 1}^{s} \delta^{\nu}_{\tau_{q}}T^{\rho_{1}\cdots \rho_{r}}_{{}\tau_{1}\cdots \tau_{q-1}\lambda \tau_{q+1}\cdots \tau_{s}} - \delta^{\nu}_{\lambda}T_{A} ,[/tex] with last term is absent when [itex]T_{A}[/itex] is not a density.

Remaks: (1) Notice that [itex][T_{A}]^{\nu}{}_{\lambda} \epsilon^{\lambda}{}_{\nu}[/itex] is nothing but the change of [itex]T_{A}[/itex] under an infinitesimal [itex]\mbox{GL}(n)[/itex] transformation parametrized by [itex]\epsilon^{\lambda}{}_{\nu}[/itex]. So, for any object [itex]\Psi[/itex], we define [itex][\Psi]^{\nu}{}_{\lambda}[/itex] by its infinitesimal transformation under the general linear group [itex]\mbox{GL}(n)[/itex].

(2) You can show that the (above defined) operator [itex]\nabla_{\mu}[/itex] satisfies the Leibniz rule.
 
  • Like
Likes vanhees71 and ergospherical

Similar threads

Replies
7
Views
2K
Replies
7
Views
725
Replies
19
Views
2K
Replies
22
Views
3K
Replies
6
Views
2K
Replies
62
Views
5K
Replies
10
Views
1K
Back
Top