Covariant Derivative - where does the minus sign come from?

In summary, the minus sign in the second equation is due to the different transformation laws for vectors and covectors, and it arises when contracting with the metric tensor.
  • #1
unscientific
1,734
13
I was reading through hobson and my notes where the covariant acts on contravariant and covariant tensors as
[tex]\nabla_\alpha V^\mu = \partial_\alpha V^\mu + \Gamma^\mu_{\alpha \gamma} V^\gamma[/tex]
[tex]\nabla_\alpha V_\mu = \partial_\alpha V_\mu - \Gamma^\gamma_{\alpha \mu} V_\gamma [/tex]

Why is there a minus sign in the second equation?

I mean, if you simply multiply ##g_{\mu \gamma}## on both side of the equation:
[tex]\nabla_\alpha V^\mu g_{\mu \gamma} = \partial_\alpha V^\mu g_{\mu \gamma} + \Gamma^\mu_{\alpha \gamma} V^\gamma g_{\mu \gamma}[/tex]
[tex]\nabla_\alpha V_{\gamma} = \partial_\alpha V_\gamma + \Gamma^\mu_{\alpha \gamma} V_\mu [/tex]
Swapping ##\mu## and ##\gamma## we have
[tex]\nabla_\alpha V_{\mu} = \partial_\alpha V_\mu + \Gamma^\gamma_{\alpha \mu} V_\gamma [/tex]
 
Physics news on Phys.org
  • #2
unscientific said:
Why is there a minus sign in the second equation?

Because of the different transformation laws that are obeyed by vectors and covectors, i.e., by upper and lower index quantities.
 
Last edited:
  • #3
unscientific said:
I mean, if you simply multiply ##g_{\mu \gamma}## on both side of the equation:
[tex]\nabla_\alpha V^\mu g_{\mu \gamma} = \partial_\alpha V^\mu g_{\mu \gamma} + \Gamma^\mu_{\alpha \gamma} V^\gamma g_{\mu \gamma}[/tex]
I don't know the answer to your question, but this part is wrong. You cannot have one upstairs and two downstairs repeated indexes.
 
  • #4
I tried to look up a derivation, but couldn't find it anywhere, so I'm asking if anyone has seen a proof or working that starts from the first equation and shows how the minus sign appears.
 
  • #5
Carroll's notes seem to treat this in quite some detail.
http://preposterousuniverse.com/grnotes/grnotes-three.pdf

I haven't thoroughly checked them since I used the book that came from those.

However I suspect these show even more steps if memory serves me.

P.S. try checking those and come back if you have problems

[edit] added link
 
Last edited by a moderator:
  • Like
Likes Mentz114
  • #6
JorisL said:
Carroll's notes seem to treat this in quite some detail.
http://preposterousuniverse.com/grnotes/grnotes-three.pdf

I haven't thoroughly checked them since I used the book that came from those.

However I suspect these show even more steps if memory serves me.

P.S. try checking those and come back if you have problems

[edit] added link

Good answer. Carrolls explanation is very well put together.
 
Last edited by a moderator:
  • #7
unscientific said:
I mean, if you simply multiply ##g_{\mu \gamma}## on both side of the equation:
[tex]\nabla_\alpha V^\mu g_{\mu \gamma} = \partial_\alpha V^\mu g_{\mu \gamma} + \Gamma^\mu_{\alpha \gamma} V^\gamma g_{\mu \gamma}[/tex]
[tex]\nabla_\alpha V_{\gamma} = \partial_\alpha V_\gamma + \Gamma^\mu_{\alpha \gamma} V_\mu [/tex]

Notice that there is a summation for ##\gamma## on the vector and Christoffel symbols in the original formula (as Dalespam said, this is invalid). Multiplying by ##g_{\mu\gamma}## will not work. What you can do is introduce another index and multiply by ##g_{\mu\tau}## For the second term though, you actually lower the Christoffel symbol index instead of the vector index so you will get: [tex]\nabla_\alpha V_\mu = \partial_\alpha V_\mu +\Gamma_{\mu\alpha\gamma}V^\gamma[/tex]

As you can see, now you are dealing with a Christoffel symbol of the first kind (##\Gamma_{\mu\nu\gamma}\equiv g_{\mu\tau}\Gamma^{\tau}_{~~\nu\gamma}##) instead of a Christoffel symbol of the second kind that we are usually more familiar with.

EDIT: WARNING: There's an error in my first term.
 
Last edited:
  • #8
For some intuition, I usually first try to tell students how it works in curvilinear coordinates on the flat ##\mathbb R^n##. Given a set of curvilinear coordinates ##x^\mu## we can define two sets of basis vectors ##\vec E_\mu = \partial \vec r /\partial x^\mu## and ##\vec E^\mu = \nabla x^\mu## (where ##\nabla## is the gradient and ##x^\mu## a coordinate function). These bases are neither orthogonal nor normalised, but they do obey ##\vec E_\nu \cdot \vec E^\mu = \delta^\mu_\nu## and will generalise into the coordinate bases for vectors and covectors. The Christoffel symbols are given by how the basis changes with respect to the coordinates (since this is ##\mathbb R^n##, we can compare vectors at different points without problem) ##\vec E^\sigma\cdot \partial_\mu \vec E_\nu = \Gamma^\sigma_{\mu\nu}##, i.e., ##\partial_\mu \vec E_\nu = \Gamma^\sigma_{\mu\nu} \vec E_\sigma##. But at the same time we have
$$
0 = \partial_\mu \vec E^\sigma \cdot \vec E_\nu = \vec E^\sigma \cdot \partial_\mu \vec E_\nu + \vec E_\nu \cdot \partial_\mu \vec E^\sigma = \Gamma^\sigma_{\mu\nu} + \vec E_\nu \cdot \partial_\mu \vec E^\sigma.
$$
It follows that ##\vec E_\nu \cdot \partial_\mu \vec E^\sigma = - \Gamma^\sigma_{\mu\nu}##, or in other terms ##\partial_\mu \vec E^\sigma = -\Gamma^\sigma_{\mu\nu} \vec E^\nu## and there you have your minus sign.

This generalises readily to curved spaces and the absolutely easiest way to see that it must be like that is to apply the covariant derivative to the scalar field ##V_\mu W^\mu## for which it reduces to a partial derivative. At the same time, the covariant derivative should obey the product rule and we would have
$$
\nabla_\sigma(V_\mu W^\mu) = (\nabla_\sigma V_\mu) W^\mu + V_\mu \nabla_\sigma W^\mu = W^\mu \nabla_\sigma V_\mu + V_\mu \partial_\sigma W^\mu + \Gamma^\mu_{\rho\nu} V_\mu W^\rho = V_\mu \partial_\sigma W^\mu + W^\mu \partial_\sigma V_\mu.
$$
The only way of satisfying this for arbitrary vector fields is if
$$
\nabla_\sigma V_\mu = \partial_\sigma V_\mu - \Gamma^\rho_{\mu\nu} V_\rho.
$$
 
  • Like
Likes JorisL
  • #9
unscientific said:
I was reading through hobson and my notes where the covariant acts on contravariant and covariant tensors as
[tex]\nabla_\alpha V^\mu = \partial_\alpha V^\mu + \Gamma^\mu_{\alpha \gamma} V^\gamma[/tex]
[tex]\nabla_\alpha V_\mu = \partial_\alpha V_\mu - \Gamma^\gamma_{\alpha \mu} V_\gamma [/tex]

Why is there a minus sign in the second equation?

I mean, if you simply multiply ##g_{\mu \gamma}## on both side of the equation:
[tex]\nabla_\alpha V^\mu g_{\mu \gamma} = \partial_\alpha V^\mu g_{\mu \gamma} + \Gamma^\mu_{\alpha \gamma} V^\gamma g_{\mu \gamma}[/tex]
1) You need to be careful with the indices.

2) It is true that [tex]\left( \nabla_{\alpha} V^{\mu}\right) g_{\mu \tau} = \nabla_{\alpha} \left( V^{\mu} g_{\mu \tau} \right) = \nabla_{\alpha} V_{\tau} ,[/tex] because [itex]\nabla_{\alpha} g_{\mu \nu} = 0[/itex].

3) But [tex]\partial_{\alpha} ( V^{\mu} g_{\mu \tau} ) \neq ( \partial_{\alpha} V^{\mu} ) g_{\mu \tau} = \partial_{\alpha} ( V^{\mu} g_{\mu \tau} ) - V^{\mu} \partial_{\alpha} g_{\mu \tau} .[/tex] So, when you contract with [itex]g_{\mu \tau}[/itex], you get [tex]\nabla_{\alpha} V_{\tau} =\partial_{\alpha} V_{\tau} + \left( g_{\mu \tau} \Gamma^{\mu}_{\alpha \gamma} - \partial_{\alpha} g_{\tau \gamma} \right) g^{\gamma \beta} V_{\beta} .[/tex] Now the minus sign show up, because [tex]g^{\gamma \beta} \left( g_{\mu \tau} \Gamma^{\mu}_{\alpha \gamma} - \partial_{\alpha} g_{\tau \gamma} \right) = - \Gamma^{\beta}_{\alpha \tau} .[/tex]
 
  • #10
Ah whoops, Sam's answer is correct. I was too careless in mine. Please disregard. Thanks. :)
 

Related to Covariant Derivative - where does the minus sign come from?

1. What is a covariant derivative?

A covariant derivative is a mathematical tool used in differential geometry and tensor calculus to describe how a vector field changes as it moves along a curved surface or manifold. It takes into account the curvature of the surface and is a generalization of the standard derivative used in calculus.

2. Where does the minus sign come from in the covariant derivative?

The minus sign in the covariant derivative comes from the fact that it takes into account the intrinsic geometry of a curved surface. When a vector is parallel transported along a curved surface, it may not maintain its original direction and will rotate. The minus sign is used to adjust for this rotation and ensure the derivative is a true measure of the vector's change.

3. Why is the minus sign important in the covariant derivative?

The minus sign is important in the covariant derivative because it ensures that the derivative is a true measure of the change of a vector as it moves along a curved surface. Without this adjustment, the derivative would not accurately reflect the changes in direction and magnitude of the vector.

4. Can the minus sign be omitted in the covariant derivative?

No, the minus sign cannot be omitted in the covariant derivative. As mentioned before, it is a crucial factor in adjusting for the rotation of vectors when they are parallel transported along a curved surface. Omitting the minus sign would result in an incorrect derivative that does not accurately reflect the changes of the vector.

5. How is the covariant derivative used in physics?

The covariant derivative is used in physics, specifically in the field of general relativity, to describe the behavior of matter and energy in curved spacetime. It is an essential tool for understanding the effects of gravity and other forces on objects moving through curved space. It is also used in other areas of physics, such as electromagnetism and quantum mechanics, to describe the behavior of fields and particles on curved surfaces.

Back
Top