Alternative definitions of geodesic

andrewkirk · Oct 21, 2015

In the terminology of my text (Gelfand and Fomin 'Calculus of Variations'), for the saddle point that I linked the term 'variation' has no meaning, because the saddle point depicted is a function (from ##\mathbb{R}^2## to ##\mathbb{R}##) and 'variation' is a property of a functional, at a function. Based on their definition of 'variation' I would expect - based on the above argument - that the variation of the length functional at the spacelike geodesic is non-vanishing. They define 'vanishing' to mean identically zero. At least I think that's what they mean, but there's some uncertainty because they say '##\delta J_y## is vanishing at ##y##' means that ##\delta J_y[h]=0## 'for all admissible h', and they don't explain what they mean by 'admissible' anywhere that I can find (h is the incremental function added to the function at which the variation is zero.

What text are you using? I have to say that I'm not in love with Gelfand and Fomin, as they keep saying things like 'the functional J[h]' whereas J is the functional and J[h] is a real number that is obtained when one applies the functional to function h. It caused me no end of confusion.

PAllen · Oct 21, 2015

andrewkirk said:

In the terminology of my text (Gelfand and Fomin 'Calculus of Variations'), for the saddle point that I linked the term 'variation' has no meaning, because the saddle point depicted is a function (from ##\mathbb{R}^2## to ##\mathbb{R}##) and 'variation' is a property of a functional, at a function. Based on their definition of 'variation' I would expect - based on the above argument - that the variation of the length functional at the spacelike geodesic is non-vanishing. They define 'vanishing' to mean identically zero. At least I think that's what they mean, but there's some uncertainty because they say '##\delta J_y## is vanishing at ##y##' means that ##\delta J_y[h]=0## 'for all admissible h', and they don't explain what they mean by 'admissible' anywhere that I can find (h is the incremental function added to the function at which the variation is zero.

What text are you using? I have to say that I'm not in love with Gelfand and Fomin, as they keep saying things like 'the functional J[h]' whereas J is the functional and J[h] is a real number that is obtained when one applies the functional to function h. It caused me no end of confusion.

The book I am using is a monograph by Gilbert Ames Bliss. What you describe is similar to part of his presentation, except that he does define admissibility of h. The h functions need only be continuous, meet boundary conditions, and be c2 smooth except for a finite number of points. If, without assuming anything specific about h, the derivative of the integral with respect to the parameter applied to it is zero, it is said the variation is zero. This is equivalent to satisfying the Euler Lagrange equations. Spacelike geodesics clearly satisfy these equations, so they have stationary (zero) variation. However, since they are not locally extremal, they are some type of saddle point.

bcrowell · Oct 22, 2015

andrewkirk said:

Assume for simplicity that, in a given coordinate system the two spacetime points differ only in a single spatial coordinate, say x. Then there is a basis for the vector space of paths between the two points, that is made up of basis functions each of which is confined to one of the x-t, x-y or x-z planes.

Here you seem to be assuming that spacetime can be covered by a single chart and that it has the structure of a vector space. Neither of these is true in GR.

stevendaryl · Oct 22, 2015

bcrowell said:

Here you seem to be assuming that spacetime can be covered by a single chart and that it has the structure of a vector space. Neither of these is true in GR.

Well let's suppose that we have a definition of geodesic, "locally geodesic", that applies in a small enough simply-connected, open subset of the manifold. Then wouldn't that extend to arbitrary geodesics by saying that the path is locally geodesic in every sufficiently small simply-connected open subset of the manifold that the geodesic passes through?

andrewkirk · Oct 22, 2015

bcrowell said:

Here you seem to be assuming that spacetime can be covered by a single chart and that it has the structure of a vector space. Neither of these is true in GR.

Re the coverage by a single chart: only locally. The intended context of the comment is two spacelike separated points inside the same geodesic ball. That fits in with the overall picture via the finite set of points ##h_k## you posed in the OP that break up the full geodesic. If the points are close enough together, each pair can be in a geodesic ball.

The vector space to which I referred is an infinite-dimensional vector space of functions, not a 4D one of spacetime directions. Each element of the vector space denotes a path (not necessarily geodesic) between the two points. We choose a local coordinate system in which the ##x## two points have coordinates ##(a^t,a^x,a^y,a^z)## and ##(a^t,b^x,a^y,a^z)##. Then the vector space mentioned is the set of suitably nice (ie continuous etc) functions ##f## from the real interval ##[a^x,b^x]## to ##\mathbb{R}^3## such that ##f(x')## is the ##t, y, z## coordinates of the point on the path that has ##x## coordinate ##x'##. The functional being considered is ##f\mapsto L(\pi^{-1}\circ g_f([a^x,b^x]))## where

##g_f## is the function ##x\mapsto (x,f(x))##
##\pi## is the coordinate map for the geodesic ball
##L## is the length function for paths in ##M##

I think I have almost convinced myself that the variation of the length functional really is zero at the spacelike geodesic, but I need to think it through more before I can have any confidence that I understand why.

stevendaryl · Oct 23, 2015

andrewkirk said:

I think I have almost convinced myself that the variation of the length functional really is zero at the spacelike geodesic, but I need to think it through more before I can have any confidence that I understand why.

Are you defining geodesic to mean a parametrized path satisfying the geodesic equation (or the definition in terms of parallel transport, which is mathematically equivalent for a connection defined in terms of the metric)? In that case, doesn't the usual Lagrange-Euler equations imply that (spacelike or timelike) geodesics have zero variation?

If we consider a parametrized curve [itex]x^\mu(s)[/itex], then the invariant length of the curve is given by [itex]\int ds \sqrt{g_{\mu \nu} U^\mu U^\nu}[/itex], where [itex]U^\mu = \frac{dx^\mu}{dx^\nu}[/itex].If you treat this like an action integral, with lagrangian [itex]L = \sqrt{g_{\mu \nu} U^\mu U^\nu}[/itex], then the Lagrange-Euler equations for a path with zero variation leads to

[itex]\dfrac{d}{ds} \dfrac{\partial L}{\partial U^\mu} - \dfrac{\partial L}{\partial x^\mu} = 0[/itex]

After reparametrizing to use an affine parameter, this becomes the geodesic equation.

I made the restriction to spacelike or timelike because the derivation of Euler-Lagrange equations breaks down if the path is a null path. That's because the quantity

[itex]\dfrac{\partial L}{\partial U^\mu} = \dfrac{g_{\mu \nu} U^\nu}{L}[/itex]

is undefined whenever [itex]L = 0[/itex], which is always the case for null paths.

bcrowell · Oct 23, 2015

stevendaryl said:

Well let's suppose that we have a definition of geodesic, "locally geodesic", that applies in a small enough simply-connected, open subset of the manifold. Then wouldn't that extend to arbitrary geodesics by saying that the path is locally geodesic in every sufficiently small simply-connected open subset of the manifold that the geodesic passes through?

Yes. As written, that sounds like a much weaker condition that the definition I proposed in #1, but I assume they're actually equivalent.

Allin · Oct 23, 2015

Geodesics is land surveying at the scale of the whole planet.

DrGreg · Oct 23, 2015

Allin said:

Geodesics is land surveying at the scale of the whole planet.

I think you mean "geodesy" or "geodetics". That's not what we are talking about here.

andrewkirk · Oct 23, 2015

stevendaryl said:

the Lagrange-Euler equations for a path with zero variation leads to

[itex]\dfrac{d}{ds} \dfrac{\partial L}{\partial U^\mu} - \dfrac{\partial L}{\partial x^\mu} = 0[/itex]

After reparametrizing to use an affine parameter, this becomes the geodesic equation.

That sounds like a promising approach that ought to work. I tried doing this but, because of the complexity of L, the algebra soon became very ugly and I ran out of paper ( or patience - one or the other).

Maybe I took a wrong turn somewhere. Have you worked it through?

stevendaryl · Oct 23, 2015

andrewkirk said:

That sounds like a promising approach that ought to work. I tried doing this but, because of the complexity of L, the algebra soon became very ugly and I ran out of paper ( or patience - one or the other).

Maybe I took a wrong turn somewhere. Have you worked it through?

It's easy to get into a blind alley, but I don't think this is that bad.

With [itex]L = \sqrt{g_{\mu \nu} U^\mu U^\nu}[/itex],

[itex]\dfrac{\partial}{\partial U^\mu} L = \dfrac{1}{L} g_{\mu \nu} U^\nu[/itex]
[itex]\dfrac{\partial}{\partial x^\mu} L = \dfrac{1}{2L} \dfrac{\partial g_{\mu' \nu}}{\partial x^\mu} U^{\mu'} U^\nu[/itex]

So the Euler-Lagrange equations give:

[itex]\dfrac{1}{L} \dfrac{d}{ds} (g_{\mu \nu} U^\nu) + g_{\mu \nu} U^\nu \dfrac{d}{ds} \dfrac{1}{L} = \dfrac{1}{2L} \dfrac{\partial g_{\mu' \nu}}{\partial x^\mu} U^{\mu'} U^\nu[/itex]

Now, the great simplification comes from assuming [itex]\dfrac{d}{ds} \dfrac{1}{L} = 0[/itex], so [itex]L = [/itex] a constant along the path. With this assumption, the equation simplifies to (multiplying both sides by [itex]L[/itex])

[itex]\dfrac{d}{ds} (g_{\mu \nu} U^\nu) = \dfrac{1}{2} \dfrac{\partial g_{\mu' \nu}}{\partial x^\mu} U^{\mu'} U^\nu[/itex]

We expand the left-hand side to get:

[itex](\dfrac{d}{ds} g_{\mu \nu}) U^\nu + g_{\mu \nu} \dfrac{d}{ds} U^\nu = \dfrac{1}{2} \dfrac{\partial g_{\mu' \nu}}{\partial x^\mu} U^{\mu'} U^\nu[/itex]

Then you use: [itex]\dfrac{d}{ds} g_{\mu \nu} = (\dfrac{\partial}{\partial x^{\mu'}} g_{\mu \nu}) \dfrac{d}{ds} x^{\mu'} = (\dfrac{\partial}{\partial x^{\mu'}} g_{\mu \nu}) U^{\mu'} [/itex] (the chain rule for derivatives)

So we now have:
[itex](\dfrac{\partial}{\partial x^{\mu'}} g_{\mu \nu}) U^{\mu'} U^\nu + g_{\mu \nu} \dfrac{d}{ds} U^\nu = \dfrac{1}{2} \dfrac{\partial g_{\mu' \nu}}{\partial x^\mu} U^{\mu'} U^\nu[/itex]

Rearranging gives:

[itex]g_{\mu \nu} \dfrac{d}{ds} U^\nu = - ( (\dfrac{\partial}{\partial x^{\mu'}} g_{\mu \nu}) U^{\mu'} U^\nu - \dfrac{1}{2} \dfrac{\partial g_{\mu' \nu}}{\partial x^\mu} U^{\mu'} U^\nu)[/itex]

Finally, get rid of the [itex]g_{\mu \nu}[/itex] from the left-hand side by multiplying by the inverse matrix, [itex]g^{\mu \nu'}[/itex] and summing over [itex]\mu[/itex]. this gives:

[itex]\dfrac{d U^{\nu'}}{ds} = - \frac{1}{2} g^{\mu \nu'} (2 \dfrac{\partial g_{\mu \nu}}{\partial x^{\mu'}} - \dfrac{\partial g_{\mu' \nu}}{\partial x^\mu}) U^{\mu'} U^\nu[/itex]

So if we define [itex]Q^{\nu'}_{\nu \mu'} = \frac{1}{2} g^{\mu \nu'} (2 \dfrac{\partial g_{\mu \nu}}{\partial x^{\mu'}} - \dfrac{\partial g_{\mu' \nu}}{\partial x^\mu})[/itex], then this becomes:
[itex]\dfrac{d U^{\nu'}}{ds} = -Q^{\nu'}_{\nu \mu'} U^{\mu'} U^\nu[/itex]

Almost there! The usual connection coefficient is [itex]\Gamma^{\nu'}_{\nu \mu'} = \frac{1}{2} g^{\mu \nu'} (\dfrac{\partial g_{\mu \nu}}{\partial x^{\mu'}} + \dfrac{\partial g_{\mu \mu'}}{\partial x^{\nu}} - \dfrac{\partial g_{\mu' \nu}}{\partial x^\mu})[/itex], So we can write:

[itex]Q^{\nu'}_{\nu \mu'} = \Gamma^{\nu'}_{\nu \mu'} + \frac{1}{2} g^{\mu \nu'} (\dfrac{\partial g_{\mu \nu}}{\partial x^{\mu'}} - \dfrac{\partial g_{\mu \mu'}}{\partial x^{\nu}})[/itex]

Notice that the last term on the right is antisymmetric under the exchange [itex]\nu \Rightarrow \mu'[/itex]. On the other hand, [itex]U^{\mu'} U^\nu[/itex] is symmetric under that exchange. The product of an antisymmetric tensor with a symmetric tensor is zero. So:

[itex]Q^{\nu'}_{\nu \mu'} U^{\mu'} U^\nu = \Gamma^{\nu'}_{\nu \mu'} U^{\mu'} U^\nu[/itex]

So the Euler-Lagrange equations boil down to:

[itex]\dfrac{d U^{\nu'}}{ds} = -\Gamma^{\nu'}_{\nu \mu'} U^{\mu'} U^\nu[/itex]

which is the geodesic equation (whew!). Okay, I guess it was pretty bad...

Alternative definitions of geodesic

Similar threads

Hot Threads

Recent Insights