Alternative definitions of geodesic

In summary: A geodesic is a curve that is parallel to its own tangent vector. This definition requires some preliminary machinery, but it's easy to state and understand at an intuitive level. The main technical issue is that we can have conjugate points. Hawking and Ellis have a good statement of the main result on pp. 110-111.
  • #1
bcrowell
Staff Emeritus
Science Advisor
Insights Author
Gold Member
6,724
431
I teach both physics and math at a community college, and I've volunteered to give a short talk for students at our weekly math colloquium that has to do with curvature and non-curvature singularities in relativity. This is a tall order, given that I can't even assume that all the students will have had calculus. I'm hoping to make it the kind of talk where everyone at least understands some of it. Just to make things harder, the culture of these colloquia is that there should be at least a taste of rigor, e.g., speakers are usually expected to give a full and rigorous statement of something like one central definition and/or one main theorem or conjecture.

The basic concept I'm trying to get across is geodesic incompleteness, and for that purpose I'd like to be able to give a fairly elementary, but rigorous, definition of a geodesic. The standard definition is that it's a curve that parallel-transports its own tangent vector, but this definition requires quite a bit of preliminary machinery. One could leave geodesic as a fundamental notion, as in Euclid, but I think this only really works in contexts like hyperbolic geometry.

A third option might be to talk about geodesics as curves of extremal or stationary metric length. This may not be the best approach if you're trying for full generality (won't work for spacelike or null geodesics), but it might be easy to state and understand for timelike geodesics. It has the advantage that it's intuitively appealing and easy to explain at an intuitive level, and then if the formalized definition is too confusing, people will still have gotten the general idea.

The main technical issue seems to be that we can have conjugate points. Hawking and Ellis have a good statement of the main result on pp. 110-111. Here are some nonrigorous paraphrases of some of the preliminary notions. They define a timelike geodesic ##\gamma(t)## from q to p as maximal if it can't be lengthened by making small variations. Point p and q on a certain geodesic are said to be conjugate if the geodesic can be perturbed infinitesimally to yield another geodesic that also passes through p and q (p. 97). They then have:

Proposition 4.5.8: A timelike geodesic curve ##\gamma(t)## from q to p is maximal if and only if there is no point conjugate to q along ##\gamma(t)## in (q,p).

Now we only expect to have a finite number of conjugate points conjugate to a given point in any finite segment of a geodesic. (H&E state on p. 111 that "conjugate points are isolated." I assume this is what they mean, although they don't say why it has to be so. Presumably this relates to smoothness.) So can I define a timelike geodesic as follows?

A timelike geodesic ##\gamma## is defined as a timelike curve of maximal length, in the following sense. Given points ##a## and ##b## on ##\gamma##, one can always find a set of points ##h_0##,... ##h_n##, also on ##\gamma##, with the following properties:

(1) ##h_0=a## and ##h_n=b##. The points are laid out in order from 0 to n according to the parameter used to define ##\gamma##.

(2) The length of each segment, from ##h_i## to ##h_{i+1}##, is maximal in the sense defined above.

(3) The curve is twice differentiable.

Does this work? I'm thinking that if there are points conjugate to a or b, we can just put h's at them, and also fill in further h's between these, so there will never be two successive h's that are conjugate to each other. Condition 3 is intended to ensure that we can't have a situation where segment ##h_ih_{i+1}## is a geodesic, and so is ##h_{i+1}h_{i+2}##, but ##h_ih_{i+2}## is not, because there's a kink at ##h_{i+1}##.
 
Last edited:
Physics news on Phys.org
  • #2
I think that works (or at least, Synge claims a result, without proof, in his GR book, that justifies it). What he states is that if p and q are made close enough, the geodesic between them, within some neighborhood containing p and q, is unique (the neighborhood is to rule out 'long way around' geodesics in closed spacetimes). Then, if it is a timelike geodesic, it is maximal in the neighborhood (again, for p and q made sufficiently close, the only exception is a long ctc).
 
  • Like
Likes bcrowell
  • #3
bcrowell said:
A timelike geodesic γ\gamma is defined as a timelike curve of minimal length

I assume you mean "maximal"?

bcrowell said:
Does this work?

It looks OK to me. The only question I have is how much detail you expect to have to go into regarding conjugate points, if someone asks why you're doing all this in small segments instead of just looking at the whole curve at once.
 
  • Like
Likes bcrowell
  • #4
The lay definition of geodesic that I find very intuitive is that it's the path you follow if you start off in a certain direction and then just 'follow your nose'. That's a folksy formulation of the 'parallel transport the tangent vector' definition, but I don't think it actually introduces any misconceptions.

A nice historical version of the more general notion of parallel transport is the (mythical?) ancient Chinese 'south-pointing chariot'. I don't think anybody knows whether one was ever built, but the idea was to put mechanisms on the chariot, connected to the wheels, so that it parallel transports the direction that a figure on top of the chariot is pointing, as it rolls along, including turning corners, going up and down hills etc. The chariot notion doesn't deal with the Earth's curvature. I suppose that's because the Ancient Chinese hadn't done enough differential geometry to work out that parallel transporting a vector around a loop on a sphere will make it point in a different direction when you get back to the start.

I think a modern version of that is the gyroscope, of the type used in aeroplanes. If we spin up a gyroscope that points in the direction you want to set off, and then we always follow the direction pointed by the gyroscope, we will be tracing out a geodesic, because the gyroscope's vector is being parallel-transported.

Another easily-graspable definition of a timelike geodesic is as a path with the property that an observer traveling along that path will feel perfectly weightless for the entire trip. ie it is a 'free-fall'. On reflection, this definition is superior to the above because it deals with spacetime geodesics, whereas the above only works for spatial geodesics. That is, the 'follow your nose' definition doesn't specify the spatial speed at which you traverse the path.

A nice example of the impact of curvature, that requires no calculus, is that if you start on the equator at longitude 90, spin up a gyroscope to point due North, travel to the North Pole, then down to the equator at longitude 0 (passing through Greenwich), then back to the starting point, the gyro will be pointing West. This is easily grasped using a pointing finger and moving one's hands around in the air. It's even easier if one has a large sphere, like a beach ball, handy.

The proposed definition in the OP looks like it will work. But I have no proof handy that there will only be a finite number of conjugate points between any two conjugate points. The nice thing about the parallel transport definition is that one can then just present the minimal property as a theorem stating that any curve of minimal length between two points will be a geodesic. There does not have to be a unique minimum, and not every geodesic between a given two points need be minimal.
 
Last edited:
  • #5
bcrowell said:
This is a tall order, given that I can't even assume that all the students will have had calculus.
I can't really say if your definition is correct, but I can tell that most people will have no idea what it means.
I can kind-of understand a pair of conjugate points as two focuses (foci?) of a gravitational lens. Even if I don't quite understand why these would have to be points rather than line segments.
But having a shortest path that has no conjugate points... ugh.
If you can spend 30+ minutes on the idea, it might make sense to some.
 
  • #6
bcrowell said:
The standard definition is that it's a curve that parallel-transports its own tangent vector, but this definition requires quite a bit of preliminary machinery.
I would stick with the standard definition. The preliminary machinery is not too onerous. I think it will be less difficult and more intuitive than the suggested alternative.
 
  • Like
Likes bcrowell
  • #7
The simplest thing I think one can say that doesn't have counterexamples (that I know of, at least) is that if we start out with the assumption there is no torsion - a standard assumption in GR (but not for instance a standard assumption in ECSK gravity), then when two points are "sufficiently close" together, a spatial geodesic is a curve of minimal proper length, while a time-like geodesic is a curve of maximal proper time.

This is already getting fairly complicated, and a mathemetician might possibly further object to the use of "minimum" rather than "greatest lower bound". It also doesn't really address what "sufficiently close" means in a rigorous manner, I believe it means "there exists a local convex neighborhood" around any point in which the statement is true.

That said, I find the basic idea of the curve of shortest length (or longest time) is much more intuitive and much more likely to be understood by a lay audience than a curve that parallel transports itself.

A useful examples of the need for the caveats with the "shortest/longes curve" definition is to consider two towns separated by a very tall hill. For a sufficiently tall hill, it will be shorter to go around the hill than straight over it. However, the longer path "over the hill" is still a geodesic path, even though it's not the shortest.

Good luck with your talk, whatever approach you decide is best.
 
  • #8
I would do parallel transport and tangent vectors graphically. Just show pictures of parallel transport and tangent vectors. People will understand the concept geometrically quite easily even if the symbols are a little intimidating.
 
  • #9
pervect said:
The simplest thing I think one can say that doesn't have counterexamples (that I know of, at least) is that if we start out with the assumption there is no torsion - a standard assumption in GR (but not for instance a standard assumption in ECSK gravity), then when two points are "sufficiently close" together, a spatial geodesic is a curve of minimal proper length, while a time-like geodesic is a curve of maximal proper time.

Actually this isn't true for the spacelike case. Example: In Minkowski space, pick a frame in which two events are simultaneous and lie at distance 1 from one another. Suppose we take the geodesic connecting them and deform it into an isosceles triangle with sides of slope b. If the triangle lies in the tx plane, then the length is ##\sqrt{1-b^2}##. If the triangle lies in the ty plane, then the length is ##\sqrt{1+b^2}##.

pervect said:
A useful examples of the need for the caveats with the "shortest/longes curve" definition is to consider two towns separated by a very tall hill. For a sufficiently tall hill, it will be shorter to go around the hill than straight over it. However, the longer path "over the hill" is still a geodesic path, even though it's not the shortest.

Right. An example I find helpful is the following one on the sphere. Let A, B, and C be three points that form an equilateral triangle on a great circle. Let AB be the shortest path between A and B, and BC the shortest path between B and C. Then ABC is a geodesic, but is not the shortest path between A and C.
 
Last edited:
  • #10
DaleSpam said:
I would do parallel transport and tangent vectors graphically. Just show pictures of parallel transport and tangent vectors. People will understand the concept geometrically quite easily even if the symbols are a little intimidating.

Yeah, that works, but it's sort of like sketching what rigor would look like without actually doing the rigor.
 
  • #11
Oh. I thought that is what you were going for.
 
  • #12
pervect said:
The simplest thing I think one can say that doesn't have counterexamples (that I know of, at least) is that if we start out with the assumption there is no torsion - a standard assumption in GR (but not for instance a standard assumption in ECSK gravity), then when two points are "sufficiently close" together, a spatial geodesic is a curve of minimal proper length, while a time-like geodesic is a curve of maximal proper time.

.
The first part is not true in pseudo-Riemannian manifold. Even for arbitrarily close points connected by a spacelike geodesic, there is a lightlike (non-geodesic) path between that is obviously of 0 interval (this path will have the feature that, in any 1x3 foliation, it will go backwards in time, but it is still a path). Then, a nearby spaceilike path of arbitrarily small proper length (again, it must go back in time in any local foliation). Nutshell: spacelike paths have no extremal properties whatsoever of the 4-manifild if it is Minkowskian. Only timelike paths have extremal properties. I believer Synge's result, that I quoted, is the tightest that can be made. For spacelike geodesic between sufficiently close points, there is a neighborhood uniqueness property, but no extremal property.
 
  • Like
Likes bcrowell
  • #13
bcrowell said:
A timelike geodesic ##\gamma## is defined as a timelike curve of minimal length, in the following sense. Given points ##a## and ##b## on ##\gamma##, one can always find a set of points ##h_0##,... ##h_n##, also on ##\gamma##, with the following properties:

(1) ##h_0=a## and ##h_n=b##. The points are laid out in order from 0 to n according to the parameter used to define ##\gamma##.

(2) The length of each segment, from ##h_i## to ##h_{i+1}##, is minimal in the sense defined above.

(3) The curve is twice differentiable.

Does this work? I'm thinking that if there are points conjugate to a or b, we can just put h's at them, and also fill in further h's between these, so there will never be two successive h's that are conjugate to each other.

PAllen said:
I think that works (or at least, Synge claims a result, without proof, in his GR book, that justifies it). What he states is that if p and q are made close enough, the geodesic between them, within some neighborhood containing p and q, is unique (the neighborhood is to rule out 'long way around' geodesics in closed spacetimes). Then, if it is a timelike geodesic, it is maximal in the neighborhood (again, for p and q made sufficiently close, the only exception is a long ctc).
Despite the discussion having moved away from the 'minimising' definition, I wanted to follow through on this to see what constraints were needed for it to be valid.

John Lee provides formal proofs of the necessary results in his 'Riemannian Manifolds' in Chapter 6, subsection 'Geodsesics and minimizing curves'. Proposition 6.10 states:
John Lee said:
Suppose ##p\in M## [##p## is a Riemannian manifold] and ##q## is contained in a geodesic ball around ##p##. Then (up to reparametrization) the radial geodesic from ##p## to ##q## is the unique minimizing curve from ##p## to ##q## in ##M##.​
[A geodesic ball around ##p## is a neighbourhood of ##p## that is diffeomorphic (say by diffeomorphism ##\phi##) to an open ball in ##\mathbb{R}^n##.
The radial geodesic from ##p## to ##q## is the pre-image under ##\phi## of the geodesic in ##\mathbb{R}^n## from ##\phi(p)## to ##\phi(q)##]

Although Lee's proofs are for Riemannian Manifolds, they do not appear to use any properties that are not also possessed by pseudo-Riemannian manifolds, ie spacetime. Essentially the proofs rely on (earlier proofs of) the existence of a neighbourhood of ##p## with normal coordinates, and the uniqueness of the exponential map. For this application we need to interpret 'minimising' as referring to minimising the curve length over the set of all qualifying timelike curves, and we replace each instance of ##\mathbb{R}^n## by 'Minkowski Space'.

The 'close enough' property to which Synge refers would be the necessity for ##q## to be in a geodesic ball around ##p##. The bijectiveness of the ball's diffeomorphism ##\phi## forbids it from including a point antipodal/conjugate to ##p##. So in particular, ##q## cannot be conjugate to ##p##.

With proposition 6.10 under our belt we can proceed as follows:

Let ##\gamma:I\to M## be a timelike geodesic such that ##\gamma(0)=a,\ \gamma(1)=b##. For each point ##\gamma(t)## on the geodesic there is a collection of diffeomorphisms of neighbourhoods of ##\gamma(t)## to open balls in ##\mathbb{R}^n##. Let ##S(t)## be the subset of those diffeomorphisms for which the image contains the unit open ball in ##\mathbb{R}^n##, and let ##D(t)## be the set of domains of diffeomorphisms in ##S(t)##

Then ##\bigcup_{t\in I}D(t)## is an open covering of the geodesic. Since the geodesic is compact, there exists a finite subcover of the geodesic by geodesic balls. If we choose our ##h_i##s to be the centres of those balls, then the geodesic is the unique differentiable curve that minimises the distance through ##a,b## and all the finite set of ##h_i##s.

EDIT: I realized we'd need to replace the open balls in Minkowski space by rectangles of the form ##\{\vec{x}\ : \ \forall \alpha\in\{0,1,2,3\}\ |x^\alpha-(\phi(p))^\alpha|<\epsilon\}##, because open balls in Minkowski space will contain unlimited expanses of spacelike and lightlike separated points. The geodesic balls in ##M## would become rectangles that are the pre-images under ##\phi## of those rectangles in Minkowski space.
 
Last edited:
  • Like
Likes bcrowell
  • #14
PAllen said:
I believer Synge's result, that I quoted, is the tightest that can be made. For spacelike geodesic between sufficiently close points, there is a neighborhood uniqueness property, but no extremal property.

I think this can be strengthened quite a bit. For points spacelike in relation to one another and sufficiently close together, I think the geodesic between them has vanishing variation. That is, if you move points on the geodesic by epsilon, the length changes by something on the order of epsilon squared. (Of course this is not a rigorous formulation, e.g., you need some machinery to define how you measure epsilon.)
 
  • #15
@andrewkirk: Thanks very much for posting the info from Lee and discussing how it could be generalized to a semi-Riemannian space. That's very helpful. Your post refers to a theorem, but I don't see where you ever state the actual content of the theorem...?

[quoted from Lee:] [A geodesic ball around ##p## is a neighbourhood of ##p## that is diffeomorphic (say by diffeomorphism ##\phi##) to an open ball in ##\mathbb{R}^n##.

andrewkirk said:
The 'close enough' property to which Synge refers would be the necessity for ##q## to be in a geodesic ball around ##p##.

It seems to me that this isn't going to work. As a counterexample, consider a spacetime describing a black hole forming by gravitational collapse. If you look at the Penrose diagram for this type of spacetime, you can see that the whole thing is diffeomorphic to an open ball in ##\mathbb{R}^4## (or in Minkowski space, since diffeomorphism doesn't depend on the metric). So by this definition, the geodesic ball around any point would be the whole spacetime. But it is certainly true that this spacetime contains conjugate points. (In fact, conjugate points occur generically in almost any physically realistic spacetime.) So I think maximal length within a geodesic ball may be a necessary condition for a timelike curve to be a geodesic, but it cannot be a sufficient one.

The definition I proposed in my OP was my attempt to make the "close enough" property rigorous. It is much more restrictive than the one you propose, and therefore I think it's more likely to allow us to state necessary and sufficient conditions for a curve to be a geodesic -- which is what we need for a definition.

andrewkirk said:
EDIT: I realized we'd need to replace the open balls in Minkowski space by rectangles of the form ##\{\vec{x}\ : \ \forall \alpha\in\{0,1,2,3\}\ |x^\alpha-(\phi(p))^\alpha|<\epsilon\}##, because open balls in Minkowski space will contain unlimited expanses of spacelike and lightlike separated points. The geodesic balls in ##M## would become rectangles that are the pre-images under ##\phi## of those rectangles in Minkowski space.

I don't think this matters, because the definition you give of a geodesic ball only depends on diffeomorphism, and a diffeomorphism on Euclidean space is also a diffeomorphism on Minkowski space. But are you sure you're not paraphrasing the definition of a geodesic ball incorrectly? If the definition only depends on diffeomorphism, then I don't see why it would be called a *geodesic* ball, since geodesics are a metrical notion.
 
Last edited:
  • #16
Googling for definitions of a geodesic ball in Riemannian spaces, what I seem to find is that it's the set of all points that you can reach from a given point by traveling along a geodesic for less than some fixed distance.
 
  • #17
bcrowell said:
I think this can be strengthened quite a bit. For points spacelike in relation to one another and sufficiently close together, I think the geodesic between them has vanishing variation. That is, if you move points on the geodesic by epsilon, the length changes by something on the order of epsilon squared. (Of course this is not a rigorous formulation, e.g., you need some machinery to define how you measure epsilon.)
Vanishing variation, yes, but still no extremal property. Pick any neighborhood of p, q, and the spacelike geodesic between them, however small. Then, imagine a zigzag (in time, in e.g. Minkowski space in standard coordinates) light path within the neighborhood, connecting them. Then a spacelike zigzag path can approach arbitrarily close to this. Note, the zig zags can be made arbitrarily small to stay within any tubular neighborhood of the geodesic. Thus I claim:

For any p,q, (however 'close'), and any neighborhood containing p,q and the spacelike geodesic between them (however small and tight to the geodesic), there exists a spacelike (non-geodesic) path with length arbitrarily close to zero between them, that remains in the neighborhood.
 
Last edited:
  • Like
Likes bcrowell
  • #18
PAllen said:
Vanishing variation, yes, but still no extremal property.

Actually, doesn't your zigzag example in #17 prove that I was completely wrong, and there's not even a vanishing variation?
 
  • #19
bcrowell said:
Actually, doesn't your zigzag example in #17 prove that I was completely wrong, and there's not even a vanishing variation?
As I understand it, the variation being zero means that:

Given any function h that is zero at p and q, and continuous, and the one parameter family curves formed by adding ah to the given curve, the rate of change of the integral with 'a' goes to zero as 'a' goes to zero. The generality is that it doesn't matter what h you choose. In general, this variation being zero is only a necessary condition for any kind extremal property, even a local one, for arbitrary integrals. Actions and positive definite metric distances have special properties that allow this condition to be sufficient.

Note that my example does not violate this. You can take h to be a function such that for 'a' = 1, it produces any particular small amplitude zig zag path. Still, as 'a' goes to zero, the variation with 'a' of the proper length will be quadratic in 'a', for a spacelike geodesic (and not for some arbitrary spacelike curve).
 
  • #20
PAllen said:
As I understand it, the variation being zero means that:

Given any function h that is zero at p and q, and continuous, and the one parameter family curves formed by adding ah to the given curve, the rate of change of the integral with 'a' goes to zero as 'a' goes to zero. The generality is that it doesn't matter what h you choose. In general, this variation being zero is only a necessary condition for any kind extremal property, even a local one, for arbitrary integrals. Actions and positive definite metric distances have special properties that allow this condition to be sufficient.

Note that my example does not violate this. You can take h to be a function such that for 'a' = 1, it produces any particular small amplitude zig zag path. Still, as 'a' goes to zero, the variation with 'a' of the proper length will be quadratic in 'a', for a spacelike geodesic (and not for some arbitrary spacelike curve).

Right, that makes sense. Equivalently, this definition requires both the displacement and the derivative of the displacement to be of order a.

But it would be perhaps equally natural to consider variations such that only the displacement has to be of order a, not its derivative. Then spacelike curves are not stationary.
 
  • #21
bcrowell said:
Your post refers to a theorem, but I don't see where you ever state the actual content of the theorem...?
[quoted from Lee:] [A geodesic ball around p is a neighbourhood of p that is diffeomorphic (say by diffeomorphism ϕ) to an open ball in Rn.]​
Sorry, I butchered Lee's definition of a geodesic ball. I left out the crucial detail that the diffeomorphism ##\phi## has to be the exponential map. That's how the metric comes into the picture. The relevant theorem is that every point in a Riemannian manifold has a geodesic ball around it. This is his Lemma 5.10 (Normal Neighbourhood Lemma):
Lee said:
For any ##p\in M## there is a neighbourhood ##\mathscr{V}## of the origin in ##T_pM## and a neighbourhood ##\mathscr{U}## of ##p## in ##M## such that ##exp_p:\mathscr{V}\to\mathscr{U}## is a diffeomorphism.
The geodesic ball around ##p## is ##\mathscr{U}##.
bcrowell said:
It seems to me that this isn't going to work. As a counterexample, consider a spacetime describing a black hole forming by gravitational collapse. If you look at the Penrose diagram for this type of spacetime, you can see that the whole thing is diffeomorphic to an open ball in R4 (or in Minkowski space, since diffeomorphism doesn't depend on the metric). So by this definition, the geodesic ball around any point would be the whole spacetime.
Hmm. Yes I'll have to go away and think about that.
 
Last edited:
  • #22
andrewkirk said:
Hmm. Yes I'll have to go away and think about that.

That example would only apply to the earlier definition of a geodesic ball that you gave, which had the mistake in it.
 
  • #23
A Lorentzian metric neighborhood definition I've used in some proofs I've done is the 'light cone diamond' suggested by Andrewkirk in an earlier post, with the 'size' defined by the longest timelike path contained in it.
 
  • #24
bcrowell said:
Googling for definitions of a geodesic ball in Riemannian spaces, what I seem to find is that it's the set of all points that you can reach from a given point by traveling along a geodesic for less than some fixed distance.
This definition doesn't work for Lorentzian manifolds because of null geodesics. Trying to use an affine parameter will fail to non-uniqueness. If you're willing to define a neighborhood via two points rather than one, the light cone diamond sidesteps these issues. Any such diamond has a maximal timelike path interval (perhaps an LUB). Further, it has a maximal spacelike geodesic length. [These statements may breakdown if your diamond is too big or 'tries' to include a curvature singularity - but don't do that ...]
 
  • #25
I think re-defining the Geodesic Ball for the Lorentzian case may not be a problem. The geo ball has to be diffeomorphic to an open ball in ##T_pM##, which is ##\mathbb{R}^4## with the Lorentzian metric. But I'm pretty sure ##\mathbb{R}^4## has the same topology and Riemannian Connection with the Lorentzian Metric as with the Riemannian Metric, because the Christoffel symbols are zero in both cases. So the same diffeomorphism will work for both cases. Hence we can define the geodesic ball in spacetime as the pre-image of a sufficiently small ball in Riemannian ##\mathbb{R}^4## under that diffeomorphism. That should get rid of the null geodesic problem.

I've been skimming through Lee's proofs of 5.10 and 6.10 mentioned above, and confirmed that he doesn't use the positive definite property of the Riemannian Metric anywhere, so the proofs should work for Lorentzian manifolds as well, as long as the various terms are well-defined. One has to change 'locally minimising' to 'locally extremising' because for spacelike and timelike curves the property will be that the geodesic locally maximises for one of them and locally minimises for the other, since the key to the proof is showing that the curve length is at a local extremum if the curve is a geodesic.

The difficulty that I haven't managed to work out yet is that for Riemannian manifolds the proof shows the geodesic is an extremum over all curves inside the geodesic ball, with the same endpoints, subject only to the curves being piecewise differentiable. But for Lorentzian manifolds we want to impose the additional constraint on the set of curves over which we are extremising that they must either be all spacelike or all timelike.
 
  • #26
andrewkirk said:
I think re-defining the Geodesic Ball for the Lorentzian case may not be a problem. The geo ball has to be diffeomorphic to an open ball in ##T_pM##, which is ##\mathbb{R}^4## with the Lorentzian metric. But I'm pretty sure ##\mathbb{R}^4## has the same topology and Riemannian Connection with the Lorentzian Metric as with the Riemannian Metric, because the Christoffel symbols are zero in both cases. So the same diffeomorphism will work for both cases. Hence we can define the geodesic ball in spacetime as the pre-image of a sufficiently small ball in Riemannian ##\mathbb{R}^4## under that diffeomorphism. That should get rid of the null geodesic problem.
I had a preference for defining a neighborhood without reference to charts, which are what set up the bijection to R4.
andrewkirk said:
I've been skimming through Lee's proofs of 5.10 and 6.10 mentioned above, and confirmed that he doesn't use the positive definite property of the Riemannian Metric anywhere, so the proofs should work for Lorentzian manifolds as well, as long as the various terms are well-defined. One has to change 'locally minimising' to 'locally extremising' because for spacelike and timelike curves the property will be that the geodesic locally maximises for one of them and locally minimises for the other, since the key to the proof is showing that the curve length is at a local extremum if the curve is a geodesic.
I've already established in several posts here, by explicit construction, that there is NO local extremal property at all for spacelike geodesics in a Lorentzin 4-manifold. See esp. my post #17 and following discussion with bcrowell. Synge only claimed a neighborhood uniqueness property for spacelike geodesics, for this reason (NOT an extremal property).

FYI - the whole basis of Synge's discussion was just to establish that his 'Two point World function' that he heavily uses in his book, was well defined for its intended purposes - to analyze geometry by coordinate independent power series with complete generality in small regions (but not just in the limit of a point, as is technically the case for 'local'). For the world function to be well defined, only neighborhood uniqueness is required, not any extremal property. It is unfortunate that he simply stated results as true with no proof. However, given that he was a leading mathematician researching in the area at the time, one presumes he was just avoiding boring physicists with the proofs.
andrewkirk said:
The difficulty that I haven't managed to work out yet is that for Riemannian manifolds the proof shows the geodesic is an extremum over all curves inside the geodesic ball, with the same endpoints, subject only to the curves being piecewise differentiable. But for Lorentzian manifolds we want to impose the additional constraint on the set of curves over which we are extremising that they must either be all spacelike or all timelike.
 
  • #27
PAllen said:
I've already established in several posts here, by explicit construction, that there is NO local extremal property at all for spacelike geodesics in a Lorentzin 4-manifold. See esp. my post #17 and following discussion with bcrowell
Maybe I'm misunderstanding your construction. But if I'm understanding it correctly, can't one make the analogous construction for a timelike path and make the conclusion, which we know to be incorrect, that timelike geodesics are not extremal?

For the sake of concreteness, here's how I am understanding #17.

Spacelike path from (0,0) to (0,4) in Minkowski (t,x) space, of length 4.
Lightlike zigzag traverses points (0,0), (1,1), (2,0), (3,1), (4,0), of length 0.
Spacelike zigzag path traverses points (0,0), (1,0.999), (2,0), (3,0.999), (4,0), of length unknown, but suggested it might be very short.

Is that the sort of thing you had in mind?

For the timelike analog we just switch x and y, and switch spacelike with timelike. The non-zigzag timelike path has length -4.
 
  • #28
Thought about it a bit more. I think both the timelike and spacelike counter-examples work. I think an extra condition on the minimizing property is needed that rules out zigzags. Something like:

The minimum absolute length timelike (spacelike), smooth curve between two timelike (spacelike) separated points is a geodesic.

The extra property is not needed in Riemannian manifolds but I think it is needed for Lorentzian ones.

EDIT: I originally wrote 'differentiable' above, but have now changed it to 'smooth', because I realized that one can make a differentiable curve arbitrarily close to the zigzag curves just by rounding off the corners.
 
Last edited:
  • #29
andrewkirk said:
Maybe I'm misunderstanding your construction. But if I'm understanding it correctly, can't one make the analogous construction for a timelike path and make the conclusion, which we know to be incorrect, that timelike geodesics are not extremal?

For the sake of concreteness, here's how I am understanding #17.

Spacelike path from (0,0) to (0,4) in Minkowski (t,x) space, of length 4.
Lightlike zigzag traverses points (0,0), (1,1), (2,0), (3,1), (4,0), of length 0.
Spacelike zigzag path traverses points (0,0), (1,0.999), (2,0), (3,0.999), (4,0), of length unknown, but suggested it might be very short.

Is that the sort of thing you had in mind?

For the timelike analog we just switch x and y, and switch spacelike with timelike. The non-zigzag timelike path has length -4.
No, it does not work for timelike. For timelike, the extremal property we seek is local maximum. Going 'near light' gives near zero, which is irrelevant to a local maximum. It is trivial to have near zero proper time between any two events connectable by any timelike path - just a tight nearly null, spiral around the given world line.
 
  • #30
andrewkirk said:
Thought about it a bit more. I think both the timelike and spacelike counter-examples work. I think an extra condition on the minimizing property is needed that rules out zigzags. Something like:

The minimum absolute length timelike (spacelike), smooth curve between two timelike (spacelike) separated points is a geodesic.

The extra property is not needed in Riemannian manifolds but I think it is needed for Lorentzian ones.

EDIT: I originally wrote 'differentiable' above, but have now changed it to 'smooth', because I realized that one can make a differentiable curve arbitrarily close to the zigzag curves just by rounding off the corners.
No, the construction is irreleivant for a timelike exstremal property, which is a maximum.

For spacelike, no amount of smoothness will prevent the construction. Spacelike geodesics are minimal within any spacelike 3-submanfold that contains them, using the induced metric from the 4-manifold. But they have no extremal property at all within the 4-manifold.

[note, it reference to the claims in my next post, that if a spacelike 3-submanifold contains a path with length shorter than the geodesic connecting p and q, it follows that it does not contain the geodesic. Thus, an arguably interesting statement that can be made is that a spacelike geodesic between sufficiently close p and q is neighborhood minimal within any spacelike 3-submanifold containing the geodesic, using the induced metric from the 4-mainifold.]
 
Last edited:
  • #31
Here is what is true within some small neighborhood of two sufficiently close points p and q within a Lorentzian 4-manifold:

1) Spacelike paths (not geodesics) connecting p and q may have any length > 0 and < infinity (yes, even in a tiny neighborhood).
2) If any timelike path can connect p and q, there is a maximal timelike path within the neighborhood and it is the unique geodesic.
3) If no timelike or null path can connect p and q, then there is a unique spacelike geodesic defined by stationary variation, but not by any extremal property. [obviously, it could also be defined by parallel transport, but the original spirit of this thread was to explore extremal or at least variational definitions)].
4) Otherwise, p and q are connected by a unique null geodesic. I have never been satisfied with variational definitions of null geodesics, for this case I would always use parallel transport definition.
 
Last edited:
  • #32
@bcrowell have you decided if you are going to go with the "straight line" definition or the "shortest distance between two points" definition?
 
  • #33
DaleSpam said:
@bcrowell have you decided if you are going to go with the "straight line" definition or the "shortest distance between two points" definition?

Shortest distance (maximal time). All I really need for my talk is timelike geodesics, so I think it works. I'll probably give out a handout with some of the technical details on it, so that I don't have to run through all of that in the talk itself. For my purposes, the difficult points that I really want to concentrate on and get at rigorously are the definition of a singularity in terms of geodesic completeness, and the distinction between a curvature singularity and a non-curvature singularity. The definition of a geodesic is just some preliminary apparatus, so I think relegating its details to a handout will be fine.

BTW, thanks, everyone, for a very helpful discussion!
 
  • #34
bcrowell said:
I think this can be strengthened quite a bit. For points spacelike in relation to one another and sufficiently close together, I think the geodesic between them has vanishing variation. That is, if you move points on the geodesic by epsilon, the length changes by something on the order of epsilon squared. (Of course this is not a rigorous formulation, e.g., you need some machinery to define how you measure epsilon.)
PAllen said:
Vanishing variation, yes, but still no extremal property.
I've been thinking about this. I had to brush up on my calculus of variations to do so.

I am doubtful that the variation functional is vanishing, in the sense that is used in calculus of variations to identify a necessary but not sufficient condition for a function being an extremum of a functional.

Assume for simplicity that, in a given coordinate system the two spacetime points differ only in a single spatial coordinate, say x. Then there is a basis for the vector space of paths between the two points, that is made up of basis functions each of which is confined to one of the x-t, x-y or x-z planes. The functions would use x as parameter and give a result (y,z,t). That basis wouldn't span curves that backtrack in the x dimension but for now I'm assuming that doesn't affect the argument. The result of applying the variation of the length functional to these basis functions will be positive for paths in the x-y and y-z planes and negative for paths in the x-t plane. So the variation functional must be nonzero.

In fact, I think that the spacelike geodesic will be the functional equivalent of a saddle point in ordinary calculus.

What we can say (I think) is that the geodesic is a minimum for paths for which the time coordinate is constant (provided the geodesic has constant time parameter. It may not, and if it doesn't then it will be more complex to characterise the set of paths over which it is a minimum), and a maximum for paths for which the y and z coordinates are constant. This is like how for a saddle point in calculus we can say that the surface gives a maximum value of z in the x direction and a minimum value of z in the y direction. For the saddle point, ##\frac{\partial z}{\partial x}=\frac{\partial z}{\partial y}=0;\ \frac{\partial^2 z}{\partial x^2}>0## and ##\frac{\partial^2 z}{\partial y^2}<0##.
 
  • #35
A saddle point has vanishing variation. In fact, a generalized saddle point is all that vanishing variation guarantees, at least in the terminology of my (math, not physics) book on calculus of variations.
 

Similar threads

Replies
1
Views
275
Replies
57
Views
3K
Replies
8
Views
2K
Replies
16
Views
2K
Replies
6
Views
2K
Replies
18
Views
5K
Back
Top