Assumption in the derivation of the Lorentz transformation

In summary: Minkowski spacetime model.Summary:: Justification for the sign-independence of the Lorentz factor in the Lorentz transformationIn summary, the sign-independence of the Lorentz factor in the Lorentz transformation can be justified through the assumption of isotropy of space and the reciprocity theorem, which states that the relative velocities between two inertial frames are equivalent regardless of the direction. This assumption is also supported by experimental evidence. Additionally, this assumption is necessary for the Lorentz transformation to satisfy the requirements of being a group transformation in the special theory of relativity.
  • #1
Kyouran
70
10
TL;DR Summary
Assumption in the derivation of the lorentz transformation
In the special theory of relativity, it seems impossible to derive the lorentz transformation without assuming that the lorentz factor is independent of the sign of the relative velocity. For some reason, I can't get my head around why this assumption is so easily made, as if it's trivial. Can anyone explain why this assumption can be made?
 
Physics news on Phys.org
  • #2
Can you try and articulate why you think the sign of the relative velocity would affect the Lorentz factor?
 
  • #3
It's not that I have a physical reason for it, but it is an assumption and assumptions must be justified. In particular, I am thinking about the following step:

## x' = (x-vt) \gamma(v) ##
## t' = (t-\frac{vx}{c^2}) \gamma(v) ##

Now, to obtain the reverse transformation, one would simply replace v by -v and swap the x and x' and t and t'. Then, substitution of the reverse transformation into this transformation (e.g. first equation) gives:

## x'= x' (1-\frac{v^2}{c^2}) \gamma(v) \gamma(-v) ##

So here's where I'm stuck. I need one more justification to exclude the possibilities that ##\gamma(v) = \frac{1}{1+\frac{v}{c}}## or ##\gamma = \frac{1}{1-\frac{v}{c}}##
 
  • Like
Likes PeroK
  • #4
Kyouran said:
It's not that I have a physical reason for it, but it is an assumption and assumptions must be justified.
Consider two cases:
The mathematical assumption as a postulate
: In mathematics, one can make any assumption as a postulate and determine the logical consequences of that postulate. There is no need to justify it.
The physical "assumption" due to experimental evidence: There is plenty of experimental evidence that the direction of velocity is not important in reality. So the conditions of the Lorentz transformation are satisfied and the transformation can be applied.
 
  • Like
Likes cianfa72 and robphy
  • #5
Trigonometrically, your question amounts to the question of
why the dot-product is independent of the sign of the angle-between them.

So, maybe one should consider the Euclidean analogue to derive
the cosine function, and worry about any needed constraints on the "angle between".
 
  • #6
Kyouran said:
Now, to obtain the reverse transformation, one would simply replace v by -v and swap the x and x' and t and t'. Then, substitution of the reverse transformation into this transformation (e.g. first equation) gives:

## x'= x' (1-\frac{v^2}{c^2}) \gamma(v) \gamma(-v) ##

So here's where I'm stuck. I need one more justification to exclude the possibilities that ##\gamma(v) = \frac{1}{1+\frac{v}{c}}## or ##\gamma = \frac{1}{1-\frac{v}{c}}##

The underlying assumption is that of the isotropy of space, meaning that a priori a relative velocity of ##v## is physically equivalent to a relative velocity of ##-v##.
 
  • Like
Likes jbergman and Dale
  • #7
Note, there are many different derivations of Lorentz transform, many with no room for such ambiguity. One variant considers only one direction of transform, with the requirement a spherical light pulse is spherical in the primed frame, with the same speed of light. You get gamma explicitly, as a result (actually, you get the complete forward Lorentz transform). Then, simply algebraically solving for the unprimed variables given the full forward transformation you have derived, shows the reverse transform is the same except for the sign of v.

For example, this is the derivation given in Peter Bergmann’s 1942 text on relativity.
 
  • Like
Likes bhobba
  • #8
Kyouran said:
Summary:: Assumption in the derivation of the lorentz transformation

In the special theory of relativity, it seems impossible to derive the lorentz transformation without assuming that the lorentz factor is independent of the sign of the relative velocity. For some reason, I can't get my head around why this assumption is so easily made, as if it's trivial. Can anyone explain why this assumption can be made?
Space is isotropic. So a transformation between inertial frames has to be the same under spatial rotation. That means that you can always rotate it to be in the positive direction. In other words, it can only depend on the speed because speed is the isotropic part of velocity.
 
  • Like
  • Love
Likes jbergman and bhobba
  • #9
Kyouran said:
Now, to obtain the reverse transformation, one would simply replace v by -v and swap the x and x' and t and t'. Then, substitution of the reverse transformation into this transformation (e.g. first equation) gives:

## x'= x' (1-\frac{v^2}{c^2}) \gamma(v) \gamma(-v) ##

So here's where I'm stuck. I need one more justification to exclude the possibilities that ##\gamma(v) = \frac{1}{1+\frac{v}{c}}## or ##\gamma = \frac{1}{1-\frac{v}{c}}##

But ##\gamma=\sqrt{\frac{1}{1-(v/c)^2}}##. When you reverse the sign of ##v## to form the inverse transformations, you don't reverse the sign of ##v## in the expression for ##\gamma##.
 
  • #10
@Mister T the op is following a particular derivation where the functional form of ##\gamma(v)## is yet to be determined
 
  • #11
Kyouran said:
It's not that I have a physical reason for it, but it is an assumption and assumptions must be justified. In particular, I am thinking about the following step:

## x' = (x-vt) \gamma(v) ##
## t' = (t-\frac{vx}{c^2}) \gamma(v) ##
Let's consider what the x-axis looks like in the primed frame at ##t' = 0##. Which means that ##t = \frac{vx}{c^2}## and: $$x' = \gamma(v)(x - vt) = \gamma(v)x(1 - \frac{v^2}{c^2})$$Let's assume that ##v > 0## and choose a point ##x > 0##.

Now, if we reverse the velocity so that the primed frame has velocity ##-v##, then the point ##-x## is physically the same as ##x## was previously. Let's use double prime in this case. We know from physical symmetry that: ##x'' = -x'##. It must be the same coordinate as before, with the sign reversed. In this case: ##t'' = 0##, ##t = \frac{vx}{c^2}## and: $$x'' = \gamma(-v)(-x + vt) = -\gamma(-v)x(1 - \frac{v^2}{c^2})$$
And, for ##x'' = -x'## we must have ##\gamma(-v) = \gamma(v)##.
 
  • Like
Likes Kyouran, Abhishek11235, vanhees71 and 1 other person
  • #12
That touches on the "reciprocity theorem", i.e., the apparently self-evident property of inertial reference frames that if ##\Sigma'## moves with constant velocity ##\vec{v}## wrt. ##\Sigma## then ##\Sigma## moves with velocity ##-\vec{v}## relative to ##\Sigma'##.

It depends a bit on the assumptions you take as "postulates". One derivation uses only the special principle of relativity (existence of a global inertial frame of reference), the Euclidicity of space for any inertial observer (implying homogeneity and isotropy of space) and the homogeneity of time for any inertial observer. Additionally it is assumed that the symmetry transformations of the spacetime model form a group. With these assumptions you find out that the reciprocity theorem holds and that the only two spacetime models are either the Galilei-Newton spacetime model (with no "limiting speed") or the Einstein-Minkowski spacetime model of special relativity (with a "limiting speed" which empirically coincides with the speed of light in vacuo in Maxwell's equations).

https://aip.scitation.org/doi/10.1063/1.1665000
 
  • Like
  • Informative
Likes Kyouran and PeroK
  • #13
Kyouran said:
## x' = (x-vt) \gamma(v) ##
...
Now, to obtain the reverse transformation, one would simply replace v by -v and swap the x and x' and t and t'.
From that follows for the spatial distance between two events:
## \Delta x' = ( \Delta x-v \Delta t) \gamma(v) \ \ \ \ \ \ \ \ \ (1)##
## \Delta x = ( \Delta x'+v \Delta t') \gamma(-v) \ \ \ \ \ (2)##

If you define in equation (1) ##L' := \Delta x'## and set ## \Delta t =0##, then you get the length contraction:
##\Delta x = \frac{1}{\gamma(v)} L'##

If you define in equation (2) ##M := \Delta x## and set ## \Delta t' =0##, then you get the length contraction:
##\Delta x' = \frac{1}{\gamma(-v)} M##

For symmetry reasons, both length contraction factors must be equal.

Kyouran said:
Then, substitution of the reverse transformation into this transformation (e.g. first equation) gives:

## x'= x' (1-\frac{v^2}{c^2}) \gamma(v) \gamma(-v) ##

With ## \gamma(v) = \gamma(-v) ## follows:
$$\gamma(v) = \frac{1}{\sqrt{1-v^2/c^2}}$$
 
Last edited:
  • Like
Likes Kyouran and vanhees71
  • #14
PeroK said:
Let's consider what the x-axis looks like in the primed frame at ##t' = 0##. Which means that ##t = \frac{vx}{c^2}## and: $$x' = \gamma(v)(x - vt) = \gamma(v)x(1 - \frac{v^2}{c^2})$$Let's assume that ##v > 0## and choose a point ##x > 0##.

Now, if we reverse the velocity so that the primed frame has velocity ##-v##, then the point ##-x## is physically the same as ##x## was previously.
Ok, I have some problem with that last statement. Just because you change the relative velocity of a frame doesn't mean you change the orientation of its axes. Therefore, I don't get how you can say -x is physically the same as x, since both frames keep the same orientation?
 
  • #15
Kyouran said:
Just because you change the relative velocity of a frame doesn't mean you change the orientation of its axes. Therefore, I don't get how you can say -x is physically the same as x, since both frames keep the same orientation?
All of the answers above boil down to using the fact that the laws of physics are isotropic. This means that if you have some experiment, and then you repeat the experiment with everything rotated by ##\theta## the two experiments are physically the same. Equivalently, you can take one experiment and rotate your axes by ##\theta## and use the same laws of physics.

So if you choose ##\theta=\pi## then you immediately have “-x is physically the same as x”
 
  • Like
Likes cianfa72, Kyouran, PeroK and 1 other person
  • #16
Kyouran said:
Ok, I have some problem with that last statement. Just because you change the relative velocity of a frame doesn't mean you change the orientation of its axes. Therefore, I don't get how you can say -x is physically the same as x, since both frames keep the same orientation?
You could imagine an observer in S' reaching the origin and the point ##x## is a certain distance in front of it. Then, an observer in S'' reaches the origin and the point ##-x## must be the same distance in front of it. These must be physically identical scenarios.

I chose the ##x''## axis to be in the same direction as the others. You don't need to do this, but the orientation of the positive ##x''## axis cannot be physically relevant.

If you want the ##x''## axis in the opposite direction you are going to have to mess around with all the formulas. That can't change the physical equivalence or the validity of my analysis.
 
  • Like
Likes Kyouran
  • #17
PeroK said:
We know from physical symmetry that: ##x'' = -x'##. It must be the same coordinate as before, with the sign reversed. In this case: ##t'' = 0##, ##t = \frac{vx}{c^2}## and: $$x'' = \gamma(-v)(-x + vt) = -\gamma(-v)x(1 - \frac{v^2}{c^2})$$
And, for ##x'' = -x'## we must have ##\gamma(-v) = \gamma(v)##.

Hmm...why is ##t" = 0## not ##t = -\frac{vx}{c^2}##? Srry for being so confused guys :D
 
  • #18
You are standing on the north embankment watching a train moving to the west (your right). I am on the south embankment watching the same train moving to the west (my left). If you and I each use coordinates where the +x direction is to our right then for you the train is moving at +v and for me it is moving at -v. Since we use the same laws of physics to determine the time dilation of the train it is therefore necessary that ##\gamma(v)=\gamma(-v)##
 
  • Like
Likes kith, Nugatory, Kyouran and 1 other person
  • #19
Kyouran said:
Hmm...why is ##t" = 0## not ##t = -\frac{vx}{c^2}##? Srry for being so confused guys :D
Kyouran said:
It's not that I have a physical reason for it, but it is an assumption and assumptions must be justified. In particular, I am thinking about the following step:

## x' = (x-vt) \gamma(v) ##
## t' = (t-\frac{vx}{c^2}) \gamma(v) ##
It might be clearer if we reserve ##v## for the generic velocity there and use ##v = u > 0## and ##v = -u < 0## for our two frames. Then we have:

## x' = (x-ut) \gamma(u) ##
## t' = (t-\frac{ux}{c^2}) \gamma(u) ##

## x'' = (x+ut) \gamma(-u) ##
## t'' = (t+\frac{ux}{c^2}) \gamma(-u) ##

Now, we consider points ##x = x_1 > 0## and ##x = x_2 = -x_1 < 0##:

## x'_1 = (x_1-ut_1) \gamma(u) ##
## t'_1 = (t_1-\frac{ux_1}{c^2}) \gamma(u) ##

## x''_2 = (x_2+ut_2) \gamma(-u) ##
## t''_2 = (t_2+\frac{ux_2}{c^2}) \gamma(-u) ##

At ##t' =0##, we have a relationship between the ##x_1## and ##t_1## coordinates:

##t_1 = \frac{ux_1}{c^2}##

And, at ##t'' = 0##, we have a relationship between the ##x_2## and ##t_2## coordinates.

##t_2 = -\frac{ux_2}{c^2} = +\frac{ux_1}{c^2}##

This gives us:

##x'_1 = (x_1-ut_1) \gamma(u) = x_1(1 - \frac{u^2}{c^2})\gamma(u)##

##x''_2 = (x_2+ut_2) \gamma(-u) = x_2(1 - \frac{u^2}{c^2})\gamma(-u) = -x_1(1 - \frac{u^2}{c^2})\gamma(-u)##

Now, we use the physical equivalence of ##(t_1, x_1)## in the S' frame and ##(t_2, x_2)## in the S'' frame and the chosen orientation of all axes to demand that:

##x''_2 = - x'_1##

And this requires ##\gamma(u) = \gamma(-u)##
 
  • Like
Likes Kyouran and vanhees71
  • #20
PeroK said:
At ##t' =0##, we have a relationship between the ##x_1## and ##t_1## coordinates:

##t_1 = \frac{ux_1}{c^2}##

And, at ##t'' = 0##, we have a relationship between the ##x_2## and ##t_2## coordinates.

##t_2 = -\frac{ux_2}{c^2} = +\frac{ux_1}{c^2}##
This is interesting as the time coordinates have sorted themselves out and are seen to be equal (as they must by a physical symmetry argument). In other words, a clock at ##x_1## measured in the S' frame is equivalent to a clock at ##-x_1## as measured in the S'' frame.
 
  • Like
Likes Kyouran and vanhees71
  • #21
Thanks guys, I think I got it now:smile:
 
  • #22
Kyouran said:
Summary:: Assumption in the derivation of the lorentz transformation

In the special theory of relativity, it seems impossible to derive the lorentz transformation without assuming that the lorentz factor is independent of the sign of the relative velocity. For some reason, I can't get my head around why this assumption is so easily made, as if it's trivial. Can anyone explain why this assumption can be made?
At random, point your finger in some direction. Now, is that direction any better of a random direction than any other? Of course not. Similarly, it does not matter what direction a photon is observed moving relative to the observer - and with that postulate and the one that a photon is always observed moving at the constant speed of light, the Lorentz transformation (and the transformations for General Relativity as well) must be location & orientation independent.
 
  • Like
  • Skeptical
Likes Dale and PeroK
  • #23
swampwiz said:
At random, point your finger in some direction. Now, is that direction any better of a random direction than any other?
I pointed my finger towards my lunch and I can confirm that this direction is indeed better than many other directions.
 
  • Haha
  • Like
Likes strangerep, vanhees71, Dale and 1 other person
  • #24
Dale said:
All of the answers above boil down to using the fact that the laws of physics are isotropic. This means that if you have some experiment, and then you repeat the experiment with everything rotated by ##\theta## the two experiments are physically the same. Equivalently, you can take one experiment and rotate your axes by ##\theta## and use the same laws of physics.

So if you choose ##\theta=\pi## then you immediately have “-x is physically the same as x”
Is the assumption of isotropic supported at relativistic speeds before the experiments like the Michelson-Morley experiment?
 
  • #25
FactChecker said:
Is the assumption of isotropic supported at relativistic speeds before the experiments like the Michelson-Morley experiment?
No previous experiment detected any relativistic anisotropy, but the MMX was the first specifically designed to detect it. However, the isotropy assumption was pretty widely held since it was part of non-relativistic mechanics and Galilean invariance.
 
  • Like
Likes swampwiz, vanhees71 and FactChecker
  • #26
Dale said:
No previous experiment detected any relativistic anisotropy, but the MMX was the first specifically designed to detect it. However, the isotropy assumption was pretty widely held since it was part of non-relativistic mechanics and Galilean invariance.
Actually, I don't think the aether theory was considered to imply anisotropy. It was considered to be a physically preferred 'frame' in the same way as air on earth. What its frame was would be considered an accident of initial condition (in modern parlance, a symmetry breaking in the origin of the universe). But there would be no difference between moving left, right, north, south etc. relative to the aether, nor would homogeneity be violated - different positions in the aether would be equivalent.
 
  • Like
Likes bhobba and vanhees71
  • #27
PAllen said:
Actually, I don't think the aether theory was considered to imply anisotropy. It was considered to be a physically preferred 'frame' in the same way as air on earth. What its frame was would be considered an accident of initial condition (in modern parlance, a symmetry breaking in the origin of the universe). But there would be no difference between moving left, right, north, south etc. relative to the aether, nor would homogeneity be violated - different positions in the aether would be equivalent.
Yes, I agree. It would be considered a practical sort of anisotropy rather than a fundamental one.
 
  • Like
Likes bhobba and vanhees71
  • #29
Dale said:
Yes, I agree. It would be considered a practical sort of anisotropy rather than a fundamental one.

Yes. But in any other frame moving relative to it at a constant velocity, you have an aether wind that breaks isotropy. That's why if you do classical mechanics using the beautiful treatment in Landau Mechanics, while not disproving an aether, you feel like Einstein did when told of Eddingtons results. The best version of that story I know was his assistant was given the results first. Excitedly they showed it to Einstein, who glanced at it then carried on with what he was doing. The assistant was shocked and asked him why he was so calm. Of course, the reply was famous. 'I would feel sorry for the good Lord; the theory is correct.

Thanks
Bill
 
  • Like
Likes vanhees71
  • #30
bhobba said:
That is why I always fall back on the derivation where symmetries are explicitly assumed (I know I have posted it many times before):
http://www2.physics.umd.edu/~yakovenk/teaching/Lorentz.pdf

Thanks
Bill
One has only to add, why the "Euclidean case", ##a>0## is excluded. That's, because we want to establish a causality structure with our spacetime model, i.e., there must be some temporal order for causally connected "events". That implies that the physical symmetry group should be restricted to an "orthochronous subgroup". Now for the Euclidean case for the here considered (1+1)-dimensional spacetime the transformation group is O(2), i.e., the transformation matrix is of the form
$$\hat{T}=\begin{pmatrix} \cos \alpha & \sin \alpha \\ -\sin \alpha & \cos \alpha \end{pmatrix}.$$
The restriction to orthochronous transformations, i.e., ##T_{00}>0## restricts ##\alpha## to the interval ##[-\pi/2,\pi/2]##, but with this restriction the transformation matrices do not form a group, which is why this possibility is ruled out, while ##a<0## leads to the Lorentz group and special-relativistic spacetime (Einstein-Minkowski spacetime) while ##a \rightarrow \infty## leads to the Galilei group and Galilei-Newton spacetime.
 
  • Informative
Likes bhobba
  • #31
PAllen said:
Actually, I don't think the aether theory was considered to imply anisotropy. It was considered to be a physically preferred 'frame' in the same way as air on earth. What its frame was would be considered an accident of initial condition (in modern parlance, a symmetry breaking in the origin of the universe). But there would be no difference between moving left, right, north, south etc. relative to the aether, nor would homogeneity be violated - different positions in the aether would be equivalent.
So you're saying that the idea is that the æther existed, just that no one really knew where the origin or how the axes were oriented?
 
  • #32
swampwiz said:
So you're saying that the idea is that the æther existed, just that no one really knew where the origin or how the axes were oriented?
No, that's the Lorentz aeither theory. The one before the Michelson Morley experiment, there was the expectation that there would be anisotropy in observed two way speed of light, but this was not considered fundamental, any more than such an observation for sound in air along two directions on a platform moving in air would.
 
  • Like
Likes bhobba
  • #33
Well, there was another episode around the Michelson-Morley (MM) experiment in the early 1920ies. Dayton and Miller repeated the MM experiment in 1925 at Mount Wilson with the result of having seen an "aether wind". The rationale behind this was the idea that the null result of the MM is not because of relativity but because
there is an "aether drag" close to the Earth becoming weaker higher up. Of course most of the physicists where very skeptical because of all the confirmations of special (and also some of general) relativity, but there were also some prominent proponents of the aether hypothesis. Thus the measurements where repeated again, some financed by the "Notgemeinschaft der Deutschen Wissenschaft", which shows how important the issue has been considered in those days, with the result that the Dayton and Miller experiment must have been flawed somehow, because indeed the original null result by MM has been confirmed.
 
  • Like
Likes bhobba
  • #34
vanhees71 said:
Dayton and Miller
One person. Dayton is hist first name. Not as confusing as Hanbury-Brown. But close.

The real reason this never went anywhere is that they were neither confirmed nor "prefirmed". (Past experiments disagreed) At the time, statistics was still being developed, and it wasn't really appreciated that millions of data point was only part of the story.

Tom Roberts re-analyzed the Miller experiment from a modern perspective here: https://arxiv.org/abs/physics/0608238 I dislike re-analysis papers in general, but this one is very good.
 
  • Informative
Likes bhobba and vanhees71

FAQ: Assumption in the derivation of the Lorentz transformation

What is the Lorentz transformation?

The Lorentz transformation is a mathematical equation that describes the relationship between space and time in special relativity. It was developed by Hendrik Lorentz and is a fundamental concept in Einstein's theory of relativity.

What are the assumptions made in the derivation of the Lorentz transformation?

The main assumptions made in the derivation of the Lorentz transformation are that the laws of physics are the same for all inertial reference frames and that the speed of light is constant in all inertial frames.

Why is the assumption of a constant speed of light important in the derivation of the Lorentz transformation?

The assumption of a constant speed of light is important because it leads to the concept of time dilation and length contraction, which are fundamental principles in special relativity. Without this assumption, the Lorentz transformation would not accurately describe the relationship between space and time in different inertial frames.

Are there any other assumptions that are necessary for the Lorentz transformation to hold?

Yes, there are a few other assumptions that are necessary for the Lorentz transformation to hold. These include the assumption of homogeneity and isotropy of space, as well as the principle of relativity, which states that the laws of physics are the same for all observers in uniform motion.

How does the Lorentz transformation affect our understanding of space and time?

The Lorentz transformation has significantly changed our understanding of space and time. It has shown that space and time are relative and that they are intertwined in a four-dimensional spacetime. It has also led to the concept of time dilation and length contraction, which have been experimentally verified and have important implications in fields such as particle physics and cosmology.

Similar threads

Replies
5
Views
2K
Replies
5
Views
1K
Replies
33
Views
2K
Replies
28
Views
3K
Replies
22
Views
1K
Replies
8
Views
1K
Replies
14
Views
3K
Back
Top