Could SR not be built from only one postulate?

  • Thread starter guitarphysics
  • Start date
  • Tags
    Sr
In summary, the two postulates of SR, according to Schutz, state that no experiment can measure the absolute velocity of an observer and that the speed of light is constant for any unaccelerated observer. While some formulations of SR drop the second postulate, it is still necessary to acknowledge that Maxwell's equations are a law of physics in order to derive all the facts given by the original two postulates.
  • #36
PAllen said:
If you include as the laws of physics Newtonian mechanics + Maxwell, you have a self contradictory system.
...or Newtonian mechanics is wrong, and not a law of physics. That is why it is better to say something like 'all experimental results'.
 
Physics news on Phys.org
  • #37
PAllen said:
For (2), the result you get depends on what laws of physics you include. If you use Newtonian machanics, you get the Galilean transform from these assumptions...

I'm trying to elaborate on Rindler's approach (I assume it is valid and recognized) as presented by strangerep in #32 according to which it seems possible to derive the Lorentz transformation of SR, referring neither to the invariance of the speed of light nor to Maxwell's equations, from constraints applying to the transformation between inertial frames of reference, in particular the need for transformations to “preserve inertial motion (i.e., that zero acceleration is mapped to zero acceleration), and that they form a Lie group”. I find this approach interesting and the purpose of my contribution is to challenge my own understanding of Rindler's approach. Actually I assume it entails as well that finite accelerations map onto finite (non-zero) accelerations, hence the first postulate I suggest:

First postulate: the difference between the inertial and non-inertial states of motion is absolute, so that any "7-dimensional abstract phase space" which matches the boundary between both classes of objects (i.e. which correctly assigns its inertial or non-inertial state of motion to any object) can be said providing an “inertial frame of reference”.

The advantage I see is that this postulate caters for a definition of an “inertial frame of reference” which avoids the circularities of usual definitions: on the one hand they refer to Newton's first law which refers to the absence of forces which refers to inertial motion (first circularity), and on the other hand they refer to a law of physics (Newton's first law) in order to define the framework into which laws of physics should be formulated (second circularity).

However your comment shows that my second postulate is inappropriate. So let me replace it with something less ambiguous, as follows:

Second postulate: the difference between the rest state and an inertial state of motion is arbitrary, so that all inertial frames of reference (as defined above) are physically equivalent. Therefore the laws of physics (i.e. the internal rules belonging to a physics theory) which hold in a given inertial frame of reference should also be valid in any other inertial frame of reference. This in turn implies that their mathematical formulation shall remain invariant through the transformation between inertial frames of reference.

Hopefully the above is only a rewording of the constraints presented in #32. Assuming this can lead to the Lorentz transformation (as per my limited understanding of Rindler's approach), I doubt it can equally lead to the galilean transformation unless an additional constraint is imposed whereby the time coordinate remains invariant through the transformation.
 
  • #38
Sugdub said:
I'm trying to elaborate on Rindler's approach (I assume it is valid and recognized) as presented by strangerep in #32 according to which it seems possible to derive the Lorentz transformation of SR, referring neither to the invariance of the speed of light nor to Maxwell's equations, from constraints applying to the transformation between inertial frames of reference, in particular the need for transformations to “preserve inertial motion (i.e., that zero acceleration is mapped to zero acceleration), and that they form a Lie group”. I find this approach interesting and the purpose of my contribution is to challenge my own understanding of Rindler's approach. Actually I assume it entails as well that finite accelerations map onto finite (non-zero) accelerations, hence the first postulate I suggest:

First postulate: the difference between the inertial and non-inertial states of motion is absolute, so that any "7-dimensional abstract phase space" which matches the boundary between both classes of objects (i.e. which correctly assigns its inertial or non-inertial state of motion to any object) can be said providing an “inertial frame of reference”.

The advantage I see is that this postulate caters for a definition of an “inertial frame of reference” which avoids the circularities of usual definitions: on the one hand they refer to Newton's first law which refers to the absence of forces which refers to inertial motion (first circularity), and on the other hand they refer to a law of physics (Newton's first law) in order to define the framework into which laws of physics should be formulated (second circularity).

However your comment shows that my second postulate is inappropriate. So let me replace it with something less ambiguous, as follows:

Second postulate: the difference between the rest state and an inertial state of motion is arbitrary, so that all inertial frames of reference (as defined above) are physically equivalent. Therefore the laws of physics (i.e. the internal rules belonging to a physics theory) which hold in a given inertial frame of reference should also be valid in any other inertial frame of reference. This in turn implies that their mathematical formulation shall remain invariant through the transformation between inertial frames of reference.

Hopefully the above is only a rewording of the constraints presented in #32. Assuming this can lead to the Lorentz transformation (as per my limited understanding of Rindler's approach), I doubt it can equally lead to the galilean transformation unless an additional constraint is imposed whereby the time coordinate remains invariant through the transformation.

No, the reasoning in #32 only establishes the either Galilean transform or Lorentz transform with some invariant speed TBD are the only possibilities. It is then either choice of what you consider to be laws (if you approach this axiomatically) or what experimental data you have that selects between Galilean and Lorentz, and also determines what the invariant speed is.

[edit: I think you fail to grasp the last sentence of #32:

"The possibility of an invariant limiting speed constant, and an invariant length constant, emerge as derived consequences. Only their values need be determined experimentally."

This says, based on the POR and the additional plausible assumptions and much math, you get some invariant speed as a possibility. The other possibility is no invariant speed = Galilean relativity.]
 
Last edited:
  • #39
Sugdub said:
1) the difference between the inertial and non-inertial states of motion is absolute, so that any "7-dimensional abstract phase space" which matches the boundary between both classes of objects (i.e. which correctly assigns its inertial or non-inertial state of motion to any object) can be said providing an “inertial frame of reference”;
That doesn't really make sense.

For present purposes, one only needs the physical input that an observer can distinguish between his own accelerated/unaccelerated state -- hence that an equation like $$\frac{d^2 x}{dt^2} ~=~ 0$$is applicable to all unaccelerated observers. One then seeks the maximal set of symmetries of that equation.

If one admits non-zero acceleration, then one must begin with a much larger abstract phase space. E.g., for constant acceleration, the phase must be enlarged to 10 dimensions, and the group of possible transformations is much larger than fractional-linear -- since one is now preserving an equation of motion like $$\frac{d^3 x}{dt^3} ~=~ 0 ~.$$The maximal symmetry group of this equation has been (partially) investigated, but I'm not sure if anything physically-useful has come out of that.

For non-constant accelerations, the phase space becomes ever larger, maybe even infinite-dimensional for truly arbitrary acceleration.

To understand this setting properly, one needs to know about continuous symmetries of differential equations. There's a significant body of theory about this -- I found the textbook of Stephani [1] quite useful. (The case of inertial motion symmetries is actually an exercise in one of his early chapters: the generators derived in that exercise correspond to fractional-linear transformations when integrated).

I'm trying to elaborate on Rindler's approach (I assume it is valid and recognized) as presented by strangerep in #32 according to which it seems possible to derive the Lorentz transformation of SR, referring neither to the invariance of the speed of light nor to Maxwell's equations, from constraints applying to the transformation between inertial frames of reference, in particular the need for transformations to “preserve inertial motion (i.e., that zero acceleration is mapped to zero acceleration), and that they form a Lie group”.
Note that I only quoted Rindler insofar as his 1st postulate and definition of inertial frame. Although he does perform a group-theoretic derivation of the Lorentz transformation, he adds an extra postulate about homogeneity, which restricts fractional-linear transformations down to linear transformations. The more general case is discussed in the other references I gave.

Assuming [#32] can lead to the Lorentz transformation (as per my limited understanding of Rindler's approach), I doubt it can equally lead to the galilean transformation unless an additional constraint is imposed whereby the time coordinate remains invariant through the transformation.
Actually, all these possibilities are contained in the most general (fractional-linear) approach. In the limit where spatial distances are small compared to the universal length constant, one recovers the Poincare transformations, i.e., standard special relativity. Then, in the limit where relative speed is much smaller than ##c##, one recovers Galilean transformations.

---------------
References:

[1] H. Stephani,
Differential Equations -- Their solution using symmetries,
Cambridge University Press, 1989C, ISBN 0-521-36689-5.
 
Last edited:
  • #40
Meir Achuz said:
...or Newtonian mechanics is wrong, and not a law of physics. That is why it is better to say something like 'all experimental results'.

I prefer to state specifically that Maxwell's equations are a "law of physics", or to assume that Poincare symmetry is a symmetry of the "laws of physics". If one says all experimental results (to date), that includes GR with accelerated expansion of the universe, in which case SR does not hold.
 
  • #41
atyy said:
If one says all experimental results (to date), that includes GR with accelerated expansion of the universe, in which case SR does not hold.
That is excluded by the word "constant in the postulate:
"any two observers moving at constant speed and direction with respect to one another will obtain the same results for all experiments"
If gravity were included the phrase "at the same location" would have to be included, but I think that is implied in the postulate, which gives only relative velocity in the difference beteen the two observers.
 
  • #42
Meir Achuz said:
...or Newtonian mechanics is wrong, and not a law of physics.
All theories are wrong. Some are just less wrong than others. So if a statement that's part of a "wrong" theory of physics can't be a law of physics, there are no laws of physics.
 
  • #43
strangerep said:
That doesn't really make sense.

For present purposes, one only needs the physical input that an observer can distinguish between his own accelerated/unaccelerated state ... One then seeks the maximal set of symmetries of that equation. ...If one admits non-zero acceleration, then one must begin with a much larger abstract phase space.

Thanks for your answer. I have the strong feeling that your rejection of my first postulate is due to a misunderstanding since it deals with physical objects, not with observers.

Although I agree with the statements quoted above, I'm trying to eliminate any direct reference to “observers” performing measurements or experiments. Yes, only an “observer” who feels unaccelerated can imagine being attached to an inertial frame of reference and the transformation between inertial frames of reference will map zero-accelerated observers to other zero-accelerated observers. But assuming one of these non-accelerated observers observes a non-zero-accelerated object, the said transformation between inertial frames of reference will map this non-zero-accelerated object onto a non-zero-accelerated object: if an object is accelerated when represented in one inertial frame of reference, it must have an accelerated motion in any other inertial frame of reference, irrespective of the presence of any “observer”. This is what I tried to express in my first postulate which deals with objects and not with observers. Therefore there is no need to involve complex maths.


strangerep said:
Note that I only quoted Rindler insofar as his 1st postulate and definition of inertial frame. Although he does perform a group-theoretic derivation of the Lorentz transformation, he adds an extra postulate about homogeneity, which restricts fractional-linear transformations down to linear transformations...

Yes, homogeneity and isotropy of space, as well as homogeneity of time merely reflect the absence of good reasons to inject asymmetries in our representations of space and time. Any alternative would require a justification counteracting the empirical evidence. The two postulates I formulated are very general in their nature: since we do not sense any difference between velocity and rest, it would be irrational to assume a priori an absolute difference between uniform motion and rest. Conversely, our sense of acceleration suggests the opposite a priori assumption about the difference between inertial and non-inertial state of motion.

It is noticeable that the second postulate proposed by Einstein in 1905 about the invariance of the speed of light is of a less general nature, so that the set of postulates and assumptions from which he derived the Lorentz transformation lacks homogeneity. However the main effect of his second postulate is to inject a dependency between space and time quantities, a parameter (c) homogeneous to a velocity which is left invariant through changes of the inertial frame of reference: this imposes correlated changes in the transformation between space (x coordinate) and time physical quantities. Therefore the most general transformation compatible with all constraints cannot be squeezed down to a mere transformation of space coordinates. De facto it deals with space-time and this rules out the galilean transformation.

No doubt, injecting Maxwell's equations as a constraint has the same effect: the transformation will induce correlated changes in space (x coordinate) and time physical quantities and this also rules out the galilean transformation. However, injecting Maxwell's equations leads to the same pattern as Einstein's second postulate insofar the set of conditions which leads to the Lorentz transformation lacks homogeneity: Maxwell's equations relate to a specific range of phenomena whereas the postulate on relativity of motion and the homogeneity / isotropy symmetries encompass all phenomena.

On the other hand, one may decide to inject laws of the Newtonian mechanics as a constraint, which are incompatible with the perspective of a parameter homogeneous to a velocity remaining invariant. This would also lack homogeneity but more importantly it rules out the perspective of an invariant correlation between changes in space (x coordinate) and changes in time. The most general transformation between inertial frames of reference falls down to a mere transformation of space coordinates, not of space-time. This will lead to the galilean transformation.


strangerep said:
Actually, all these possibilities are contained in the most general (fractional-linear) approach. ..

As a conclusion, I think the above shows that:

i) the two postulates I proposed, complemented with space and time symmetries established empirically lead to either the Lorentz transformation or the galilean transformation, which are exclusive;

ii) the Lorentz transformation which embraces space-time is more general than the galilean transformation insofar the former reduces to the latter proviso the addition of one constraint, e.g. imposing that time is not affected by the transformation between inertial frames of reference, or imposing that c is infinite, or imposing that simultaneity at a distance makes sense, etc... in which case the Lorentz transformation is ruled out, leaving the galilean transformation as the only possible outcome.

Overall, I believe that the Lorentz transformation is the most general solution that can be derived from the two postulates I proposed, maximising the impact of empirical symmetries without injecting any additional constraint. This should not come as a surprise since the universality of time is somehow a hidden postulate of the Newtonian mechanics.
 
  • #44
Sugdub said:
[...] I'm trying to eliminate any direct reference to “observers” performing measurements or experiments.
Then you're not doing physics. A frame of reference is an abstract construction of a particular observer.

[...] a parameter (c) homogeneous to a velocity [...]
That phrase doesn't make sense. I guess you mean "a parameter (c) with dimensions of speed".

You also seem to have used the word "homogeneous" in a distorted way elsewhere in your post. Have you actually studied the Rindler textbook reference I gave? He explains it reasonable detail. (I sense that you are uncomfortable with math, but without math it's all just hand-waving.)
 
  • #45
strangerep said:
Then you're not doing physics. A frame of reference is an abstract construction of a particular observer.

I can't see any way to adapt my “first postulate” and the subsequent definition of an “inertial frame of reference” in order to replace “physical objects” with “observers”. However nothing prevents attaching an hypothetical “observer” to each inertial frame of reference as I defined it, but what he/she will actually “observe” will be “distorted” by the Doppler effect over the signals transporting the information about remote events. It depends on what you wish to represent... But you're right, I'm not a physicist.

strangerep said:
That phrase doesn't make sense. I guess you mean "a parameter (c) with dimensions of speed".

Yes, my command of English is rather limited. Does my statement make sense once properly worded?

strangerep said:
You also seem to have used the word "homogeneous" in a distorted way elsewhere in your post.

I meant that some assumptions whereby Maxwell's equations are valid, or whereby Newton's first law is valid, are less general than other assumptions like the principle of relativity of motion or the isotropy and homogeneity of space. May be non- “homogeneous” was inappropriate to qualify the association of different categories ... But my main reservation in respect to such assumptions is that they appear to bring circularity: they invoke some specific “laws of physics” in order to set / define the mathematical framework into which “laws of physics” should be stated.

strangerep said:
Have you actually studied the Rindler textbook reference I gave? He explains it reasonable detail. (I sense that you are uncomfortable with math, but without math it's all just hand-waving.)

No, I'm quite convinced that the Rindler textbook will fly far above my head, for both maths and physics. But I found your presentation interesting, because it seems to indicate that the above “circular” references can be overcome, the only one which remains according to your post #32 being the definition of the “Einsteinian inertial frame” which still refers to Newton's first law, so that I suggested a different approach for that specific definition through a “first postulate” leading to a new definition of an inertial frame of reference. That's all, I'm afraid.
 
  • #46
Sugdub said:
I meant that some assumptions whereby Maxwell's equations are valid, or whereby Newton's first law is valid, are less general than other assumptions like the principle of relativity of motion or the isotropy and homogeneity of space.

[...] my main reservation in respect to such assumptions is that they appear to bring circularity: they invoke some specific “laws of physics” in order to set / define the mathematical framework into which “laws of physics” should be stated.

I found your presentation interesting, because it seems to indicate that the above “circular” references can be overcome, the only one which remains according to your post #32 being the definition of the “Einsteinian inertial frame” which still refers to Newton's first law,
Yes, in the most general treatment, one need not appeal to other laws of physics to restrict the possibilities. A remarkable amount can be derived just from the concept of preservation of "zero acceleration".

BTW, the Rindler definition of "Einsteinian inertial frame" I quoted previously mentions "universal time". This is sufficient for ordinary special relativity, but in fact this is relaxed even further in the more general fractional--linear approach. Instead, one works with a weaker assumption: that spacetime (in the absence of gravitation) seems the same to inertial observers anywhere and anywhen.

[...] I'm quite convinced that the Rindler textbook will fly far above my head, for both maths and physics.
Well, I would encourage you to at least look at some of his early chapters before adopting such a defeatist attitude.
 
  • #47
strangerep said:
Yes, in the most general treatment, one need not appeal to other laws of physics to restrict the possibilities. A remarkable amount can be derived just from the concept of preservation of "zero acceleration".

BTW, the Rindler definition of "Einsteinian inertial frame" I quoted previously mentions "universal time". This is sufficient for ordinary special relativity, but in fact this is relaxed even further in the more general fractional--linear approach. Instead, one works with a weaker assumption: that spacetime (in the absence of gravitation) seems the same to inertial observers anywhere and anywhen...

As long as the definition of an "Einsteinian inertial frame" refers to “inertial observers”, one still needs to provide an acceptable definition for the word “inertial”. The issue at stake is whether this can be done without invoking “laws of physics” such as "Newton's first law" or circular definitions like the "absence of forces".
 
  • #48
I'm still at a complete loss as to why we need to talk about the "laws of physics" or any of the other verbiage.

An inertial frame is simply one which has both zero 4-rotation and 4-acceleration; every excellent textbook on relativity I have ever seen defines it in this way, in terms of kinematical concepts. What's the issue?
 
  • #49
WannabeNewton said:
I'm still at a complete loss as to why we need to talk about the "laws of physics" or any of the other verbiage.

An inertial frame is simply one which has both zero 4-rotation and 4-acceleration; every excellent textbook on relativity I have ever seen defines it in this way, in terms of kinematical concepts. What's the issue?

Couldn't it be possible that spacetime is Lorentzian and flat, but the laws of physics do not have Poincare symmetry?
 
  • #50
WannabeNewton said:
I'm still at a complete loss as to why we need to talk about the "laws of physics" or any of the other verbiage.

An inertial frame is simply one which has both zero 4-rotation and 4-acceleration; every excellent textbook on relativity I have ever seen defines it in this way, in terms of kinematical concepts. What's the issue?

Because the topic is to derive SR, not assume it. The concept of 4-rotation and 4-acceleration presumes it. A physical definition of inertial frame needs some further physical assumption or experimental finding to select between Galilean spacetime and Minkowski spacetime.
 
  • #51
@atyy and PAllen, the distinction between Galilean space-time and Minkowski space-time is the latter presumes a space-like foliation with separate Euclidean temporal and spatial metrics whereas the latter contains no such foliation and presumes the Minkowski metric. From here one can demand that equations (or "the laws of physics") be Lorentz (or Poincare) covariant and get SR. Why are inertial frames required for any of this? Equations will be Lorentz covariant as long as they are written in terms of spinor or tensor representations of the Lorentz group which is a completely frame-independent condition. After the dust settles we can simply define an inertial frame in terms of zero rotation and acceleration. I honestly don't see any need to talk about inertial frames before the dust settles.
 
  • #52
WannabeNewton said:
@atyy and PAllen, the distinction between Galilean space-time and Minkowski space-time is the latter presumes a space-like foliation with separate Euclidean temporal and spatial metrics whereas the latter contains no such foliation and presumes the Minkowski metric. From here one can demand that equations (or "the laws of physics") be Lorentz (or Poincare) covariant and get SR. Why are inertial frames required for any of this? Equations will be Lorentz covariant as long as they are written in terms of spinor or tensor representations of the Lorentz group which is a completely frame-independent condition. After the dust settles we can simply define an inertial frame in terms of zero rotation and acceleration. I honestly don't see any need to talk about inertial frames before the dust settles.

The discussion was do you need anything other than a physical principle of relativity+homgeneity+isotropy, to get SR. To apply POR, you need a non-circular physical definition of inertial frames. Then you need something physical (experiment or law) to select SR vs. GR (Galilean relativity, not General relativity).
 
Last edited:
  • #53
WannabeNewton said:
@atyy and PAllen, the distinction between Galilean space-time and Minkowski space-time is the latter presumes a space-like foliation with separate Euclidean temporal and spatial metrics whereas the latter contains no such foliation and presumes the Minkowski metric. From here one can demand that equations (or "the laws of physics") be Lorentz (or Poincare) covariant and get SR. Why are inertial frames required for any of this? Equations will be Lorentz covariant as long as they are written in terms of spinor or tensor representations of the Lorentz group which is a completely frame-independent condition. After the dust settles we can simply define an inertial frame in terms of zero rotation and acceleration. I honestly don't see any need to talk about inertial frames before the dust settles.

Yes, for defining SR, that's the modern way. But the old way using the Principle of Relativity and the speed of light still works.

Incidentally, were you actually commenting on Sugdub's question whether an inertial frame can be determined without reference to the laws of physics, assuming SR is true? In theory, yes. In practice, no, since one has to build some instruments to measure acceleration, rotation etc. And in calibrating them, the laws of physics will be used.
 
  • #54
WannabeNewton said:
I'm still at a complete loss as to why we need to talk about the "laws of physics" or any of the other verbiage.

An inertial frame is simply one which has both zero 4-rotation and 4-acceleration; every excellent textbook on relativity I have ever seen defines it in this way, in terms of kinematical concepts. What's the issue?

In which way can a mathematical concept such as a coordinate system be physically zero-accelerated? May be you assume its origin remains collocated with a zero-accelerated physical body? Then how can one characterize a zero-accelerated body unless a postulate states that its accelerated or non-accelerated state of motion is an objective property of this body?

Apart from setting a postulate as suggested above, one necessarily comes back to invoking physical laws, leading to circular statements as per the Wikipedia dedicated article:

Within the realm of Newtonian mechanics, an inertial frame of reference, or inertial reference frame, is one in which Newton's first law of motion is valid. However, the principle of special relativity generalizes the notion of inertial frame to include all physical laws, not simply Newton's first law... According to the first postulate of special relativity, all physical laws take their simplest form in an inertial frame, ...
 
  • #55
Sugdub said:
In which way can a mathematical concept such as a coordinate system be physically zero-accelerated?

Who said anything about a coordinate system having zero acceleration? All I said was the frame has zero acceleration. All this means is the object of interest at rest in the frame has zero acceleration.

Sugdub said:
Then how can one characterize a zero-accelerated body unless a postulate states that its accelerated or non-accelerated state of motion is an objective property of this body?

There is no need for such a postulate. No such postulate exists in SR. It is simply a consequence of the definition in both Newtonian and relativistic mechanics.

Sugdub said:
Apart from setting a postulate as suggested above, one necessarily comes back to invoking physical laws, leading to circular statements as per the Wikipedia dedicated article

http://articles.adsabs.harvard.edu//full/1967QJRAS...8..252D/0000252.000.html

Also just because inertial frames are defined in a certain way in Newtonian mechanics doesn't mean we need to follow the same tired route in relativity. As atyy mentioned there is a much more coherent and fundamental way to approach SR, as opposed to the antiquated approach taken by Einstein and some of his contemporaries.
 
  • #56
PAllen said:
The discussion was do you need anything other than a physical principle of relativity+homgeneity+isotropy, to get SR. To apply POR, you need a non-circular physical definition of inertial frames. Then you need something physical (experiment or law) to select SR vs. GR (Galilean relativity, not General relativity).

Ah I see; I probably should have read the entire discourse.

atyy said:
In theory, yes. In practice, no, since one has to build some instruments to measure acceleration, rotation etc. And in calibrating them, the laws of physics will be used.

I don't disagree there.
 
  • #57
(Not sure whether I should stay involved with this, but... maybe one more post...)

First, let's replace the phrase:

"The laws of physics are identical in all inertial frames."

by the equivalent:

"The outcome of any physical experiment is the same when performed with identical initial conditions relative to any inertial frame."

Then we can seek a physical definition of "inertial frame"...

The task is not to express an observer's local experiences without referring to "laws of physics" (or, equivalently, the "outcome of any physical experiment [...]"). Rather, the task is to relate one observer's experiences to those of others. That's why it's called "relativity". :wink:

Of course each observer already possesses some physical concepts such as (local) position, time, and devices for measuring such things locally, and hence also a concept of differential ratios thereof (velocity, acceleration, etc). An observer equipped with suitable accelerometers, gyroscopes, etc, can tell whether heshe is accelerating or not. If heshe detects no acceleration, then heshe is an inertial observer. (In this sense, non-acceleration is indeed a property that an observer can ascribe to hisherself.)

The "inertial reference frame" imagined by an inertial observer is simply an intuitively natural extrapolation of locally performable operations, e.g., moving 1 step to the right, waiting until 1 minute has elapsed according to hisher clock, etc. To be an "inertial motion", such operations must be non-accelerative once completed, meaning that (e.g.,) after the spatial translation of moving 1 step to the right heshe still detects no acceleration.

Then we assume that (the mathematical expressions of) these operations form a Lie group, since that seems to be the case for strictly local operations, at least as far as they can reach.

Such local experience can be extended a bit further by the "radar" method, if the observer has a light emitting source (e.g., a torch) and a device for receiving light and noting its direction (e.g., a pair of eyes that implement binocular vision). Such parallax methods allow an observer to relate remote events to hisher imagined reference frame.

(I'll skip the additional complications/ambiguities that arise beyond the useful range of the radar method or more sophisticated parallax techniques.)
 
Last edited:
  • #58
WannabeNewton said:
I probably should have read the entire discourse.
Tsk, tsk. :wink:
 
  • #59
strangerep said:
Such local experience can be extended a bit further by the "radar" method, if the observer has a light emitting source (e.g., a torch) and a device for receiving light and noting its direction (e.g., a pair of eyes that implement binocular vision). Such parallax methods allow an observer to relate remote events to hisher imagined reference frame.

Well, if you are talking about drawing conclusions from the POR + experiments, before using the Radar method you first have to establish the constancy of light speed (no need to worry about one way / two way if we are assuming isotropy). Having done such an experiment, you already find SR selected rather than Galilean relativity.
 
  • #60
PAllen said:
Well, if you are talking about drawing conclusions from the POR + experiments, [...]
Actually, I was trying to describe how one might reach the concept of an inertial frame, beginning at a physically plausible starting point. Probably, I should have ditched the radar stuff in my previous post, since it confuses the logic -- as you pointed out.

[Edit: ... and thank you for pointing it out, btw. :biggrin: ]
 
Last edited:
  • #61
strangerep said:
An observer equipped with suitable accelerometers, gyroscopes, etc, can tell whether heshe is accelerating or not.

Yes, however one cannot demonstrate that the devices called “accelerometer”and “gyroscope” actually measure “accelerations” and “changes in orientation” without invoking somehow the laws of physics. Such devices can provide a more accurate assessment of our state of motion than our senses, once it has been demonstrated that they are fit to purpose, but beforehand the need remains for an independent definition of “inertial” (see below).

I agree with many of your statements which I find better than mine, in particular the need to refer to a consensus between observers. However I think a more logic presentation can be proposed if the postulates leading to SR are set at a deeper level, as follows.

strangerep said:
… non-acceleration is indeed a property that an observer can ascribe to hisherself.

This is not derived from your assumptions, it is a postulate, the first postulate from which everything will flow: my sensation reveals an objective property, a qualification upon which all observers will agree. Then, assuming I do not sense any acceleration or rotation in my body, my state of motion can be defined as “inertial”. Hence a 4-coordinate system attached to my body and reflecting my time flow provides an inertial frame of reference. I think this definition is immune from contradictions whilst removing the indirect reference to laws of physics (via "accelerometers").

A second postulate is required in order to derive the Lorentz transformation (assuming no further constraint will reduce the generality of the development alongside Rindler's approach), whereby the difference between representing oneself at rest or in constant motion is non-objective, conventional.

Thanks to the first postulate, the equivalence relationship “to be in constant relative motion” structures the family of all possible frames of reference in such a way that the associated transformation will map an inertial frame onto an inertial frame. Thanks to the arbitrariness set by the second postulate, one may conclude that:

strangerep said:
"The outcome of any physical experiment is the same when performed with identical initial conditions relative to any inertial frame."

In conclusion two independent postulates are required for developing SR, which are of a more general nature that those often proposed.
 
  • #62
Sugdub said:
the first postulate from which everything will flow: my sensation reveals an objective property, a qualification upon which all observers will agree. Then, assuming I do not sense any acceleration or rotation in my body, my state of motion can be defined as “inertial”. Hence a 4-coordinate system attached to my body and reflecting my time flow provides an inertial frame of reference. I think this definition is immune from contradictions whilst removing the indirect reference to laws of physics (via "accelerometers").
There's no significant difference between specifying that human senses should be used and specifying that an accelerometer (defined by instructions on how to build one) should be used. All you have done is to use a different device to detect acceleration.
 
  • #63
Sugdub said:
In conclusion two independent postulates are required for developing SR, which are of a more general nature that those often proposed.

That's fine so far. But to conclude SR is true rather than Galilean relativity, you must do some experiments that distinguish them. Your distinguishing experiment need not have anything to do with light, which you may consider a virtue of this approach (e.g. it could be time dilation).
 
  • #64
PAllen said:
That's fine so far. But to conclude SR is true rather than Galilean relativity, you must do some experiments that distinguish them. Your distinguishing experiment need not have anything to do with light, which you may consider a virtue of this approach (e.g. it could be time dilation).

I'm not sure how to interpret your statement. In the following I will assume you mean that a reference to actual experimental results is necessary before one can state that the postulates we are dealing with lead to the Lorentz transformation of SR as opposed to the Galilean transformation of the Newtonian mechanics.

If this is the case I disagree. I already explained why in a previous input but I welcome comments.

The Lorentz- or the galilean- transformation are the only possible outcomes but they are exclusive. So the choice between both must be sorted out before reaching any conclusion. On the one hand, the Lorentz transformation necessarily deals with a 4-dimensions space-time coordinate system since it entails a dependency between space coordinates (typically x) and the time coordinate. This dependency is constitutive of SR: it cannot be eliminated without abandoning the theory.

On the other hand, the Lorentz transformation gets squeezed down to the galilean transformation by negating this dependency between space and time so that only a 3-dimensions space coordinate system is required, complemented with a separate, independent, invariant 1-dimension time coordinate system. This reduction from 4 to 3+1 dimensions corresponds to a loss of generality. This loss can be triggered by various hypotheses or constraints acting on top of the postulates, for example by imposing that the time coordinate is left invariant.

The Lorentz transformation appears to be the most general solution for the required transformation, the only one which deals with a 4-dimensions integrated coordinate system. Since both possible outcomes are exclusive, a conclusion in favor of the galilean transformation could only be reached following the conscious acceptance of a reduction of generality, e.g. by adding that the time coordinate remains invariant or equivalently by adding the possibility for a signal propagating at an infinite speed, or by adding that simultaneity at a distance is a reality. The addition of any of the above constraints on top of the postulates will trigger the derivation of the galilean transformation. Otherwise, the Lorentz transformation will be arrived at.
 
  • #65
Sugdub said:
I'm not sure how to interpret your statement. In the following I will assume you mean that a reference to actual experimental results is necessary before one can state that the postulates we are dealing with lead to the Lorentz transformation of SR as opposed to the Galilean transformation of the Newtonian mechanics.

If this is the case I disagree. I already explained why in a previous input but I welcome comments.

The Lorentz- or the galilean- transformation are the only possible outcomes but they are exclusive. So the choice between both must be sorted out before reaching any conclusion. On the one hand, the Lorentz transformation necessarily deals with a 4-dimensions space-time coordinate system since it entails a dependency between space coordinates (typically x) and the time coordinate. This dependency is constitutive of SR: it cannot be eliminated without abandoning the theory.

On the other hand, the Lorentz transformation gets squeezed down to the galilean transformation by negating this dependency between space and time so that only a 3-dimensions space coordinate system is required, complemented with a separate, independent, invariant 1-dimension time coordinate system. This reduction from 4 to 3+1 dimensions corresponds to a loss of generality. This loss can be triggered by various hypotheses or constraints acting on top of the postulates, for example by imposing that the time coordinate is left invariant.

The Lorentz transformation appears to be the most general solution for the required transformation, the only one which deals with a 4-dimensions integrated coordinate system. Since both possible outcomes are exclusive, a conclusion in favor of the galilean transformation could only be reached following the conscious acceptance of a reduction of generality, e.g. by adding that the time coordinate remains invariant or equivalently by adding the possibility for a signal propagating at an infinite speed, or by adding that simultaneity at a distance is a reality. The addition of any of the above constraints on top of the postulates will trigger the derivation of the galilean transformation. Otherwise, the Lorentz transformation will be arrived at.

That just becomes another assumption. In addition to POR, isotropy, homogeneity you add: of the two remaining possibilities, pick the one you like. Any 'natural philosopher' of circa 1800, brought up to speed on the math, would say you obviously want to add a postulate of simultaneity to rule out the nonsensical alternative (what we call SR). In fact, Newton had such postulate: time flows equably and consistently for all observers. Unlike for symmetries, I don't see any convincing argument for one conclusion over another except experiment.
 
Last edited:
  • #66
PAllen said:
...In fact, Newton had such postulate: time flows equably and consistently for all observers.

Indeed it was this a priori assumption which prevented physicists looking for a transformation of 4-coordinate events. Once Einstein understood that his postulate on the invariance of the speed of light was incompatible with this a priori assumption, he could set more general 4-dimensions equations in view of producing a genuine transformation of space-time events, … and later on he understood that a further generalization could encompass the non-uniform gravity field... This is the way science is progressing.

PAllen said:
Unlike for symmetries, I don't see any convincing argument for one conclusion over another except experiment.

Imposing the invariance of the time coordinate also acts as an asymmetry since it restricts the transformation to 3 out of 4 coordinates of any event. There is no fundamental difference.
 
  • #67
Sugdub said:
Imposing the invariance of the time coordinate also acts as an asymmetry since it restricts the transformation to 3 out of 4 coordinates of any event. There is no fundamental difference.

How is "no time invariance" fundamentally more natural "time invariance"? Note, time is distinguishable in SR (the geometry is pseudo-Riemannian, not Riemannian). Or you can say, manifolds are preferable to fiber bundles (?).

Also, note that the value of c only chooses units not physics. Thus, there exactly two physically distinguishable choices to make (there is or is not an invariant speed), not infinite versus one. You can choose "time invariance" or "c invariance". You are not making headway convincing me that anything other than experiment chooses SR over Galilean relativity.
 
  • #68
PAllen said:
You [Sugdub] are not making headway convincing me that anything other than experiment chooses SR over Galilean relativity.
:smile:
 
  • #69
strangerep said:
Tsk, tsk. :wink:

:cry:
 
  • #70
PAllen said:
How is "no time invariance" fundamentally more natural "time invariance"? Note, time is distinguishable in SR (the geometry is pseudo-Riemannian, not Riemannian). Or you can say, manifolds are preferable to fiber bundles (?).

Also, note that the value of c only chooses units not physics. Thus, there exactly two physically distinguishable choices to make (there is or is not an invariant speed), not infinite versus one. You can choose "time invariance" or "c invariance". You are not making headway convincing me that anything other than experiment chooses SR over Galilean relativity.


Yes, I agree that in order to derive the Lorentz transformation a finite constant (c) with the dimension of a speed must somehow be injected according to which “instantaneous actions at a distance” get excluded. Obviously my previous inputs overlooked it. Thanks for this lesson.

Still I wish to challenge the rationale for invoking either a “law of physics” (such as Maxwell's equations or the “invariance of the speed of light”) or some experimental results (such as the Michelson and Morley experiment) as a valid foundation for the formal derivation of the Lorentz transformation. In my view, statements about the world, how it is, how it works, what happens there,... are just inappropriate. We should not accept any a priori statements in this range.

SR provides a formal framework into which a model / description / simulation of the world and associated phenomena will get hosted. The purpose of that framework is to specify how our formal description of the same phenomena should be evolved when we change the perspective from which this description is proposed. The SR framework should be physically neutral, it should provide an empty structure, in the same way as in GR the actual curvature of space-time relies upon the effective presence of energy or mass.

The two postulates I have proposed for SR are not about the world, they are about us: we sense accelerations and rotations whereas we do not sense speed or rest. The symmetries we have discussed are not about the world itself, they deal with our a priori concepts of space and time: we only grasp differences in position, in orientation, in time, not their absolute values ... But the addition of c as an external constraint, somehow linked to a belief in the existence of a “law of nature”, in order to complement this set of postulates and symmetry rules does not fit well. It has no bearing to the meta-rules which the SR formal framework must comply with.

Actually we all know why c is necessary, why “no signal can travel at an infinite speed”, why we must impose this constraint on our formal framework. It is not a postulate about the world and neither an external constraint derived from experiments. It reflects the causal structure we impose to any abstract construction deserving to be labelled as a “physics theory”. We can't accept that our theories claim “explaining” phenomena through “instantaneous actions at a distance”. Causes and effects must be ordered in time otherwise they can't be distinguished from each other. As long as our physics theories abide to some concept of causality, they must fit within a formal framework imposing a maximum limit for the speed of any signal invoked in a causal explanation.

My conclusion is that we impose the existence of c as a consequence of our own internal mental structure, it is not imposed to us by external experiments and neither by some miraculous knowledge about the world. Comments are, of course, welcome.
 
Back
Top