The Relativistic Force-Norm and the Proof of E=mc2

  • Thread starter velo city
  • Start date
  • Tags
    Proof
In summary, the proof of Einstein's famous equation, E = mc2, is based on a derivation that goes something like this: - Assume that the energy of a particle is proportional to the mass and is a function of the magnitude of the velocity: E = g(v) m (By "mass", I mean rest mass). - Assume that the momentum of a particle is proportional to the mass and is in the same direction as the velocity: \vec{p} = f(v) m \vec{v}. - Assume that the expression for momentum approaches the Newtonian result, \vec{p} = m \vec{v} for low speeds. That implies
  • #71
DrStupid said:
That's not a matter of taste. According to Newton's second law this is a force.

Yes, it is purely a matter of taste. The equation of motion:

[itex]F_{physical} = m \dfrac{dv}{dt}[/itex]

and the equation of motion

[itex]F_{total} = m \dfrac{dv}{dt} + \dfrac{dm}{dt} v[/itex]

are EXACTLY equivalent, provided that

[itex]F_{total} = F_{physical} + \dfrac{dm}{dt} v[/itex]

Adding the same quantity, [itex]\dfrac{dm}{dt} v[/itex], to both sides of an equation cannot possibly change the physical content of the equation.

If you had some independent way of computing [itex]F_{total}[/itex], then it would make sense to use the equation [itex]F_{total} = \dfrac{dp}{dt}[/itex] even when the mass is variable. But there is no way to compute [itex]F_{total}[/itex] other than adding up the known interaction forces and then adding a term [itex]\dfrac{dm}{dt} v[/itex]. The only point of adding this term on the left side (the forces) is to cancel the same term appearing on the right side. Why not just leave it out, since it always cancels?
 
Physics news on Phys.org
  • #72
stevendaryl said:
Dr. Stupid's approach is to say that both the left-hand side and the right-hand side should be augmented:

[itex]\vec{F}_{total} = \vec{F}_{interaction} + \vec{F}_{boundary}[/itex]

where [itex]\vec{F}_{boundary}[/itex] reflects the rate at which momentum is passing across the object boundary.

In classical mechanics I agree with this interpretation but this thread is primary about special relativity and that makes it different. In order to avoid confusions I better return to Newton's term "quantity of matter". In classical mechanics this property is identical with mass and can change in open systems only. But in special relativity it can also change in closed systems because it is frame dependent under Lorentz transformation. In that case both terms of the equation result from interactions.
 
  • #73
stevendaryl said:
I think I've gotten to the bottom of this dispute, and I'm fairly sure that it's just a terminological difference. Dr. Stupid's position simply amounts to calling the expression [itex]\dfrac{dm}{dt}\ \vec{v}[/itex] a "force". There are no consequences to this choice. It's not a different theory, it's just a different way of accounting for things.
I think you have analyzed the situation pretty beautifully and it's in the spirit of what i said in the middle of #46 and the end of #48 (although i wasn't aware of the fact that the difference between the two situations is entirely due to the boundary term). But I also think that this (together with the fact that ##F=\dot p## spoils Galilean invariance) is even more evidence in favour of the idea that the real fundamental second law is ##F=ma## instead of ##F=\dot p##, since force is a real physical thing that can be measured and thus shouldn't depend on the boundaries that we draw mentally. Thus ##F=\dot p## seems like some kind of ensemble description really (much like the grand canonical ensemble in statistical mechanics).

But I'm still not convinced that ##F=\dot p## works everywhere. Think for example of a fuel consisting of ferromagnetic particles that are confined to a tank while inside the rocket whereas they can move freely as soon as they are exhausted. If there are any external magnetic fields, then it should make a real physical difference whether we count the fuel particles to the rocket tank or the freely moving fuel and it's not just a matter of drawing boundaries. I might be wrong of course, but it seems very plausible at last.
 
  • #74
stevendaryl said:
Yes, it is purely a matter of taste. The equation of motion:

[itex]F_{physical} = m \dfrac{dv}{dt}[/itex]

You can define such a property but you should not call it force in order to avoid confusions. This term is reserved for dp/dt. Your [itex]F_{physical}[/itex] and force can be equal (e.g. for all closed systems in classical mechanics) but they are not identical.
 
  • #75
rubi said:
Well, obviously if you treat the system as a many-particle system (consisting of the empty balloon and the individual water molecules) with each of these "particles" evolving according to ##F_i=m_i a_i##, then the sum of all momenta (the total momentum) is conserved (##\dot p_\mathrm{total} = 0##) or more generally, it is the sum of all the forces that act on the individual "particles". This just confirms the idea that in classical mechanics, only ##F=ma## is valid and every corresponding formula for variable-mass systems is just an effective version of ##F=ma##.

This is exactly right.

rubi said:
You can't solve such problems without taking the relative exhaust velocity of the ejected matter into account. It should be obvious that it matters, because the resulting motion of the balloon/rocket/.. must depend heavily on both the direction and the magnitude of the relative exhaust velocity. The equation ##F=\dot p## can't take the relative exhaust velocity into account, since the term that shows up if you apply the product rule (##\dot m v##) doesn't depend on the relative exhaust velocity at all! Thus ##F=\dot p## must necessarily be wrong in almost all cases where ##\dot m \neq 0##, except those in which ##F_\mathrm{thrust}= \dot m v_\mathrm{rocket}## happens to be true by coincidence.

Yes, you're right again.


There's really no way to save ##F=\dot p##. It is just inconsistent.


The first law doesn't even matter. It follows already from a many-particle description of the situation (empty balloon + water molecules) that there is no force acting on the balloon and the water molecules that are still contained in the balloon, since they are moving at constant velocity and the fundamental description ##F=ma## of the situation really implies ##F=0##. If ##F=\dot p## were consistent for a variable-mass system, then it would have to reproduce this fact.


Unfortunately there are some coincidental situations in which it works. That's the reason for why it is such a wide spread misconception that even some textbook authors get it wrong.

I know. People should be more careful when using "effective theories" (they should derive their formulas, in this case, from Newtonian mechanics of a system of many particles of constant mass).


It should be clear, that if we have a microscopic theory consisting only of particles of constant mass, then any effective description of a composite system like a variable-mass system requires a derivation from first principles, i.e it requires to be derived from a theory with only particles of constant mass. So even if it were true, we couldn't simply postulate ##F=\dot p## for ##\dot m\neq 0##.

Totally agree with this.
 
  • #76
stevendaryl said:
Ah! Okay, I think I understand what you're saying.

It's the old continuity equation business. If you have a surface representing the boundary of an "object", there are two ways that momentum and energy can change:

(1) The particles within the boundary might change their energy or momentum.
(2) Particles may enter or leave through the boundary (taking energy and momentum with them).

So I think that this is just a terminological difference. I would not consider effect (2) to be a "force", at all. Effect (2) is present even for noninteracting particles. For example, suppose that I have a collection of noninteracting particles, initially confined within a region shaped like a cube (whose faces point in the x, y, and z directions). Half the particles have velocity zero, and the other have a nonzero velocity [itex]v[/itex] in the x-direction. Initially, the stationary particles and the moving particles are both distributed uniformly within the cube.

If we consider the cube to be the object, then the object will be losing mass at a rate of [itex]\dfrac{dm}{dt} = -\rho v A[/itex] where [itex]\rho[/itex] is the density of the moving particles near the face pointing in the x-direction, [itex]v[/itex] is the speed normal to the face, and [itex]A[/itex] is the area of the face.

The particles that the cube is losing all have velocity [itex]v[/itex] in the x-direction, so the momentum of the cube is changing with time according to

[itex]\dfrac{dp}{dt} = \dfrac{dm}{dt}\ v[/itex]

I would not call this rate of change of momentum a "force" at all. To me, noninteracting particles by definition cannot exert forces.

Exactly. You have described the issue perfectly here. It is just "another terminology" that, in my opinion, only complicate things (after all, there is no cube; in reality there are only particles in your example).

It can be harmless or it can be dangerous (it is harmless if you know what is going on in reality, but if someone really thinks that in your example there is "a force" acting on your "vanishing cube"...that can be dangerous, for his undertanding).
 
  • #77
Note, that if you define all systems as consisting or particles, or all systems as describe by continuous elements of ρdv, then F=dp/dt holds precisely, with no anomalies or confusions. As has been noted, confusions originate in defining composite opjects, for which dm/dt is presumed to apply.

Note, that modern approaches to relativity do not introduce dm/dt. Instead, mass/density are invariant scalars, and momentum is 4-vector (covector if you care about the Lagrangian), with appropriate transformation properties.
 
  • #78
PAllen said:
Note, that modern approaches to relativity do not introduce dm/dt. Instead, mass/density are invariant scalars, and momentum is 4-vector (covector if you care about the Lagrangian), with appropriate transformation properties.

That's why it's better to start with Newton's "quantity of matter" as a parameter that is not specified and than derive the properties of this parameter from basic definitions and the corresponding transformation.
 
  • #79
PAllen said:
Note, that modern approaches to relativity do not introduce dm/dt. Instead, mass/density are invariant scalars, and momentum is 4-vector (covector if you care about the Lagrangian), with appropriate transformation properties.
This is right. And in relativity, the time parameter in derivatives should really be replaced by the proper time in order to make things Lorentz invariant and in this case, the equation of motion in relativity is ##f^\mu = m a^\mu## as well, where ##m## is the (constant) rest mass of the particle, much in accordance to what happens in classical mechanics. It also generalizes beautifully to non-inertial frames, where this equation becomes ##m\left(\frac{\mathrm d^2 x^\mu}{\mathrm d \tau^2} + \Gamma^\mu_{\rho\sigma} \frac{\mathrm d x^\rho}{\mathrm d \tau} \frac{\mathrm d x^\sigma}{\mathrm d \tau}\right) = f^\mu##. Making the ##m## parameter dependent on the velocity would spoil pretty much all of the coordinate-invariant notions of relativity.
 
  • #80
rubi said:
This is right. And in relativity, the time parameter in derivatives should really be replaced by the proper time in order to make things Lorentz invariant and in this case, the equation of motion in relativity is ##f^\mu = m a^\mu## as well, where ##m## is the (constant) rest mass of the particle, much in accordance to what happens in classical mechanics. It also generalizes beautifully to non-inertial frames, where this equation becomes ##m\left(\frac{\mathrm d^2 x^\mu}{\mathrm d \tau^2} + \Gamma^\mu_{\rho\sigma} \frac{\mathrm d x^\rho}{\mathrm d \tau} \frac{\mathrm d x^\sigma}{\mathrm d \tau}\right) = f^\mu##. Making the ##m## parameter dependent on the velocity would spoil pretty much all of the coordinate-invariant notions of relativity.

Well, except that in SR, p=mU, so F=dp/d[itex]\tau[/itex] is exactly equivalent. Further, it is more accurate because you really want 4-velocity and 4-acceleration to be be vectors, while momentum and force are covectors, for a Lagrangian treatment. Thus, p = mU<lower index with metric>, and force is dp/d[itex]\tau[/itex]
 
Last edited:
  • #81
PAllen said:
Well, except that in SR, p=mU, so F=dp/dt is exactly equivalent. Further, it is more accurate because you really want 4-velocity and 4-acceleration to be be vectors, while momentum and force are covectors, for a Lagrangian treatment. Thus, p = mU<lower index with metric>, and force is dp/d[itex]\tau[/itex]
This is right and I didn't want to deny that. Of course ##F=ma## and ##F=\dot p## are always equivalent if ##m## is a constant and ##p=mv##. I'm just saying that ##m=\mathrm{const}## is conceptually better, because it makes things Lorentz invariant and even allows for coordinate-free descriptions. (Of course you're also right about the vector/covector distinction that needs to be taken seriously in general. I have just neglected it here in order to not be too technical.)
 
  • #82
rubi said:
##m=\mathrm{const}## is conceptually better, because it makes things Lorentz invariant

##m## can be a function of proper time and still be a Lorentz scalar and preserve covariance and transformation properties.
 
  • #83
PeterDonis said:
##m## can be a function of proper time and still be a Lorentz scalar and preserve covariance and transformation properties.
This is correct as well, but i was talking about velocity dependence in that post. I should have made that clear.

I'm not sure what the effects of a ##\tau##-dependent mass would be in relativity. I suspect it would introduce the same difficulties with boundary terms (this time 4-dimensional) that stevendaryl explained in his posts. But point-particle interactions in relativity are problematic anyway, so I'm not sure whether that situation is relevant anyway.
 
  • #84
PeterDonis said:
##m## can be a function of proper time and still be a Lorentz scalar and preserve covariance and transformation properties.

Except then, if you take force as dp/d[itex]\tau[/itex], you get the result that the 4-force does not transform properly - its magnitude is frame dependent. I think for simplicity and consistency you need to treat either a system of particles where m is constant for each, or a continuous system based on [itex]\rho[/itex]dv.
 
  • #85
PAllen said:
Except then, if you take force as dp/d[itex]\tau[/itex], you get the result that the 4-force does not transform properly - its magnitude is frame dependent.

Can you give an example? Bear in mind I'm not really thinking of something like a rocket; for that example I agree that the system should be modeled as a bunch of objects each with a constant rest mass: the payload plus the individual packets of fuel/reaction mass that get ejected. Each fuel/reaction mass packet just gets ejected at a slightly different proper time along the payload's worldline (i.e., it follows the same worldline as the payload up until some proper time ##\tau##, and then gets ejected).

I'm thinking of something more like this: I have an object that gets heated by laser beams coming at it isotropically, so the net 3-force on it is zero in its rest frame; the only thing that happens to it in its rest frame is that it gets heated, meaning its rest mass increases. That means the 4-force on the object is *not* zero, even though the 3-force in the rest frame is; the 4-force ##dp_{\mu} / d \tau## includes a "timelike" term for the heating, which then transforms into "spacelike" terms in other frames (corresponding to the fact that the 3-force in those other frames is not zero even though the object's velocity in an inertial frame in which it is moving does not change--its 3-acceleration is zero in any frame).
 
  • #86
PeterDonis said:
Can you give an example? Bear in mind I'm not really thinking of something like a rocket; for that example I agree that the system should be modeled as a bunch of objects each with a constant rest mass: the payload plus the individual packets of fuel/reaction mass that get ejected. Each fuel/reaction mass packet just gets ejected at a slightly different proper time along the payload's worldline (i.e., it follows the same worldline as the payload up until some proper time ##\tau##, and then gets ejected).
This is what I was thinking of.
PeterDonis said:
I'm thinking of something more like this: I have an object that gets heated by laser beams coming at it isotropically, so the net 3-force on it is zero in its rest frame; the only thing that happens to it in its rest frame is that it gets heated, meaning its rest mass increases. That means the 4-force on the object is *not* zero, even though the 3-force in the rest frame is; the 4-force ##dp_{\mu} / d \tau## includes a "timelike" term for the heating, which then transforms into "spacelike" terms in other frames (corresponding to the fact that the 3-force in those other frames is not zero even though the object's velocity in an inertial frame in which it is moving does not change--its 3-acceleration is zero in any frame).

I agree, an this is a great example of why dp/d[itex]\tau[/itex] is fundamental. You have 4-force on the body, with the time component nonzero in its rest frame. This leads to change in the norm=invariant mass of the body.
 
  • #87
PAllen said:
You have 4-force on the body, with the time component nonzero in its rest frame. This leads to change in the norm=invariant mass of the body.

Yes, and the magnitude of the force--meaning the norm of the 4-force vector (or covector if we're being precise) is a Lorentz scalar, just as it should be. It's not frame-dependent. The magnitude of the *3-force* is frame-dependent, of course; but we would expect it to be since it's not a 4-vector to begin with.
 

Similar threads

Replies
67
Views
7K
Replies
12
Views
1K
Replies
7
Views
2K
Replies
3
Views
1K
Replies
5
Views
5K
Replies
14
Views
2K
Replies
7
Views
2K
Replies
14
Views
5K
Replies
5
Views
7K
Back
Top