How Does the Lagrangian Transform in Polar Coordinates for Orbital Motion?

In summary: IIn summary, the conversation is discussing the Lagrangian in terms of polar coordinates for a particle moving in a potential. One person is struggling to understand the second term of the Lagrangian, which represents tangential velocity. Another person explains that the first term is equivalent to the kinetic energy in cartesian coordinates. The second term can be derived by assuming the vectors for radial and tangential velocity are orthogonal or by using a coordinate transformation. The first method is simpler. The conversation also touches on the use of differentials and infinitesimals in calculus, with some disagreement about their validity in mathematical rigor.
  • #1
cepheid
Staff Emeritus
Science Advisor
Gold Member
5,199
38
In describing the motion (orbit) of a particle of mass m moving in a plane in a potential V(r), my prof wrote the Lagrangian as:

[tex] \mathcal{L} = \frac{1}{2}m|\dot{\vec{r}}|^2 - V(r) [/tex]

Without further ado (or explanation of any kind), he wrote this as:

[tex] \mathcal{L} = \frac{1}{2}m(\dot{r}^2 + r^2\dot{\phi}^2) - V(r) [/tex]

Where [itex] (r,\phi) [/itex] are the coordinates in the orbital plane.

I understand that he is expressing the magnitude of r'(t) in terms of its components, and that [itex] \dot{r} [/itex] is the velocity in the radial direction, whereas the second term should represent tangential velocity. But that part bothers me. After all...how did he arrive at this second term? It seems to me that he assumed r was constant, otherwise the second term is invalid. Yet if r were constant, then [itex] \dot{r} [/itex] would be zero!

Shouldn't it be done as follows?

[tex] |\dot{\vec{r}}|^2 = \left(\frac{ds}{dt}\right)^2 = \left(\frac{d(r \phi)}{dt}\right)^2 = (\dot{r}\phi + r\dot{\phi})^2 [/tex]

What am I doing wrong, and how did he get his formula?
 
Physics news on Phys.org
  • #2
cepheid said:
Shouldn't it be done as follows?

[tex] |\dot{\vec{r}}|^2 = \left(\frac{ds}{dt}\right)^2 = \left(\frac{d(r \phi)}{dt}\right)^2 = (\dot{r}\phi + r\dot{\phi})^2 [/tex]

What am I doing wrong, and how did he get his formula?

s is not equal to phi*r unless it is a circular motion (constant r). In polar coordinates,

[tex]ds=\sqrt{(dr)^2+(rd\phi)^2} [/tex]. See attached picture. The yellow thing is a rectangular triangle when the displacement is very small.

ehild
 
Last edited:
  • #3
He's computing .5*m*v^2 to get the kinetic energy for the first term.

In cartesian coordinates, it's .5*m*(vx^2 + vy^2).

To duplicate his result, you can either

1) note that the vectors [tex]\frac{d \hat r}{dt}[/tex] and [tex] \frac {d \phi}{dt}[/tex] are orthogonal, so you just add the squares of the two velocites, or

2) you can make the coordinate transformation

x = r(t)*cos(phi(t))
y = r(t)*sin(phi(t))

use the chain rule to compute dx/dt and dy/dt

square the results and add them together

but the first way is much easier. (The second is not too bad if you have a symbolic algebra program).
 
  • #4
Hmm...I think I've got it. However, it is somewhat confusing when two people give slightly different answers. However, am I right in saying that pervect's method number 1 is the same as ehild's method? The only difference is that pervect assumed that I would be able to get the two expressions for radial and tangential velocity components just like that, whereas ehild showed me how to do it on a diagram. Another thing that bothers me: to get the desired answer from what ehild showed on the diagram, you have to do this:


[tex]ds=\sqrt{(dr)^2+(rd\phi)^2} [/tex]

[tex] \frac{ds}{dt} = \frac{\sqrt{(dr)^2+(rd\phi)^2}}{dt} [/tex]

[tex] = \sqrt{\frac{(dr)^2+(rd\phi)^2}{(dt)^2}} [/tex]

[tex] = \sqrt{\frac{(dr)^2}{(dt)^2}+ \frac{(rd\phi)^2}{(dt)^2}} [/tex]

[tex] = \sqrt{\left(\frac{dr}{dt}\right)^2 + r^2\left(\frac{d\phi}{dt}\right)^2} [/tex]

[tex] = \sqrt{\dot{r}^2 + r^2\dot{\phi}^2} [/tex]

So, we get the right answer, but I'm wondering about the mathematical validity of the manipulations I performed. I understand that there must be some validity to algebraic manipulations of differentials (i.e. infinitesimals), because it is done all the time in integral calculus. But it seems to me that physicists are a lot more free and easy with differntials than mathematicians would say is permissible.

My question is, are we allowed to interpret them the way we do, as actual infinitesimal quantities, that can be squared, grouped together etc. Can we say that the derivative is a ratio of infinitesimal changes in (i.e. "elements" of or "little bits" of) two variables? This is the method of presenting calculus in Silvanus P. Thompson's Calculus Made Easy, and it is totally intuitive to me. He even gives reference to a mathematician (I can't remember who) who wrote a paper asserting (with some proof) that infinitesimals are legitimate quantities in mathematics. However, I have seen people on this site freaking out about that. My calculus prof would also have none of it. "You could interpret the derivative (etc.) that way," he said, "but you would merely be restating the same thing a different way. A limit is a well defined concept, and the definition of a derivative is given in terms of one."

In fact, isn't the definition of a differential a limit as well? I haven't looked it up, but I think it goes something like, for a differentiable function [itex] y = f(x) [/itex], on an interval [itex] \Delta x [/itex], we can make the following definition:

[tex] dy = \lim_{\substack{\Delta x \rightarrow 0}}f'(x) \Delta x [/tex]

I guess I could have derived the equation above more rigorously with that in mind. Still, clarification on this very confusing issue would be appreciated.
 
  • #5
This sort of thing is a real quagmire, because a great many number of concepts all get similar notation because they all "act like infintiessimals".


Infinitessimals can, in fact, be used rigorously, but you have to learn non-standard analysis. For example, the derivative is defined by:

[tex]
\frac{d}{dx} f(x) := \mathrm{Std} \frac{f^*(x^* + \Delta x) - f^*(x^*)}{\Delta x}
[tex]

Where [itex]\Delta x[/itex] is any nonzero infinitessimal, Std means to round to the nearest standard number, and the superscript * means to use the nonstandard version of that symbol. (It's sort of like distinguishing between 1 the integer and 1 the rational number)


However, you're probably better off translating your intuition to the standard way of doing things than trying to do nonstandard stuff intuitively.



Differentials, like f(x) dx, are simply functions on paths. That is, given a differential, f(x) dx, you can apply it to a path, [itex]\gamma[/itex], to get a number: [itex]\int_{\gamma} f(x) \, dx[/itex].


However, I don't know if the "arithmetic" used in computing ds is an actual arithmetic, or just a formal symbol that means "do the right thing".
 
  • #6
cepheid said:
Hmm...I think I've got it. However, it is somewhat confusing when two people give slightly different answers. However, am I right in saying that pervect's method number 1 is the same as ehild's method? The only difference is that pervect assumed that I would be able to get the two expressions for radial and tangential velocity components just like that, whereas ehild showed me how to do it on a diagram.

Yes, it is. In a polar coordinate system the base vectors (angular and azimuthal) can be written in terms of Cartesian coordinates as

[tex] \vec e_{r} = \cos\phi \vec i + \sin \phi \vec j \mbox { and }\vec e_\phi= -\sin \phi \vec i + \cos \phi \vec j[/tex].

The position of a point in vector form is

[tex] \vec r = r \vec e_r[/tex]

The velocity is the time derivative of the position vector.

[tex] \vec v = \dot \vec r = \dot r \vec e _r +r \dot \vec e_r[/tex]

We have to find the time derivative of the radial unit vector.

First, as

[tex]\vec e \cdot \vec e = 1\rightarrow \vec e \cdot \dot \vec e =0[/tex]

the time derivative of an unit vector is normal to the original vector, which means it is parallel to [tex] \vec e _\phi[/tex].

The magnitude of an unit vector [tex]\dot \vec e [/tex] can be found from a geometrical approach. In the limit when

[tex]\Delta\phi \rightarrow 0[/tex]

[tex] \frac{\Delta |\vec e|}{\Delta t}=\frac{\Delta \phi}{\Delta t} \rightarrow \dot \phi[/tex]

If you do not like this method, you can derive the radial unit vector written in Cartesian components

[tex]\dot \vec e_{r} = -\sin\phi *\dot\phi *\vec i + \cos \phi *\dot\phi*\vec j =\dot \phi * \vec e_\phi[/tex]

We get the velocity now in polar coordinates :

[tex] \vec v = \dot r \vec e _r +r\dot \phi \vec e_\phi[/tex].

Take the square :

[tex] (\vec v) ^2 = (\dot r )^2 + (r \dot \phi)^2 [/tex]

As for your problem with ds, how did you learn the line integral? How do you calculate the length of a line y=f(x) from A to B?

[tex] \int _A^B ds [/tex]

You replace ds with

[tex]\sqrt{dx^2+dy^2}=\sqrt{ 1+(f'(x))^2}dx[/tex],

don't you? This is the same method with infinitesimal quantities I used to get ds from dr and d(phi).

ehild
 
  • #7
How did we get this step? Sorry, probably something really easy I'm missing.

ehild said:
[tex]\vec e \cdot \vec e = 1\rightarrow \vec e \cdot \dot \vec e =0[/tex]
 
  • #8
cepheid said:
How did we get this step? Sorry, probably something really easy I'm missing.

It is differentiation of a product.
If you do not like vectors,

[tex]( \vec e )^2= \vec e \cdot \vec e = = e_x^2+e_y^2=1 [/tex]
[tex]\frac{d(\vec e)^2}{dt}= 2 (e_x*\dot e_x+e_y*\dot e_y) = 0[/tex]
Notice that
[tex](e_x*\dot e_x+e_y*\dot e_y)=\vec e \cdot \dot \vec e[/tex]

dot product of he unit vector and its derivative, and this is zero, which means that they are orthogonal.


ehild
 

FAQ: How Does the Lagrangian Transform in Polar Coordinates for Orbital Motion?

What is "Motion in a Central Field"?

"Motion in a Central Field" refers to the motion of an object or particle that is acted upon by a central force, meaning that the force acts towards or away from a fixed point.

What are some examples of central forces?

Examples of central forces include gravitational force, electrostatic force, and nuclear force.

How is the motion of an object in a central field described?

The motion of an object in a central field can be described using Newton's laws of motion and the law of universal gravitation, as well as other equations and principles in classical mechanics.

What is the difference between circular and elliptical orbits in a central field?

In a central field, a circular orbit has a constant radius and a constant speed, while an elliptical orbit has a changing radius and a changing speed. The shape of an elliptical orbit is determined by the eccentricity of the orbit, which is a measure of how elongated the orbit is.

What factors affect the motion of an object in a central field?

The motion of an object in a central field is affected by the mass of the object, the mass of the central body, the distance between the two objects, and the strength of the central force. Other factors such as air resistance and external forces may also play a role.

Similar threads

Replies
3
Views
1K
Replies
2
Views
1K
Replies
2
Views
822
Replies
2
Views
5K
Replies
2
Views
476
Replies
76
Views
5K
Replies
2
Views
782
Replies
3
Views
2K
Back
Top