Simple Derivation of Euler-Lagrange Equations

In summary, the author is trying to deduce the equations of motion in the form \frac{d}{dt} \frac{\partial \cal L}{\partial \dot{q}} - \frac{\partial \cal L}{\partial q} = 0, but has not been able to do so using algebra directly from Hamilton's principle. After describing a strategy for finding a stationary point for an integral with the help of derivatives, the author discusses how to derive the Euler-Lagrange equations using a more general problem.
  • #1
dx
Homework Helper
Gold Member
2,147
50
I'm trying to deduce the equations of motion in the form

[tex] \frac{d}{dt} \frac{\partial \cal L}{\partial \dot{q}} - \frac{\partial \cal L}{\partial q} = 0[/tex]

with little algebra directly from Hamilton's principle, like the geometric derivation of snell's law from the principle of least time. It should be possible since they're simple enough to write. I've tried for about an hour with little luck. Does anyone know how to do it?
 
Physics news on Phys.org
  • #2
First try letting [itex]q_i[/itex] be the usual Cartesian coordinates, [itex](x, y, z)[/itex]. Then you should be able to recover

[tex]F_x = m \dot x,[/tex]

etc. Once you have that, show that the form of the Euler-Lagrange equation is invariant under coordinate transformations of the form

[tex]s_i = f_i(q_1, q_2, \dots q_n)[/tex]

where [itex]s_i[/itex] are the new coordinates and [itex]q_i[/itex] are the old.
 
  • #3
Actually I was looking for something more deductive, i.e starting out without already knowing the form of the E-L eqns.
 
  • #4
Wait, I think I read your post wrong. My post above describes how to prove that Lagrange's and Newton's methods are equivalent. What you want to do is derive the Euler-Lagrange equation from the principle of least action, right?

It's easiest to consider a more general problem (because the Euler-Lagrange equations are, in fact, more general, and they can be used to find stationary points of any integrals). You have some integral J expressed by

[tex]J = \int_A^B F(y, \dot y; x) \, dx[/tex]

for some functional F, function y, and independent variable x. Your goal is to find a function y that minimizes (or maximizes, or in general gives a stationary value for) the integral J on the interval [A,B], given the functional F. So here is what you do:

Denote the "correct" function as [itex]y(x)[/itex]. Consider some functions [itex]y(x, \alpha)[/itex] which are "close to" y(x) in the sense that

[tex]y(x, \alpha) = y(x) + \alpha \eta(x)[/tex]

for some (arbitrary) function [itex]\eta(x)[/itex], which is chosen such that [itex]\eta(A) = \eta(B) = 0[/itex]. This way, [itex]y(x, \alpha)[/itex] is equal to [itex]y(x)[/itex] on the endpoints of [A,B], and they differ only in the middle.

Now, we can express J in terms of [itex]\alpha[/itex] as

[tex]J(\alpha) = \int_A^B F\left(y(x, \alpha), \frac{d y(x, \alpha)}{dx}; x\right) \, dx[/tex]

Note the total derivative on y; this is important. What we see is that a stationary point (about variations in y) occurs when

[tex]\frac{dJ}{d \alpha} = 0[/tex]

So, carry out the differentiation. What's left is a little algebra and a little cleverness, but you should be able to arrive at the Euler-Lagrange equations:

[tex]\frac{d}{dx} \frac{\partial F}{\partial \dot y} - \frac{\partial F}{\partial y} = 0[/tex]

where [itex]\dot y = dy / dx[/itex] is treated as an independent variable.

Look up "calculus of variations" if you need more help. :)
 
Last edited:
  • #5
I know the usual derivation using the calculus of variations! I was looking for a simpler way to see it without using the calculus of variations, possibly geometric.
 
  • #6
Oh, why didn't you say so in the first place? I wrote all that out for nothin'... :P

My strategy would be to go to one dimension first. From Newton's laws, you know

[tex]F = m\ddot x[/tex]

[tex]T = \frac12 m \dot x^2[/tex]

If you consider that F is some function of time and space, F(x,t), then you can get U(x,t) by first defining an arbitrary zero point. It might be easiest to consider a force field which is constant in time, which gives

[tex]U(x) = - \int_{\mathcal O}^x F(x') \, dx'[/tex]

So then the Lagrangian is

[tex]\mathcal L = T - U = \frac12 m\dot x^2 + \int_{\mathcal O}^x F(x') \, dx'[/tex]

Then we find

[tex]\frac{\partial \mathcal L}{\partial \dot x} = m \dot x[/tex]

and

[tex]\frac{\partial \mathcal L}{\partial x} = F(x)[/tex]

which lead directly to

[tex]\frac{d}{dt} \frac{\partial \mathcal L}{\partial \dot x} - \frac{\partial \mathcal L}{\partial x} = m \ddot x - F(x) = 0[/tex]

This works for the static case, where F does not change in time. You can probably figure out how to prove it in the general case.

Once you get it for Cartesian coordinates, it's easy to show that the form of the EL equations is invariant under a change of coordinates.
 
  • #7
Of course, this is a deduction from Newton's laws, and the definitions of potential and kinetic energy. How to get it from the least action principle, without calculus of variations, I don't know.

But the least action principle is pretty ad hoc anyway. Newton's laws are grounded in experiment.
 
  • #8
I don't think you're going to get them from the principle of least action without using COV. Well, the only other way I've seen is from the Feynman path integral, taking saddle-point approximation, but this is essentially [tex]\frac{\delta S}{\delta x} = 0[/tex].

Oh, and another way you can derive EOMs is to use the theory of canonical transformations and derive the Hamilton-Jacobi equations of motion, which are equivalent to the E-L equations. There are two ways of deriving these, one from COV and one from canonical transformations, as I said. However, I suspect that you use the E-L implicitly at some point.

Do you have a prejudice against COV?
 
  • #9
No I don't have any prejudice against COV :) I just like having simpler ways to see things which can possibly help my intuition. Do you know the geometric derivation of snell's law from the principle of least time? I was looking for something of that sort. I think one should be able to do something like that by looking at a differential element of the path and applying the stationary action condition to that.
 
  • #10
I'm not sure about Snell's law. The law of reflection can be done purely geometrically, but doesn't Snell's law require a single minimization? If you could point me at a reference I could think about the E-L case.
 
  • #11
Snell's law can be done purely geometrically too. Just draw the path from point A in the first medium to point B in the second medium with angles [tex] \theta_{1} [/tex] and [tex] \theta_{2} [/tex] respectively with the normal. Now move the point at which the path touches the medium interface by a little amount [tex] \Delta x [/tex]. You will see that if the length of one path increases, the other will decrease. From the geometry it's easy to see that the first order changes are [tex] \Delta x sin \theta_{1} [/tex] and [tex] \Delta x \sin \theta_{2} [/tex] (with opposite signs). The changes in the time are simply these changes in path distance multiplied by the respective refractive index. So for the total first order change in time to be equal to zero, we need

[tex] n_{1}\Delta x sin \theta_{1} = n_{2}\Delta x sin \theta_{2} [/tex].

Canceling the [tex]\Delta x[/tex]s we get Snell's law.
 
  • #12
I don't have the books with me right now, but I remember this derivation is given in Feynman's lectures vol.1, and also in Mach's "Science of Mechanics".
 
Last edited:
  • #13
dx said:
Snell's law can be done purely geometrically too. Just draw the path from point A in the first medium to point B in the second medium with angles [tex] \theta_{1} [/tex] and [tex] \theta_{2} [/tex] respectively with the normal. Now move the point at which the path touches the medium interface by a little amount [tex] \Delta x [/tex]. You will see that if the length of one path increases, the other will decrease. From the geometry it's easy to see that the first order changes are [tex] \Delta x sin \theta_{1} [/tex] and [tex] \Delta x \sin \theta_{2} [/tex] (with opposite signs). The changes in the time are simply these changes in path distance multiplied by the respective refractive index. So for the total first order change in time to be equal to zero, we need

[tex] n_{1}\Delta x sin \theta_{1} = n_{2}\Delta x sin \theta_{2} [/tex].

Canceling the [tex]\Delta x[/tex]s we get Snell's law.

But that's just doing Calculus of Variations. It just so happens that the only functions you need to consider are piecewise linear. Varying the "point at which the ray crosses the boundary" is the same as varying [itex]\alpha[/itex] in [itex]y(x, \alpha) = y(x) + \alpha \eta(x)[/itex]. It just so happens that you can visualize this variation, because the path is a physical path in 3-dimensional space; whereas, in the Lagrangian formulation of mechanics, the paths being considered are paths in N-dimensional configuration space.

The only purely geometrical way of deriving Snell's Law that I know is by means of wave optics. Consider a plane wave incident on medium 2 at an angle [itex]\theta_1[/itex] moving at velocity [itex]c / n_1[/itex]. In medium 2, it moves with a velocity [itex]c / n_2[/itex]. Using Huygens' Principle, one can see that the wave fronts must emerge at an angle [itex]\theta_2[/itex] given by Snell's Law.

The idea of "shortest time" doesn't enter into it; and in fact, the entire concept of minimizing time is suspect in the first place. After all, why should the light know where it is going? The geometrical argument shows that refraction is an entirely local effect, and the math just happens to work out such that the path of each ray of light minimizes time along itself.

In a similar vein, my derivation of the EL equations from Newton's laws shows that they are an entirely local effect, and the math "just works out" so that the path of the system through configuration space minimizes some quantity known as "action".

P.S.: You can get pretty-looking sines, cosines, logarithms, etc., in Latex by preceding them with a backslash such as "\sin \theta_1": [itex]\sin \theta_1[/itex].
 
Last edited:
  • #14
I think the E-L are secretly buried in Huygen's principle, which is essentially a Green's function solution of the wave equation.
 
  • #15
Ben Niehoff said:
But that's just doing Calculus of Variations. It just so happens that the only functions you need to consider are piecewise linear. Varying the "point at which the ray crosses the boundary" is the same as varying [itex]\alpha[/itex] in [itex]y(x, \alpha) = y(x) + \alpha \eta(x)[/itex]. It just so happens that you can visualize this variation, because the path is a physical path in 3-dimensional space; whereas, in the Lagrangian formulation of mechanics, the paths being considered are paths in N-dimensional configuration space.

Of course we have to do some kind of variation, because it's a variational principle (which should be called the principle of stationary time to emphasize that it is not really a teleological thing). But in this case we were able to see what the variation was geometrically. And in the case of E-L, we don't have to consider all N dimensions. We can derive the differential equation for each of the [tex] q_i(t) [/tex] independently. Each of the qs can be plotted vs time on the plane.

Ben Niehoff said:
The idea of "shortest time" doesn't enter into it; and in fact, the entire concept of minimizing time is suspect in the first place. After all, why should the light know where it is going? The geometrical argument shows that refraction is an entirely local effect, and the math just happens to work out such that the path of each ray of light minimizes time along itself.In a similar vein, my derivation of the EL equations from Newton's laws shows that they are an entirely local effect, and the math "just works out" so that the path of the system through configuration space minimizes some quantity known as "action".

Actually it's not true that the action is minimized. It is well known that the action is only stationary and that it is a local effect. And since it is a local effect, it should be possible to derive the E-L equations by considering only a small element of the path, instead of using the calculus of variations on the entire path.

Even in the case of light, it's not true that the time is minimum. It is only stationary.

Thanks for the tex tip :)
 
  • #16
Whether it is stationary or minimal is immaterial to the point I was making.

And besides, I know that. "Least action principle" is what it's called, by convention, and it's easier to say than "stationary action principle".
 
  • #17
Ben Niehoff said:
So, carry out the differentiation. What's left is a little algebra and a little cleverness, but you should be able to arrive at the Euler-Lagrange equations:

[tex]\frac{d}{dx} \frac{\partial F}{\partial \dot y} - \frac{\partial F}{\partial y} = 0[/tex]

where [itex]\dot y = dy / dx[/itex] is treated as an independent variable.

Why is it that one can take y to be independent of its derivative, dy/dx ?
 
  • #18
Someone must know this! :-) Every book I've seen just says that they are independent, but offers no explanation.
 
  • #19
For a functional like F[x,y(x),y'(x)], it's F that's dependent on x, y(x), and y'(x). These can be regarded as independent variables since we can define F in many unique ways using all or some of these variables. For example F=x+y'/y defines a particular F and therefore a particular class of "curves" (or functions) over some range of x and some definition of the function y. The functional F=x(y+y') defines is an entirely different class of curves even for identical specifications of x and y. Both instances of F depend on the way the variables x,y(x), and y'(x) are used in F's definition and so are in that sense the independent variables of F.

[At least that's the way I've come to see it. If I'm wrong maybe someone can straighten us both out]
 
  • #20
My last post sort of sux. For a functional F=F(x,y,y'), where y=y(x) and y'=dy/dx, F "totally" depends on x and "partially" depends on y and y'. That is, we can find the total derivative of F with respect x, dF/dx. However, we can also find the partial derivative of F with respect to y or y'. The nature of functionals like F is a matter of the mathematics of "function spaces". The function space for F(x,y,y') is the set of all functions that have a continuous first derivative. The function space for F(x,y,y',y'') are the set of all functions having continuous second derivatives, etc. Understanding functionals requires a little knowledge of the math of function spaces, normed linear spaces, and so on. An important part of this is how "distance" is defined between functions and therefore what it means to add an infinitesimal increment to a function.

A very readable treatment of this is in the first chapter of the very inexpensive Dover publication, Calculus of Variations, I.M. Gelfand and S.V. Fomin. See https://www.amazon.com/dp/B001G4RG0G/?tag=pfamazon01-20
 
Last edited by a moderator:
  • #21
Ah, now it's starting to make a little sense. I will try and get a hold of that book.
 
  • #22
Here's a fairly common and informative example - consider the following functional J[y]:

First, let

[tex]y = y(x)[/tex]

[tex]y' = \frac{{dy}}{{dx}}[/tex]

Then define F:
[tex]F(x,y,y') = \sqrt {1 + y{'^2}}[/tex]

And then define J:
[tex]J[y] = \int_a^b {F(x,y,y')dx} = \int_a^b {\sqrt {1 + y{'^2}} } dx[/tex]

First off, F is defined for all (and only) those functions y=y(x) that have continuous first derivatives (we'll call them "valid functions") over the closed interval x=a to x=b. Obviously, the same goes for J[y]. Geometrically speaking, for any valid function y=y(x), J[y] represents the length of that function's curve between the two points P1=[a,y(a)] and P2=[b, y(b)]. Notice that y=y(x) in the closed interval x=a to x=b, isn't just one function - it's an independent variable and represents an infinity of possible valid functions or curves. Also, for this example, P1 and P2 more than likely NOT the same points for different y functions. A particular value for J would require us having to specify values for a and b, plus we'd need to define a particular function y=y(x), making sure we've picked a "valid function". So you can see how the function y acts as an independent variable here for both F and J. And even though y' isn't specified directly and is always dependent on our choice of y, it still is a variable in the sense that F depends on it directly.
 
Last edited:

FAQ: Simple Derivation of Euler-Lagrange Equations

1. What are Euler-Lagrange equations?

Euler-Lagrange equations are a set of mathematical equations that are used to describe the motion of a system in classical mechanics. They are derived from the principle of least action, which states that the path taken by a system between two points is the one that minimizes the action, a quantity that is related to the energy of the system.

2. How are Euler-Lagrange equations derived?

Euler-Lagrange equations are derived by applying the principle of least action to a Lagrangian function, which is a mathematical representation of the system's kinetic and potential energy. This results in a set of differential equations that describe the motion of the system.

3. What is the significance of Euler-Lagrange equations?

Euler-Lagrange equations are significant because they provide a powerful mathematical tool for analyzing the motion of a system in classical mechanics. They can be used to derive the equations of motion for a wide range of physical systems and can help predict the behavior of these systems.

4. What is the difference between Euler-Lagrange equations and Newton's laws of motion?

The main difference between Euler-Lagrange equations and Newton's laws of motion is that Euler-Lagrange equations are derived from the principle of least action, while Newton's laws are based on the concept of force. While Newton's laws are more intuitive and easier to apply, Euler-Lagrange equations are more powerful and can be used to describe a wider range of systems.

5. How are Euler-Lagrange equations used in real-world applications?

Euler-Lagrange equations have many practical applications in physics and engineering. They are used in the study of classical mechanics, as well as in fields such as robotics, fluid dynamics, and control theory. They are also used in advanced topics in physics, such as quantum mechanics and general relativity.

Similar threads

Replies
1
Views
722
Replies
5
Views
1K
Replies
3
Views
922
Replies
18
Views
1K
Replies
3
Views
2K
Replies
1
Views
331
Replies
1
Views
1K
Back
Top