Prove Euler's Equation for Functional $J(y)$

In summary, the conversation discusses the theorem that states if y is a local extremum for the functional J(y)=∫a^b L(x,y,y') dx with y∈C2([a,b]), y(a)=y0, y(b)=y1 then the extremum y satisfies the ordinary differential equation of second order L_y(x,y,y')-d/dxL_y'(x,y,y')=0 (Euler's equation). The conversation also explores how to prove that Euler's equation can be written in the form L_t-d/dt(L-y'L_y')=0. The individual provides a hint on how to show this using the chain rule and Euler-Lagrange equation, and also provides an example
  • #1
evinda
Gold Member
MHB
3,836
0
Hello! (Wave)

According to my notes, the following theorem holds:

If $y$ is a local extremum for the functional $J(y)= \int_a^b L(x,y,y') dx$ with $y \in C^2([a,b]), \ y(a)=y_0, \ y(b)=y_1$ then the extremum $y$ satisfies the ordinary differential equation of second order $L_y(x,y,y')- \frac{d}{dx}L_{y'}(x,y,y')=0$ (Euler's equation).

I want to prove that Euler's equation of the problem $J(y)=\int L(t,y,y') dt$ can be written in the form $L_t- \frac{d}{dt}(L-y' L_{y'})=0$.

Could you give me a hint how we could show this? (Thinking)
 
Physics news on Phys.org
  • #2
Hi evinda,

We compute via the chain rule:

\(\displaystyle
\begin{align*}
L_{t}-\frac{d}{dt}(L-y'L_{y'})&=L_{t}-\frac{d}{dt}L+\frac{d}{dt}y'L_{y'}\\
&=L_{t}-L_{t}-y'L_{y}-y''L_{y'}\ldots
\end{align*}\)

I haven't finished the above computation of \(\displaystyle \frac{d}{dt}y'L_{y'},\) because I think you can probably get there yourself using the product and chain rules. If done correctly, you should be able to cancel a few terms and factor out a \(\displaystyle y'\) term. From there, you will want to recognize that three of the remaining terms are precisely \(\displaystyle \frac{d}{dt}L_{y'}.\) After noticing this, you can use the Euler-Lagrange equation to show that

\(\displaystyle L_{t}-\frac{d}{dt}(L-y'L_{y'})=0\)

Let me know if you're still stuck, or if anything is unclear/not quite right.
 
Last edited:
  • #3
Isn't it as follows?

$$L_t-\frac{d}{dt}(L-y' L_{y'})=L_t-L_t+\frac{d}{dt}(y' L_{y'})=y''(t) L_{y'}(t,y,y')+y' \frac{d}{dt} L_{y'}(t,y,y')$$

If so, then how do we deduce that the latter is equal to $0$?

Or am I wrong? (Thinking)
 
  • #4
Hi evinda,

I think the point of confusion is that we need to remember $y=y(t),$ so that when we write $L$, we really mean

$L=L(t,y(t),y'(t))$

With this in mind,

$\frac{d}{dt}L(t,y(t),y'(t))\neq L_{t}(t,y(t),y'(t)),$

because $\frac{d}{dt}$ is the so-called total derivative <--- which is basically a fancy way of saying the multivariable chain rule(see Total derivative - Wikipedia, the free encyclopedia for a reference)

I think an example to show why the previous equation doesn't hold may prove useful: Let's pretend that $L(t,y,y')$ is given by

$L(t,y,y') = t + y^{2}+3y'$

and that $y(t)$ is given by $y(t) = e^{t}$ (To be clear, I'm simply making up an example here, I am not claiming $L$ comes from a specific variational problem, nor am I claiming that $y$ solves the Euler-Lagrange equation).

Now, in this example, $L_{t}$ really means $L_{t}(t,y(t),y'(t)),$ which means to take the partial derivative of $L$ with respect to $t$ first, then plug in for $y(t)$. Doing so gives

$L_{t}(t,y(t),y'(t)) = 1$

On the other hand, the total-derivative (i.e. chain rule) $\frac{d}{dt}$ gives

$
\begin{align*}
\frac{d}{dt}L(t,y(t),y'(t))&=L_{t}(t,y(t),y'(t))\frac{dt}{dt}+L_{y}(t,y(t),y'(t))\frac{dy}{dt}+L_{y'}(t,y(t),y'(t))\frac{dy'}{dt}\\
&=1+2y(t)y'(t)+3y''(t)\\
&=1+2e^{2t}+3e^{t}
\end{align*}
$

From this example we see that $\frac{d}{dt}L\neq L_{t}.$ Does this example help shed some light on how the computation works out?

Returning to my original post, I began the computation as follows:

GJA said:
Hi evinda,

We compute via the chain rule:

\(\displaystyle
\begin{align*}
L_{t}-\frac{d}{dt}(L-y'L_{y'})&=L_{t}-\frac{d}{dt}L+\frac{d}{dt}y'L_{y'}\\
&=L_{t}-L_{t}-y'L_{y}-y''L_{y'}\ldots
\end{align*}\)

The $-L_{t}-y'L_{y}-y''L_{y'}$ terms in the above come from computing $-\frac{d}{dt}L.$ I did not finish the part of the computation involving the $\frac{d}{dt}y'L_{y'},$ which is what the ellipses represented. Do you think after reading this new post you can try to compute $\frac{d}{dt}y'L_{y'}$ using the product and chain rules again? If you're able to do this, my original post outlines what should happen from there. I'm sure you can get this, so I'm trying to avoid just giving you the full details right now. However, if you get stuck again, let me know and I will provide more help. Let me know if anything is unclear/not quite right.
 
  • #5
GJA said:
Hi evinda,

I think the point of confusion is that we need to remember $y=y(t),$ so that when we write $L$, we really mean

$L=L(t,y(t),y'(t))$

With this in mind,

$\frac{d}{dt}L(t,y(t),y'(t))\neq L_{t}(t,y(t),y'(t)),$

because $\frac{d}{dt}$ is the so-called total derivative <--- which is basically a fancy way of saying the multivariable chain rule(see Total derivative - Wikipedia, the free encyclopedia for a reference)

I think an example to show why the previous equation doesn't hold may prove useful: Let's pretend that $L(t,y,y')$ is given by

$L(t,y,y') = t + y^{2}+3y'$

and that $y(t)$ is given by $y(t) = e^{t}$ (To be clear, I'm simply making up an example here, I am not claiming $L$ comes from a specific variational problem, nor am I claiming that $y$ solves the Euler-Lagrange equation).

Now, in this example, $L_{t}$ really means $L_{t}(t,y(t),y'(t)),$ which means to take the partial derivative of $L$ with respect to $t$ first, then plug in for $y(t)$. Doing so gives

$L_{t}(t,y(t),y'(t)) = 1$

On the other hand, the total-derivative (i.e. chain rule) $\frac{d}{dt}$ gives

$
\begin{align*}
\frac{d}{dt}L(t,y(t),y'(t))&=L_{t}(t,y(t),y'(t))\frac{dt}{dt}+L_{y}(t,y(t),y'(t))\frac{dy}{dt}+L_{y'}(t,y(t),y'(t))\frac{dy'}{dt}\\
&=1+2y(t)y'(t)+3y''(t)\\
&=1+2e^{2t}+3e^{t}
\end{align*}
$

From this example we see that $\frac{d}{dt}L\neq L_{t}.$ Does this example help shed some light on how the computation works out?

I think that I got it... (Thinking)

GJA said:
Returning to my original post, I began the computation as follows:
The $-L_{t}-y'L_{y}-y''L_{y'}$ terms in the above come from computing $-\frac{d}{dt}L.$ I did not finish the part of the computation involving the $\frac{d}{dt}y'L_{y'},$ which is what the ellipses represented. Do you think after reading this new post you can try to compute $\frac{d}{dt}y'L_{y'}$ using the product and chain rules again? If you're able to do this, my original post outlines what should happen from there. I'm sure you can get this, so I'm trying to avoid just giving you the full details right now. However, if you get stuck again, let me know and I will provide more help. Let me know if anything is unclear/not quite right.

So is it as follows?

$L_t-\frac{d}{dt}\left( L(t,y,y')-y' L_{y'}(t,y,y')\right)=L_t(t,y,y')-\frac{d}{dt} L(t,y,y')+\frac{d}{dt}(y' L_{y'}(t,y,y')) \\ =L_t(t,y,y')-L_t(t,y,y')-L_y(t,y,y')y'(t)-L_{y'}(t,y,y')y''+y''L_{y'}(t,y,y')+y' \left( L_{y't}+L_{y'y}y'+L_{y'y'}y'' \right)$Or have I done something wrong? (Thinking)
 
  • #6
evinda said:
$L_t-\frac{d}{dt}\left( L(t,y,y')-y' L_{y'}(t,y,y')\right)=L_t(t,y,y')-\frac{d}{dt} L(t,y,y')+\frac{d}{dt}(y' L_{y'}(t,y,y')) \\ =L_t(t,y,y')-L_t(t,y,y')-L_y(t,y,y')y'(t)-L_{y'}(t,y,y')y''+y''L_{y'}(t,y,y')+y' \left( L_{y't}+L_{y'y}y'+L_{y'y'}y'' \right)$

Looks good!
 
  • #7
GJA said:
Looks good!

Do we continue as follows?

$$L_t(t,y,y')-L_t(t,y,y')-L_y(t,y,y')y'(t)-L_{y'}(t,y,y')y''+y''L_{y'}(t,y,y')+y' \left( L_{y't}+L_{y'y}y'+L_{y'y'}y'' \right)=y'(t) (-L_y(t,y,y')+L_{y't}(t,y,y')+y' L_{y'y}+y'' L_{y'y'}) \\ =-y'(t) \left( L_{y}(t,y,y')\right)-\frac{\partial}{\partial{t}}L_{y'}(t,y,y') \frac{dt}{dt}-\frac{\partial}{\partial{y}}L_{y'}(t,y,y') \frac{dy}{dt}-\frac{\partial}{\partial{y'}}L_{y'} \frac{dy'}{dt}\\=-y'(t)(L_{y}(t,y,y')-\frac{d}{dt}L_{y'}(t,y,y')) \overset{\text{ Euler-Lagrange}}{=}0$$

Or have I done something wrong? (Thinking)

Also, have we shown in that way that the Euler equation of the problem $J(y)= \int L(t,y,y')dt$ can be written in the form $L_t-\frac{d}{dt} (L-y'L_{y'})=0$?
If so could you explain me why it is like that?
Because of the fact that we deduced that $L_t-\frac{d}{dt} (L-y'L_{y'})=0$ using the Euler-Lagrange equation?
 
  • #8
evinda said:
Do we continue as follows?

$$L_t(t,y,y')-L_t(t,y,y')-L_y(t,y,y')y'(t)-L_{y'}(t,y,y')y''+y''L_{y'}(t,y,y')+y' \left( L_{y't}+L_{y'y}y'+L_{y'y'}y'' \right)=y'(t) (-L_y(t,y,y')+L_{y't}(t,y,y')+y' L_{y'y}+y'' L_{y'y'}) \\ =-y'(t) \left( L_{y}(t,y,y')\right)-\frac{\partial}{\partial{t}}L_{y'}(t,y,y') \frac{dt}{dt}-\frac{\partial}{\partial{y}}L_{y'}(t,y,y') \frac{dy}{dt}-\frac{\partial}{\partial{y'}}L_{y'} \frac{dy'}{dt}\\=-y'(t)(L_{y}(t,y,y')-\frac{d}{dt}L_{y'}(t,y,y')) \overset{\text{ Euler-Lagrange}}{=}0$$

Or have I done something wrong? (Thinking)

You got it! Nicely done :D

evinda said:
Also, have we shown in that way that the Euler equation of the problem $J(y)= \int L(t,y,y')dt$ can be written in the form $L_t-\frac{d}{dt} (L-y'L_{y'})=0$?
If so could you explain me why it is like that?
Because of the fact that we deduced that $L_t-\frac{d}{dt} (L-y'L_{y'})=0$ using the Euler-Lagrange equation?

We have actually shown this, because there are equal signs throughout. If you really want to convince yourself that the two equations are equivalent, you can START with the Euler-Lagrange equation

$L_{y}-\frac{d}{dt}L_{y'}=0$

multiply both sides by $-y'$ then follow the string of equations you wrote in your last post back to the form you want in the problem. Let me know if anything is unclear/not quite right.
 
  • #9
GJA said:
You got it! Nicely done :D

(Smile)

GJA said:
We have actually shown this, because there are equal signs throughout. If you really want to convince yourself that the two equations are equivalent, you can START with the Euler-Lagrange equation

$L_{y}-\frac{d}{dt}L_{y'}=0$

multiply both sides by $-y'$ then follow the string of equations you wrote in your last post back to the form you want in the problem. Let me know if anything is unclear/not quite right.

So you mean that we could also justify that the Euler-Lagrange equation can be written in the form $L_t-\frac{d}{dt}(L-y' L_{y'})$ as follows, right?$$L_y-\frac{d}{dt}L_{y'}=0 \\ \Leftrightarrow -y' L_y+y' \frac{d}{dt}L_{y'}=0 \\ \Leftrightarrow -y'L_{y}+y'(L_{y't}+L_{y'y} y'+ L_{y'y'} y'')=0 \\ \Leftrightarrow L_t-L_t-y'L_y+y'L_{y't}+(y')^2 L_{y'y}+y' y'' L_{y'y'}=0 \\ \Leftrightarrow L_t-(L_t+y' L_y+y'' L_{y'})+y'' L_{y'}+y' L_{y't}+(y')^2 L_{y'y}+y'y'' L_{y'y'}=0 \\ \Leftrightarrow L_t-\frac{d}{dt}L+\frac{d}{dt} y'L_{y'}=0 \\ \Leftrightarrow L_t-\frac{d}{dt}(L-y'L_{y'})=0$$

Or have I done somethig wrong? (Thinking)

Thinking about it again, couldn't we also do it as previously and just start with the equality $ L_t-\frac{d}{dt}(L-y'L_{y'})=0$ and then use equivalences? (Thinking)
 

FAQ: Prove Euler's Equation for Functional $J(y)$

What is Euler's equation for functional J(y)?

Euler's equation for functional J(y) is a fundamental result in mathematical analysis that relates the behavior of a functional to its derivative. It is expressed as J'(y) = 0, where J'(y) is the derivative of the functional J with respect to the function y.

Why is Euler's equation important in science?

Euler's equation is important in science because it allows us to optimize and find the most efficient solutions to a wide range of problems. It is used in fields such as physics, engineering, economics, and more to find the best possible solution to a given problem.

How do you prove Euler's equation for functional J(y)?

Euler's equation can be proven using the calculus of variations, which is a branch of mathematics that deals with finding the optimal path or function between two endpoints. This involves setting up an integral expression for the functional and then using the Euler-Lagrange equation to find the stationary points, which correspond to the optimal solution.

Can Euler's equation be extended to higher dimensions?

Yes, Euler's equation can be extended to higher dimensions, such as for functionals of multiple variables. The equation takes the form of a system of partial differential equations, known as the Euler-Lagrange equations, which must be solved simultaneously to find the optimal solution.

What are some practical applications of Euler's equation for functional J(y)?

Euler's equation has many practical applications, such as in optimizing the shape of structures, minimizing energy consumption, and finding the shortest path between two points. It is also used in the field of control theory to find the optimal control input for a system. Additionally, it has applications in quantum mechanics, where it is known as the Euler-Lagrange equation of motion.

Similar threads

Back
Top