Understanding Stationary Points in Multivariable Functions

In summary: That's great. But it is not always the case. So what? What is important is to clearly understand the method. We learn more from counter-examples than we do from examples.In summary, the conversation discusses the problem of finding stationary points in the functions G(t) and F(x,y). It is stated that G(t) has stationary points at t = 0, t = -1/2, and t = -4, while F(x,y) only has a suitable stationary point at (0,0). The use of Lagrange multipliers is also explored, and it is noted that while it may seem intuitive for any maximum/minimum on g(x,y) to also lie on f
  • #1
unscientific
1,734
13

Homework Statement



The problem as attached in the picture.

The Attempt at a Solution



So here's the problem that I was facing:

G(t) has stationary points at t = 0, t = -1/2, t = -4

They correspond to (0,0), (a/4, -a) and (16a,-8a) in the function G(t).

However, only (0,0) is a suitable stationary point in F(x,y).

Here are some of my conclusions:

Before the "condition" x = at2, y = 2at was applied, (x,y) was free to have any possible value.

However, with the restriction implemented, again let's imagine F(x,y) as a terrain with many mountains, pits and saddles. By implementing this restriction we only see what lies along the "track" of h(x,y), taking the highest mountain and the lowest pit as the maximum and minimum. But this is not the case, as in the whole map of F(x,y) there may very well be even higher mountains and even deeper pits that just lie outside the track..
 

Attachments

  • stationary pts.jpg
    stationary pts.jpg
    52.1 KB · Views: 455
Last edited:
Physics news on Phys.org
  • #2
Part 2: Lagrange multipliers

I attempted to use Lagrange multipliers to solve the problem above:

The problem is summed up as such:

F(x,y) = x2 + y2 +3xy
subject to the condition h(x,y) = y2 -4ax = 0

dF = (2x+3y) dx + (2y + 3x) dy = 0

λdh = (-4a)λ dx + (2y)λ dy = 0d(F-λdh) = (2x + 3y + 4aλ) dx + (3xλ) dx = 0

Which reduces to these 3 equations:

y2 - 4ax = 0 -----------(1)

3xλ = 0 ---------------------------(2)

2x + 3y + 4aλ = 0 -----------(3)

Solving (1) and (2) together, you get x = 0, y = 0. But plugging them into (3), you get λ = 0 which you sub back in, you get (x=0,y=0) or the solution (y=-4a), which is wrong since it implies x = 4a.
 
Last edited:
  • #3
But come to think about it, is any stationary point along "track" h(x,y) also a stationary point in F(x,y).

I know it may not be the highest mountain (maximum) or the deepest pit (minimum), but we have to agree that if a stationary point exists on "track" h(x,y), it got to exist on terrain F(x,y) right?
 
  • #4
unscientific said:
But come to think about it, is any stationary point along "track" h(x,y) also a stationary point in F(x,y).

I know it may not be the highest mountain (maximum) or the deepest pit (minimum), but we have to agree that if a stationary point exists on "track" h(x,y), it got to exist on terrain F(x,y) right?

Absolutely NOT! in the problem max/min f(x,y), subject to g(x,y) = c, the solution usually is at a NON-stationary point of f(x,y) in the 2-dimensional plane. In fact, f may not have any stationary points at all. For example, max or min (x+y), subject to x^2+y^2=2 has the obvious (and easily found) solutions: max at (1,1), min at (-1,-1). However, f(x,y) = x+y has no stationary points. For another example: min f(x,y) = (x-2)^2+(y-2)^2, subject to g(x,y) = x+y=1. The solution is at (x,y) = (1/2,1/2), but the only stationary point of f is at (2,2).

Along the curve g(x,y) = c we have, at (x0,y0) on the curve, that the "differentials" must satisfy [itex]dg = g_x dx + g_y dy = 0, [/itex] where [itex] g_x = \partial f/\partial x|_{x=x_0,y=y_0}, [/itex] etc. The the "differential" of f is [itex] df = f_x dx + f_y dy.[/itex] If, for example, [itex] g_x \neq 0[/itex] we can solve dg = 0 to find dx: [itex] dx = -g_y/g_x \; dy,[/itex] so along the curve we have [tex] df = f_x dx + f_y dy = \left( f_y - \frac{g_y}{g_x}f_x \right) dy.[/tex] So, we need [tex] f_y - \frac{g_y}{g_x} f_x = 0.[/tex]
Note that if we write [itex] \lambda = f_x/g_x,[/itex] the condition df=0 becomes [itex] f_y - \lambda g_y = 0[/itex]. This is just the Lagrange equation in the y-direction. Note also that the defintion of [itex] \lambda[/itex] can be written as [itex] f_x - \lambda g_x = 0,[/itex] which is the other Lagrange condition!

RGV
 
Last edited:
  • #5
Ray Vickson said:
Absolutely NOT! in the problem max/min f(x,y), subject to g(x,y) = c, the solution usually is at a NON-stationary point of f(x,y) in the 2-dimensional plane. In fact, f may not have any stationary points at all. For example, max or min (x+y), subject to x^2+y^2=2 has the obvious (and easily found) solutions: max at (1,1), min at (-1,-1). However, f(x,y) = x+y has no stationary points. For another example: min f(x,y) = (x-2)^2+(y-2)^2, subject to g(x,y) = x+y=1. The solution is at (x,y) = (1/2,1/2), but the only stationary point of f is at (2,2).

Along the curve g(x,y) = c we have, at (x0,y0) on the curve, that the "differentials" must satisfy [itex]dg = g_x dx + g_y dy = 0, [/itex] where [itex] g_x = \partial f/\partial x|_{x=x_0,y=y_0}, [/itex] etc. The the "differential" of f is [itex] df = f_x dx + f_y dy.[/itex] If, for example, [itex] g_x \neq 0[/itex] we can solve dg = 0 to find dx: [itex] dx = -g_y/g_x \; dy,[/itex] so along the curve we have [tex] df = f_x dx + f_y dy = \left( f_y - \frac{g_y}{g_x}f_x \right) dy.[/tex] So, we need [tex] f_y - \frac{g_y}{g_x} f_x = 0.[/tex]
Note that if we write [itex] \lambda = f_x/g_x,[/itex] the condition df=0 becomes [itex] f_y - \lambda g_y = 0[/itex]. This is just the Lagrange equation in the y-direction. Note also that the defintion of [itex] \lambda[/itex] can be written as [itex] f_x - \lambda g_x = 0,[/itex] which is the other Lagrange condition!

RGV

Thank you for the very thorough exaplanation with examples! However, using the "terrain and track" analogy, it seems rather intuitive to me that any maximum/minimum on g(x,y) must also lie on f(x,y)...

Any way to explain why it's not this case rather than using counter-examples?
 
  • #6
You say " it seems rather intuitive to me that any maximum/minimum on g(x,y) must also lie on f(x,y)...", so I can't tell whether you are trying to maximize or minimize f or g. If your problem is max or min f(x,y), subject to g(x,y) = c, then, of course, we don't look for any max or min of g(x,y)---we are only interested in those points where g(x,y) = c, but if you are trying to maximize or minimize g(x,y), subject to f(x,y) = constant, then I have no idea what you are talking about. I don't know what you could possibly mean when you say "must also lie on f(x,y)". What does "lie on f(x,y)" mean? My previous response assumed (maybe incorrectly) that you thought a constrained optimum must also be a stationary point in the unconstrained problem, and that is not true. I can't figure out what you think IS true.

RGV
 
  • #7
Sorry for the confusion... what I meant was given a question to maximize f(x,y) given the constraint g(x,y) = c.

Treating f(x,y) as a height as a function of x,y for a terrain, won't every point on g(x,y) also be a point in f(x,y)? Given that f(x,y) alone is true for all x and for all y, so any point on g(x,y) be also a point in f(x,y)?

Therefore, any maximum detected in g(x,y) would also be a maximum on f(x,y) given the constraint.

I'm using the "terrain and track" analogy, where f(x,y) is the terrain and g(x,y) = c is a track that only can be taken.

I've attached a picture to help my explanation..
dh6iu.jpg
 
  • #8
unscientific said:
Thank you for the very thorough exaplanation with examples! However, using the "terrain and track" analogy, it seems rather intuitive to me that any maximum/minimum on g(x,y) must also lie on f(x,y)...

Any way to explain why it's not this case rather than using counter-examples?

Then you are thinking wrong. The (x, y, x) 'terrain' defined by z= f(x, y) is NOT the same as the 'terrain' defined by z= g(x,y). Of course, you can take the same x and y, but then z is different. In particular, a max or min for f(x,y) may not be as max or min for g(x,y).
 
  • #9
HallsofIvy said:
Then you are thinking wrong. The (x, y, x) 'terrain' defined by z= f(x, y) is NOT the same as the 'terrain' defined by z= g(x,y). Of course, you can take the same x and y, but then z is different. In particular, a max or min for f(x,y) may not be as max or min for g(x,y).

Yes I understand what you mean; you're saying even though in the terrain of g(x,y) a minimum appears at say (1,1) but this doesn't mean that this is a minimum at f(x,y) as you don't get the complete picture by just looking at g(x,y). Like, it may be possible that just with a small Δx you may find that the height goes deeper.

Secondly, for a constraint of g(x,y) = c, won't this imply that this is simply a flat terrain with a constant height of c?
 
  • #10
unscientific said:
Yes I understand what you mean; you're saying even though in the terrain of g(x,y) a minimum appears at say (1,1) but this doesn't mean that this is a minimum at f(x,y) as you don't get the complete picture by just looking at g(x,y). Like, it may be possible that just with a small Δx you may find that the height goes deeper.

Secondly, for a constraint of g(x,y) = c, won't this imply that this is simply a flat terrain with a constant height of c?

I think I might finally understand what you are attempting to say. Let me give it a try. You have a curve C in the XY-plane; it happens to be defined by an equation of the form g(x,y) = c (so is on a horizontal contour of the surface z= g(x,y)). Think of the curve C as a trail marked on a contour map of a region, but as viewed from above. The point (x,y) on C essentially gives you the latitude and longitude of a point on the trail, but the altitude of that point lies on the surface z = f(x,y). Viewed from above, the curve C might be a straight line (corresponding to a linear function g), but in 3 dimensions the trail is not straight: it goes up and down according to the local height of the land. Maybe that is what you were trying to say (but not in enough detail to be understandable by others), and if so, it is, of course, absolutely true, but not terribly useful: it does not really help you to solve the problem.

RGV
 
  • #11
I think I have better understood the problem now so let me conclude the points:

1. if h = f(x,y) is defined to be the height of a terrain, g(x,y) = c does not imply a flat plane of height c, since g(x,y) ≠ f(x,y) ≠ h.

2. Rather, g(x,y) = c tells you where you can go and where you cannot go by fixing a constraint between x and y, for example x + y = 1 or y2 = 4ax.

3. This sort of determines the "path" or "trail" you can take along f(x,y) which at any point would satisfy the constraint and give h = f(x,y).

4. However, any maxima/minima along the trail does NOT imply that it is a maxima/minima of f(x,y). Think of a global minima in f(x,y) where the trail crosses the sloping-down-side but does not cross the bottom of the minima. It appears as a minima in g(x,y) but we know that it is not "yet" the minima in f(x,y).
 
  • #12
unscientific said:
I think I have better understood the problem now so let me conclude the points:

1. if h = f(x,y) is defined to be the height of a terrain, g(x,y) = c does not imply a flat plane of height c, since g(x,y) ≠ f(x,y) ≠ h.

2. Rather, g(x,y) = c tells you where you can go and where you cannot go by fixing a constraint between x and y, for example x + y = 1 or y2 = 4ax.

3. This sort of determines the "path" or "trail" you can take along f(x,y) which at any point would satisfy the constraint and give h = f(x,y).

4. However, any maxima/minima along the trail does NOT imply that it is a maxima/minima of f(x,y). Think of a global minima in f(x,y) where the trail crosses the sloping-down-side but does not cross the bottom of the minima. It appears as a minima in g(x,y) but we know that it is not "yet" the minima in f(x,y).

No, No , No! Minima, or maxima, or anything else about g(x,y) are 100% irrelevant! The constraint g(x,y) = c only tells you what (x,y) values you are allowed to use. Any minima or maxima of g (if they even exist---which they did NOT in one of my previous examples) have nothing to do with the problem. Forevermore, *forget about maxima or minimal in g*; it is f you need to examine (but of course, the solution may not be a max of f either---but at least looking at f is looking at the correct function).

RGV
 
  • #13
Ray Vickson said:
No, No , No! Minima, or maxima, or anything else about g(x,y) are 100% irrelevant! The constraint g(x,y) = c only tells you what (x,y) values you are allowed to use. Any minima or maxima of g (if they even exist---which they did NOT in one of my previous examples) have nothing to do with the problem. Forevermore, *forget about maxima or minimal in g*; it is f you need to examine (but of course, the solution may not be a max of f either---but at least looking at f is looking at the correct function).

RGV

I see. Is it because by the constraint g(x,y) -c = 0 it only tells you what 'x' and what 'y' you are allowed to use and by that feature any maxima or minima of g(x,y) - c is due to the individual mechanics of g(x,y) - c not related to f(x,y) at all?
 

FAQ: Understanding Stationary Points in Multivariable Functions

What are stationary points of a function?

Stationary points of a function are points on the graph where the slope (or derivative) is equal to zero. This means that the function is neither increasing nor decreasing at that point.

How are stationary points different from critical points?

Stationary points and critical points are often used interchangeably, but there is a slight difference. Critical points are points where the derivative is either zero or undefined, while stationary points are points where the derivative is specifically equal to zero.

How do you find stationary points of a function?

To find stationary points, you must first take the derivative of the function and set it equal to zero. Then, you can solve for the x-values that make the derivative equal to zero. These x-values are the stationary points of the function.

What do stationary points tell us about a function?

Stationary points can give us information about the behavior of a function. For example, a stationary point can tell us if the function has a local maximum, a local minimum, or a point of inflection.

Can a function have more than one stationary point?

Yes, a function can have multiple stationary points. This can occur when the function has multiple local maxima or minima, or when it has a point of inflection.

Back
Top