In derivative as linear approximation why does E(h)/h->0 as h->0

In summary: I am asking is not obvious that E(h)/h->0 as h->0. In summary, when learning Multivariable calculus, the concept of derivative is developed as a linear approximation. The equation F(a+h)=F(a)+mh+E(h) is derived, where m represents the derivative and E(h) is the error term. There is a requirement that E(h)/h->0 as h->0, which may not be obvious when discovering the derivative for the first time using only line approximation. This requirement is necessary in order to extend basic single-variable results to multiple dimensions and to ensure the existence of directional derivatives in every direction.
  • #1
Melac12
6
0
When I started learning Multivariable calc first we went back and developed a new notion of derivative as a linear approximation. And what we came up with was

F(a+h)=F(a)+mh+E(h) *

where m is the derivative. Basically the function minus the line tangent to point a. However there is a requirement that E(h)/h->0 as h->0. In the book and my proff say that we want E(h) to go faster to 0 then h, but that's not very mathematical! I asked my proff and he said that we don't have to divide by h it just makes approx better. The reason I am heaving trouble is because if I were to discovering the derivative for the 1st time using line approx (without knowing limit way) the condition for E(h)/h->0 as h->0 would not be obvious. It seems out of no wear. Why can't
E(h)->0 as h->0 on its own without heaving to worry what happens when to E(h)/h as h->0. Could some guide me through the logic for the requirement of E(h)/h->0 as h->0 Thank you.

PS: I know that if we rearrange * then E(h)/h->0 as h->0 works out but I am asking if we just focus in this linear approx way only!
 
Physics news on Phys.org
  • #2
Melac12 said:
When I started learning Multivariable calc first we went back and developed a new notion of derivative as a linear approximation. And what we came up with was

F(a+h)=F(a)+mh+E(h) *

where m is the derivative. Basically the function minus the line tangent to point a. However there is a requirement that E(h)/h->0 as h->0. In the book and my proff say that we want E(h) to go faster to 0 then h, but that's not very mathematical! I asked my proff and he said that we don't have to divide by h it just makes approx better. The reason I am heaving trouble is because if I were to discovering the derivative for the 1st time using line approx (without knowing limit way) the condition for E(h)/h->0 as h->0 would not be obvious. It seems out of no wear. Why can't
E(h)->0 as h->0 on its own without heaving to worry what happens when to E(h)/h as h->0. Could some guide me through the logic for the requirement of E(h)/h->0 as h->0 Thank you.

PS: I know that if we rearrange * then E(h)/h->0 as h->0 works out but I am asking if we just focus in this linear approx way only!
I suppose E(h) is meant to be the error term.

Solve
F(a+h)=F(a)+mh+E(h)​
for
(F(a+h) - F(a))/h​
and take the limit as h → 0 .
 
  • #3
Taking the single variable definition of a derivative as our motivation we have that

[tex] f'(a) = \lim_{h \rightarrow 0} \frac{f(a + h) - f(a)}{h}[/tex]

We can remove the limits from this definition (explicitly) if we re-write the above as

[tex] f'(a) \ = \ \frac{f(a + h) - f(a)}{h} \ - \ e(h) [/tex]

[tex] f'(a)h \ = \ f(a + h) - \ f(a) \ - \ e(h)h[/tex]

[tex] f(a + h) \ = \ f(a) \ + \ f'(a)h \ + \ e(h)h[/tex]

Where [tex] \lim_{h \rightarrow 0} e(h) = 0[/tex]

thus you see the idea of an error term with a limit going to zero is a natural consequence of the classical definition of a derivative. Note the minus beside the error was only included to make things look nice on re-arrangement. Note that we could have called the error term in the last line something else, i.e.


[tex] f(a + h) \ = \ f(a) \ + \ f'(a)h \ + \ \nu (h) [/tex]

so long as we specified that [tex] \lim_{h \rightarrow 0} \frac{\nu (h)}{h} = 0[/tex]

Something we need to do because we want to be able to re-derive the classical form of the derivative as we'd originally defined it.

Now, for the multi-variable case you are following the same principles, you want to be able to re-derive the difference quotient idea so that you can define the directional derivative.

If we defined the derivative as

[tex]f(a + h) = f(a) + f'(a)h + E(h)[/tex]

where E(h) goes to zero as h goes to zero then if you tried to derive the directional derivative using h = tv you'd end up with an error term E(tv)/t which makes no sense as far as I can see, whereas if you mimic the classical theory you naturally end up with the directional derivative as a theorem you can prove. In fact, some books (such as Apostol's Mathematical Analysis) start by defining the derivative as the directional derivative, which is a very very natural extension of the single variable derivative (probably the first thing you'd define yourself if you tried to define derivatives of several variables) & then he shows how this definition leads to the possibility of directional derivatives existing in every direction while the function not being continuous, in other words differentiability implies continuity does not hold in the multivariable case (the reason being the error term E(h) going to zero slower than the h in E(h)/h !) thus if you want to extend other basic single-variable results to multiple dimensions you need to look at derivatives from a different perspective, namely as a linear approximation & the above counter-example needs to be incorporated in whatever you do via the linear approximation methods!
 
  • #4
Ok the explanation is good and I do realize that this linear approx. works really well with the original definition of derivative. But I am asking if we were formulating the derivative for the first time. If we were just discovering the derivative using line approximation, We have our

F(a)=L(a) L(x)=mx+b and since b=F(a)-ma so L(x)=F(a)+m(x-a)

now take the function and minus the line F(x)-L(x)=F(x)-F(a)-m(x-a) make x-a=h

so F(x)-L(x)=F(a+h)-F(a)-m(h) and we say that F(x)-L(x) is a function of h E(h) so

E(h)=F(a+h)-F(a)-m(h) so F(a+h)=F(a)+m(h)+E(h) is the derivative as the line

approximates the function. Wouldn't E(h)->0 as h->0 be enough. what would be my train of thought from here that would logically lead to hey we need to make sure E(h)/h->0 as h->0.
 
  • #5
Ah I see, well if you send h to zero in your second last line:

"E(h)=F(a+h)-F(a)-m(h) so F(a+h)=F(a)+m(h)+E(h) is the derivative as the line"

what you get is:

0 = lim E(h) = lim [F(a+h)-F(a)-m(h)] = lim F(a + h) - lim F(a) - lim m(h) = F(a) - F(a) - m(0) = m(0)

Thus all this sentence amounts to is showing that m(0) = 0, it doesn't give us any information about what m actually is when h goes to zero (i.e. what m does at the point a) so we need a way to explicitly give more information about m at a. Therefore if you divide by h & take a limit (where E(h)/h goes to zero) you have

0 = lim [E(h)/h] = lim [F(a + h) - F(a)]/h - lim m(h)/h = lim [F(a + h) - F(a)]/h - lim m(h)/h = lim [F(a + h) - F(a)]/h - lim m

Thus m = lim [F(a + h) - F(a)]/h

This way we can actually establish what m is at the point a. Obviously if lim [E(h)/h] =/= 0 then you can't end up with m = lim [F(a + h) - F(a)]/h or a uniquely defined derivative. Hopefully that's more helpful!
 
  • #6
I think that's very nice. So your saying that we make the E(h)/h->0 as h->0 rule because it actually helps us see what the derivative has to be. My proff said that its just make approximation even better then if its just E(h)->0 as h->0. I think this is what he meant and extra condition that helps us actually see what the derivative is dong and makes approximation better.
Thank you very much that question bothered, but I think I get it now.
 
  • #7
If we do NOT require that, the "derivative" would not be unique.
 

FAQ: In derivative as linear approximation why does E(h)/h->0 as h->0

Why is the limit of E(h)/h going to 0 as h approaches 0?

As h approaches 0, the value of E(h) also approaches 0. This is because E(h) is a linear approximation of a function, and as h gets smaller, the linear approximation becomes more accurate and approaches the true value of the function. Therefore, the ratio of E(h)/h also approaches 0.

How does the derivative relate to E(h)/h?

E(h)/h is the slope of the linear approximation of a function at a specific point. The derivative, on the other hand, is the slope of the tangent line at that same point. As h approaches 0, the linear approximation becomes more accurate and approaches the true tangent line, which is why the limit of E(h)/h goes to 0.

Is E(h)/h the same as the derivative?

No, E(h)/h is not the same as the derivative. The derivative is the exact slope of the tangent line at a specific point, while E(h)/h is an approximation of that slope using a linear function. However, as h approaches 0, the linear approximation becomes more accurate and approaches the true derivative.

How is E(h)/h used in calculus?

E(h)/h is used in calculus to approximate the derivative of a function at a specific point. This can be helpful in situations where the derivative cannot be easily calculated or when a more simple and approximate solution is needed.

Can E(h)/h be used for any function?

No, E(h)/h can only be used for differentiable functions, meaning functions that have a well-defined derivative at a specific point. It is important to note that not all functions are differentiable, and in those cases, E(h)/h cannot be used as a linear approximation.

Back
Top