What is the interpretation of dx in calculus?

In summary: You can't add a number to a differential form.In summary, the conversation is discussing the interpretation of dx in calculus and how it relates to infinitesimals and differentials. The idea of promoting infinitesimal quantities to functions of tangent vectors is brought up, along with the concept of using the gradient to construct a new function. The conversation also touches on L'Hospital's rule and the use of differentials in single variable calculus. There is some confusion about the notation and interpretation of dx, but ultimately, the discussion is centered around the use and understanding of differentials in calculus.
  • #36
"Don't panic!" said:
Now, I understand that the notion of an infinitesimally small number is nonsense, as if a number is infinitesimally close to zero then it should be equal to zero according to the definition of the limit, but I'm struggling to understand what [itex] dx^{i} [/itex] means intuitively in this new (more rigorous approach), and also what the intuition behind the definition [itex] df(v) =v(f) [/itex]?

[itex] dx^{i} [/itex] is the differential of the coordinate function [itex] x^{i} [/itex]. It is no different than the differential of any function.

The differential of a function,df, is a linear function defined on vectors. The value of df on a vector v is df(v). One can also think of vectors as operators on functions. The value of v on a function is called v(f) and is evaluated as df(v). Operators on functions can be defined abstractly. One can show that if an operator satisfies certain conditions it is in fact a tangent vector.
 
Physics news on Phys.org
  • #37
lavinia said:
dxi dx^{i} is the differential of the coordinate function xi x^{i} . It is no different than the differential of any function.

I'm confused about one should interpret a differential now though (especially after reading that paragraph given in Spivak's book). I'm confused as to whether one can interpret it as an infinitesimal change in a function anymore, or whether now [itex] df[/itex] denotes a new function which encodes how [itex] f[/itex] changes locally. As [itex] df_{p} [/itex] is defined at a given point then we can interpret it as describing an infinitesimal change in [itex] f[/itex] along all possible directions passing through that point. As such infinitesimal changes are induced by tangent vectors at that point, [itex] df[/itex] must be a function of tangent vectors?!

Should [itex] df(v) =v(f) [/itex] be interpreted as a differential change in [itex] f[/itex] along [itex] v[/itex] at a point, which is equal to evaluating the directional derivative of [itex] f[/itex] along [itex] v[/itex] (although how is it possible to equate a differential to a derivative)?
 
Last edited:
  • #38
"Don't panic!" said:
I'm confused about one should interpret a differential now though (especially after reading that paragraph given in Spivak's book). I'm confused as to whether one can interpret it as an infinitesimal change in a function anymore, or whether now [itex] df[/itex] denotes a new function which encodes how [itex] f[/itex] changes locally. As [itex] df_{p} [/itex] is defined at a given point then we can interpret it as describing an infinitesimal change in [itex] f[/itex] along all possible directions passing through that point. As such infinitesimal changes are induced by tangent vectors at that point, [itex] df[/itex] must be a function of tangent vectors?!

As has been explained, today in mathematics, df is considered to be a function on tangent vectors.

Also as as been explained small increments in f along a curve with velocity vector,v, satisfy the equation

Δf = Δtdf(v) + small error term that goes to zero faster than the square of Δt.

If Δt is chosen small enough so that the error term is negligible -e.g. too small to be measured -then Δf is often written as df and is called an infinitesimal increment and Δt is written as dt. In this case one could write df = df(v)dt. Notice that the notation is being used in two ways. I think you are confusing the two. The notation df and dt is common in Physics books.

I don't know the history but I think that df was called an infinitesimal change in f because it represented an increment that could only be perfectly measured in the limit of no increment at all. I guess that Leibniz and others thought of this as an infinitesimal increment. Maybe they thought of df(v) as something real. Today we reformulate this idea by saying that v is a tangent vector and that df is a cotangent vector. Just as the infinitesimal increment of Leibniz could only be inferred from measurement and never directly observed, so tangent and cotangent vectors can not be directly observed since they exist in a separate space, the tangent space of the manifold.

I suppose one could use language that says that df is the infinitesimal tendency of f to change because it exists on its own ( as a cotangent vector in modern terms) but is not realized as a finite increment until f is evaluated along a curve.
 
Last edited:
  • #39
lavinia said:
If Δt is chosen small enough so that the error term is negligible -e.g. too small to be measured -then Δf is often written as df and is called an infinitesimal increment and Δt is written as dt. In this case one writes df = df(v)dt. Notice that the notation is being used in two ways. I think you are confusing the two

But isn't this appealing to the notion of an infinitesimal quantity again (by letting [itex] \Delta t[/itex] become really small)? Is the point that [itex] df[/itex] represents the first order change in a function near a point, and therefore, as when we are near that point this can exactly describe the change in the function, we can consider the linear change in [itex] f[/itex] as the infinitesimal change in [itex] f[/itex] at that point?!

Is Spivak basically saying that we can no longer consider infinitesimal changes in functions directly, but can do so indirectly by considering them as functions of infinitesimal changes (i. e. tangent vectors) along all possible directions at a particular point. In doing so the differentials of the coordinate functions [itex] dx^{i} [/itex] themselves become functions of tangent vectors, describing how the coordinate maps [itex] x^{i} [/itex] change along all possible directions at that point?!
 
  • #40
Is Spivak basically saying that we can no longer consider infinitesimal changes in functions directly, but can do so indirectly by considering them as functions of infinitesimal changes (i. e. tangent vectors) along all possible directions at a particular point. In doing so the differentials of the coordinate functions [itex] dx^{i} [/itex] themselves become functions of tangent vectors, describing how the coordinate maps [itex] x^{i} [/itex] change along all possible directions at that point?!

yes.

Along any curve one can ask what the coordinates are at any time. Thus one can ask what the change in coordinates is over a small time interval. There is no difference between this and asking what the change in any measurement is over a small time interval - e.g. the change in gravitational potential or the change in altitude - whatever.
 
  • #41
"Don't panic!" said:
But isn't this appealing to the notion of an infinitesimal quantity again (by letting [itex] \Delta t[/itex] become really small)? Is the point that [itex] df[/itex] represents the first order change in a function near a point, and therefore, as when we are near that point this can exactly describe the change in the function, we can consider the linear change in [itex] f[/itex] as the infinitesimal change in [itex] f[/itex] at that point?!

In Physics books I have seen this. In this way of looking at things,df and dx are small increments- so small that the error term above the first order approximation can be ignored. I am not sure that Leibniz thought of it this way. I have not read his works but I do know that he was a philosopher and may have thought of df as an infinitesimal increment in a metaphysical way. No idea. I have read that non-standard analysis models his idea of infinitesimal but I know nothing about it.

We often talk of an object as having a speed. We say " The car was going 50 miles per hour when it crashed into the truck." This contains the idea that at any point in time the car actually has something called a speed. But what is this speed really? How can something have a speed in a instant of time? Don't we need to watch it move over a small time interval and calculate its average speed? In Mathematics, the answer is the the speed is a tangent vector - not directly observable because it doesn't exist in the manifold but exists in a separate space,in the tangent space.
 
Last edited:
  • #42
Thanks for your help (and patience). I think I'm starting to understand it a bit better.
 
  • #43
"Don't panic!" said:
Thanks for your help (and patience). I think I'm starting to understand it a bit better.

No Problem. It is good to struggle with these difficult ideas.
 
  • #44
So, is the reason [itex]df[/itex] is called the differential of [itex]f[/itex] because it describes the linear change in [itex]f[/itex] near a point that has been obtained by differentiating [itex] f[/itex] at that point along a given vector, and thus measuring the local change in [itex]f[/itex] along that vector at that point. We can use this to approximate [itex]f[/itex] near this point as [itex]f(x)\simeq f(p)+d_{p}f(v)[/itex]. As we get "nearer" to the point [itex]p[/itex] this approximation becomes more and more exact?!
Would it be correct to say that [itex]dx^{i}_{p}(v)[/itex] represents a finite quantity, which is the component of the tangent vector [itex]v[/itex] along the direction [itex]\frac{\partial}{\partial x^{i}}[/itex] (i.e. the amount of change that occurs in the coordinate map [itex]x^{i}[/itex] along [itex]v[/itex] at the point [itex]p[/itex])?!
Is the reason why such quantities can be considered differential changes in the corresponding function because that describe how it is changing at a specific point and not over some region (or interval)?Restricting to one-dimension for simplicity, is the function [itex]df[/itex] the change in the function describing the tangent line to the function [itex]f[/itex] at a given point; thus the differential is a function describing the change along the tangent line to [itex]f[/itex] at a given point (as we can no longer consider infinitesimals, the closest we can come to this notion is to describe the change along a particular direction at a given point, i.e. along the tangent line at that point)? Thus, locally to this point we can obtain a good linear approximation of [itex]f[/itex] by using this change along the tangent line to [itex]f[/itex].
 
Last edited:
  • #45
In one dimension df(∂/∂x) = df/dx. It is the ordinary derivative. It is the rate of change of f with respect to x.

When you evaluate df on a vector it is a rate of change - just like in one variable calculus.

An increment in f as a function along a curve, is close to Δtdf(v) for small Δt and Δf/Δt is close to df(v) so df(v) is just the limit of Δf/Δt as Δt goes to zero.
That is the whole story.

But all of this has already been explained.
 
Last edited:
  • #46
lavinia said:
But all of this has already been explained.

Yes, sorry I was trying to summarize what I'd gleamed from the discussion and to see if I'd understood it correctly. I think the block I'm struggling to get around in my brain, is that (in the past) I've always associated [itex]df[/itex] as describing an actual change in the function, but in differential geometry it seems to be that the derivative (rate of change of [itex]f[/itex]) are merged into the same concept?!

"Don't panic!" said:
Restricting to one-dimension for simplicity, is the function dfdf the change in the function describing the tangent line to the function ff at a given point; thus the differential is a function describing the change along the tangent line to ff at a given point (as we can no longer consider infinitesimals, the closest we can come to this notion is to describe the change along a particular direction at a given point, i.e. along the tangent line at that point)? Thus, locally to this point we can obtain a good linear approximation of ff by using this change along the tangent line to ff.

In this paragraph I was trying to justify in my mind (at least intuitively) why we still refer to [itex]df[/itex] as the differential of [itex]f[/itex] (which I've, incorrectly, always associated with an infinitesimal quantity) despite it being a finite quantity, and how it can still capture the notion of a change in [itex]f[/itex]?!

Referring to Spivak's comments on the subject of infinitesimal changes in functions, is the point that we can no longer consider infinitesimals, however we can utilize the notion of rates of change in given directions (tangent vectors) to capture the notion of an infinitesimal change in a function at a point. We do this by noting that such a change should depend on which direction we consider and rate of change in that direction, i.e. it should depend on the tangent vectors at that point. Thus, [itex]df[/itex] maps tangent vectors at a point to real numbers which describe how [itex]f[/itex] is changing in a given direction at that point. Would this be a correct intuition at all?
 
  • #47
Yet another take : When f is differentiable at x df(x) is the change of the function f along the linear approximation to f (tangent line, plane, etc.), which is as good as you want ( in a rigorous ## \delta - \epsilon## sense when f is differentiable). So, e.g., for ##f(x)=x^2 ## , 2xdx gives you a(n) local approximation to the actual change of x^2 at ##x= x_0## (change along the tangent line) between two values of x close to each other. For any choice of ## \epsilon## you can find an interval ## (x -\delta, x+\delta) ## where ## | (x^2-x_0^2)-(2x(x-x_0))| < \epsilon ## for ##|x-x_0| < \delta ; \delta >0 ## .
 
  • #48
So is the point that [itex]dx[/itex] is a finite change along the tangent line to a function [itex]f[/itex] (at a given point [itex]x_{0}[/itex]) and this [itex]f(x_{0})+f'(x_{0})dx[/itex] is the equation of the tangent line to the function [itex]f[/itex] at [itex]x=x_{0}[/itex]. We can then use an [itex]\epsilon -\delta[/itex] limit approach to make this approximation approach the exact change in [itex]f[/itex] at [itex]x=x_{0}[/itex], such that [itex]\lim_{x\rightarrow x_{0}}\left[(f(x)-f(x_{0}))-f'(x_{0})(x-x_{0})\right]=0[/itex] (where [itex]dx=x-x_{0}[/itex]). In other words if we construct a linearised function at a point that approximates [itex]f[/itex] locally, then [itex]df[/itex] is a change in this linear function. As such a linearised function is obtained through computing the derivative of [itex]f[/itex] at a given point, we thus call it the differential of [itex]f[/itex]?!

Also, is any of what I put in my previous post correct?

(Sorry to bug everyone with this; I feel like I'm going round in circles at the moment :-( )
 
Last edited:
  • #49
Yes,df is finite I finite since you are dealing with continuous functions ( differentiable implies continuous) and it is in a sense the best linear approximation to the change of f locally, and the linearization is done through the derivative/gradient. Same applies to higher dimensions, e.g., you can approximate the change of the function locally in the same sense by studying the change of f at (x,y) along the tangent plane at (x,y). .
 
Last edited:

Similar threads

Replies
10
Views
2K
Replies
36
Views
2K
Replies
21
Views
3K
Replies
10
Views
2K
Replies
3
Views
2K
Replies
11
Views
3K
Back
Top