Notational clash in index notation?

In summary, "Classical Theory of Particles and Fields" by Boris Kosyakov explains the use of differential coordinates dx^i as a convenient basis of 1-forms, which transform like covectors. The author also discusses the interpretation of dx^i as linear functionals and the use of dx^i in the directional derivative. The line elements dx^i are referred to as 1-forms and the author's point of view is not considered unusual. There is a notational collision between upper/lower-index quantities as vectors/covectors and the use of lower/upper-index quantities as basis vectors for the space of vectors/covectors, which can lead
  • #1
bcrowell
Staff Emeritus
Science Advisor
Insights Author
Gold Member
6,724
431
Classical Theory of Particles and Fields, by Boris Kosyakov, has the following in appendix A:

Elie Cartan proposed to use differential coordinates [itex]dx^i[/itex] as a convenient basis of 1-forms. The differentials [itex]dx^i[/itex] transform like covectors [...] Furthermore, when used in the directional derivative [itex]dx^i \partial F/\partial x^i[/itex], [itex]dx^i[/itex] may be viewed as a linear functional which takes real values on vectors [itex]\partial F/\partial x^i[/itex]. The line elements [itex]dx^i[/itex] are called [...] 1-forms.

(I don't own a copy of the book. This came to my attention through this physics.SE question, and it turned out that this part of the book is accessible through Amazon's peephole.)

This completely mystifies me, since the gradient of a scalar is what I would think of as the prototypical example of a covector, not a vector. The partial derivative operator [itex]\partial_i=\partial/\partial x^i[/itex] has a lower index; it's a covector.

My best guess would be that this is occurring because of the notational collision between the interpretation of upper/lower-index quantities as vectors/covectors and the use of lower/upper-index quantities as basis vectors for the space of vectors/covectors. That is, we often use [itex]\partial_i[/itex] as a basis for the space of vectors, and we then have expressions like [itex]a^i\partial_i[/itex], which require context in order to see that they are vectors expressed using the Einstein summation convention, rather than scalars. Similarly, [itex]dx^i[/itex] can be taken as the covector to [itex]\partial_i[/itex]. However, none of this resolves the craziness (AFAICT) of referring to a gradient as a vector rather than a covector.

In this total derivative, I would give [itex]dx^i[/itex] the context-dependent interpretation of being an infinitesimally small [itex]\Delta x^i[/itex], which is the prototypical example of a vector (not covector). For a total derivative involving a finite change, we would have [itex]\Delta F\approx \Delta x^i \partial F/\partial x^i[/itex]. I don't see what Kosyakov could do here, since [itex]\Delta x^i[/itex] is clearly a vector, not a covector.

Is Kosyakov's point of view unusual? Is there something I'm missing?
 
Last edited:
Physics news on Phys.org
  • #2
bcrowell said:
The partial derivative operator [itex]\partial_i=\partial/\partial x^i[/itex] has a lower index; it's a covector.
It's a tangent vector, not a cotangent vector. So its n-tuple of components (which in this coordinate system is ##(\delta^1_i,\dots,\delta^n_i)##) transforms contravariantly. However, the n-tuple ##(\partial_1,\dots,\partial_n)## transforms covariantly. (The definition of "covariant" is essentially that we use that term when something transforms just like this n-tuple).
 
Last edited:
  • #3
Also, the mathematical "gradient" is a covector - but that's because mathematicians define "gradient" differently from physicky vector calculus. For example, one can find df as a "gradient" in http://sophia.dtp.fmph.uniba.sk/~fecko/referaty/regensburg.pdf (p6) and http://www.damtp.cam.ac.uk/research/gr/members/gibbons/dgnotes3.pdf (just before Eq 3.42). That's why one often reads that the gradient is a covector. The physicist's vector calculus "gradient" is a vector, because it is defined in a space with a metric, which can be used to get the Hodge dual and make the physics vector calculus "gradient".
 
  • #4
bcrowell said:
Classical Theory of Particles and Fields, by Boris Kosyakov, has the following in appendix A:
My best guess would be that this is occurring because of the notational collision between the interpretation of upper/lower-index quantities as vectors/covectors and the use of lower/upper-index quantities as basis vectors for the space of vectors/covectors. That is, we often use [itex]\partial_i[/itex] as a basis for the space of vectors, and we then have expressions like [itex]a^i\partial_i[/itex], which require context in order to see that they are vectors expressed using the Einstein summation convention, rather than scalars. Similarly, [itex]dx^i[/itex] can be taken as the covector to [itex]\partial_i[/itex]. However, none of this resolves the craziness (AFAICT) of referring to a gradient as a vector rather than a covector.

In this total derivative, I would give [itex]dx^i[/itex] the context-dependent interpretation of being an infinitesimally small [itex]\Delta x^i[/itex], which is the prototypical example of a vector (not covector). For a total derivative involving a finite change, we would have [itex]\Delta F\approx \Delta x^i \partial F/\partial x^i[/itex]. I don't see what Kosyakov could do here, since [itex]\Delta x^i[/itex] is clearly a vector, not a covector.

Is Kosyakov's point of view unusual? Is there something I'm missing?

I don't think Kosyakov's point of view is unusual. ##dx^i## has a scalar value, or a scalar range. If ##dx^i## is interpreted as an operator, having a domain and a range , so that ##dx^i## operates on something in the domain and returns a value in the range, then it is a one form, as in this case the range is a scalar value, and the domain is a vector. A map from a vector to a scalar is a covector by definition (well, in my recollection, I should probably look that up but I'm not going to).

MTW also makes a similar point, they also point out that the notation is a bit ambiguous between a pure scalar, and an operator that takes in a vector and outputs a scalar. MTW prefers to use a bold face d for the later case.

The definition of a vector is pretty universally taken to be the derivative operators, which makes ##\partial_i## a vector, and ##dx^i## with the appropriate definition a map from a vector to a scalar or a covector.

As far as the upper-lower index notation goes, I've always found it very confusing in some particular cases such as this one, and I'm glad to see I'm not the only one.
 
  • #5
atyy said:
Also, the mathematical "gradient" is a covector - but that's because mathematicians define "gradient" differently from physicky vector calculus. For example, one can find df as a "gradient"
The "physicky vector calculus" gradient ##(\partial_1f,\dots,\partial_nf)## transforms covariantly (because ##(\partial_1,\dots,\partial_n)## does, and f doesn't transform at all). Note that it's not a cotangent vector. It's an n-tuple of tangent vectors associated with each coordinate system, that transforms covariantly. That makes it a "covector" according to the old-fashioned definitions.

##\mathrm df## is a 1-form (a cotangent vector field) that can be thought of as the gradient of f because its n-tuple of components is the "physicky vector calculus" gradient:
$$(\mathrm df)_\mu=(\mathrm df)(\partial_\mu)=\partial_\mu f.$$
 
Last edited:
  • #6
Fredrik said:
The "physicky vector calculus" gradient ##(\partial_1f,\dots,\partial_nf)## transforms covariantly (because ##(\partial_1,\dots,\partial_n)## does, and f doesn't transform at all). Note that it's not a cotangent vector. It's an n-tuple of tangent vectors associated with each coordinate system, that transforms covariantly. That makes it a "covector" according to the old-fashioned definitions.

##\mathrm df## is a 1-form (a cotangent vector field) that can be thought of as the gradient of f because its n-tuple of components is the "physicky vector calculus" gradient:
$$(\mathrm df)_\mu=(\mathrm df)(\partial_\mu)=\partial_\mu f.$$

That isn't the physicist's gradient, because the formula is only correct in Cartesian coordinates. If one switches to cylindrical coordinates, the mathematician's gradient has components given by the partial derivative, but the physicist's gradient picks up a 1/r as in http://mathworld.wolfram.com/CylindricalCoordinates.html (eq 32).

Edit: I spoke too hastily about the Hodge dual in post #3. The Hodge dual is used to get to the vector calculus "div" and "curl". The mathematician's gradient and physicist's gradient are related by sharp and flat isomorphisms eg. http://lamington.wordpress.com/2014/05/26/div-grad-curl-and-all-this/. The notation in Calegari's post is different from the Fecko and Gibbons links in post #3. The angle brackets ##\langle,\rangle## are used by Calegari to an mean inner product which takes two vectors and outputs a number, whereas Fecko and Gibbons use them to mean a form acts on a vector to give a number, or that a form is acted on by a vector to give a number.
 
Last edited:
  • #7
A bit more that I didn't have time to write:

A vector ##a^i## can be expanded out as ##a^1 (e_1) + a^2 (e_2) + ... a^n (e_n)##, where ##a^i## are the coefficients, and ##e_i## are the basis vectors. So the basis vectors really always have a lower index, which is consistent with the notion that partial derivative operators are basis vectors and the notation for how said operators are written, i.e. ##\partial_i##. It is the coefficients of the basis vectors that have the upper index. I also checked another text, when Wald gives a set of basis vectors, he gives it as a subscript, i.e. ##{v_i}##.

The parenthisis above aren't really needed but I find them helpful, sort of like "scare quotes".

I don't see any way to improve the notation, but I still find it confusing.
 
  • #8
I think there are really two types of indices: ##dx^i## can be written as ##\underset{\sim}{dx^i}## or
##(dx^i){}_\mu##, where ##i## is a summation index and ##\mu## is probably best thought of as an abstract index.

The basis could have been ##A_\mu##, ##B_\mu##, ##C_\mu##, ##D_\mu##...
or ##(dx^0)_\mu##, ##(dx^1)_\mu##, ##(dx^2)_\mu##, ##(dx^3)_\mu## so that summation notation can be used.
 

FAQ: Notational clash in index notation?

What is notational clash in index notation?

Notational clash in index notation refers to the issue of using the same index notation for different quantities in a mathematical expression. This can lead to confusion and errors when trying to interpret and manipulate the expression.

Why is notational clash a problem in index notation?

Notational clash can cause confusion and errors in mathematical expressions, making it difficult to accurately represent and manipulate equations. It can also make it challenging for others to understand and reproduce your work.

How can notational clash be avoided in index notation?

To avoid notational clash in index notation, it is important to use clear and consistent notation throughout the expression. This includes using different indices for different quantities, using subscripts and superscripts appropriately, and clearly defining all variables and symbols used.

What are some common examples of notational clash in index notation?

Some common examples of notational clash in index notation include using the same index for multiple summations or products, using the same index for both a summation and a limit, and using the same index for different components of a vector or matrix.

How can notational clash be resolved in index notation?

If notational clash is identified in index notation, it can be resolved by revising the notation to make it clearer and more consistent. This may involve reassigning indices, using different notation for different quantities, or clarifying the meaning of symbols and variables.

Similar threads

Replies
6
Views
2K
Replies
12
Views
2K
Replies
6
Views
2K
Replies
9
Views
4K
Back
Top