Momentum operator in curvilinear coordinates

In summary, the conversation discusses a paper about the momentum operator in curvilinear coordinates. The author argues that the commonly used form of the momentum operator is only applicable to Cartesian coordinates. They then try to find expressions for the momentum operator in curvilinear coordinates, using the uncertainty principle as a starting point. However, their approach is flawed and their initial claim is incorrect. The conversation also touches on the units of the commutator in this context and the idea of uncertainty in angular variables. Ultimately, the conversation concludes that the paper is not worth reading and that the transformation of the momentum operator to other coordinate systems has been extensively studied and proven by mathematical physicists.
  • #71
Post#2:

I see a simple geometry problem: In R^3 (realistic QM models always assume the freedom of infinite motion and also infinite time evolution), one's free to use any coordinates he likes to describe the infinite motion. There's an impressive list displayed in one of the classics: the 2 volume book of Morse and Feshbach.

Then what does a freedom of reparametrization of R^3 mean for QM? Well, essentially a chain of isomorphisms of Hilbert spaces. If R^3 is described in Cartesian coordinates, one has a simple identification of coordinates and (canonical) momenta via the Newtonian/Hamiltonian dynamics + Dirac "canonical quantization". One obtains the Born-Jordan CCRs and their representation on L^(R^3). But if R^3 is described in spherical coordinates (which are almost mandatory for a neat resolution of the quantum dynamics of the Hydrogen atom, for example), then the classical Hamiltonian momenta p_r, p_theta and p_phi don't get a Dirac "canonical quantization" (as you may have discovered in Post#1). We say that the 3 components of spherical momenta are not quantum observables, cause there's no (essentially) self-adjoint operator to describe them, whether the particle is in a potential field or free. The Hilbert space isomorphic to L^2(R^3) would then be L^((0,∞), r^2 dr) ⊗ L^2(S^2) and the only sensible observable in this space is necessarily the Hamiltonian, which, thanks to the nice work by T. Kato pioneered by F. Rellich, is always shown to be self-adjoint. Then you move to cylinder coordinates. Then to parabolic ones, then to ellipsoidal ones, etc. to check if the triplet of quantum (canonical) momenta are observables or not. Fortunately, the Hamiltonian is self-adjoint every time (its spectral equation will always be reduced to a Sturm-Liouville ODE). A thing which is actually cool, because the (time-independent) Hamiltonian through its spectral equation always provides us with a complete system of (generalized) states of feasible values of energy, but not necessarily of momentum. But, despite this considerations I outlined, a true description of quantization should be done from a pure diff-geom. treatment of the classical dynamics (-> Arnold, -> Marsden) which is to be quantized: I refer to what's known as geometric quantization which originated within the context of the Groenewold-van Hove no-go theorem,then with the work of Moyal and reached maturity through the works of Souriau and Kostant.

On the other hand, and here I’m trying to build a connection with what vH71 said several times, the representation theory of the Galilei group or the (restricted) Poincare group which he mentions always considers the Galilean space-time or the Minkowski space-time as being in a Cartesian spatial representation/parametrization. That is our classical and quantum fields are always functions of x,y,z,t, not of r, phi, z for example. I can only suspect/guess that, if one goes to GR with its simplest space-time (the Schwarzschild one), one should be forced to use a spherical (or other type of) "spatial" parametrization of the "spatial" 3D submanifold, so that the Weyl-Wigner-Bargmann-type representation theory (which one would like to carry over from QM and QFT into a QFT on a curved space-time) would necessarily consider (projective) representations of the symmetry groups of this GR space-time in which a Cartesian parametrization of the “spatial” 3D submanifold would no longer be possible. Would this be doable? I don't know... A somehow related problem, I guess, can be seen as trying to put a "classical" Dirac spinor in a curved space-time, thing which is possible (up to technicalities as described by Wald in his famous chapter 13) only because a physical curved space-time is locally flat, thus the known R^3 and its particular aforementioned Cartesian parametrization "creeps" into the “curvy” picture precisely to accommodate the Dirac spinors.
 
  • Like
Likes vanhees71
Physics news on Phys.org
  • #72
Dex, I'm very interested to explore this stuff, but maybe it should go in a new thread (referencing this one)?

Anyway, the crucial difference between the Souriau approach and the usual approach is that (iiuc) Souriau concentrates on quantization applied to the symmetries of the solution space of a particular (classical) equation of motion. The background space (i.e., phase space) is just a mathematical artifact. What matters is the solution subspace thereof, and hence the symmetries of that subspace. The background coordinate system is, in the end, unphysical.
 
Last edited:
  • #73
I had to look up the Groenewold-van Hove theorem ;-). That's very interesting and easy to prove. It only shows once more on a more formal level that what's called "canonical quantization" is at best a heuristical tool. As I said before, the observable algebra should be determined on physics grounds, and the most elegant and direct way are the symmetry considerations as, e.g., worked out in Ballentine's and Weinberg's (the latter of course also for relativistic QFT) textbooks. I don't know about "geometric quantization". Do you have a good review article on this approach? In some sense, of course, the symmetry approach is also geometric in some sense. This idea goes back to Riemann and most prominently Felix Klein in the 19th century, where they investigated the symmetry properties of geometries and the possibility to reconstruct geometries from and classify them according to their symmetry groups. This culminated in a "hype" about what then was called "the theory of invariants", and from a theoretical-physics point of view this was one of the most fruitful developments in mathematics since the discovery of calculus (Noether's theorem is one highlight, the use of group theory in QT by Weyl, van der Waerden from the mathematical and by Wigner, Bargmann et al from the physical side are others).

Concerning quantum theory in curved general-relativistic space time be warned! It's very complicated and not fully understood as far as I know, and I'm not an expert on it. This doesn't only hold for the quantization of the gravitational interaction which is very puzzling for decades now but even for the more simple problem to do relativistic QFT in a general-relativistic background spacetime (i.e., spacetime stays "unquantized" as in non-relativistic and special relativistic QT). There is quite some literature about this topic. The most simple spaces are not of the Schwarzschild type (I don't know whether there exists something about QFT with a Schwarzschild background spacetime) but the most symmetric Robertson-Walker-Friedmann-Lemaitre spacetimes. A good source on this is the famous review article by B. deWitt:

DeWitt, Bryce S.: Quantum field theory in curved spacetime, Phys. Rept. 19, 295–357, 1975
http://dx.doi.org/10.1016/0370-1573(75)90051-4

I'm sure there are newer papers on the subject but, as I said, I'm not an expert in these issues.
 
Last edited by a moderator:
  • #74
There is something (actually, quite a few things) that I don't understand about the business of operators being self-adjoint. In curved space (consider just two-dimensions, for simplicity), what I would think would be the position space representation of the inner product of two wave functions [itex]\psi[/itex] and [itex]\phi[/itex] is:

[itex]\langle \psi | \phi \rangle = \int \psi^* \phi \sqrt{g} dx dy[/itex]

where [itex]g[/itex] is the determinant of the metric tensor. In this case, we have, with [itex]p_x = -i \hbar \partial_x[/itex],

[itex]\langle p_x \psi | \phi \rangle = \langle \psi | {P}_x \phi \rangle + ST[/itex]

where [itex]{P}_x[/itex] is the operator defined by [itex]{P}_x f = -i \hbar \frac{1}{\sqrt{g}} \partial_x (\sqrt{g} f) = (p_x - i \hbar \frac{1}{\sqrt{g}} \partial_x \sqrt{g}) f[/itex], and where [itex]ST[/itex] is the "surface term": [itex]\int ({P}_x (D_x (\psi^* \phi))) \sqrt{g} dx dy[/itex]

So [itex]p_x[/itex] is only symmetric if [itex]ST = 0[/itex] and [itex]g[/itex] is constant.

(Note: An identity that can be used is that: [itex]\frac{1}{\sqrt g} \partial_x \sqrt{g} = \Gamma^i_{i x}[/itex], where [itex]\Gamma[/itex] is the connection coefficients (implicit summation over the dummy index [itex]i[/itex]) So the operator [itex]P_x[/itex] can actually be written in the form: [itex]P_x f = (p_x - i \hbar \Gamma^i_{i x}) f[/itex], which seems like sort of a covariant derivative, except that since [itex]f[/itex] is a scalar, there's no difference between partial derivatives and covariant derivatives.)
So if an operator being symmetric is a necessary (but maybe not sufficient) condition for being an observable, then the usual momentum operator isn't an observable in curved space. What does that mean?
 
  • #75
stevendaryl said:
[itex]\langle \psi | \phi \rangle = \int \psi^* \phi \sqrt{g} dx dy[/itex]

where [itex]g[/itex] is the determinant of the metric tensor. In this case, we have, with [itex]p_x = -i \hbar \partial_x[/itex],
In the earlier spherical-polar conundrum, the resolution involving recognizing the the radial momentum should be written (classically) as something like ##\hat e_r \cdot \nabla##, and then symmetrized.

In your case, maybe one should start from ##\hat e_x(z) \cdot \nabla \, \Big|_z##, where ##z## denotes a point on the manifold ##M##, and the ##\hat e_x(z)## is a unit vector in the tangent space at ##z##, i.e., ##TM_z##. So when one moves from point to point on ##M##, one must also adjust ##\hat e_x## accordingly, and one must work in the 2nd tangent space ##TTM##, iiuc. (In high-falutin' language, one must choose a ``horizontal/vertical'' decomposition of ##TTM## which is independent of local coordinates -- which is essentially what's happening here, afaict.)

But I could be wrong. :oldwink:
 
  • #76
The point is to formulate your observables in a manifestly covariant way. In non-relativistic quantum theory in the position representation ("wave mechanics") this boils down to express everything in terms of the classical differential operators grad, curl, and div or in terms of the nabla calculus. This includes constructs like ##-\mathrm{i} \vec{r} \times \vec{\nabla}## for orbital angular momentum. Then there is no problem with writing down the Hamiltonian in terms of the operator algebra. Sometimes operator-ordering problem occur. I think then one has to more or less guess the right form of the Hamiltonian, relying on more or less handwaving concepts like Weyl ordering. In QFT often normal ordering helps to avoid some complications with renormalization parts of Feynman diagrams but at the same time it can be problematic concerning, e.g., gauge symmetries. The symmetries are the most helpful properties in finding first the operator algebra and then the correct Hamiltonian for a given problem. Last but not least one should always be aware that the final judgement about theories and models will always be experiments and observation. I think there is no generally valid scheme to guess the Hamiltonian from the classical analogue of a given problem, and sometimes there's no classical analogue at hand (e.g., Bose-Einstein condensation, superconductivity and superfluidity, etc.).
 
  • #77
stevendaryl said:
There is something (actually, quite a few things) that I don't understand about the business of operators being self-adjoint. In curved space (consider just two-dimensions, for simplicity), what I would think would be the position space representation of the inner product of two wave functions [itex]\psi[/itex] and [itex]\phi[/itex] is:

[itex]\langle \psi | \phi \rangle = \int \psi^* \phi \sqrt{g} dx dy[/itex]

where [itex]g[/itex] is the determinant of the metric tensor. In this case, we have, with [itex]p_x = -i \hbar \partial_x[/itex],

[itex]\langle p_x \psi | \phi \rangle = \langle \psi | {P}_x \phi \rangle + ST[/itex]

where [itex]{P}_x[/itex] is the operator defined by [itex]{P}_x f = -i \hbar \frac{1}{\sqrt{g}} \partial_x (\sqrt{g} f) = (p_x - i \hbar \frac{1}{\sqrt{g}} \partial_x \sqrt{g}) f[/itex], and where [itex]ST[/itex] is the "surface term": [itex]\int ({P}_x (D_x (\psi^* \phi))) \sqrt{g} dx dy[/itex]

So [itex]p_x[/itex] is only symmetric if [itex]ST = 0[/itex] and [itex]g[/itex] is constant.

(Note: An identity that can be used is that: [itex]\frac{1}{\sqrt g} \partial_x \sqrt{g} = \Gamma^i_{i x}[/itex], where [itex]\Gamma[/itex] is the connection coefficients (implicit summation over the dummy index [itex]i[/itex]) So the operator [itex]P_x[/itex] can actually be written in the form: [itex]P_x f = (p_x - i \hbar \Gamma^i_{i x}) f[/itex], which seems like sort of a covariant derivative, except that since [itex]f[/itex] is a scalar, there's no difference between partial derivatives and covariant derivatives.)
So if an operator being symmetric is a necessary (but maybe not sufficient) condition for being an observable, then the usual momentum operator isn't an observable in curved space. What does that mean?

vanhees71 said:
The point is to formulate your observables in a manifestly covariant way. In non-relativistic quantum theory in the position representation ("wave mechanics") this boils down to express everything in terms of the classical differential operators grad, curl, and div or in terms of the nabla calculus.

This is how I think of this(let me know if it is not correct). Whether expressing operators in curvilinear coordinates or in curved space one must keep them linear in QM by fiat, so in the case of operators acting on scalar functions like the Hamiltonian or kinetic energy one can use for example the Laplace-Beltrami linear operator in curved spaces but for vectorial operators like momentum QM's ban on nonlinear operators only leaves open the option vanhees refers to: "express everything in terms of the classical differential operators grad, curl, and div", no curved covariant derivative as in stevendaryl's example.
 
  • #78
This I don't understand. The operator ##\vec{\nabla}## is linear by definition. In, e.g., spherical coordinates, it's
$$\vec{\nabla} \psi=\vec{e}_r \partial_r \psi+\frac{\vec{e}_{\vartheta}}{r} \partial_{\vartheta} \psi + \frac{\vec{e}_{\varphi}}{r \sin \vartheta} \partial_{\varphi} \psi.$$
 
  • #79
vanhees71 said:
This I don't understand. The operator ##\vec{\nabla}## is linear by definition. In, e.g., spherical coordinates, it's
$$\vec{\nabla} \psi=\vec{e}_r \partial_r \psi+\frac{\vec{e}_{\vartheta}}{r} \partial_{\vartheta} \psi + \frac{\vec{e}_{\varphi}}{r \sin \vartheta} \partial_{\varphi} \psi.$$
Sure, that was my point too. I was trying to address stevendaryl's question: "then the usual momentum operator isn't an observable in curved space. What does that mean?"
So you just wrote the gradient in spherical coordinates, with of course a non-coordinate(nonholonomic) basis as espherical curvilinear coordinates are orthonormal like you also pointed out in a previous post and the basis dependence on position is canceled out, but you cannot do that for true spatial curvature as stevendaryl noticed. Then I'm basically saying again what all of you have already been repeating in this thread: one cannot then single out the radial component and declare it an operator on its own, for one thing the noncoordinate basis makes it not viable already before geting into most of the considerations that have been commented in the thread about self-adjointness, hermiticity, etc(wich are all correct of course). If one tries to use a coordinate basis one must introduce nonlinearity in the operator that is not possible in QM by definition .
Then I made a distinction between the gradient operator which is vector valued used for ## \hat p ## and the scalar-valued Laplacian operator used for ## \hat H ##, in the latter case, the component operators issue doesn't come up of course and we don't have any problem applying it to any manifold, i.e. its generalization the Laplace-Beltrami op.) maintains the linearity when applied to curved spaces like for instance spherical harmonics problems can be fitted into when constrained to ## S^2 ##. So it seems it all comes down to the vectorial nature of the momentum operator and that the authors of the paper linked in the OP ignored the basis-dependence intrinsic to QM dynamics introduced by Planck constant h.
 
  • #80
I guess, here is some confusion. Of course, the momentum opertor in Euclidean space is always the same, no matter in which curvilinear coordinates you express the operator ##\vec{\nabla}##.

Another thing is when you have a curved, i.e., non-Euclidean, space. Then the symmetries of this space tell you, whether there is one or more than one momentum operator. There's a momentum operator, if the space admits one or several translation(s), and the generator of the translation is a momentum operator.
 
  • #81
vanhees71 said:
I guess, here is some confusion.
Can you quote the part you find confused and specify how it is wrong? I'm not saying anything disagreeing with the rest of your post that I'm aware of.
 
  • #82
In the thread it looked as if different things have been mixed up. On the one hand there is the gradient operator in Euclidean space, expressed in curvilinear coordinates. This is a covariant operator and as such it doesn't make a difference whether I describe it in terms of Cartesian or any curvilinear coordinates. It's almost trivial that the momentum operator doesn't depend on the coordinates you use to express it.

On the other hand there are non-Euclidean spaces, where you have to figure out whether there's a momentum operator at all or not. If there is one, it must be possible to express it in terms of any (local) coordinate system, i.e., it's generally (diffeomorphic) covariant.
 
  • #83
vanhees71 said:
In the thread it looked as if different things have been mixed up. On the one hand there is the gradient operator in Euclidean space, expressed in curvilinear coordinates. This is a covariant operator and as such it doesn't make a difference whether I describe it in terms of Cartesian or any curvilinear coordinates. It's almost trivial that the momentum operator doesn't depend on the coordinates you use to express it.
Yes, when using nonholonomic basis where i.e. ## \hat p_x## is a bona fide operator but the key issue is that it does depend on the coordinates when using a coordinate-basis.
On the other hand there are non-Euclidean spaces, where you have to figure out whether there's a momentum operator at all or not. If there is one, it must be possible to express it in terms of any (local) coordinate system, i.e., it's generally (diffeomorphic) covariant.
And it seems there is no such momentum operator in those spaces, while there is a Hamiltonian operator. Note that this doesn't arise in curved spacetime where by virtue of the equivalence principle one can always use orthonormal frames locally.
 
Back
Top