Why are Sturm-Louville Functions Orthogonal Relative to a Weight Function?

member 428835 · Aug 13, 2014

hi pf!

ok, so my math text for PDE's states the following theorem: $$f(x) = \sum_n a_n \phi_n (x)$$ for "nice enough" functions. however, the next theorem states that ##\phi_n (x)## and ##\phi_m (x)## are orthogonal relative to a weight function, ##\sigma(x)##. in other words, $$\int_\Omega \phi_n(x) \phi_m(x) \sigma(x) dx = 0 : m \neq n$$
can someone explain why this would be zero?

thanks!

MathematicalPhysicist · Aug 14, 2014

If [tex]\phi_m[/tex] is an orthogonal basis with respect to a dot product of the form:

[tex]<f,g> = \int fg w[/tex] for some weight function w then obviously by definition the above product should vanish since that's the definitino of orthogonal basis.

micromass · Aug 14, 2014

joshmccraney said:

hi pf!

ok, so my math text for PDE's states the following theorem: $$f(x) = \sum_n a_n \phi_n (x)$$ for "nice enough" functions. however, the next theorem states that ##\phi_n (x)## and ##\phi_m (x)## are orthogonal relative to a weight function, ##\sigma(x)##. in other words, $$\int_\Omega \phi_n(x) \phi_m(x) \sigma(x) dx = 0 : m \neq n$$
can someone explain why this would be zero?

thanks!

You should see this as a far-reaching generalization of Fourier series. Indeed, we have that

$$f(x) = \sum_n a_n \sin(nx) + b_n \cos(nx)$$

for "nice enough" functions, and the functions ##\sin(nx)## and ##\cos(nx)## are orthogonal in the sense that

$$\int_{-\pi}^\pi \sin(nx)\sin(mx)dx = \int_{-\pi}^\pi \sin(nx)\cos(mx)dx = 0.$$

ShayanJ · Aug 14, 2014

Take a look at this.

member 428835 · Aug 14, 2014

Shyan said:

Take a look at this.

This looks very similar to my text. I notice page 197-198 they use this orthogonality relation, but it's taken from the preceding theorem on page 189, part 3. But why is this the case. For sure it's true with sine and cosines. But then there's the ##\sigma##.

thanks for your help

member 428835 · Aug 14, 2014

micromass said:

You should see this as a far-reaching generalization of Fourier series. Indeed, we have that

$$f(x) = \sum_n a_n \sin(nx) + b_n \cos(nx)$$

for "nice enough" functions, and the functions ##\sin(nx)## and ##\cos(nx)## are orthogonal in the sense that

$$\int_{-\pi}^\pi \sin(nx)\sin(mx)dx = \int_{-\pi}^\pi \sin(nx)\cos(mx)dx = 0.$$

Yes, this is how I began to accept the theorem, but what about that ##\sigma(x)##? That's what threw me off.

pasmith · Aug 14, 2014

joshmccraney said:

Yes, this is how I began to accept the theorem, but what about that ##\sigma(x)##? That's what threw me off.

The general theory of linear operators on an inner product space then says that if an operator is self-adjoint with respect to the inner product then its eigenvalues (if any) are real, and that eigenvectors corresponding to different eigenvalues are orthogonal with respect to the inner product. By definition, [itex]f[/itex] and [itex]g[/itex] are orthogonal with respect to an inner product [itex]\langle \cdot, \cdot \rangle[/itex] if and only if [itex]\langle f, g \rangle = 0[/itex], and the operator [itex]A[/itex] is self-adjoint if and only if [itex]\langle A(f), g \rangle = \langle f, A(g) \rangle[/itex] for every [itex]f[/itex] and [itex]g[/itex] in the space.

Now with suitable boundary conditions it can be shown that the second order linear differential operator [itex]\mathcal{L}_1(y) = (p(x)y')' - q(x)y[/itex] is self-adjoint with respect to the inner product [tex]
\langle f, g \rangle_1 = \int_a^b f(x)g(x)\,dx.
[/tex]

Consider now the most general linear second-order differential operator [tex]\mathcal{L}(y) = y'' + B(x)y' + C(x)y.[/tex] This can be put into self-adoint form by taking [tex]
y'' + B(x)y' + C(x)y = \frac{1}{w(x)}\left( \frac{d}{dx}(p(x)y') - q(x)y\right)
[/tex] where [itex]w[/itex], [itex]p[/itex] and [itex]q[/itex] can be found in terms of [itex]B[/itex] and [itex]C[/itex] as [tex]
w(x) = \exp\left( \int B(x)\,dx\right), \\
p(x) = \exp\left( \int B(x)\,dx \right), \\
q(x) = - C(x) \exp\left( \int B(x)\,dx \right).
[/tex]
It follows that [itex]\mathcal{L} = (1/w(x))\mathcal{L}_1[/itex] is self-adjoint with respect to the inner product [tex]
\langle f, g \rangle_2 = \int_a^b w(x)f(x)g(x)\,dx.
[/tex] (I suppose one should at some point prove that if [itex]w(x) > 0[/itex] on [itex](a,b)[/itex] then indeed this is an inner product). Thus if you include a weight function then you can apply Sturm-Liouville theory to any second-order linear differential operator, and the most frequent example in practice (aside from Fourier series) are the Bessel functions which arise in separable solutions of Laplace's equation in cylindrical polar coordinates and are the solutions of [tex]
y'' + \frac{y'}{x} - \frac{\alpha^2}{x^2}y = \lambda y,
[/tex] where [itex]\alpha[/itex] is a parameter and for which we find [itex]w(x) = x[/itex].

AlephZero · Aug 14, 2014

We don't know how your textbook motivated this, but in practice you might start with the function ##\sigma## and then derive some orthogonal functions that are consistent with it.

For example if you want to work in cylindrical coordinates, translating some physics into a double integral over an area is likely to include ##\iint \dots\, r\,dr\,d\theta## as the element of area, so you would probably want to take ##\sigma(r) = r##. In that situation, the relevant orthogonal functions are Bessel functions, not sines and cosines.

Most of the other "special functions" in mathematical physics (e.g. Legendre, Laguerre, and Chebyshev polynomials, etc) have similar orthogonality relations with different functions ##\sigma##. You can consider Sturm-Liouville theory as a generalization of all these "special cases".

ShayanJ · Aug 14, 2014

joshmccraney said:

This looks very similar to my text. I notice page 197-198 they use this orthogonality relation, but it's taken from the preceding theorem on page 189, part 3. But why is this the case. For sure it's true with sine and cosines. But then there's the ##\sigma##.

thanks for your help

Page 195, proof of theorem 6.9!
The weight function appears because sometimes you should multiply a function by the operator so that the operator becomes self-adjoint.
Just read the paper from beginning and you'll get your answer.

Why are Sturm-Louville Functions Orthogonal Relative to a Weight Function?

FAQ: Why are Sturm-Louville Functions Orthogonal Relative to a Weight Function?

What is the Sturm-Louville equation and why is it important?

What are Sturm-Louville functions and how are they related to the Sturm-Louville equation?

Why are Sturm-Louville functions orthogonal relative to a weight function?

How does the orthogonality of Sturm-Louville functions help in solving the Sturm-Louville equation?

What are some real-world applications of Sturm-Louville functions and the Sturm-Louville equation?

Similar threads

Hot Threads

Recent Insights