- #1
Mandelbroth
- 611
- 24
Suppose we have a compact Lie group ##G##, and a differentiable function ##f:G_0\to\mathbb{R}## from the identity component of ##G## to the real numbers. I'm looking to maximize the value of this function.
Being something of a neophyte at optimization, especially of this kind, I decided to stick with something I thought I knew well: the method of steepest ascent. Long story short, I'm having trouble with generalizing the concept to Lie groups.
My idea was fairly simple. We start with some point ##p## on the identity component of ##G##, and then we take the logarithm of this point (that is, we take the inverse of the exponential map) because we want to add something to it. As a note, I justified this by saying that, if the point had more than one value for the logarithm, I could just pick one (every point on ##G_0## has at least one logarithm). Then, I would add some multiple of ##\nabla f_p## to ##\log(p)##, and finish by exponentiating to get ##p'=\exp(c\nabla f_p + \log(p))=\exp(c\nabla f_p)p##. Repeat.
The problem is, I can't figure out what I would use for the analogue of the gradient. Maybe I'm just not seeing something? I don't know. Any nudge in the right direction would be greatly appreciated. Thank you.
Being something of a neophyte at optimization, especially of this kind, I decided to stick with something I thought I knew well: the method of steepest ascent. Long story short, I'm having trouble with generalizing the concept to Lie groups.
My idea was fairly simple. We start with some point ##p## on the identity component of ##G##, and then we take the logarithm of this point (that is, we take the inverse of the exponential map) because we want to add something to it. As a note, I justified this by saying that, if the point had more than one value for the logarithm, I could just pick one (every point on ##G_0## has at least one logarithm). Then, I would add some multiple of ##\nabla f_p## to ##\log(p)##, and finish by exponentiating to get ##p'=\exp(c\nabla f_p + \log(p))=\exp(c\nabla f_p)p##. Repeat.
The problem is, I can't figure out what I would use for the analogue of the gradient. Maybe I'm just not seeing something? I don't know. Any nudge in the right direction would be greatly appreciated. Thank you.
Last edited: