Finding Stationary Points of a Matrix Function: Derivatives and Eigenvectors

In summary, the conversation discusses finding stationary points of a function involving a symmetric matrix A. The derivative is taken and rewritten in terms of x transpose and A, with the quotient rule used for differentiation. The final simplified form of the derivative is shown.
  • #1
libragirl79
31
0
Hi,

I am trying to find stationary points of the function f(x)=(xtAx)/(xtx) (the division of x transpose times A times x divided by x transpose x) where A is a px1 symmetric matrix. I need to take the derivative of this to show that when i set it to zero i get the eigenvectors of A. I know how to take the derivative of xtx in respect to x, which is xtAt + xtA and since A=At, then this would be 2Axt, but I don't know what to do when it's a division, is there an equivalent of a quotient rule for matrix derivatives?

Thanks very much!
 
Physics news on Phys.org
  • #2
libragirl79 said:
Hi,

I am trying to find stationary points of the function f(x)=(xtAx)/(xtx) (the division of x transpose times A times x divided by x transpose x) where A is a px1 symmetric matrix. I need to take the derivative of this to show that when i set it to zero i get the eigenvectors of A. I know how to take the derivative of xtx in respect to x, which is xtAt + xtA and since A=At, then this would be 2Axt, but I don't know what to do when it's a division, is there an equivalent of a quotient rule for matrix derivatives?

Thanks very much!

Welcome to PF, libragirl79! :smile:

Suppose you diffentiate ##f(\mathbf x) = f((^x_y)) = {(x\ y)A(^x_y) \over (x\ y)(^x_y)}##.

First you would use the quotient rule to differentiate with respect to x to find:
$${\partial f \over \partial x} = {((1\ 0)A(^x_y) + (x\ y)A(^1_0))\cdot (x\ y)(^x_y) - (x\ y)A(^x_y) \cdot 2x \over ((x\ y)(^x_y))^2}$$

Now let's rewrite, using ##A=A^T##, with ##\mathbf x##:
$${\partial f \over \partial x} = {2\mathbf x^T A (^1_0) \cdot \mathbf x^T\mathbf x - \mathbf x^T A \mathbf x \cdot 2\mathbf x^T(^1_0) \over (\mathbf x^T\mathbf x)^2}$$
$${\partial f \over \partial x} = {2\mathbf x^T A \cdot \mathbf x^T\mathbf x - \mathbf x^T A \mathbf x \cdot 2\mathbf x^T \over (\mathbf x^T\mathbf x)^2}(^1_0)$$

In other words, if ##A=A^T##:
$$Df(\mathbf x) = {2\mathbf x^T A \cdot \mathbf x^T\mathbf x - \mathbf x^T A \mathbf x \cdot 2\mathbf x^T \over (\mathbf x^T\mathbf x)^2}$$
 
Last edited:
  • #3
Thanks for your reply!

Since I forgot a lot of my matrix algebra, when you say A (x y) it looks like x choose y, but i am assuming that's not what you meant...
 
  • #4
libragirl79 said:
Thanks for your reply!

Since I forgot a lot of my matrix algebra, when you say A (x y) it looks like x choose y, but i am assuming that's not what you meant...

Since you are talking about x transpose, that means that x is a vector.
A vector is commonly written as:
$$\mathbf x = \begin{pmatrix}x_1 \\ x_2 \\ \vdots \\ x_n\end{pmatrix}$$

I have simplified it to:
$$\mathbf x = \begin{pmatrix}x \\ y\end{pmatrix}$$

So indeed, this does not mean x choose y. ;)


Furthermore:
$$\mathbf x^T = \begin{pmatrix}x_1 & x_2 & \dots & x_n\end{pmatrix}$$
and
$$\mathbf x^T \mathbf x = \begin{pmatrix}x_1 & x_2 & \dots & x_n\end{pmatrix}\begin{pmatrix}x_1 \\ x_2 \\ \vdots \\ x_n\end{pmatrix} = x_1^2 + x_2^2 + \dots + x_n^2$$
 
  • #5
Ok, I see! Thanks! :)
 

FAQ: Finding Stationary Points of a Matrix Function: Derivatives and Eigenvectors

What are derivatives of matrices?

Derivatives of matrices refer to the mathematical concept of finding the rate of change of a matrix with respect to one or more variables. It involves taking the partial derivatives of each element of the matrix and arranging them into a new matrix, known as the Jacobian matrix.

How are derivatives of matrices calculated?

To calculate derivatives of matrices, we use the chain rule and product rule, just like in regular calculus. We take the partial derivatives of each element of the matrix and arrange them into a new matrix, which gives us the Jacobian matrix. This process can be extended to higher-order derivatives as well.

What are the applications of derivatives of matrices?

Derivatives of matrices have many applications in various fields such as physics, engineering, and economics. They are used to find optimal solutions in optimization problems, model physical systems, and analyze the behavior of systems over time.

Can derivatives of matrices be calculated for any type of matrix?

Yes, derivatives of matrices can be calculated for any type of matrix, including square matrices, rectangular matrices, and even complex matrices. However, the process of calculation may vary depending on the type of matrix and the number of variables involved.

Are there any tools or software that can help with calculating derivatives of matrices?

Yes, there are various mathematical software and programming languages that have built-in functions for calculating derivatives of matrices, such as MATLAB, Python, and Mathematica. These tools can save time and effort in performing complex calculations and can handle large matrices efficiently.

Similar threads

Replies
3
Views
2K
Replies
6
Views
2K
Replies
5
Views
13K
Replies
8
Views
1K
Replies
8
Views
6K
Replies
1
Views
2K
Back
Top