Expand (x-μi)ᵀΣ⁻¹(x-μi) Homework

  • Thread starter devonho
  • Start date
  • Tags
    Expanding
In summary, the question is asking whether the expression (\bf{x-\mu_{i}})^{t}\Sigma^{-1}(\bf{x-\mu_{i}}) can be written as \bf{x}^t\Sigma^{-1}\bf{x}+ \bf{\mu_{i}}^t\Sigma^{-1}\bf{\mu_{i}}- 2\bf{x}^t\Sigma^{-1}\bf{\mu_{i}}. The answer is yes if the matrix \Sigma is symmetric.
  • #1
devonho
8
0

Homework Statement



I need help with expanding:

[itex](\bf{x-\mu_{i}})^{t}\Sigma^{-1}(\bf{x-\mu_{i}})[/itex]

[itex]\bf{x,\mu_{i}}[/itex] are column vectors.
[itex]\Sigma[/itex] is a square matrix.

Thank you.

Homework Equations



Can:

[itex]
\bf{x}^t\Sigma^{-1}\bf{\mu_{i}}
[/itex]

be written as?

[itex]
\bf{\mu_{i}}^t\Sigma^{-1}\bf{x}
[/itex]

I've tried to compute this numerically and the answer is no.


The Attempt at a Solution



[itex]
(\bf{x-\mu_{i}})^{t}\Sigma^{-1}(\bf{x-\mu_{i}})
=
\bf{x}^t\Sigma^{-1}\bf{x}+
\bf{\mu_{i}}^t\Sigma^{-1}\bf{\mu_{i}}-
\bf{x}^t\Sigma^{-1}\bf{\mu_{i}}-
\bf{\mu_{i}}^t\Sigma^{-1}\bf{x}
[/itex]

 
Physics news on Phys.org
  • #2


devonho said:

Homework Statement



I need help with expanding:

[itex](\bf{x-\mu_{i}})^{t}\Sigma^{-1}(\bf{x-\mu_{i}})[/itex]

[itex]\bf{x,\mu_{i}}[/itex] are column vectors.
[itex]\Sigma[/itex] is a square matrix.

Thank you.

Homework Equations



Can:

[itex]
\bf{x}^t\Sigma^{-1}\bf{\mu_{i}}
[/itex]

be written as?

[itex]
\bf{\mu_{i}}^t\Sigma^{-1}\bf{x}
[/itex]

I've tried to compute this numerically and the answer is no.


The Attempt at a Solution



[itex]
(\bf{x-\mu_{i}})^{t}\Sigma^{-1}(\bf{x-\mu_{i}})
=
\bf{x}^t\Sigma^{-1}\bf{x}+
\bf{\mu_{i}}^t\Sigma^{-1}\bf{\mu_{i}}-
\bf{x}^t\Sigma^{-1}\bf{\mu_{i}}-
\bf{\mu_{i}}^t\Sigma^{-1}\bf{x}
[/itex]

I don't understand your problem. The result in (3) is correct. If *you* derived that result, then you have obtained the desired result. On the other hand, if you mean that somebody else has given you the result in (3) and you don't know how they got it, that is a different question. So, what, exactly are you asking?

RGV
 
  • #3


BTW, [itex]\Sigma[/itex] is a horrible name for a matrix for the reason that it is used primarily to mean summation.
 
  • #4


Mark44 said:
BTW, [itex]\Sigma[/itex] is a horrible name for a matrix for the reason that it is used primarily to mean summation.

I agree with you. Nevertheless, it is often used in Statistics and Econometrics, etc., to denote the variance-covariance matrix of a multidimensional random variable.

RGV
 
  • #5


devonho said:

Homework Statement




Can:

[itex]
\bf{x}^t\Sigma^{-1}\bf{\mu_{i}}
[/itex]

be written as?

[itex]
\bf{\mu_{i}}^t\Sigma^{-1}\bf{x}
[/itex]

I've tried to compute this numerically and the answer is no.

The answer is yes IF the matrix [tex] \Sigma [/tex] is symmetric. Since
[tex]
\bf{\mu_i}^t \Sigma^{-1} \bf{x}
[/tex]

is a scalar, it equals its transpose, so
[tex]
\bf{\mu_i}^t \Sigma^{-1} \bf{x} = \left(\bf{\mu_i}^t \Sigma^{-1} \bf{x}\right)^t = \bf{x}^t \Sigma^{-1} \bf{\mu_i}
[/tex]

Did the example you used have sigma symmetric? (If it is a variance-covariance matrix, it has to be symmetric).
 
  • #6


The way (3) is written, it is valid even if the matrix is not symmetric: the terms x^T A m and m^T A x are written separately.

RGV
 
  • #7


Hi all, thanks for the replies. The goal was to get:

[itex](\bf{x-\mu_{i}})^{t}\Sigma^{-1}(\bf{x-\mu_{i}})
= \bf{x}^t\Sigma^{-1}\bf{x}+ \bf{\mu_{i}}^t\Sigma^{-1}\bf{\mu_{i}}- 2\bf{x}^t\Sigma^{-1}\bf{\mu_{i}}[/itex]

Hence,
[itex]
\bf{\mu_i}^t \Sigma^{-1} \bf{x} = \bf{x}^t \Sigma^{-1} \bf{\mu_i}
[/itex]

Was what I needed. Thanks.
 
  • #8


Thanks for the help. I wrote the long proof.

If [itex]
m_{12}=m_{21}, m_{13}=m_{31},m_{32}=m_{23},
[/itex]

[itex]
\bf{x} = \left[
\begin{array}{ccc}
x_1 \\ x_2 \\ x_3
\end{array}
\right]
[/itex]

[itex]
\bf{y} = \left[
\begin{array}{ccc}
y_1 \\ y_2 \\ y_3
\end{array}
\right]
[/itex]

[itex]
\bf{M} =
\left[
\begin{array}{ccc}
m_{11} & m_{12} & m_{13} \\
m_{21} & m_{22} & m_{23} \\
m_{31} & m_{32} & m_{33}
\end{array}
\right]
[/itex]


then

[itex]
\bf{y^tMx}=
[/itex]
[itex]
\begin{array}{ccc}
x_1y_1m_{11}+
(x_1y_2+x_2y_1)m_{12}+
(x_1y_3 + x_3y_1)m_{13}+
x_2y_2m_{22}+
(x_2y_3 + x_3y_2)m_{23}+
x_3y_3m_{33}
\end{array}
[/itex]

[itex]
\bf{x^tMy}=
[/itex]
[itex]
\begin{array}{ccc}
\begin{array}{ccc}
(x_1m_{11}+x_2m_{21}+x_3m_{31})y_1 + (x_1m_{12}+x_2m_{22}+x_3m_{32})y_2 + (x_1m_{13}+x_2m_{23}+x_3m_{33})y_3
\end{array}
\\=
\begin{array}{ccc}
x_1y_1m_{11}+
(x_2y_1+
x_1y_2)m_{12}+
(x_3y_1+
x_1y_3)m_{13}+
x_2y_2m_{22}+
(x_3y_2+
x_2y_3)m_{23}+
x_3y_3m_{33}
\end{array}
\end{array}
[/itex]
[itex]
=\bf{y^tMx}
[/itex]
 
Last edited:

FAQ: Expand (x-μi)ᵀΣ⁻¹(x-μi) Homework

What is (x-μi)ᵀΣ⁻¹(x-μi)?

(x-μi)ᵀΣ⁻¹(x-μi) is a mathematical expression used in statistics and probability to calculate the distance between a point and a mean value, while taking into account the variability of the data. It is often used in multivariate analysis and is a measure of the Mahalanobis distance.

How is (x-μi)ᵀΣ⁻¹(x-μi) used in statistics?

(x-μi)ᵀΣ⁻¹(x-μi) is used in statistics to determine the distance of a data point from the mean value, while also considering the covariance between different variables. This is helpful in identifying outliers and understanding the relationship between variables in a dataset.

What do the symbols in (x-μi)ᵀΣ⁻¹(x-μi) represent?

The symbol x represents a data point, while μi represents the mean value of that data point. Σ⁻¹ is the inverse of the covariance matrix Σ, which measures the relationship between different variables in a dataset.

How do you calculate (x-μi)ᵀΣ⁻¹(x-μi)?

To calculate (x-μi)ᵀΣ⁻¹(x-μi), you first need to subtract the mean value μi from the data point x. Then, you multiply the result by the inverse of the covariance matrix Σ⁻¹, and finally, multiply the whole expression by (x-μi)ᵀ. This will give you the distance between the data point and the mean value, taking into account the variability of the data.

What is the significance of (x-μi)ᵀΣ⁻¹(x-μi) in data analysis?

(x-μi)ᵀΣ⁻¹(x-μi) is an important tool in data analysis, as it allows for the identification of outliers and provides a measure of the relationship between variables in a dataset. It is also used in various statistical tests and models, such as linear regression and discriminant analysis, to make more accurate predictions and inferences about the data.

Similar threads

Replies
13
Views
2K
Replies
4
Views
9K
Replies
7
Views
2K
Replies
7
Views
4K
Replies
2
Views
1K
Back
Top