Two dimensional normal distribution

In summary: So the result is: $$f(u,v) = \frac1{2\pi\sqrt{1-r^2}}\exp\biggl(\frac{-(u^2+v^2)}{2(1-r^2)}\biggr).$$ This is what you would get if the two distributions were actually normal.
  • #1
Suvadip
74
0
If \(\displaystyle ( X,Y )\) has the normal distributions in two dimensions with zero means and unit variances and the correlation coefficient \(\displaystyle r\), then how to prove that the expectation of the greater of \(\displaystyle X\) and \(\displaystyle Y\) is \(\displaystyle \sqrt{(1-r)\pi}\)?
 
Physics news on Phys.org
  • #2
suvadip said:
If \(\displaystyle ( X,Y )\) has the normal distributions in two dimensions with zero means and unit variances and the correlation coefficient \(\displaystyle r\), then how to prove that the expectation of the greater of \(\displaystyle X\) and \(\displaystyle Y\) is \(\displaystyle \sqrt{(1-r)\pi}\)?

Without loss of generality suppose $X=x>y$ then the conditional mean for $x$ is:

$\displaystyle \overline{X}_y=\int_{x=y}^{\infty} x p(x|y)\, dx$

So:

$\displaystyle \overline{X}=\int_{y=-\infty}^{\infty} \overline{X}_y p(y)\, dy=\int_{y=-\infty}^{\infty}\int_{x=y}^{\infty} x p(x|y)p(y)\, dxdy =\int_{y=-\infty}^{\infty}\int_{x=y}^{\infty} x p(x,y)\, dx dy$

.
 
  • #3
Geometrically, y= x is a line through the origin. x> y is the half-plane to the right. Given that x> y we want to integrate over that half plane. We can, like zzephod, say that taking y from [tex]-\infty[/tex] to [tex]\infty[/tex] and, for each y, x from y to [tex]\infty[/tex]:
[tex]\overline{X}= \frac{1}{2\pi}\int_{-\infty}^\infty\int_y^\infty xe^{-(x^2+ y^2}dxdy[/tex].

Or you can take x from [tex]-\infty[/tex] to [tex]\infty[/tex] and y from [tex]-\infty[/tex] to x: [tex]\overline{Y}= \frac{1}{2\pi}\int_{-\infty}^\infty\int_{-\infty}^x ye^{-x^2+ y^2}dy dx[/tex].
 
  • #4
suvadip said:
If \(\displaystyle ( X,Y )\) has the normal distributions in two dimensions with zero means and unit variances and the correlation coefficient \(\displaystyle r\), then how to prove that the expectation of the greater of \(\displaystyle X\) and \(\displaystyle Y\) is \(\displaystyle \sqrt{(1-r)\pi}\)?
I am not a statistician, so I may be misunderstanding something here. But it seems to me that the previous two comments overlook the fact that $X$ and $Y$ are supposed to be correlated, with correlation coefficient $r$. According to Multivariate normal distribution - Wikipedia, the free encyclopedia, the density function in that case is given by $f(x,y) = \frac1{2\pi\sqrt{1-r^2}}\exp\bigl(-\frac12[x\:\:y]\Sigma^{-1}\bigl[{x\atop y}\bigr]\bigr)$, where $\Sigma$ is the correlation matrix $\Sigma = \begin{bmatrix}1&r \\r&1 \end{bmatrix}.$ Then $\Sigma^{-1} = \frac1{1-r^2}\begin{bmatrix}1&-r \\-r&1 \end{bmatrix},$ giving $$f(x,y) = \frac1{2\pi\sqrt{1-r^2}}\exp\biggl(\frac{-(x^2 - 2rxy + y^2)}{2(1-r^2)}\biggr).$$ To integrate that over the region $x>y$ I would make the change of variables $u = x+y$, $v = x-y$, using the fact that $$x^2 - 2rxy + y^2 = \tfrac12(1-r)u^2 + \tfrac12(1+r)v^2.$$
 
  • #5


The expectation of the greater of X and Y can be calculated by integrating the joint probability density function over the appropriate region. In this case, we are interested in the region where X > Y. Using the properties of the normal distribution, we can rewrite this expectation as:

E[max(X,Y)] = ∫∫ max(x,y) * f(x,y) dx dy

Where f(x,y) is the joint probability density function of (X,Y). Since X and Y have zero means and unit variances, we can write the joint probability density function as:

f(x,y) = (1/2π√(1-r^2)) * e^(-(x^2+y^2-2rx^y)/2(1-r^2))

Substituting this into the expectation equation and using the fact that max(x,y) = x when x > y, we get:

E[max(X,Y)] = ∫∫ x * (1/2π√(1-r^2)) * e^(-(x^2+y^2-2rx^y)/2(1-r^2)) dx dy

Integrating this expression over the region where X > Y, we get:

E[max(X,Y)] = ∫∫∫ x * (1/2π√(1-r^2)) * e^(-(x^2+y^2-2rx^y)/2(1-r^2)) dx dy dz

= ∫∫ (1/2π√(1-r^2)) * e^(-(x^2+y^2-2rx^y)/2(1-r^2)) dx dy

= (1/2π√(1-r^2)) * ∫∫ x * e^(-(x^2+y^2-2rx^y)/2(1-r^2)) dx dy

Using the substitution u = x^2+y^2-2rx^y, we can rewrite this integral as:

E[max(X,Y)] = (1/2π√(1-r^2)) * ∫∫ √(u+2ry) * e^(-u/2(1-r^2)) du dy

= (1/2π√(1-r^2)) * ∫∫ √(u+2ry) * e^(-u/2
 

FAQ: Two dimensional normal distribution

What is a two dimensional normal distribution?

A two dimensional normal distribution, also known as a bivariate normal distribution, is a probability distribution that describes the relationship between two continuous variables. It is characterized by a bell-shaped curve and is commonly used in statistics and data analysis.

How is a two dimensional normal distribution different from a one dimensional normal distribution?

A one dimensional normal distribution only describes the relationship between one continuous variable, while a two dimensional normal distribution describes the relationship between two continuous variables. This means that a two dimensional normal distribution takes into account the correlation between the two variables.

What is the equation for a two dimensional normal distribution?

The equation for a two dimensional normal distribution is:
f(x,y) = (1/2πσ1σ2√(1-ρ^2)) * exp(-(1/2(1-ρ^2))*((x-μ1)^2/σ1^2 - (2ρ(x-μ1)(y-μ2))/σ1σ2 + (y-μ2)^2/σ2^2))
Where:
μ1 and μ2 are the means of the two variables
σ1 and σ2 are the standard deviations of the two variables
ρ is the correlation coefficient between the two variables

What is the importance of the correlation coefficient in a two dimensional normal distribution?

The correlation coefficient, ρ, determines the strength and direction of the correlation between the two variables. It ranges from -1 to 1, where -1 indicates a perfect negative correlation, 0 indicates no correlation, and 1 indicates a perfect positive correlation. This information is important in understanding the relationship between the variables and making predictions based on the data.

How is a two dimensional normal distribution used in real-world applications?

A two dimensional normal distribution is commonly used in fields such as finance, economics, and social sciences to analyze the relationship between two variables, such as stock prices and interest rates. It is also used in data analysis and machine learning to model and make predictions based on data that follows a normal distribution.

Back
Top