A hideous Linear Regression/confidence set question

  • Thread starter Phillips101
  • Start date
  • Tags
    Linear Set
In summary, the conversation discusses finding the distribution of (beta.hat-beta)' * X'*X * (beta.hat-beta) and using it to find a (1-a)-level confidence set for beta. It is mentioned that the expression has a chi-squared distribution, but the variance is unknown and needs to be estimated. The MLE sigma2.hat is suggested as an estimate and it is noted that it has a chi-squared n-1 distribution. It is also mentioned that the expression is distributed as an F distribution, which is needed for finding the confidence set. The question of whether swapping beta.hat for beta.hat-beta makes a difference is raised.
  • #1
Phillips101
33
0
Take the linear model Y=X*beta+e, where e~Nn(0, sigma^2 * I), and it has MLE beta.hat

First, find the distribution of (beta.hat-beta)' * X'*X * (beta.hat-beta), where t' is t transpose. I think I've done this. I think it's a sigma^2 chi-squared (n-p) distribution.

Next, Hence find a (1-a)-level confidence set for beta based on a root with an F distribution. I can't do this to save my life. I'm aware that an F distribution is the ratio of two chi-squareds, but where the hell I'm going to get another chi squared from I have no idea. Also, we're dealing in -vectors- and I don't know how,what,why any confidence set is going to be or even look like, and I've no idea how to even try to get one.

-Any- help would be appreciated. Thanks
 
Physics news on Phys.org
  • #2
Notice that

[tex]
\frac{\hat{\beta}' X'X \hat{\beta}}{\sigma^2}
[/tex]

has a [tex] \Chi^2 [/tex] distribution. however, the variance is unknown, so you need to estimate it (with another expression from the regression). What would you use for the estimate, and what is its distribution?
 
  • #3
Use the MLE sigma2.hat=(1/n)*||Y-Xbeta.hat||^2 ? This is distributed as a chi-squared n-1 variable if I remember correctly...
 
  • #4
If that's correct, then the thing you posted is distributed as an F distribution, which is what I need? And would swapping beta.hat for beta.hat-beta make any difference to this?
 
  • #5


I understand that this may seem like a daunting and confusing task, but I assure you that it is a crucial step in statistical analysis. Let's break down the problem and address each part separately.

First, let's start with the distribution of (beta.hat-beta)' * X'*X * (beta.hat-beta). This is known as the Hotelling's T-squared distribution, which is a multivariate extension of the chi-squared distribution. It follows a non-central chi-squared distribution with n-p degrees of freedom and a non-centrality parameter of (beta.hat-beta)' * X'*X * (beta.hat-beta)/sigma^2. This distribution can be used to calculate the confidence interval for beta.

Next, we need to find a (1-a)-level confidence set for beta. This is where the F distribution comes into play. The F distribution is used to test the equality of two variances, which in this case, is the ratio of (beta.hat-beta)' * X'*X * (beta.hat-beta)/sigma^2 divided by the residual sum of squares. This F statistic can be used to construct a confidence interval for beta using the Hotelling's T-squared distribution.

Since we are dealing with vectors, the confidence set for beta will be a region in the n-dimensional space. This region will be defined by a lower and upper bound for each component of beta. This can be visualized as a hyperellipsoid in the n-dimensional space.

Finally, to obtain the confidence set, we need to calculate the critical values for the F distribution and use them to construct the confidence interval for each component of beta. This will give us a region in the n-dimensional space where we can be (1-a)% confident that the true value of beta lies within.

In conclusion, constructing a confidence set for beta in a linear regression model may seem challenging, but it is a necessary step in statistical analysis. By understanding the underlying distributions and using appropriate statistical tests, we can obtain a region in the n-dimensional space where we can be confident that the true value of beta lies within. I hope this explanation has helped to clarify the process and I am happy to provide further assistance if needed.
 

FAQ: A hideous Linear Regression/confidence set question

What is Linear Regression?

Linear Regression is a statistical method used to analyze the relationship between two or more variables. It is commonly used to predict a dependent variable based on one or more independent variables.

How is Linear Regression different from other regression methods?

Linear Regression assumes a linear relationship between the dependent and independent variables, whereas other regression methods may allow for non-linear relationships. Additionally, Linear Regression uses the least squares method to find the best fit line, while other methods may use different techniques.

What is a confidence set in Linear Regression?

A confidence set in Linear Regression is a range of values within which the true value of the regression coefficient is likely to lie. It is based on the standard error of the estimated coefficient and a chosen level of confidence, typically 95%.

How is a confidence set calculated in Linear Regression?

A confidence set is calculated by taking the estimated coefficient and adding and subtracting a margin of error, which is determined by the standard error of the coefficient and the chosen confidence level. This creates a range of values within which the true coefficient is likely to fall.

Why is it important to have a confidence set in Linear Regression?

A confidence set is important because it helps to assess the reliability and significance of the estimated regression coefficients. It allows us to determine the range of values within which the true coefficient is likely to lie, and to make more accurate predictions and interpretations based on the regression model.

Similar threads

Replies
7
Views
2K
Replies
30
Views
3K
Replies
1
Views
1K
Replies
1
Views
780
Replies
1
Views
885
Replies
7
Views
2K
Replies
8
Views
2K
Replies
3
Views
994
Replies
1
Views
2K
Back
Top