Hypothesis Testing Stats: Making a Test w/ Unknown SD & Tables

In summary, the conversation discusses the process of conducting a hypothesis test under the assumptions of an unknown population standard deviation and using tables only. The conversation also touches on the alternative hypothesis and its effect on the acceptance region, as well as the limitations of hypothesis testing as a decision-making tool. The conversation concludes with a statement that the general procedure for solving such problems is typically covered in a statistics 101 course.
  • #1
Klungo
136
1
Statistics doesn't come to me as naturally as math.

I'm curious as to how to make a hypothesis test under the assumptions that the population standard deviation is unknown and using tables only.

Here is my understanding.

Given
Suppose:
[itex]H_0: \mu = \mu_0[/itex].
Suppose also that:
[itex]\bar{x}, s[/itex] is the mean and standard deviation of a sample of size [itex]n[/itex].
Suppose a significance level of [itex]\alpha[/itex].

Question: If [itex]\bar{x} < \mu_0[/itex], do we use [itex]H_A: \mu < \mu_0[/itex]?
When do we use [itex]<,>,\neq[/itex]?

Test Statistic
Since the population standard deviation is unknown, we "standardize" to a random variable [itex]T[/itex] with a [itex]t[/itex] distribution with [itex]n-1[/itex] degrees of freedom.

[itex]T_{test} = \displaystyle\frac{\bar{x} - \mu_0}{s/\sqrt{n}}[/itex].

Determining P-value
Assuming [itex]H_A: \mu < \mu_0[/itex], we have a left-tail test
So, [itex]p-value = P(T < T_{test}) = P(-T > -T_{test})[/itex] by symmetry. (My tables only show the right tail.)

So, now we look at the table.
Question I'm not sure if I know how to read and apply the values of the table. Is my work below correct?

Suppose [itex]df = 10, T_{test} = 2 [/itex].
Here is a t-table: http://3.bp.blogspot.com/_5u1UHojRiJk/TEdJJc6of2I/AAAAAAAAAIE/Ai0MW5VgIhg/s1600/t-table.jpg.

We have [itex]T_{0.05}=1.812 < T_{test}= 2 < T_{0.025} = 2.228 [/itex]
Thus, [itex]0.025 < p-value < 0.05[/itex].

Decision
Finally, if [itex]p-value < \alpha[/itex], then we reject [itex]H_0[/itex]. Otherwise, we do not reject [itex]H_0[/itex].

The not equal case
If [itex]H_A : \mu \neq \mu_0[/itex], then we have
[itex]p-value = 2 P(|T| > T_{test}) = 2 P(-T < -T_{test}) + 2 P(T > T_{test})[/itex].

Question What now? And is the equality above correct?Thanks.
 
Physics news on Phys.org
  • #2
Klungo said:
Question: If [itex]\bar{x} < \mu_0[/itex], do we use [itex]H_A: \mu < \mu_0[/itex]?
When do we use [itex]<,>,\neq[/itex]?

The alternative hypothesis is determined by the context of the particular problem. For example if [itex] \mu_0 [/itex] were the mean weight of a type of candy bar and you are a consumer then your concern might be that you are not "short changed" by getting a batch of candy bars whose mean weight is less than [itex] \mu_0 [/itex]. if you are a candy bar producer then your concern might be that the factory is not making candy bars that too heavy or too light, so your alternative hypothesis would be that the mean is not equal to [itex] \mu_0 [/itex].

The effect of alternative hypothesis on the mechanics of the problem is that it determines what kind of "acceptance region" you use in the hypothesis test. The typical decision about acceptance regions is whether to do a "one tailed" or "two tailed" test.

You must realize that hypothesis testing is simply a procedure for making a decision. It is not a proof that the decision is correct. It doesn't even tell you the probability that the decision is correct. There is no mathematical proof that it is an optimal procedure. So if you are seeking a mathematical understanding of why hypothesis testing should be done a certain way, you won't find any mathematical justifications based on those goals. If you want a procedure that aims for those goals, you have to study Bayesian statistics.
 
  • #3
You must realize that hypothesis testing is simply a procedure for making a decision. It is not a proof that the decision is correct. It doesn't even tell you the probability that the decision is correct. There is no mathematical proof that it is an optimal procedure. So if you are seeking a mathematical understanding of why hypothesis testing should be done a certain way, you won't find any mathematical justifications based on those goals.

I just want to know how it's done as it is in a stats 101 course. I know that it is not a proof, but rather an "educated" guess that could be wrong.

[Edit:] That is, the general procedure for solving problems given those assumptions.
 
  • #4
I declare this thread over. I now understand the procedure.
 
  • #5


Hello,

Thank you for sharing your understanding and questions about hypothesis testing with unknown standard deviation and using tables only. You have a good understanding of the process so far, but let me clarify and answer your questions.

First, to answer your question about when to use <, >, or \neq in the alternative hypothesis H_A, it depends on the research question and the direction of the expected difference between the sample mean and the population mean. For example, if the research question is whether a new treatment has a lower mean effect than the current treatment, then the alternative hypothesis would be H_A: \mu < \mu_0. If the research question is whether a new product has a different mean satisfaction score than the current product, then the alternative hypothesis would be H_A: \mu \neq \mu_0. It is important to carefully consider the research question and the expected direction of the difference in order to choose the appropriate alternative hypothesis.

Next, your understanding of the test statistic and how to determine the p-value is correct. However, for the not equal case, your understanding of the p-value calculation is not entirely correct. The correct formula for the p-value in the not equal case is P(|T| > T_{test}) = P(T > T_{test}) + P(T < -T_{test}). This accounts for both tails of the distribution. Therefore, when using tables, you would look up the p-value for T > T_{test} and add it to the p-value for T < -T_{test}. So, in your example, if T_{test} = 2 and df = 10, the p-value would be P(T > 2) + P(T < -2) = 0.026 + 0.026 = 0.052.

It looks like you are using a t-table with values for the right tail only. In order to use this table for left-tail tests, you can use the symmetry property of the t-distribution. This means that if you are looking for the p-value for T < T_{test}, you can use the p-value for T > T_{test} by subtracting it from 1. So, in your example, the p-value for T > 2 is 0.026, so the p-value for T < 2 would be 1 - 0.026 = 0.974.

I hope this helps to clarify your understanding
 

FAQ: Hypothesis Testing Stats: Making a Test w/ Unknown SD & Tables

What is hypothesis testing in statistics?

Hypothesis testing in statistics is a method of using data to make decisions about a larger population. It involves comparing a sample to a known or assumed population and determining whether the results are statistically significant.

What is the purpose of hypothesis testing?

The purpose of hypothesis testing is to determine whether there is enough evidence to reject or accept a null hypothesis. The null hypothesis is a statement that there is no significant difference between the sample and population being studied.

What is an unknown standard deviation in hypothesis testing?

An unknown standard deviation in hypothesis testing means that the population standard deviation is not known and must be estimated from the sample data. This can affect the type of statistical test that is used and the interpretation of the results.

How do you make a hypothesis test with unknown standard deviation?

To make a hypothesis test with an unknown standard deviation, you will need to use a t-test instead of a z-test. The t-test takes into account the estimated standard deviation from the sample and adjusts the results accordingly. You will also need to use a t-table to find the critical value for your test.

What are the common types of hypothesis tests used in statistics?

The most common types of hypothesis tests used in statistics are one-sample t-test, two-sample t-test, paired t-test, ANOVA, and Chi-square test. The type of test used depends on the research question, the type of data being analyzed, and the assumptions of the data.

Similar threads

Replies
3
Views
962
Replies
20
Views
3K
Replies
1
Views
1K
Replies
10
Views
2K
Replies
10
Views
3K
Replies
5
Views
3K
Replies
4
Views
1K
Replies
16
Views
3K
Back
Top