The t-test and the central limit theorem

In summary: And "t0.05,15 = 1.7" is wrong - that is a single-tailed value, while you are looking for the two-tailed value.
  • #1
TytoAlba95
132
19
Homework: No Effort - Member warned that some effort must be shown
Homework Statement
It is hypothesized that the mean dry weight of a female in a Drosophila population is 4.5 mg. In a sample of 16 female with ȳ =4.8 mg and s = 0.8 mg, what dry weight values would lead to rejection of the null hypothesis at p = 0.05 level?
(take t0.05=2.1)
Relevant Equations
1. Values lower than 4.0 and values higher than 5.6
2. Values lower than 3.20 and values higher than 6.40
3. Values lower than 4.38 and values higher than 5.22
4. Values lower than 3.22 and values higher than 6.48
Ans is 3.

I know basic t-test but I have no clue to solve this question.
Thanks.
 
Last edited by a moderator:
Physics news on Phys.org
  • #2
It would appear that by "what dry weight values would lead...?" they mean "what hypothesised values of the mean dry weight would lead...?" Which seems odd when they've given you a hypothesised value already. It sounds as if they're talking about measured values, but that would make no sense. Mean, standard deviation and hypothesised mean are all you need for a t-test; if they were talking about rejecting outliers that would be a different test.
 
  • #3
I'm sorry, I couldn't understand what you said.
The book I'm following has provided a solution to this question which is too complex for me. It talks about central limit theorem.
243739
 
  • #4
What in particular don't you understand? You say you "know basic t-test". Do you mean you know a formula that you apply blindly, or do you understand where it comes from?
The question as you state it (have you copied it correctly?) is badly expressed and unclear. The answer also contains mistakes, e.g. in the last line "does not contain" should be "contains". The paragraph immediately after the diagram should read
"... there is a 95% probability that the interval μ ± 2.1(σ/√n) contains Y. Likewise, there is a 95% probability that the interval specified by Y ± 2.1(σ/√n) contains μ."
If the value of the hypothesised mean was greater than 5.22 or less than 4.38, you would reject the hypothesis on the basis of the data. This is what answer 3 must mean. But they tell you that the hypothesised mean is 4.5. It makes no sense.
 
  • #5
Sorry, I understand it was too vague to have said 'I know basic t-test'. I only know how to apply formulae, I should have mentioned that.
I have copied the sum correctly (there's no mistake...).

My Attempt :
Ho= There's no difference between the sample mean (ȳ =4.8 mg) and the population mean μ=4.5.
Ha= There's difference between the sample mean and population mean. The sample doesn't belong to the population.

tcal= (ȳ -μ)/SE
here SE=s/√n-1.
tcal= {4.8-4.5}/(0.8/3.8) = 0.3/.21 = 0.38

My tcal is greater than the given t0.05 = 2.1 (though t0.05, df=15 =1.7), so the Ho is rejected. The sample does not belong to the population.

what dry weight values would lead to rejection of the null hypothesis at p = 0.05 level?

From the above quote it appears to me that the Ho should not have been rejected, and the question is asking the hypothetical μ for which it will be rejected.

Then again after checking the solution I got more confused with confidence interval and central limit theorem. (Could you suggest some easy-reads on these terms)

I hope I could make my current understanding of this sum more clear.
 
  • #6
I'm afraid your post illustrates that "only knowing how to apply formulae" without understanding them means that you will sometimes apply them wrongly.
Your H0 is nonsensical - of course there is a difference between 4.8 and 4.5. Do you mean "the difference is not statistically significant at the 0.05 level"? Similarly with Ha: "There is a statistically significant difference..."
What do you mean by SE = s/√n - 1. As written it is ambiguous. Do you mean s/√(n - 1), or s/(√n - 1), or (s/√n) - 1? I suspect the first, but that is wrong - it should be s/√n.
0.3/.21 is not 0.38, it is 1.43 - which is still not greater than 2.1, so why do you reject the hypothesis? And "t0.05,15 = 1.7" is wrong - that is a single-tailed value, while you are looking for the two-tailed value.
SanjuktaGhosh said:
From the above quote it appears to me that the Ho should not have been rejected, and the question is asking the hypothetical μ for which it will be rejected.
I agree with you. That was what I was saying, I'm sorry if it wasn't clear.
There's no real substitute for understanding where the t distribution comes from and how the t test should be used. Unfortunately, I can't recommend a simple source for you, as the book I learned from is probably unobtainable now, and the Wikipedia article looks fearsomely mathematical. Perhaps someone else could help with suggestions?
 
  • #7
mjc123 said:
I'm afraid your post illustrates that "only knowing how to apply formulae" without understanding them means that you will sometimes apply them wrongly.
Your H0 is nonsensical - of course there is a difference between 4.8 and 4.5. Do you mean "the difference is not statistically significant at the 0.05 level"? Similarly with Ha: "There is a statistically significant difference..."
What do you mean by SE = s/√n - 1. As written it is ambiguous. Do you mean s/√(n - 1), or s/(√n - 1), or (s/√n) - 1? I suspect the first, but that is wrong - it should be s/√n.

Oh! I didn't know it should be s/√n, I was taught SE=s/√(n-1).

0.3/.21 is not 0.38, it is 1.43 - which is still not greater than 2.1, so why do you reject the hypothesis? And "t0.05,15 = 1.7" is wrong - that is a single-tailed value, while you are looking for the two-tailed value.

Sorry, those were silly mistakes.

I agree with you. That was what I was saying, I'm sorry if it wasn't clear.
There's no real substitute for understanding where the t distribution comes from and how the t test should be used. Unfortunately, I can't recommend a simple source for you, as the book I learned from is probably unobtainable now, and the Wikipedia article looks fearsomely mathematical. Perhaps someone else could help with suggestions?

Yes, Wikipedia is too mathematical.
Can I create a post and ask other biologists for book recommendation?
 
  • #8
I suggest posting a request in the biology forum, in case they aren't looking here.
 
  • Like
Likes TytoAlba95
  • #9
The book I used was "Statistics for Mathematicians" by D.J.Finney; it appears that second-hand copies are available on Amazon. The same author also appears to have written a somewhat shorter "Statistics for Biologists", which costs about £100 new, but used copies are available much cheaper. I don't know how much easier it is.
 
  • Like
Likes TytoAlba95
  • #10
mjc123 said:
The book I used was "Statistics for Mathematicians" by D.J.Finney; it appears that second-hand copies are available on Amazon. The same author also appears to have written a somewhat shorter "Statistics for Biologists", which costs about £100 new, but used copies are available much cheaper. I don't know how much easier it is.
Thank you for helping me so much and bearing with me.
I'll post in Biology/Medical forum.
 

FAQ: The t-test and the central limit theorem

1. What is a t-test and when is it used?

A t-test is a statistical test used to determine if there is a significant difference between the means of two groups. It is commonly used when the sample size is small or when the population standard deviation is unknown.

2. What is the central limit theorem and why is it important?

The central limit theorem states that as the sample size increases, the distribution of sample means will approach a normal distribution regardless of the underlying distribution of the population. This is important because it allows us to make inferences about a population based on a sample, and it is the basis for many statistical tests.

3. How does the central limit theorem relate to the t-test?

The t-test relies on the central limit theorem to assume that the sample means are normally distributed. This allows us to use the t-distribution to calculate p-values and confidence intervals.

4. What are the assumptions of the t-test?

The main assumptions of the t-test are that the data is normally distributed, the variances of the two groups are equal, and the observations are independent. Violating these assumptions can lead to inaccurate results.

5. Can the t-test be used for non-parametric data?

No, the t-test is a parametric test and assumes that the data is normally distributed. If the data is non-parametric, a non-parametric test such as the Wilcoxon rank-sum test should be used instead.

Similar threads

Replies
25
Views
1K
Replies
1
Views
651
Replies
10
Views
1K
Replies
11
Views
3K
Replies
6
Views
2K
Back
Top