MHB Standard deviation for proportions

AI Thread Summary
To calculate the standard deviation and standard error for a binomial distribution with multiple samples, the standard deviation is estimated using the formula σ = √(p(1 - p)/n), where p is the estimated proportion. When combining samples with different sizes, a weighted average is used, with weights calculated as w_i = 1/σ_i². The weighted mean is then computed as p̄ = (Σp_i w_i)/(Σw_i), and the standard error of the weighted mean is σ_p̄ = √(1/Σw_i). For reporting results as percentages, the standard errors and standard deviations can be expressed in percentage form by multiplying the proportion values by 100. Understanding these calculations is essential for accurately reporting statistical results.
Zues
Messages
4
Reaction score
0
Hi,,,can u please explain me how to calculate standard deviation and standard error for a binomial distribution when you have several samples?

For exapmple:
I don't know the population size. I take a sample of 10 and check for a particular characteristic. Let's say number of successes for this sample is x. So the proportion of successes is x/n. Then I repeat this process 3 times. That means I take 3 samples. Then I'll calculate the mean of the x/n for these 3 samples. So how do I calculate standard deviation or standard error for this mean value?

Eg: Sample 1 => x/n = x/10 =3/10 =30%

When this is done to all three samples,

Sample 1 => 30% +- a
Sample 2 => 32% +-b
Sample 3 => 32% +- c
Mean = 31.33% +-d

How do I calculate a,b,c and d? And what if I have different sample sizes for the three occasions? (having 10, 15, 8 instead of 10,10,10).

Thank you very much for your help
 
Mathematics news on Phys.org
Zues said:
Hi,,,can u please explain me how to calculate standard deviation and standard error for a binomial distribution when you have several samples?

For exapmple:
I don't know the population size. I take a sample of 10 and check for a particular characteristic. Let's say number of successes for this sample is x. So the proportion of successes is x/n. Then I repeat this process 3 times. That means I take 3 samples. Then I'll calculate the mean of the x/n for these 3 samples. So how do I calculate standard deviation or standard error for this mean value?

Eg: Sample 1 => x/n = x/10 =3/10 =30%

When this is done to all three samples,

Sample 1 => 30% +- a
Sample 2 => 32% +-b
Sample 3 => 32% +- c
Mean = 31.33% +-d

How do I calculate a,b,c and d? And what if I have different sample sizes for the three occasions? (having 10, 15, 8 instead of 10,10,10).

Thank you very much for your help

Hi Zues! Welcome to MHB! :)

The standard deviation of the proportion of a binomial distribution is estimated with:
$$\hat \sigma = \sqrt{\frac{\hat p (1 - \hat p)}{n}}$$
where $\hat p$ is the estimated proportion.

When combining N measurements with different standard deviations, you'll need a weighted average.
The weights are:
$$w_i = \frac{1}{\hat\sigma_i^2}$$
The weighted mean is then:
$$\bar{p} = \frac{ \displaystyle\sum_{i=1}^N \hat p_i w_i}{\displaystyle\sum_{i=1}^N w_i}$$
And the standard error $\sigma_{\bar{p}}$ of the weighted mean is:
$$\sigma_{\bar{p}} = \sqrt{\frac{ 1 }{\sum_{i=1}^N w_i}}$$
 
I like Serena said:
Hi Zues! Welcome to MHB! :)

The standard deviation of the proportion of a binomial distribution is estimated with:
$$\hat \sigma = \sqrt{\frac{\hat p (1 - \hat p)}{n}}$$
where $\hat p$ is the estimated proportion.

When combining N measurements with different standard deviations, you'll need a weighted average.
The weights are:
$$w_i = \frac{1}{\hat\sigma_i^2}$$
The weighted mean is then:
$$\bar{p} = \frac{ \displaystyle\sum_{i=1}^N \hat p_i w_i}{\displaystyle\sum_{i=1}^N w_i}$$
And the standard error $\sigma_{\bar{p}}$ of the weighted mean is:
$$\sigma_{\bar{p}} = \sqrt{\frac{ 1 }{\sum_{i=1}^N w_i}}$$

Thank you very very much Serena. I spent a whole day trying to find this. Thank you very much (Smile)
 
Hi,, I have another question regarding this. I would like to report my results as percentages. Then how should I report the standard errors and standard deviations? I'm asking this because we use the proportion (instead of the percentage value) to calculate the SD an SE

Thank you very much and I'm so sorry for bothering. Thank you
 
Zues said:
Hi,, I have another question regarding this. I would like to report my results as percentages. Then how should I report the standard errors and standard deviations? I'm asking this because we use the proportion (instead of the percentage value) to calculate the SD an SE

Thank you very much and I'm so sorry for bothering. Thank you

A proportion and a percentage represent the same thing.
The only difference is a factor of a 100.

So, suppose you have a weighted mean of $\bar p = 0.31$ and an estimated standard error of $\hat \sigma_{\bar p}=0.12$, then you might also say that $\bar p = 31\%$ and $\hat \sigma_{\bar p}=12\%$.
Or for short:
$$\bar p = 31 \pm 12 \%$$
 
Thank you very much. This means a lot. You are so kind, Thank you(Smile)
 
Seemingly by some mathematical coincidence, a hexagon of sides 2,2,7,7, 11, and 11 can be inscribed in a circle of radius 7. The other day I saw a math problem on line, which they said came from a Polish Olympiad, where you compute the length x of the 3rd side which is the same as the radius, so that the sides of length 2,x, and 11 are inscribed on the arc of a semi-circle. The law of cosines applied twice gives the answer for x of exactly 7, but the arithmetic is so complex that the...
Thread 'Imaginary Pythagoras'
I posted this in the Lame Math thread, but it's got me thinking. Is there any validity to this? Or is it really just a mathematical trick? Naively, I see that i2 + plus 12 does equal zero2. But does this have a meaning? I know one can treat the imaginary number line as just another axis like the reals, but does that mean this does represent a triangle in the complex plane with a hypotenuse of length zero? Ibix offered a rendering of the diagram using what I assume is matrix* notation...
Fermat's Last Theorem has long been one of the most famous mathematical problems, and is now one of the most famous theorems. It simply states that the equation $$ a^n+b^n=c^n $$ has no solutions with positive integers if ##n>2.## It was named after Pierre de Fermat (1607-1665). The problem itself stems from the book Arithmetica by Diophantus of Alexandria. It gained popularity because Fermat noted in his copy "Cubum autem in duos cubos, aut quadratoquadratum in duos quadratoquadratos, et...

Similar threads

Replies
2
Views
1K
Replies
1
Views
2K
Replies
4
Views
1K
Replies
1
Views
2K
Replies
42
Views
4K
Replies
4
Views
2K
Back
Top