B Testing Hypothesis (vaccine)

Agent Smith · Nov 15, 2024

Placebo/control group has 800 people. They aren't given the vaccine. 60 of them develop the disease
Treatment group has 1000 people. They're given the vaccine. 15 develop the disease

It seems there's a formula viz ##\frac{p_1 - p_2}{\sqrt{p (1 - p)}\left(\frac{1}{n_1} + \frac {1}{n_2}\right)}## to do this.

However I tried something different. Would like to know if it's valid.

For ##p_1## = proportion of diseased in unvaccinated and ##p_2## = proportion of diseased in vaccinated, my hypotheses are:
##H_0: p_2 =p_1##
##H_a: p_2 < p_1##

Is this ok?

Dale · Nov 15, 2024

It is equivalent. Your ##H_0## can be tested by seeing if the above formula is significantly different from 0.

Agent Smith · Nov 16, 2024

@Dale when I do a Chi-square test, my ##\chi^2 \approx 40## for a degree of freedom = 1, that gives a P-value << 0.05,

BUT

when I use the method in the OP, the P-value = 0.0113. It is < 0.05, but not << 0.5.

How come?

Dale · Nov 16, 2024

I don’t know. I think they should be the same, but I have to admit that I have not calculated this test by hand in almost 30 years

Agent Smith · Nov 16, 2024

@Dale can you break the formula down for me? We seem to be "combining" the frequencies with p.
##p = \frac{15 + 60}{1000 + 800}## and then using it compute a standard deviation ##\sqrt {p (1 - p)}##. What does ##\left(\frac{1}{n_1} + \frac{1}{n_2}\right)## do?

A person has assigned values for ##p_1## and ##p_2## as follows:

Can I do ##p_1 = 60/800## and ##p_2 = 15/1000##?

Agent Smith · Nov 19, 2024

In the original question, it says "variances unknown". What if variances are known?

All the relevant variances/standard deviations seem computable from givens.

Gavran · Nov 21, 2024

Agent Smith said:

@Dale when I do a Chi-square test, my ##\chi^2 \approx 40## for a degree of freedom = 1, that gives a P-value << 0.05,

This is okay.

Agent Smith said:

BUT

when I use the method in the OP, the P-value = 0.0113. It is < 0.05, but not << 0.5.

How come?

I have got ## 6,3 ## for ## z ## value which gives ## P ## much less than ## 0,05 ##.

Agent Smith said:

View attachment 353603

@Dale can you break the formula down for me? We seem to be "combining" the frequencies with p.
##p = \frac{15 + 60}{1000 + 800}## and then using it compute a standard deviation ##\sqrt {p (1 - p)}##. What does ##\left(\frac{1}{n_1} + \frac{1}{n_2}\right)## do?

## \begin{align}
z&=\frac{p_1-p_2}{\sqrt{p(1-p)(\frac{1}{n_1}+\frac{1}{n_2})}}\nonumber\\
&=\frac{p_1-p_2}{\sqrt{\frac{p(1-p)}{n_1}+\frac{p(1-p)}{n_2}}}\nonumber
\end{align} ##

where $$ \sigma_1^2=\frac{p(1-p)}{n_1} $$ is the first population variance (800 people, 60 of them develop the disease ) and $$ \sigma_2^2=\frac{p(1-p)}{n_2} $$ is the second population variance (1000 people, 15 of them develop the disease).

So ## p_1-p_2 ## has a normal distribution ## \mathcal{N}(0,\sigma_1^2+\sigma_2^2) ##.

Agent Smith said:

A person has assigned values for ##p_1## and ##p_2## as follows:
View attachment 353604

Can I do ##p_1 = 60/800## and ##p_2 = 15/1000##?

Yes you can and now you will have: ## H_0:p_1-p_2=0 ## and ## H_A:p_1-p_2<0 ##. In this case ## z ## value will be ## -6,3 ## and ## P ## will be the same as for the case in the original post.

Agent Smith · Nov 21, 2024

@Gavran gracias. Very clear. I hope I can remember.

B Testing Hypothesis (vaccine)

Similar threads

Hot Threads

B A Little Probability Puzzle

I Need help solving this Existence Algorithm for truth

A Does this computation satisfy LTL formulas?

I Stochastic calculus: Ito's lemma and differentials

I Help me understand skewness in QQ-plots please

Recent Insights

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight

Insights Relativator (Circular Slide-Rule): Simulated with Desmos - Insight

Insights Fixing Things Which Can Go Wrong With Complex Numbers

Insights Fermat's Last Theorem