Uncertainty of binomial distribution?

In summary: I solved it by taking the uncertainty of each binomial distribution and multiplying it by the coefficient of variation of the respective binomial distribution. That gives me the uncertainty of the n_total.
  • #1
penguindecay
26
0
Hi, I'd like to know how to find the uncertainty of a function that has two binomial distribution s, something like

Signal = N(yes) - N(no)

Set p = 0.6 for yes. My problem is that I do not know how to find the uncertainty for N(yes) and N(no), and do not know how to find the uncertainty for the signal. Any help would be great! Thanks
 
Physics news on Phys.org
  • #2
One measure of uncertainty is the standard deviation, which is the square root of the variance. See http://en.wikipedia.org/wiki/Binomial_distribution" , since the error of a difference is larger than the error of either of the components. Does this help?
 
Last edited by a moderator:
  • #3
Everything made sense except the propagation of the error. I did not understand the equation given, it had a term COV_{ab} which I have never seen before, and reading the page on covariance does not help as I understand very little of it. Could you give me an equation or explain the term COV please?
 
  • #4
That is a complication, but first, let's check if your two binomial distributions are independent. If they are, then the covariance is zero. (The covariance is a measure of whether the two distributions are correlated.)
 
  • #5
Mapes said:
That is a complication, but first, let's check if your two binomial distributions are independent. If they are, then the covariance is zero. (The covariance is a measure of whether the two distributions are correlated.)

The first is of events that are same-sign (ie ++ or --), while the other is the number of events that are opposite sign (+-), called N(ss) and N(os) respectively. I've taken the uncertainty of these two as:

SQRT( N(os).p_os(1 - p_os) ) and SQRT( N(ss).p_ss(1 - p_ss) )

I would like to know now the uncertainty now of a function that is N(os) - N(ss)
 
  • #6
OK, that doesn't look independent. I haven't run into this type of problem before, so unfortunately you'll need somebody else to stop by and help.
 
  • #7
Ok, Thank you for helping though!
 
  • #8
Okay, well you really only have one binomial distribution here. N(yes) - N(no) = N(yes) - (N(total) - N(yes)) = 2N(yes) - N(total)

This is a linear transformation of a single binomial distribution where n = N(total). Specifically if X is distributed as B(n,p) then your variable is 2X - n.

When you say "uncertainty" you also refer to the "uncertainty of the signal" which makes me think you are talking about the Shannon entropy of the variable. The Shannon entropy of 2X-n is the same as the Shannon entropy of X (assuming that n is a constant and not a random variable), and the entropy of X can be looked up since it is binomial. According to http://en.wikipedia.org/wiki/Binomial_distribution, the Shannon entropy of X is [tex]\frac{1}{2} \ln \left( 2 \pi n e p (1-p) \right) + O \left( \frac{1}{n} \right) [/tex]
 
  • #9
Thank you, I had not thought of considering it as a function of only the n(yes) or n(no), that helps a lot. The uncertainty is to do with a measurement of the signal, ie 112 events +/- 10.6 events, originally this was posted in the physics section. But I think I can solve it with the treatment you have told me about, I am grateful to you,

Kim
 
  • #10
penguindecay said:
Thank you, I had not thought of considering it as a function of only the n(yes) or n(no), that helps a lot. The uncertainty is to do with a measurement of the signal, ie 112 events +/- 10.6 events, originally this was posted in the physics section. But I think I can solve it with the treatment you have told me about, I am grateful to you,

Kim

Actually I got the same problem on what the uncertainty of the n_total is as it too is a sum of binomial distributions
 

FAQ: Uncertainty of binomial distribution?

What is the binomial distribution?

The binomial distribution is a discrete probability distribution that describes the likelihood of obtaining a certain number of successes in a fixed number of independent trials with a binary outcome (e.g. success or failure).

What is the formula for the binomial distribution?

The formula for the binomial distribution is P(x) = (nCx)(p^x)(q^(n-x)), where n is the number of trials, p is the probability of success, q = 1-p, and x is the number of successes.

How is the uncertainty of the binomial distribution calculated?

The uncertainty of the binomial distribution is calculated using the standard deviation formula, which is √(npq), where n is the number of trials and p and q are the probabilities of success and failure, respectively.

What is the relationship between the number of trials and the uncertainty of the binomial distribution?

As the number of trials increases, the uncertainty of the binomial distribution decreases. This is because with more trials, the distribution becomes more symmetrical and the standard deviation becomes smaller.

How can the binomial distribution be used in real-life situations?

The binomial distribution can be used to model and predict the outcomes of binary events, such as the success or failure of a product launch, the likelihood of winning a game, or the probability of a person having a certain genetic trait. It is also commonly used in quality control and market research.

Back
Top