What is the correct interpretation of the probability in this scenario?

In summary, the conversation discusses the probability of an event where an elementary school in New York state had five sets of twins in its incoming kindergarten class. This event was initially reported as a "statistical impossibility" but the conversation raises questions about the interpretation of the probability and whether it is actually rare enough to be newsworthy. It also considers the possibility of this event occurring in any school, county, or state, and whether it can still be considered a "statistical impossibility." The conversation concludes with a discussion about the combinatorial coefficient and the potential for double counting.
  • #1
mattmns
1,128
6
I am having some trouble with the interpretation of the following question (specifically, part (a)).

Here is the question from the book.
----------------------------------------
Often, news stories that are reported as startling "one-in-a-million'' coincidences are actually, upon closer examination, not rare events and can even be expected to occur. A few years ago an elementary school in New York state reported that its incoming kindergarten class contained five sets of twins. This, of course, was reported throughout the state, with a quote from the principal that this was a "statistical impossibility''. Was it? Or was it an instance of what Diaconis and Mosteller call the "law of truly large number''? Let's do some calculations.

(a) The probability of a twin birth is approximately 1/90, and we can assume that an elementary school will have approximately 60 children entering kindergarten (three classes of 20 each). Explain how our "statistically impossible'' event can be thought of as the probability 5 or more successes from a binomial(60, 1/90). Is this even rare enough to be newsworthy?

(b) Even if the probability in part (a) is rare enough to be newsworthy, consider that this could have happened in any school in the county, and in any county in the state, and it still would have been reported exactly the same. (The "Law of truly large numbers'' is starting to come into play.) New York state has 62 counties, and it is reasonable to assume that each county has five elementary schools. Does the event still qualify as a "statistical impossibility'', or is it becoming something that could be expected to occur?

(c) If the probability in (b) still seems small, consider further that this event could have happened in any of the 50 states, during any of the last 10 years, and still would have received the same news coverage.
----------------------------------------
In part (a) I am not agreeing with the book that this probability can be thought of as 5 or more successes from a binomial(60, 1/90) distribution.

What I don't get is what is a success. We have 60 students, is a success the event that one of these students has a twin? If so, then there are actually two success because the twin, who I would assume is also in kindergarten at the same school, would also be a success. So I would think that we should be looking at the probability of 10 or more successes, not 5. If we had 5 successes then that would mean 5 of the 60 students have twins, which sounds right in some sense, but if the twins of these students are not in the 60 then where are they, and if they are in the 60 shouldn't there be 10, not 5?

Any thoughts on this? Am I missing something here, or did the book make a mistake?

Thanks!
 
Physics news on Phys.org
  • #2
I think the book is correct "in the first order"; you are double counting.

Suppose there are n students in a classroom. We ask each student: "do you have a twin sibling?" If the the student answers "no," he or she leaves the classroom. If the student answers "yes," we ask the student to identify the sibling. Then both of them leave. If all students were to answer "no," the combinatorial coefficient would be 0Cn. If only one student answered "yes," then the combinatorial coefficient isn't 2Cn, it is 1C(n-1). If exactly k students answered yes, then the combinatorial coefficient isn't (2k)Cn, it is kC(n - k).
 

FAQ: What is the correct interpretation of the probability in this scenario?

1. What is probability interpretation?

Probability interpretation is the process of assigning meaning to the numerical values that represent the likelihood of a particular event or outcome occurring. It helps us understand the chances of an event happening and make informed decisions based on that understanding.

2. What are the different types of probability interpretations?

There are three main types of probability interpretations: frequentist, subjective, and classical. Frequentist interpretation is based on the idea that probability is the long-term frequency of an event occurring. Subjective interpretation sees probability as a measure of an individual's belief in an event happening. Classical interpretation is based on the principle of equally likely outcomes.

3. How is probability interpreted in real-life scenarios?

In real-life scenarios, probability is often interpreted using the frequentist approach. For example, if you flip a coin multiple times and get heads 50% of the time, the frequentist interpretation would say that the probability of getting heads is 0.5. However, in situations where personal beliefs or preferences come into play, the subjective interpretation may be more appropriate.

4. How do you calculate probability using different interpretations?

The calculation of probability depends on the interpretation being used. In the frequentist approach, probability is calculated by dividing the number of times an event occurs by the total number of trials. In the subjective approach, probability is based on an individual's personal belief and cannot be calculated. In the classical approach, probability is calculated by dividing the number of favorable outcomes by the total number of possible outcomes.

5. What are the limitations of probability interpretation?

One limitation of probability interpretation is that it is based on assumptions and cannot provide a definitive answer. It is also dependent on the quality and accuracy of the data used in the calculation. Additionally, different interpretations may lead to different conclusions, making it important to carefully consider the interpretation being used in a given situation.

Similar threads

Back
Top