- #1
Matt2
- 2
- 0
Hi, I'm trying to figure out how to compute probability related to a problem I am tackling for work, and I think I have a handle on how to do it with smaller numbers, but no idea how to approach it for larger numbers. (And I need to explain the answers to a judge in simple terms). So here is the problem:
Imagine a company that maintains data about 200,000,000 Americans. Each month, this company takes a completely random sample of 5% of these reports to analyze. However, we are concerned only with 60,000 specific people out of this group of 200,000,000.
So we can visualize this as 60,000 "red marbles" and 199,940,000 "white marbles". Assuming that these combined 200,000,000 marbles are placed into a very large container and that 10,000,000 are selected randomly. I am trying to calculate:
1) The chance that none of the 10,000,000 marbles will be red;
2) The chance that 40 or more of the 10,000,000 marbles will be red;
3) The chance that all of the possible 60,000 red marbles will be included in the 10,000,000 selected.
Of these, the answer to question #2 is the most important, followed by #1, then #3.
Does anyone have an idea on where to start? I thought maybe it would make it easier to simply remove 4 zeroes from each number so that we are working with 20,000 / 6 / 1,000 but it seems that this skews the results. Would appreciate a pointer in the right direction.
Thanks!
Matt
Imagine a company that maintains data about 200,000,000 Americans. Each month, this company takes a completely random sample of 5% of these reports to analyze. However, we are concerned only with 60,000 specific people out of this group of 200,000,000.
So we can visualize this as 60,000 "red marbles" and 199,940,000 "white marbles". Assuming that these combined 200,000,000 marbles are placed into a very large container and that 10,000,000 are selected randomly. I am trying to calculate:
1) The chance that none of the 10,000,000 marbles will be red;
2) The chance that 40 or more of the 10,000,000 marbles will be red;
3) The chance that all of the possible 60,000 red marbles will be included in the 10,000,000 selected.
Of these, the answer to question #2 is the most important, followed by #1, then #3.
Does anyone have an idea on where to start? I thought maybe it would make it easier to simply remove 4 zeroes from each number so that we are working with 20,000 / 6 / 1,000 but it seems that this skews the results. Would appreciate a pointer in the right direction.
Thanks!
Matt