Variation of the coupon problem

  • Thread starter Smitz
  • Start date
  • Tags
    Variation
In summary, the conversation discusses a problem involving a sample size of 900 balls, where a certain number of balls are randomly selected and tested each week. The goal is to determine the number of balls that need to be tested each week to achieve a 90% confidence level that 90% of the sample has been tested after 3 years. The solution involves using a two-tailed test and the cumulative normal distribution function to determine the critical sample size.
  • #1
Smitz
1
0
I have a problem which is basically a variation of the coupon problem... I can't get my head around it as I haven't done much stats :rolleyes:.

I have a sample size of 900, say a pot of 900 balls, each with a unique id. Each week I am going to select a set number of balls randomly from the pot to test them. After testing, the balls go back into the pot. How many balls do I need to test every week so that after 3 years I can be, say, 90% confident that 90% of the sample have been tested in that period.

Any help would be greatly appreciated (especially a solution!:biggrin:)

Andrew
 
Physics news on Phys.org
  • #2
This is related to the critical sample size problem:
http://en.wikipedia.org/wiki/Confidence_interval

E.g. to apply a "two-tailed" test to a given value x of a normally distributed random variable with mean m and variance σ2, one can define z = (x - m) / (σ/√n) then solve Φ(z) = 0.975 for n, where Φ is the cumulative normal distribution function.
 
  • #3


The variation of the coupon problem that you have described is known as the "balls in a jar" problem. In this case, the balls represent the coupons and the jar represents the population from which the coupons are being drawn. The goal is to determine how many balls need to be drawn (or coupons need to be tested) in order to have a certain level of confidence that a certain percentage of the population has been covered.

To solve this problem, we can use the concept of confidence intervals. A confidence interval is a range of values within which we can be confident that the true population parameter lies. In this case, our population parameter is the proportion of balls (or coupons) that have been tested.

To determine the sample size needed, we need to consider the following factors:
1. Confidence level: This is the level of certainty that we want to have in our results. In this case, you have specified a confidence level of 90%.
2. Margin of error: This is the maximum amount by which the true population proportion can differ from our sample proportion. In other words, it is the degree of precision we want in our estimate. For this problem, we can set the margin of error to be 5% (since you want to be 90% confident that 90% of the sample has been tested, we can allow for a 5% margin of error in either direction).
3. Population size: This is the total number of balls in the pot, which in your case is 900.

Using these factors, we can use a formula to calculate the sample size needed:
n = (Z^2 * p * q) / (e^2)
where:
n = sample size
Z = the z-score corresponding to the desired confidence level (for a 90% confidence level, Z = 1.645)
p = estimated proportion of the population that we want to capture (in this case, 90%)
q = 1-p (or the proportion of the population that we do not want to capture, which is 10% in this case)
e = margin of error (in decimal form, so 5% = 0.05)

Plugging in the values, we get:
n = (1.645^2 * 0.9 * 0.1) / (0.05^2) = 119.16

So, you would need to test 119 balls every week in
 

FAQ: Variation of the coupon problem

What is the "coupon problem"?

The "coupon problem" is a mathematical probability problem that involves collecting a set of items, each with a unique coupon or stamp, until all items have been collected. The goal is to determine how many items need to be collected on average in order to have a complete set.

How does variation impact the coupon problem?

Variation in the coupon problem refers to the different outcomes that can occur when collecting the items. This variation can impact the number of items needed to collect in order to have a complete set. It can also affect the likelihood of completing the set within a certain number of collections.

What factors contribute to variation in the coupon problem?

There are several factors that can contribute to variation in the coupon problem, including the number of unique items, the probability of getting a specific item, and the total number of collections. Additionally, the method of collecting (random or non-random) can also affect the variation.

How can variation be calculated in the coupon problem?

Variation in the coupon problem can be calculated using mathematical formulas and simulations. These calculations take into account the number of unique items, the probability of getting a specific item, and the number of collections. By analyzing the variation, we can better understand the likelihood of completing a set within a certain number of collections.

How is the coupon problem relevant in real-world situations?

The coupon problem has real-world applications in various fields such as statistics, economics, and marketing. It can be used to analyze consumer behavior and predict the likelihood of a product being purchased or a service being used. It can also be used to optimize inventory management and minimize waste by determining the number of items needed to have a complete set.

Similar threads

Replies
57
Views
5K
Replies
8
Views
2K
Replies
7
Views
2K
Replies
4
Views
2K
Replies
18
Views
4K
Back
Top