- #1
yahastu
- 79
- 7
I've come up with a problem that's important for an algorithm I'm developing, and I don't know how to solve it. Wondering if anyone here can help?
I have an initial set S1 of size N.
S2 is created by randomly sampling M samples from S1 with replacement (meaning the same item could be selected multiple times).
Let C = |S2| be the size of S2 (ie, the number of unique elements that get sampled).
For what value of X can we say that C > X with 95% probability?
It would be easy to solve this problem through numerical simulation, but I need a fast analytical solution (or approximate).
Any ideas?
I have an initial set S1 of size N.
S2 is created by randomly sampling M samples from S1 with replacement (meaning the same item could be selected multiple times).
Let C = |S2| be the size of S2 (ie, the number of unique elements that get sampled).
For what value of X can we say that C > X with 95% probability?
It would be easy to solve this problem through numerical simulation, but I need a fast analytical solution (or approximate).
Any ideas?