Statistics Tip of the Week: Sampling with Replacement is required when using the Binomial Distribution
One of the requirements for using the Binomial Distribution is that each trial must be independent. One consequence of this is that the Sampling must be With Replacement.
To illustrate this, let's say we are doing a study in a small lake to determine the Proportion of lake trout. Each trial consists of catching and identifying 1 fish. If it's a lake trout, we count 1. The population of the fish is finite. We don't know this, but let's say it's 100 total fish 70 lake trout and 30 other fish.
Each time we catch a fish, we throw it back before catching another fish. This is called Sampling With Replacement. Then, the Proportion of lake trout is remains at 70%. And the Probability for any one trial is 70% for lake trout.
If, on the other hand, we keep each fish we catch, then we are Sampling Without Replacement. Let's say that the first 5 fish which we catch (and keep) are lake trout. Then, there are now 95 fish in the lake, of which 65 are lake trout. The percentage of lake trout is now 65/95 =68.4%. This is a change from the original 70%.
So, we don't have the same Probability each time of catching a lake trout. Sampling Without Replacement has caused the trials to not be independent. So, we can't use the Binomial Distribution. We must use the Hypergeometric Distribution instead.
11/5/2016 02:36:56 am
This is a very simple concept.
Andrew A Jawlik
11/5/2016 07:28:06 am
Thank you for the comment. I'm not sure I understand the request. The concept is simple, as you said, but getting into mathematical examples of the use of the Hypergeometric Distribution would only make things more complicated.
11/6/2016 07:20:15 am
but when population size.N. Is very large as in samping both the cases will become the same thus use of binomial can not be denied
Andrew A Jawlik
11/6/2016 09:11:50 am
Yes. Thank you. The Binomial Distribution can be used as an approximation for the Hypergeometric when the Population Size (N) is large relative to the Sample Size (n) -- for example when N > 10n.
Leave a Reply.
Andrew A. (Andy) Jawlik is the author of the book, Statistics from A to Z -- Confusing Concepts Clarified, published by Wiley.