Configure the scenario
Think of this section as describing a story like: "Out of \(N\) customers, \(r\) have some outcome I care about, and I draw a sample of size \(n\). What is the chance I see this outcome \(k\) times?". The labels and numbers you choose below are the ones used throughout the visuals, equations, and narrative explanations.
This short phrase names the outcome you care about (for example, "opens email", "is a VIP customer", or "contains a defect").
Total number of items in the population.
Number of draws in each sample.
How many population members have the outcome you named above.
Used when computing \(P(K = k)\). For "at least one", the app uses the complement rule.
Switch between finite-population draws and independent draws with constant probability.
Choose whether to focus on a specific count or the chance of seeing at least one special item.
What event are we measuring?
When you select Exact: \(P(K = k)\), the event is “the sample contains exactly \(k\) items with this outcome”. For example, if you set \(k = 2\), the app is computing the chance that exactly two of your \(n\) sampled units have the outcome (such as being a VIP or containing a defect).
When you select At least one: \(P(K \ge 1)\), the event is “the sample contains one or more such items”. This is the natural choice for questions like “What is the chance we see any VIP customers in this sample of size \(n\)?”.
Used for Monte Carlo simulation to approximate the distribution.
How does simulation help?
Each simulated sample is a “what if” run of your design: we draw \(n\) items using the sampling mode you chose and count how many have the outcome of interest. Repeating this many times builds up the orange bars in the distribution chart.
As you increase the number of simulated samples, the simulated probabilities \(P(K = k)\) should get closer to the exact hypergeometric / binomial probabilities shown in blue, illustrating the link between theoretical probability models and long-run frequencies.
Additional info about these inputs
Population size \(N\): think of this as the full list of customers, items, or units you could sample from. In many marketing problems \(N\) is large but finite (panel members, current customers, emails on a list).
Sample size \(n\): how many draws you make. For without replacement, this cannot exceed \(N\); for with replacement you can think of re-contacting or re-targeting with possible repeats.
Special items \(r\): a fixed set you care about (VIP customers, high-value prospects, defective units). The model assumes these are known in advance and fixed inside the population.
Target count \(k\): used for \(P(K = k)\). For example, \(k = 0\) is the probability you miss all specials; \(k = 2\) is the chance you get exactly two specials in the sample.
Sampling mode: choose without replacement for one-off samples from a list, and with replacement for repeated, independent trials (like ad impressions) where the same unit could be sampled multiple times.