Distributions Normal Binominal
Distributions Normal Binominal
com/7-statistical-distributions-that-every-data-scientist-should-know-with-
intuitive-explanations-bf967db81f0b
Statistical Distributions are an important tool in data science. A distribution helps us to understand a variable by
giving us an idea of the values that the variable is most likely to obtain. Besides, when knowing the distribution of
a variable, we can do all sorts of probability calculations, to compute probabilities of certain situations occurring.
1. Normal or Gaussian distribution
A variable with a normal distribution has an average, which is also the most common value. Values closer to the
average are more likely to occur, and the further a value is away from the average, the less likely it is to occur. The
normal distribution is also characterized by symmetric variation around the average, described by the standard
deviation. This means that higher values are as common as lower values.
Examples of the normal distribution can be found in many variables that are natural, continuous variables. For
example, the weight or height of animals would follow a normal distribution, as most animals are of the average
weight, some are a little over or underweight but not so many are extremely skinny or extremely fat.
Human IQ is also a very famous example of the normal distribution, where the average is 100 and the standard
deviation is 15. Most people are average intelligent, some are a bit smarter or a bit less smart, and few are very
intelligent or very unintelligent.
2. Binomial Distribution
The Binomial distribution is like a bigger brother of the Bernoulli distribution. It models the number of successes
in a situation of repeated Bernoulli experiments. So rather than focusing on the probability of success, we focus on
a success count.
The two parameters for the Binomial distribution are the number of experiments and the probability of success. A
basic example of flipping a coin ten times would have the number of experiments equal to 10 and the probability
of success equal to 0.5. This gives the following probability for each number of successes out of 10:
Another example of the Binomial distribution would be the probability of
getting in a traffic jam in a given week, knowing that the probability of
getting in a traffic jam on 1 given day is 0.2. This is a repetition of 1
Bernoulli yes/no variable on 5 works days, so the parameters are: number
of experiments is 5 and the probability of success is 0.2. The outcome
graph below shows that it is most likely to have 1 traffic jam, then 0 and
then 2, 3, 4, and 5 respectively.