0% found this document useful (0 votes)
63 views1 page

Distributions Normal Binominal

This document discusses statistical distributions that are important for data scientists to understand. It introduces the normal distribution, which describes variables that tend to cluster around an average value, and the binomial distribution, which models the number of successes from repeated Bernoulli experiments with a fixed probability of success. Examples of variables that follow a normal distribution include heights, weights, and IQ scores. The binomial distribution is used to model situations like the number of traffic jams in a week based on the daily probability of a jam.

Uploaded by

Ana Chikovani
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
63 views1 page

Distributions Normal Binominal

This document discusses statistical distributions that are important for data scientists to understand. It introduces the normal distribution, which describes variables that tend to cluster around an average value, and the binomial distribution, which models the number of successes from repeated Bernoulli experiments with a fixed probability of success. Examples of variables that follow a normal distribution include heights, weights, and IQ scores. The binomial distribution is used to model situations like the number of traffic jams in a week based on the daily probability of a jam.

Uploaded by

Ana Chikovani
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 1

Source: https://towardsdatascience.

com/7-statistical-distributions-that-every-data-scientist-should-know-with-
intuitive-explanations-bf967db81f0b
Statistical Distributions are an important tool in data science. A distribution helps us to understand a variable by
giving us an idea of the values that the variable is most likely to obtain. Besides, when knowing the distribution of
a variable, we can do all sorts of probability calculations, to compute probabilities of certain situations occurring.
1. Normal or Gaussian distribution
A variable with a normal distribution has an average, which is also the most common value. Values closer to the
average are more likely to occur, and the further a value is away from the average, the less likely it is to occur. The
normal distribution is also characterized by symmetric variation around the average, described by the standard
deviation. This means that higher values are as common as lower values.
Examples of the normal distribution can be found in many variables that are natural, continuous variables. For
example, the weight or height of animals would follow a normal distribution, as most animals are of the average
weight, some are a little over or underweight but not so many are extremely skinny or extremely fat.
Human IQ is also a very famous example of the normal distribution, where the average is 100 and the standard
deviation is 15. Most people are average intelligent, some are a bit smarter or a bit less smart, and few are very
intelligent or very unintelligent.
2. Binomial Distribution
The Binomial distribution is like a bigger brother of the Bernoulli distribution. It models the number of successes
in a situation of repeated Bernoulli experiments. So rather than focusing on the probability of success, we focus on
a success count.
The two parameters for the Binomial distribution are the number of experiments and the probability of success. A
basic example of flipping a coin ten times would have the number of experiments equal to 10 and the probability
of success equal to 0.5. This gives the following probability for each number of successes out of 10:
Another example of the Binomial distribution would be the probability of
getting in a traffic jam in a given week, knowing that the probability of
getting in a traffic jam on 1 given day is 0.2. This is a repetition of 1
Bernoulli yes/no variable on 5 works days, so the parameters are: number
of experiments is 5 and the probability of success is 0.2. The outcome
graph below shows that it is most likely to have 1 traffic jam, then 0 and
then 2, 3, 4, and 5 respectively.

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy