Poisson Distribution - An Overview - ScienceDirect Topics
Poisson Distribution - An Overview - ScienceDirect Topics
Poisson Distribution
Related terms:
Poisson Distribution
The Poisson distribution is a discrete distribution that measures the probability of a
given number of events happening in a specified time period. In finance, the
Poisson distribution could be used to model the arrival of new buy or sell orders
entered into the market or the expected arrival of orders at specified trading venues
or dark pools. In these cases, the Poisson distribution is used to provide
expectations surrounding confidence bounds around the expected order arrival
rates. Poisson distributions are very useful for smart order routers and algorithmic
trading.
Notation
Parameter
Distribution
Cdf
Mean
Variance
Skewness
Kurtosis
https://www.sciencedirect.com/topics/mathematics/poisson-distribution 1/14
1/10/2021 Poisson Distribution - an overview | ScienceDirect Topics
Poisson Distribution
The Poisson distribution describes the probability to find exactly x events in a given
length of time if the events occur independently at a constant rate. In addition, the
Poisson distribution can be obtained as an approximation of a binomial
distribution when the number of trials n of the latter distribution is large, success
probability p is small, and np is a finite number.
The PMF of the Poisson distribution is given by
, where λ is a positive number. Both the mean
and variance of the Poisson distribution are equal to λ.
The maximum likelihood estimate of λ from a sample from the Poisson
distribution is the sample mean.
If λ is large, the probability that a Poisson random variable X takes the value x can
be obtained by approximating X by a normal variable Y with mean and variance λ
and computing the probability that Y lies between x −0.5 and x +0.5.
It is possible to generate a Poisson random variable by generating a random
number following the U(0, 1) distribution and then using the CDF-inversion
method.
The Poisson distribution has been applied in education, for example, by Novick and
Jackson (1974).
https://www.sciencedirect.com/topics/mathematics/poisson-distribution 2/14
1/10/2021 Poisson Distribution - an overview | ScienceDirect Topics
where x=0, 1, … represents the discrete random variable, such as ADC counts
recorded by a detection system, and μ>0 is the mean. Figure 9.3.3 depicts this
distribution for different values of μ. It is apparent that the width of the distribution
increases with μ, which indicates that the uncertainty in measurement increases
with an increase in the value of x.
Figure 9.3.3. Poisson probability density for different values of μ. The width of the
distribution, which is a reflection of the uncertainty in measurements, increases
with increase in μ.
Let us now apply the maximum likelihood method to determine the best estimate
of the mean of a set of n measurements, assuming that the underlying process is
Poisson in nature. The best way to do this is to use the maximum likelihood
method we outlined earlier and applied in the previous section when discussing
the binomial distribution. Since Poisson distribution is a discrete probability
distribution, its likelihood function for a set of n measurements can be written as
(9.3.32)
This shows that the simple mean is the most probable value of a Poisson
distributed variable. To determine the error in μ, we first take the second derivative
of the log-likelihood function and then substitute it in Eq. (9.3.22):
https://www.sciencedirect.com/topics/mathematics/poisson-distribution 3/14
1/10/2021 Poisson Distribution - an overview | ScienceDirect Topics
(9.3.35)
This is one of the most useful results of the Poisson distribution. It implies that if
we make one measurement, the statistical error we should expect would simply be
the square root of the measured quantity. For example, if we count the number of
γ-ray photons coming from a radioactive source using a GM tube and get a
number N, the statistical error we should expect will simply be Fortunately,
most of the processes we encounter in the field of radiation detection and
measurement, such as the activity of a radioisotope, photoelectric effect, and
electron multiplication in a PMT tube, can all be very well described by Poisson
statistics.
Probability Distributions
Ronald N. Forthofer, ... Mike Hernandez, in Biostatistics (Second Edition), 2007
https://www.sciencedirect.com/topics/mathematics/poisson-distribution 4/14
1/10/2021 Poisson Distribution - an overview | ScienceDirect Topics
Table 5.3. Calculation of poisson probabilities, Pr{X = x} = e−μ μx/x!, for μ = 1 and 2.
μ=1 μ=2
1.0000 0.9999
https://www.sciencedirect.com/topics/mathematics/poisson-distribution 5/14
1/10/2021 Poisson Distribution - an overview | ScienceDirect Topics
The shape of the Poisson probability mass function with μ equal to 2 (the top plot
in Figure 5.4) is similar to the binomial mass function for a sample of size 10 and π
equal to 0.2 just shown. The cdf (the bottom plot in Figure 5.4) has the same
general shape as that shown in the preceding binomial example, but the shape is
easier to see here, since there are more values for the X variable shown on the
horizontal axis.
Random Variables
Andrew F. Siegel, in Practical Business Statistics (Seventh Edition), 2016
Fig. 7.5.1. The Poisson distribution with 0.5 occurrences expected is a skewed
distribution. There is a high probability, 0.607, that no occurrences will happen at
all.
Fig. 7.5.2. The Poisson distribution with two occurrences expected. The distribution
is still somewhat skewed.
https://www.sciencedirect.com/topics/mathematics/poisson-distribution 7/14
1/10/2021 Poisson Distribution - an overview | ScienceDirect Topics
Fig. 7.5.3. The Poisson distribution with 20 occurrences expected. The distribution,
although still discrete, is now fairly close to normal.
There are three important facts about a Poisson distribution. These facts, taken
together, tell you how to find probabilities for a Poisson distribution when you
know only its mean.
Example
How Many Warranty Returns?
Because your firm’s quality is so high, you expect only 1.3 of your products to
be returned, on average, each day for warranty repairs. What are the chances
that no products will be returned tomorrow? That one will be returned? How
about two? How about three?
Since the mean (1.3) is so small, exact calculations are needed. Here are the
details:
From these basic probabilities, you could add up the appropriate probabilities
for 0, 1, and 2 to also find the probability that two items or fewer will be
https://www.sciencedirect.com/topics/mathematics/poisson-distribution 8/14
1/10/2021 Poisson Distribution - an overview | ScienceDirect Topics
returned. The probability is, then, 0.27253 + 0.35429 + 0.23029 = 0.857, or
85.7%.
To use Excel to compute these probabilities, you could use the function
“=POISSON (value, mean, FALSE)” to find the probability that a Poisson
random variable is exactly equal to some value, and you could use
“=POISSON (value, mean, TRUE)” to find the probability that a Poisson
random variable is less than or equal to the value. Here are the results:
Example
How Many Phone Calls?
Your firm handles 460 calls per day, on average. Assuming a Poisson
distribution, find the probability that you will be overloaded tomorrow, with
500 or more calls received.
This may be computed directly in Excel using the formula
to find the answer 0.0341, because the POISSON function finds the
probability of being less than or equal to a given number, then we use the
complement rule to find the probability of being greater; note that being “500
or more” is the complement of the event “499 or less.”
The mean, μ = 460, is given, and it follows that the standard deviation is
. The normal approximation is reasonable
because the mean (460) is so large. Since the normal distribution is
continuous, any value over 499.5 will round to 500 or more. For intuition, the
standardized number of calls is
Our situation involves being more than 1.84 standard deviations above the
mean, so we expect to see a few percent. Using the normal distribution, the
answer is a probability of 1 − 0.967 = 0.033, fairly close to the more exact
0.341 found earlier. In conclusion, you may expect to be overloaded
tomorrow with probability only about 3% (not very likely but within
possibility).
https://www.sciencedirect.com/topics/mathematics/poisson-distribution 9/14
1/10/2021 Poisson Distribution - an overview | ScienceDirect Topics
It is an elementary exercise to show that the mean and variance of Y are both equal
to λ; E[Y] = Var[Y] = λ. In fact, this property characterizes the Poisson distribution. A
related property is that the cumulant generating function of a Poisson random
variable is given by , where MY (t) is the
moment generating function of Y. This can be proved by simple calculations, but
for this presentation, it is instructive to consider the Poisson distribution as a
member of the natural exponential family of distributions.
Let f (x; θ) denote the density function of the natural exponential family with
parameter θ, i.e.,
(2)
where the notation is as in (3). Regardless of the chosen model, a fact that remains
true is that both (3) and (4) belong to the class of generalized linear models as
introduced by Nelder and Wedderburn (1972) and elaborated further by McCullagh
and Nelder (1989). Recall, that a generalized linear model consists of three
components; the random component that belongs to the exponential family of
https://www.sciencedirect.com/topics/mathematics/poisson-distribution 10/14
1/10/2021 Poisson Distribution - an overview | ScienceDirect Topics
distributions (2) with E[X] = μ, the systematic component η, and the link function
g(⋅). The link function is a monotone twice differentiable function that is chosen by
the user (or can be estimated). This function associates the random and systematic
component via g(μ) = η. For the Poisson distribution, it is clear that both (3) and (4)
introduce a generalized linear model with and g(λ) = λ (for
(3)) and g(λ) = log λ (for (4)). Estimation and inference are based on the maximum
likelihood theory – this topic has been described in several texts; see McCullagh
and Nelder (1989) and Agresti (2002), for example. In the next section, we explore
these ideas in the context of count time series.
Poisson distribution
The Poisson distribution for a variable λ is:
[23]
for k = 0, 1, 2, 3, etc. The mean of this distribution is λ and the standard deviation
is √λ. When the number n of trials is very large and the probability p small, e.g. n >
25 and p < 0.1, binomial probabilities are often approximated by the Poisson
distribution. Thus, since the mean of the binomial distribution is np (equation [21])
and the standard deviation (equation [22]) approximates to √np when p is small, we
can consider λ to represent np. Thus λ can be considered to represent the average
number of successes per unit time or unit length or some other parameter.
Key point
Approximating the binomial distribution to the Poisson distribution
If p is the possibility of an event occurring and q the possibility that it does
not occur, then q + p = 1 and so if we consider n samples, we have (q + p)n = 1
and the binomial expression gives:
If p is small then q = 1 − p × 1 and with n large the first few terms have n − 1
approximating to n, n − 2 to n, etc. The binomial expression can thus be
approximated to:
If we let np = λ then:
https://www.sciencedirect.com/topics/mathematics/poisson-distribution 11/14
1/10/2021 Poisson Distribution - an overview | ScienceDirect Topics
But this is the series for e1 (see Table 7.1) and so 1n = e1. We can thus write
the binomial expression as:
Example
2% of the output per month of a mass produced product have faults. What is
the probability that of a sample of 400 taken that 5 will have faults?
Assuming the Poisson distribution, we have λ = np = 400 × 0.02 = 8 and so
equation [23] gives for k = 5:
Example
The output from a CNC machine is inspected by taking samples of 60 items.
If the probability of a defective item is 0.0015, determine the probability of
the sample having (a) two defective items, (b) more than two defective items.
(a) We have n = 60 and p = 0.0015. Thus, assuming a Poisson distribution,
we have λ = np = 60 × 0.0015 = 0.09 and so equation [23] gives the
probability of two defective items as (0.092 × e−0.09)/2! = 3.7 × 10−3 or
0.37%.
(b) The probability of there being more than two defective items is 1 − {P(0) +
P(1) + P(2)] = 1 − e−λ{1 + λ + λ2/2!) = 1 − e−0.09{1 + 0.09 + 4.5 × 10−3} = 1.36
× 10−4 or 0.01%.
Example
There is a 1.5% probability that a machine will produce a faulty component.
What is the probability that there will be at least 2 faulty items in a batch of
100?
Assuming the Poisson distribution can be used, we have λ = np = 100 × 0.015
= 1.5 and so the probability of at least 2 faulty items will be:
Finding Probabilities
R.H. Riffenburgh, in Statistics in Medicine (Third Edition), 2012
https://www.sciencedirect.com/topics/mathematics/poisson-distribution 12/14
1/10/2021 Poisson Distribution - an overview | ScienceDirect Topics
occur on any one trial. The number of cases of bubonic plague would follow
Poisson: a large number of patients can be found with chills, fever, tender enlarged
lymph nodes, and restless confusion, but the chance of the syndrome being plague
is extremely small for any randomly chosen patient. This distribution is named for
Siméon Denis Poisson, who published the theory in 1837. The classic use of
Poisson was in predicting the number of deaths of Prussian army officers from
horse kicks from 1875 to 1894; there was a large number of kicks, but the chance
of death from a randomly chosen kick was small.
Recommended publications:
https://www.sciencedirect.com/topics/mathematics/poisson-distribution 13/14
1/10/2021 Poisson Distribution - an overview | ScienceDirect Topics
https://www.sciencedirect.com/topics/mathematics/poisson-distribution 14/14