Business Analytics Week 5
Business Analytics Week 5
1
Probability
INTRODUCTION: PROBABILITY
Probability is a measure of the relative likelihood of an event happening in the future. One method
of estimation of the likelihood of an event in the future is to use the relative frequency of events in
the past.
When someone tells you the probability of something happening, they are telling you how likely
that something is. When people buy lottery tickets, the probability of winning is usually stated, and
sometimes, it can be something like 1/10,000,000 (or even worse). This tells you that it is not very
likely that you will win.
In notational form:
P(x) = n(x)/n
where n(x) = number of times x happened and n = total number of outcomes (sample size). n(x) is
sometimes simplified as f, where f is used to represent the number of times (frequency) an
outcome:
p=f/n
where p is understood to be a probability of an event (outcome), f is the frequency of that outcome,
and n is the sample size.
PROBABILITY DISTRIBUTIONS
A Random Variable is a set of uncertain outcomes, resulting from an event of a random process.
The set of probabilities (likelihoods) of all outcomes of the random variable is called a probability
Course Module
distribution. As an example, consider the demand for a specific model of car next month. It is
uncertain how many cars will be sold; this quantity is considered to be a random variable and the
probabilities associated with each possible quantity of demand constitute the probability
distribution.
A probability distribution is a statistical function that describes all the possible values and
likelihoods that a random variable can take within a given range. This range will be bounded
between the minimum and maximum possible values, but precisely where the possible value is
likely to be plotted on the probability distribution depends on a number of factors. These factors
include the distribution's mean (average), standard deviation, skewness, and kurtosis.
Random variables, and their associated probability distributions, are categorized as either
Discrete or Continuous. For discrete random variables, the probability function that details the
probability distribution is called the probability mass function (pmf). For continuous random
variables the probability function is called the probability density function (pdf). Thus, for a
discrete random variable, the pmf is a single bar of the probability distribution (histogram). For a
continuous variable, the pdf is the height of the continuous probability distribution curve.
Recall that the probability that a random variable, X, has an outcome of x is defined, or calculated,
according to a probability function, f(x).
Similarly, if f(x) is a continuous function then the compliment, 1 − F(x), is the integral of f(x) from x
to ∞:
Fundamentals Of Business Analytics
3
Probability
The probability of an event not occurring is called the Complement and is 1 − P(A). Two outcomes
are called disjoint or mutually exclusive if they both cannot happen. For instance, for a single die
roll the outcomes 1 and 2 are disjoint since they both cannot occur. On the other hand, the
outcomes 1 and rolling an odd number are not disjoint since both occur if the outcome of the roll
is a 1.
Course Module
Calculating the probability of disjoint outcomes is easy. When rolling a die, the outcomes 1 and 2
are disjoint, and the probability that one of these outcomes will occur can be computed by adding
their separate probabilities as
P(1 or 2) = P(1) + (P(2) = 1/6 + 1/6 = 1/3
What about the probability of rolling a 1, 2, 3, 4, 5, or 6? Here again, all of the outcomes are disjoint
so we add the probabilities:
P(1or 2 or 3 or 4 or 5 or 6)
= P(1) + P(2) + P(3) + P(4) + P(5) + P(6)
= 1/6 + 1/6 + 1/6 + 1/6 + 1/6 + 1/6 = 1
This property is known as The Addition Rule, and guarantees the accuracy of this approach when
the outcomes are disjoint. Thus, in general:
OR = Additive
Two processes are independent if knowing the outcome of one provides no information about the
likelihood of the outcome of the other. For instance, flipping a coin and rolling a die are two
independent processes; knowing the coin was heads does not help determine the outcome of a die
roll. On the other hand, stock prices usually move up or down together, so they are not
independent. Thus, in general:
AND = Multiply
The Compliment of outcome x is all of the other outcomes that are not the outcome x. For
example, the complement to P(Heads) is P(Tails) and the complement to rolling a 1 on a single die
is 5/6. Typically, if the probability of the event x, is P(x) = p, then the compliment of the event:
P(x’) = 1-P(x)
or, more compactly
q = 1-p
where q is the compliment of p.
Conditional Probability is a probability that depends upon the occurrence of another event.
Mathematically, the conditional probability of an event, A, is the probability that the event will
occur given the knowledge that event B has already occurred. This probability is written P(A|B)
and is stated as
Fundamentals Of Business Analytics
5
Probability
Conditional probability is the opposite of independence. In conditional probability the answer is “it
depends” … for independence, “it does not depend.”
additional evidence. In finance, Bayes' theorem can be used to rate the risk of lending money to
potential borrowers.
Bayes' theorem is also called Bayes' Rule or Bayes' Law and is the foundation of the field of
Bayesian statistics.
Applications of the theorem are widespread and not limited to the financial realm. As an example,
Bayes' theorem can be used to determine the accuracy of medical test results by taking into
consideration how likely any given person is to have a disease and the general accuracy of the test.
Bayes' theorem relies on incorporating prior probability distributions in order to generate
posterior probabilities. Prior probability, in Bayesian statistical inference, is the probability of an
event before new data is collected. This is the best rational assessment of the probability of an
outcome based on the current knowledge before an experiment is performed. Posterior
probability is the revised probability of an event occurring after taking into consideration new
information. Posterior probability is calculated by updating the prior probability by using Bayes'
theorem. In statistical terms, the posterior probability is the probability of event A occurring given
that event B has occurred.
Bayes' theorem thus gives the probability of an event based on new information that is, or may be
related, to that event. The formula can also be used to see how the probability of an event
occurring is affected by hypothetical new information, supposing the new information will turn
out to be true. For instance, say a single card is drawn from a complete deck of 52 cards. The
probability that the card is a king is four divided by 52, which equals 1/13 or approximately
7.69%. Remember that there are four kings in the deck. Now, suppose it is revealed that the
selected card is a face card. The probability the selected card is a king, given it is a face card, is four
divided by 12, or approximately 33.3%, as there are 12 face cards in a deck.
Where:
P(A)= The probability of A occurring
P(B)= The probability of B occurring
P(A∣B)=The probability of A given B
P(B∣A)= The probability of B given A
P(A⋂B))= The probability of both A and B occurring
Course Module
EXAMPLES OF BAYES' THEOREM
Below are two examples of Bayes' theorem in which the first example shows how the formula can
be derived in a stock investing example using Amazon.com Inc. (AMZN). The second example
applies Bayes' theorem to pharmaceutical drug testing.
The conditional probability of A given that B has happened can be expressed as:
• If A is: "AMZN price falls" then P(AMZN) is the probability that AMZN falls; and B is: "DJIA is already
down," and P(DJIA) is the probability that the DJIA fell; then the conditional probability expression
reads as "the probability that AMZN drops given a DJIA decline is equal to the probability that
AMZN price declines and DJIA declines over the probability of a decrease in the DJIA index.
Where P(AMZN) and P(DJIA) are the probabilities of Amazon and the Dow Jones falling, without
regard to each other.
The formula explains the relationship between the probability of the hypothesis before seeing the
evidence that P(AMZN), and the probability of the hypothesis after getting the evidence
P(AMZN|DJIA), given a hypothesis for Amazon given evidence in the Dow.
Bayes' theorem shows that even if a person tested positive in this scenario, it is
actually much more likely the person is not a user of the drug.
Fundamentals Of Business Analytics
9
Probability
Pinder, J. (2017). Introduction to Business Analytics Using Simulation, 125 London Wall, London EC2Y 5AS,
United Kingdom
Schniederjans, M. (2017), Business Analytics Principles, Concepts, and Applications, Pearson Education, Inc,
Upper Saddle River, New Jersey 07458
Online Reference:
https://www.investopedia.com/
https://www.managementstudyguide.com/
https://courses.lumenlearning.com/
https://machinelearningmastery.com/what-is-probability/
Course Module