0% found this document useful (0 votes)

6 views9 pages

1 Module Notes

The document provides an introduction to probability and random variables, explaining the concepts of probability, random variables, and their properties. It covers basic probability rules, the distinction between discrete and continuous random variables, and how to calculate expected values and variances. Additionally, it introduces the Bernoulli distribution as a fundamental discrete probability distribution.

Uploaded by

devin.do100

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views9 pages

1 Module Notes

Uploaded by

devin.do100

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 9

MAST 6474 Introduction to Data Analysis I

Probability and Random Variables

Probabilities
We use probabilities to talk about uncertain events. Probability is defined as the chance, likelihood, or possibility that a
particular event will occur. We often use the letter “p” to represent a probability. When writing specific probability statements,
however, we will use “Pr(Event)” instead.

Probabilities can be represented as proportions (0 to 1) or as percentages (0% to 100%). A probability of 0 or 0% means that
the event never happens—it is impossible. A probability of 1 or 100% means that the event always happens—it is certain.

We can learn about probabilities in different ways. Some events have theoretical probabilities. When flipping a coin, for
example, we know that the probability that the coin will land heads (or tails) up is .5. When rolling dice with six faces, we know
that the probability that a die will land with any particular face up—either 1, 2, 3, 4, 5, or 6 — is 1/6 = .1667. Because a 52-card
deck includes 4 aces, we know the probability that a randomly selected card is an ace is 4/52 = .0769. We learn the probability
of other events empirically, using the actual frequency that the event occurs. For example, a manufacturer of children’s
products might want to know the probability that an American family has children aged 5 or younger. Using the most recent US
census, you can divide the number of families with children aged 5 or younger by the total number of families. A financial
analyst might want to know the probability that Apple’s stock price increases after a new product introduction. Using Apple’s
stock price history, the analyst can divide the number of times that the stock has increased after a new product introduction by
the total number of new product introductions. It is important to note that using only a sample of US families or new product
introductions is not enough to determine the true probability, though we will talk about how to use samples of data in Module 3.

Copyright Edward Fox and John Semple 2019 1

MAST 6474 Introduction to Data Analysis I

Basic properties of probability

Probabilities are numbers assigned to events and have the following rules.
- 0 ≤ Pr ( A ) ≤ 1: If Pr ( A )=0 , there is absolutely no possible way that the event A will result in a trial. If Pr ( A )=1, the event A
will certainly occur.
- Pr ( A )=1−Pr ⁡( A c ): The probability of an event occurring is 1 minus the probability of it not occurring. Here, Ac is referred
to the complement of the event A. For example, when tossing a coin, tail is the complement of head.
- Independence: Two events are independent of each other if the occurrence of one has no influence on the probability
of the other. For example, when tossing two coins, the outcome of one coin will not affect the other.
o Pr ( A∧B )=Pr ( A ) × Pr ⁡(B): For two independent events A and B, the probability that both A and B occur is the
1 1 1
product of their probabilities. When tossing two coins, getting head both times is Pr ( Head ) × Pr ( Head )= × = .
2 2 4
- Two events are mutually exclusive (or disjoint) if two events have no outcomes in common. For example, when
tossing a coin, we cannot get “heads” and “tails” at the same time.
o Pr ( A∨B )=Pr ( A ) + Pr ⁡(B): the probability of mutually exclusive outcome occurs is the sum of their probabilities. For
example, when tossing a coin, Pr ( Head∨Tail )=Pr ( Head ) + Pr ( Tail ) =1.
o Pr ( A∨B )=Pr ( A ) + Pr ( B )−Pr ⁡(A∧B): If two events are not mutually exclusive, we must subtract the probability of
both occurring.

Copyright Edward Fox and John Semple 2019 2

MAST 6474 Introduction to Data Analysis I

Random Variables

If an uncertain event has numeric outcomes (for example, the rate of return on an asset, the selling price of a commercial or
residential property, the daily/weekly demand for a product or service, the daily production of plant, a customer’s satisfaction
score, etc.) we call it a random variable. The vast majority of this course will focus on random variables.

To describe a random variable completely, we need to know two things: (1) every possible numeric outcome and (2) the
probability of each outcome. If we have a complete description of both, then we know the random variable’s distribution.
There are two general types of random variables: discrete and continuous.

A discrete random variable is one for which all possible outcomes can be listed. A continuous random variable is one for which
the outcomes are so numerous that they cannot be listed. An example of a continuous random variable is the time it takes to
process a customer order. If measured with infinitesimal precision, one could not list all of the possible outcomes. In practice,
however, continuous distributions are often used to approximate discrete random variables if the number of possible outcomes
is very large.

Example: Craps, a Discrete Distribution. Define a random variable whose value is the sum of the dots observed when
rolling a pair of dice. Construct the probability distribution for this random variable.

Outcomes 2 3 4 5 6 7 8 9 10 11 12

Probabilities

Copyright Edward Fox and John Semple 2019 3

MAST 6474 Introduction to Data Analysis I

We frequently summarize information for a discrete random variable by means of a probability histogram. The probability
histogram is a visual display of the outcomes (plotted along the x-axis), and the probabilities of those outcomes, which are
represented by vertical bars.

Histogram for Sum of Dice Roll

0.175
0.15
0.125
Probability

0.1
0.075
0.05
0.025
0

12
2

11
Sum

Copyright Edward Fox and John Semple 2019 4

MAST 6474 Introduction to Data Analysis I

Describing a Distribution: Expectation and Variance (for a Discrete Random Variable)

The expected value or mean of random variable X is denoted E(X) or  ; for a discrete random variable, the formula is

μ=E ( X )=∑ x i p i
i

where i indexes the possible outcomes.

The expected value is the “theoretical” average that is computed by weighting each outcome by its probability and then
summing over all possible outcomes. For the sum of the dice:

Possible values ( x i) Probability ( pi) Product ( x i ∙ pi)

2 1/36 2/36
3 2/36 6/36
4 3/36 12/36
5 4/36 20/36
6 5/36 30/36
7 6/36 42/36
8 5/36 40/36 Sum of third
9 4/36 36/36 column
10 3/36 30/36
11 2/36 22/36
12 1/36 12/36
252/36 = 7 ¿ μ=E ( X )

Find the value 7 on the histogram above. We say that E ( X ) or μ is a measure of central tendency for the random variable X.

Copyright Edward Fox and John Semple 2019 5

MAST 6474 Introduction to Data Analysis I

Example: Electric Motors. You sell large electric motors to a single customer. Based on your historical data, you know that
demand for your motors from your main customer can be 0, 1, 2, or 4 (4 come on a pallet). The distribution is

Demand (xi) 0 1 2 4
Probability .40 .40 .10 .10
(pi)

What is the expected demand for a week?

E ( X )=¿ (0)(.40)+(1)(.40)+(2)(.10)+(4)(.10) = 1.00

Another key measure is the expected value of (X – E(X))2 — this is called the variance of X. The variance, written either as
2
Var(X) or σ , is defined by the formula

σ =Var ( X )=∑ ( ( x i−E ( X ) ) ∙ pi )

2 2

i
.
Remember that pi is the probability that X takes the value x i. The formula looks complicated, but a few simple examples will
clarify its calculation and help us understand what it tells us. Observe that you must compute the expected value of X before
you can compute the variance. Var(X) or σ 2 measures the dispersion of a random variable.

To compute the variance of demand for the motor example discussed, follow these steps:

Step 1. Determine E(X). From a previous calculation, we know it is 1.00.

MAST 6474 Introduction to Data Analysis I

Step 2. List all outcomes for ( x i−E ( X ) )2, their associated probabilities, and the products.

Possible Outcomes Probability Product

2 2
for ( x i−E ( X ) ) ( pi ) ( x i−E ( X ) ) ∙ pi
——————— ————— —————
(0−1) 2 = 1 .40 1 ×.40 = .40
(1−1) 2 = 0 .40 0 ×.40 = .00
(2−1) 2 = 1 .10 1 ×.10= .10
(4−1) 2 = 9 .10 9 ×.10 = .90

Step 3. Sum the products → Sum = σ 2=Var ( X )=¿ 1.40

The variance of a random variable is not the only measure of dispersion for a random variable. An easier measure to interpret
is the standard deviation, which is the square root of the variance. For the random variable in the preceding example, the
standard deviation, denoted by the Greek letter σ , is √ 1.40.1

The standard deviation helps us determine which outcomes are more or less likely. A general rule of thumb for practical
statistical applications is that about 95% of observed outcomes will come within two standard deviations (± 2 σ ) of the mean.
Over 99.5% of all observed outcomes will be values that are within three standard deviations ( ± 3 σ ) of the mean.

Note: The term six sigma refers to the probability of a defect or error in a production process. In a six-sigma ( ± 6 σ ) process,
99.99966% of the products are expected to be free of defects or errors.

Note that this is consistent with using 

2
1
for the variance.

MAST 6474 Introduction to Data Analysis I

Translating and Scaling Random Variables

There are other rules for calculating means and variances when scaling or translating random variables that can help you save
time. These rules will be demonstrated in the associated video. The rules are summarized below.

1. If X is a random variable with mean E(X) and variance Var(X), then for any constant c, cX is a (new) random variable with mean cE(X)
and variance c 2Var(X).

2. If X is a random variable with mean E(X) and variance Var(X), then for any constant d, d + X is a (new) random variable with mean d
+ E(X) and variance Var(X).

MAST 6474 Introduction to Data Analysis I

Example: Translating and Scaling Demand for Motors. Using the random variable X from the previous motor problem and
the formulas above, calculate the mean and variance of:

(a) 3X

(b) 2+7X

Distribution of a Random Variable – Bernoulli Distribution

The simplest discrete probability distribution is the Bernoulli distribution. It assigns the numerical value of 1 to an event
occurring, the numerical value of 0 to the event not occurring. Such an event is often called a Bernoulli trial.

(a) What is the expected value or mean of a Bernoulli distribution?

(b) What is the variance of a Bernoulli distribution?

The Bernoulli distribution, including its mean and variance, will be important in discussing our second discrete distribution...the
Binomial.

Statistical Concepts
No ratings yet
Statistical Concepts
44 pages
Week 5
No ratings yet
Week 5
30 pages
Unit2 Part1
No ratings yet
Unit2 Part1
107 pages
Fabm 11 Lessons
No ratings yet
Fabm 11 Lessons
88 pages
L2 - Mathematical Preliminaries
No ratings yet
L2 - Mathematical Preliminaries
24 pages
2 - Artificial Intelligence Mathematics
No ratings yet
2 - Artificial Intelligence Mathematics
53 pages
Statistic Ways
No ratings yet
Statistic Ways
13 pages
Lesson 2 Stats Prob
No ratings yet
Lesson 2 Stats Prob
19 pages
Discrete Random Variable
No ratings yet
Discrete Random Variable
41 pages
Statistics and Probability 2
No ratings yet
Statistics and Probability 2
16 pages
Math Reviewer
No ratings yet
Math Reviewer
31 pages
Inbound 4421484962866478386
No ratings yet
Inbound 4421484962866478386
68 pages
Exploring The Mean and Variance of Discrete Random Variables
No ratings yet
Exploring The Mean and Variance of Discrete Random Variables
10 pages
Chapter 3
No ratings yet
Chapter 3
40 pages
3 Discrete Probability Distributions
No ratings yet
3 Discrete Probability Distributions
39 pages
Chapter 7 Webnotes
No ratings yet
Chapter 7 Webnotes
7 pages
Lesson 1: Basic Probability: Learning Objectives
No ratings yet
Lesson 1: Basic Probability: Learning Objectives
33 pages
Lecture 8
No ratings yet
Lecture 8
28 pages
Mean and Variance of Random Variables and Probability Distribution Discussion
No ratings yet
Mean and Variance of Random Variables and Probability Distribution Discussion
36 pages
5 - Jan10 Discrete Random Variable
No ratings yet
5 - Jan10 Discrete Random Variable
24 pages
Statistics and Probability Reviewer 1
No ratings yet
Statistics and Probability Reviewer 1
9 pages
Statistics Probability11 q3 Week2 v4
No ratings yet
Statistics Probability11 q3 Week2 v4
10 pages
Stat
No ratings yet
Stat
23 pages
2024 F STA-1005ab Review Problems For The Final Exam
No ratings yet
2024 F STA-1005ab Review Problems For The Final Exam
65 pages
MM3&4 - Probability and Distributions Summary Notes
No ratings yet
MM3&4 - Probability and Distributions Summary Notes
31 pages
Probability Distributions For Discrete Variables
No ratings yet
Probability Distributions For Discrete Variables
16 pages
Random Variables and Discrete Probability Distributions و ةيئاوشعلا تاريغتملا ةلصفنملا ةيلامتحلاا تاعيزوتلا
No ratings yet
Random Variables and Discrete Probability Distributions و ةيئاوشعلا تاريغتملا ةلصفنملا ةيلامتحلاا تاعيزوتلا
20 pages
Statistics and Probability2021 - Quarter 3 2
No ratings yet
Statistics and Probability2021 - Quarter 3 2
38 pages
Unit 1 Ssmda Notes
No ratings yet
Unit 1 Ssmda Notes
35 pages
Stat Reviewer Midterm
No ratings yet
Stat Reviewer Midterm
9 pages
Lecture7 Slides
No ratings yet
Lecture7 Slides
6 pages
Lesson 4 - Probability Distribution
No ratings yet
Lesson 4 - Probability Distribution
48 pages
Lecture6 Slides
No ratings yet
Lecture6 Slides
6 pages
Stats and Prob Reviewer, Q3 Jess Anch.
No ratings yet
Stats and Prob Reviewer, Q3 Jess Anch.
8 pages
Probability Distribution
100% (1)
Probability Distribution
20 pages
ST - Peter's S.s-Katekwan.s.5 Math
No ratings yet
ST - Peter's S.s-Katekwan.s.5 Math
7 pages
Q3 Lectures STATS
No ratings yet
Q3 Lectures STATS
7 pages
Stat and Prob Q1 M5
No ratings yet
Stat and Prob Q1 M5
12 pages
PME-lec7-ch4-a
No ratings yet
PME-lec7-ch4-a
67 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
51 pages
Lecture 7 9
No ratings yet
Lecture 7 9
16 pages
Probability and Statistics
No ratings yet
Probability and Statistics
8 pages
Stat Reviewer 1
No ratings yet
Stat Reviewer 1
61 pages
Week Two Note
No ratings yet
Week Two Note
19 pages
Probabbility Distribution Note
No ratings yet
Probabbility Distribution Note
28 pages
Unit 3 R As A Set of Statistical Tables
No ratings yet
Unit 3 R As A Set of Statistical Tables
31 pages
Statistics and Probability Midterm Reviewer
100% (1)
Statistics and Probability Midterm Reviewer
4 pages
Statistics and Probability Second SEMESTER S.Y. 2020 - 2021: Quest
No ratings yet
Statistics and Probability Second SEMESTER S.Y. 2020 - 2021: Quest
6 pages
Quantitative Methods in Management
No ratings yet
Quantitative Methods in Management
100 pages
Digital Poster Presentation - Amal Mlhem
100% (1)
Digital Poster Presentation - Amal Mlhem
4 pages
JobsDB HK 2021 Job Seeker Salary Survey Report
No ratings yet
JobsDB HK 2021 Job Seeker Salary Survey Report
183 pages
Statistics M2
No ratings yet
Statistics M2
18 pages
Nist TN 1816
No ratings yet
Nist TN 1816
201 pages
Statistical Inference
No ratings yet
Statistical Inference
106 pages
Stat - G. Assignment
No ratings yet
Stat - G. Assignment
21 pages
(3rd Month) MATH 112 - Statistics and Probability
No ratings yet
(3rd Month) MATH 112 - Statistics and Probability
65 pages
What Is Statistic
No ratings yet
What Is Statistic
129 pages
Data Analytics and Interactive Dashboards Using Python
No ratings yet
Data Analytics and Interactive Dashboards Using Python
96 pages
BS en 01093-9 1999 (En)
No ratings yet
BS en 01093-9 1999 (En)
8 pages
Mathematical Expectation
No ratings yet
Mathematical Expectation
49 pages
Ojo Olajumoke Main
No ratings yet
Ojo Olajumoke Main
42 pages
Descriptive Statistics: at B. S. Parajuli
No ratings yet
Descriptive Statistics: at B. S. Parajuli
85 pages
BBA - ECO-07 2nd Semester Statistics Solved
No ratings yet
BBA - ECO-07 2nd Semester Statistics Solved
4 pages
Stats and Prob Reviewer
No ratings yet
Stats and Prob Reviewer
7 pages
Probability Distribution
No ratings yet
Probability Distribution
7 pages
Stat 3rd Week 2
No ratings yet
Stat 3rd Week 2
11 pages
Basic Concepts of Uncertainty
No ratings yet
Basic Concepts of Uncertainty
25 pages
MET 4 LESSON 1 Mean-and-Variance-of-Discrete-Probability-Distribution
No ratings yet
MET 4 LESSON 1 Mean-and-Variance-of-Discrete-Probability-Distribution
22 pages
MSE 311 Grain Size Measurement
No ratings yet
MSE 311 Grain Size Measurement
9 pages
AP Statistics - Chapter 7 Notes: Generating A Random Number Which Can Be Any Value Along The Interval (0,1) ) and
No ratings yet
AP Statistics - Chapter 7 Notes: Generating A Random Number Which Can Be Any Value Along The Interval (0,1) ) and
3 pages
Int375 Etp Paper
No ratings yet
Int375 Etp Paper
11 pages
Pertemuan 10. Pengolahan Data - Maksi Feb Unpad Mei 2024
No ratings yet
Pertemuan 10. Pengolahan Data - Maksi Feb Unpad Mei 2024
33 pages
Normal Probability Distribution
No ratings yet
Normal Probability Distribution
33 pages
Lec4 Data Analysis
No ratings yet
Lec4 Data Analysis
39 pages
Conference - Ladya Giffari
No ratings yet
Conference - Ladya Giffari
19 pages
STA 111 Topic 2 Notes
No ratings yet
STA 111 Topic 2 Notes
17 pages
Information Presentation and Analysis AB
No ratings yet
Information Presentation and Analysis AB
63 pages
Unit 2 Project - Data Analysis Project
No ratings yet
Unit 2 Project - Data Analysis Project
4 pages
Kami Export - ZACHARY MOON - Sound-Of-Seagrass - STUDENTA
No ratings yet
Kami Export - ZACHARY MOON - Sound-Of-Seagrass - STUDENTA
6 pages
J Corp Accounting Finance - 2023 - Kalelkar - Top Management Team Incentive Dispersion and Audit Fees
No ratings yet
J Corp Accounting Finance - 2023 - Kalelkar - Top Management Team Incentive Dispersion and Audit Fees
14 pages
Internship
No ratings yet
Internship
28 pages
Acceleration
No ratings yet
Acceleration
5 pages
Guen Tens Berger 1996
No ratings yet
Guen Tens Berger 1996
7 pages
IC Product Quality Control Chart Sample 11221
No ratings yet
IC Product Quality Control Chart Sample 11221
7 pages
How Normal Is Normal, Using A Q-Q Plot, ASTM Data Points, May-June 2014
No ratings yet
How Normal Is Normal, Using A Q-Q Plot, ASTM Data Points, May-June 2014
2 pages
IB Psychology IA Example An Investigation Into Whether The Provision of Visual Context Aids Memory Retention Clastify
No ratings yet
IB Psychology IA Example An Investigation Into Whether The Provision of Visual Context Aids Memory Retention Clastify
1 page
Astm D7709 - 12
No ratings yet
Astm D7709 - 12
7 pages
International Journal of Advanced Nuclear Reactor Design and Technology
No ratings yet
International Journal of Advanced Nuclear Reactor Design and Technology
17 pages
Maths3RDTermExam2023 2024
No ratings yet
Maths3RDTermExam2023 2024
11 pages
Functions and Probability for Sixth Graders
From Everand
Functions and Probability for Sixth Graders
Home School Brew
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

1 Module Notes

Uploaded by

1 Module Notes

Uploaded by

MAST 6474 Introduction to Data Analysis I

Probability and Random Variables

Copyright Edward Fox and John Semple 2019 1

Basic properties of probability

Copyright Edward Fox and John Semple 2019 2

Copyright Edward Fox and John Semple 2019 3

Histogram for Sum of Dice Roll

Copyright Edward Fox and John Semple 2019 4

Describing a Distribution: Expectation and Variance (for a Discrete Random Variable)

where i indexes the possible outcomes.

Possible values ( x i) Probability ( pi) Product ( x i ∙ pi)

Copyright Edward Fox and John Semple 2019 5

What is the expected demand for a week?

E ( X )=¿ (0)(.40)+(1)(.40)+(2)(.10)+(4)(.10) = 1.00

σ =Var ( X )=∑ ( ( x i−E ( X ) ) ∙ pi )

Step 1. Determine E(X). From a previous calculation, we know it is 1.00.

Copyright Edward Fox and John Semple 2019 6

Possible Outcomes Probability Product

Step 3. Sum the products → Sum = σ 2=Var ( X )=¿ 1.40

Note that this is consistent with using 

Copyright Edward Fox and John Semple 2019 7

Translating and Scaling Random Variables

Copyright Edward Fox and John Semple 2019 8

Distribution of a Random Variable – Bernoulli Distribution

(a) What is the expected value or mean of a Bernoulli distribution?

(b) What is the variance of a Bernoulli distribution?

Copyright Edward Fox and John Semple 2019 9

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.