Prob and Stats Notes PDF
Prob and Stats Notes PDF
median = 67.5
mode = 68
An extreme value in a data set is called an outlier. Sometimes outliers are excluded before
the data are analyzed.
Some examples of displays of data:
Stem and
Leaf Plot
Histogram
(for continuous data)
Column Graph
(for discrete data)
Height of Trees
stem
2
3
4
leaf
3466
015
24479
Number of C ats
f 40
r
e
30
q
u
e 20
n
c
10
y
f 8
r
e
q 6
u
e 4
n
c
2
y
10
15
height (meters)
20
number of cats
min = 1
Q1 = 3
median = 5
Q1 = 6.5
max = 9
10
Exam Score
Frequency
0 x < 10
10 x < 20
20 x < 30
30 x < 40
40 x < 50
2
3
6
8
11
Cumulative
Frequency
2
5
11
19
30
Cumulative data can also be represented using a cumulative frequency polygon graph. The
graph can be used to estimate the median and to find other properties of the data.
Example:
Exam Scores
30
25
f
r
e
q
u
e
n
c
y
20
15
10
10
20
30
40
score
The median is about 35 points.
If passing is a score of 30 points or more, about 19 students passed.
About 6 students earned an A (90% or more).
About 5 students earned a B (between 80% and 90%).
The 80th percentile is a score of about 44 points.
50
Since range and interquartile range use only two data points, they are not very informative.
So there are two other more important measures of dispersion that use all the data values:
variance and standard deviation.
The deviation is the difference between the data value (x) and the mean ( ). Every value in
the data set has a deviation.
Because some deviations are positive and some are negative, we square them and then find
the average of the squared deviations. This is called the variance and is denoted 2 .
k
f (x )
i
2 =
i =1
f (x )
i
= 2 =
i =1
Note that in some contexts x is used for the mean and s is used for the standard deviation.
You can use your graphing calculator to find standard deviation, but be careful if youre
doing an IB problem! For IB problems, you should always use the 1-var stats function and
chose the value labeled . Do not use the value labeled s or the stdDev function!
Standard deviation is very important for data that are normally distributed, which we will
study later.
Probability is the likelihood that something will happen. Probability can be measured
numerically.
A random experiment is an experiment in which there is no way to determine the outcome
beforehand. For example, a dice game.
A trial is an action in a random experiment. For example, rolling the dice.
An outcome is a possible result of a trial. For example, rolling 2-4.
An event is a set of possible outcomes. For example, the total of the two dice is six (the
outcomes in this event are 1-5, 2-4, 3-3, 4-2, 5-1).
The sample space is the set of all possible outcomes of a random experiment. In the dice
game experiment, the sample space contains 36 outcomes, as shown in the grid on page 357.
For events that are equally likely, the probability is the number of outcomes in the event
divided by the total number of outcomes in the sample space.
Example: P(the total of the two dice is six) =
5
= 0.138 = 13.8 %
36
An event that is certain to occur has a probability of one. An event that cannot occur has a
probability of zero. Probability is always a number between zero and one, inclusive.
Examples: P(the total of the two dice is at least one) =
36
=1
36
0
=0
36
It is also sometimes helpful to illustrate the sample space, and there are a several ways to do
this: a list, a grid, a tree diagram, and a Venn diagram. We will learn more about how to use
these methods later in the chapter.
Events are complementary if their probabilities add up to one. This means that one of the
events is certain to happen.
Example: The probability of rain on Tuesday is 0.2. What is the
probability that it does not rain on Tuesday?
These events are complementary, so 0.8.
Tree diagrams are a useful tool for solving probability problems. When drawing a tree
diagram, follow these guidelines:
Label the events at the end of the branches, and label the probabilities on the branches.
The probabilities on each branch must always total to one and the final probabilities
must always total to one.
0.2
rain
0.85
no
0.17
biking
biking
0.8
no
rain
0.56
0.7
0.3
no
0.24
biking
Events that do not affect each other are called independent. For example, drawing two cards
from a deck with replacement.
Events that do affect each other are called conditional. For example, drawing two cards from
a deck without replacement.
The symbol A B means the intersection of A with B. It is equivalent to A and B.
The multiplication law for probability says P ( A B ) = P ( A) P ( B / A )
where P ( B / A ) means the probability of B happening given A has happened.
P ( A B ) if:
a) the events are mutually exclusive
b) the events are independent
a) P ( A B ) = P ( A ) + P ( B ) = 0.3 + 0.5 = 0.8
b) P ( A B ) = P ( A) + P ( B ) P ( A B )
P ( C D ) = 0.8 . Find P ( C D ) .
P (C D ) = P (C ) + P ( D ) P (C D )
0.8 = 0.2 + 0.7 x
0.8 = 0.9 x
x = 0.1
0.2
0.1
0.4
0.3
P ( E ) = 0.3
P ( E ) P ( F ) = 0.15
P ( F ) = 0.5
P ( E F ) = 0.1
The most important distribution for a continuous random variable is the normal distribution.
It is given by a function of the form
1 x
1
f ( x) =
e 2
2
defined for any real number x. This equation represents a family of functions that depend on
(mean) and (standard deviation).
The probabilities for a normal distribution are given by the area under the curve, and they are
found using definite integrals:
b
P (a x b) = f ( x)
a
bell-shaped
symmetric about
99.95% of the values are within 3.5 standard deviations of the mean
When we are given the probability (or percentage of values) for a normally distributed
random variable, we can find the particular value that gives that probability using the inverse
of the normal distribution.
On the GDC the inputs are invNorm(p, , ). This gives the value of k for P ( X < k ) = p .
Example: X is normally distributed with mean 32 and standard deviation 3. Find the value of
k so that P ( X < k ) = 0.37 .
P ( X > k ) = .07
P ( X < k ) = .93
The inverse of the normal distribution can also be used in problems where we know
probabilities but and are unknown. To do this, we must first convert the x-values into
x
z-values using the formula z =
. This is known as standardizing.
22
40
40 = 25.8
= 65.8
The binomial probability distribution applies to a random variable with the following
characteristics:
The hardest thing about the binomial probability distribution is determining when to use it, so
it is important to understand these characteristics!
10
3
7
P ( X = 3) = ( 16 ) ( 65 ) 0.155
3
Probabilities for binomial distributions can also be found using the GDC. For X ~ B ( n, p )
there are two buttons to choose from:
to find P ( X = r ) use binompdf(n, p, r)
Example: Ms. Carey rolls her six-sided Bicycle die ten times. Find the probability that
she rolls bicycle no more than three times.
Binomial with n = 10 and p = 16 .
P ( X 3) = binomcdf (10,
1
6
, 3) 0.930
Note that the binomcdf button on the GDC only gives probabilities for X less than or equal
to r. If you need to find P ( X > r ) or P ( X r ) you must use complementary events.
Example: Ms. Carey rolls her six-sided Bicycle die ten times. Find the probability that
she rolls bicycle at least three times.
Binomial with n = 10 and p = 16 .
P ( X 3) = 1 P ( X 2 ) = 1 binomcdf (10,
1
6
, 2 ) 0.224
Example: Five percent of the eggs produced on a farm are brown. Find the probability that,
in a case of 240 eggs, there are more than ten brown eggs.
Binomial with n = 240 and p = 0.05 .