Describing Data Numerical
Describing Data Numerical
Business and
Economics
CHAPTER 3
Describing Data
Numerically
Central Variation
Tendency
Arithmetic Rang
Mean e
Media Interquartile
n Range
Mode Varianc
e
Standard
Deviation
Coefficient of
Variation
Measures of Central Tendency
Overview
Central Tendency
x i
x i1
n
Arithmetic Midpoint of Most
average ranked frequently
values observed
value
The Arithmetic Mean is It is calculated by
the most widely used measure summing the values
of location and shows the and dividing by the
central value of the data. number of values.
A verag e
The major characteristics of the mean are: J oe
where
µ is the population mean
N is the total number of observations.
X is a particular value.
indicates the operation of adding.
Population Mean
A Parameter is a measurable characteristic of a
population.
X
56,000 ... 73,000
48,500
N 4
Example 1
For ungrouped data, the Sample Mean
is the sum of all the sample values
divided by the number of sample values:
X
X
n
Sample Mean
A statistic is a measurable characteristic of a sample.
A sample of
five 14.0,
executives
15.0,
received the
following 17.0,
bonus last 16.0,
year ($000): 15.0
Example 2
Properties of the Arithmetic Mean
Every set of interval-level and ratio-level data has a
mean.
All the values are included in computing the mean.
A set of data has a unique mean.
The mean is affected by unusually large or small
data values. (Example- 3, 4, 5, 7, 8, 45)
The arithmetic mean is the only measure of location
where the sum of the deviations of each value from
the mean is zero.
( X X ) (3 5) (8 5) (4 5) 0
Example 3
Mean as a balance point
The Weighted Mean of a set of
numbers X1, X2, ..., Xn, with
corresponding weights w1, w2, ...,wn,
is computed from the following
formula:
( w1 X 1 w2 X 2 ... wn X n )
Xw
( w1 w2 ...wn )
Weighted Mean
During a one hour period on a
hot Saturday afternoon cabana
boy Chris served fifty drinks.
He sold five drinks for $0.50,
fifteen for $0.75, fifteen for
$0.90, and fifteen for $1.10.
Compute the weighted mean of
the price of the drinks.
The Median
Finding the Median
The location of the median:
n 1
Median position position in the ordered data
2
If the number of values is odd, the median is the
middle number.
If the number of values is even, the median is the
average of the two middle numbers.
Where,
L is the lower limit of the class containing the
median.
n is the total number of frequencies.
f is the frequency in the median class.
CF is the cumulative number of frequencies in all the
classes preceding the class containing the median.
i is the width of the class in which the median lies.
Mode of Grouped Data
Symmetric distribution: A distribution having the
same shape on either side of the center
Mean=Median=Mode
M ea n
M ed ia n
M ode
The Relative Positions of the Mean, Median, and Mode: Symmetric Distribution
Positively Skewed: Mean and median are to the right of the mode.
Mean>Median>Mode
M ode M ea n
M ed ia n
The Relative Positions of the Mean, Median, and Mode:
Right Skewed Distribution
Negatively Skewed: Mean and Median are to the left of the Mode.
Mean<Median<Mode
M ea n M ode
M ed ia n
GM n ( X 1)( X 2 )( X 3)...( Xn )
GM (1.05)(1.15)
1.09886
The GM gives a more conservative
profit figure because it is not
heavily weighted by the rate of 15
percent.
Example 7
Example: The profit earned by Atkins
Construction Company on four recent
project were 30%, 20%, -40%, 200%.
Example 7
Another use of the Grow th in Sales 1999-2004
geometric mean is to 50
Sales in Millions($)
40
increase in sales, 30
production or other 20
10
business or economic 0
series from one time 1999 2000 2001 2002 2003 2004
835,000
GM 8 1 .0127
755,000
Example 8
Dispersion refers to the deviation
(i.e. deviation of the values in a
set of data from a central value.)
Measures of Dispersion
Why study dispersion?
Variation
0 1 2 3 4 5 6 7 8 9 10 11 12
13 14
Range = 14 - 1 = 13
The following represents the current year’s Return on
Equity of the 25 companies in an investor’s portfolio.
Sensitive to outliers
1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,4
Range = 5 - 1 = 4
,5
1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,4
Range = 120 - 1 =
,120 119
Mean The main features of the
Deviation mean deviation are:
The arithmetic All values are used in the
mean of the
calculation.
absolute values
It is not unduly influenced by
of the
deviations from large or small values.
The absolute values are
the arithmetic
mean. difficult to manipulate.
MD = X -X
n
Mean Deviation
The weights of a sample of crates containing books
for the bookstore (in pounds ) are:
103, 97, 101, 106, 103
Find the mean deviation.
X = 102
Drawback:
Absolutes values are difficult to work with.
Example 7
Variance: the arithmetic mean of the squared
deviations from the mean. It is non-negative
and is zero if all observations are the same.
original units.
All values are used in the calculation.
Population Variance
Population Variance formula:
= (X - ) 2
N
X is the value of an observation in the population
mu is the arithmetic mean of the population
N is the number of observations in the population
2
Most commonly used measure of
variation
Shows variation about the mean
Has the same units as the original
data Variance and standard deviation
Sample variance (s )
2 (X - X ) 2
s2 = n -1
2
s s
Sample variance and standard deviation
The hourly wages earned by a sample of five students are:
$7, $5, $11, $8, $6.
Find the sample variance and standard deviation.
X 37
X 7.40
n 5
2
s s 5.30 2.30
Chebyshev’s theorem: For any set of
observations, the minimum proportion of the values
that lie within k standard deviations of the mean is at
least:
1
1 2
k
where k is any constant greater than 1.
Chebyshev’s theorem
1. A population data set of size N = 500
has mean μ = 5.2 and standard
deviation σ = 1.1. Find the minimum
number of observations :
between 3 and 7.4;
between 1.9 and 8.5.
Chebyshev’s theorem
The Empirical Rule
68%
μ
μ 1σ
The Empirical Rule
95% 99.7%
μ 2σ μ 3σ
The Empirical Rule
Remember, Range/6 = SD
Mean & SD of Grouped Data