0% found this document useful (0 votes)
135 views15 pages

Chapter 3 PDF

The document discusses measures of dispersion and location from a chapter in Psychological Statistics. It defines the coefficient of variation as a measure of spread that describes variability relative to the mean. It is unitless, so can compare data sets with different units or means. Small CV indicates data is more homogeneous and clustered around the mean. The document provides an example to calculate CV for age and salary data. It also discusses using kurtosis and skewness to describe the shape of a distribution. Kurtosis measures peakedness and can be leptokurtic (peaked), mesokurtic (intermediate), or platykurtic (flat). Skewness measures symmetry and whether the tail is on the left or right side of
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
135 views15 pages

Chapter 3 PDF

The document discusses measures of dispersion and location from a chapter in Psychological Statistics. It defines the coefficient of variation as a measure of spread that describes variability relative to the mean. It is unitless, so can compare data sets with different units or means. Small CV indicates data is more homogeneous and clustered around the mean. The document provides an example to calculate CV for age and salary data. It also discusses using kurtosis and skewness to describe the shape of a distribution. Kurtosis measures peakedness and can be leptokurtic (peaked), mesokurtic (intermediate), or platykurtic (flat). Skewness measures symmetry and whether the tail is on the left or right side of
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 15

PSY 303 – PSYCHOLOGICAL STATISTICS

CHAPTER 3
MEASURES OF DISPERSION AND LOCATION

3.6 Coefficient of Variation - is a measure of spread that describes the amount of variability
relative to the mean. Because the coefficient of variation is unitless, you can use it instead
of the standard deviation to compare the spread of data sets that have different units or
different means.
Note: Small measure of variation indicates
• More homogeneous
• Clustered closely around the mean
• Less variable
• More consistent
• More uniformly distributed
𝑠
Formula: 𝐶𝑉 = (100), where s = standard deviation and 𝑥̄ = mean
𝑥̄

Example1. The average age of the engineers at VSAS Pipeline Corporation is 30 years
with a standard deviation of 3. The average monthly salary of the engineers is
P45,000 with a standard deviation of P3,150. Determine the coefficient of variation
of age and salary.
Solution:
AGE: Given 𝑥̄ 1 = 30 𝑠1 = 3
𝑠 3
𝐶𝑉1 = 𝑥̄1 (100) = (30) (100) = (0.1)(100) = 𝟏𝟎%
1

MONTHLY SALARY: Given 𝑥̄ 2 = 𝑃45,000 𝑠2 = 𝑃3,150


𝑠2 3,150
𝐶𝑉2 = 𝑥̄ (100) = (45,000) (100) = (0.07)(100) = 𝟕%
2

Therefore: AGE is more variable than the MONTHLY SALARY

Example2. Consider the set of contestants. Compare the variation of the two.
Set A 6 7 8 7 9
Set B 6 8 5 9 7

(Set A) x 𝒙 − 𝒙̄ (𝒙 − 𝒙̄ )𝟐
6 -1.40 1.96
7 -0.40 0.16
8 0.60 0.36
7 -0.40 0.16
9 1.60 2.56
∑ 𝑥 = 37 ∑(𝑥 − 𝑥̄ )2 = 5.20

∑ 𝑥 37
𝑥̄1 = = = 𝟕. 𝟒𝟎
𝑛 5

1
PSY 303 – PSYCHOLOGICAL STATISTICS

∑(𝑥−𝑥̄)2 5.20 5.20


𝑠1 = √ = √5−1 = √ = 𝟏. 𝟏𝟒
𝑛−1 4

𝑠 1.14
𝐶𝑉1 = 𝑥̄1 (100) = (7.40) (100) = (0.1541)(100)
1

𝑪𝑽𝟏 = 𝟏𝟓. 𝟒𝟏%


(Set B) x 𝒙 − 𝒙̄ (𝒙 − 𝒙̄ )𝟐
6 -1 1
8 1 1
5 -2 4
9 2 4
7 0 0
∑ 𝑥 = 35 ∑(𝑥 − 𝑥̄ )2 = 10

∑ 𝑥 35
𝑥̄ 2 = = =𝟕
𝑛 5
∑(𝑥−𝑥̄)2 10 10
𝑠2 = √ =√ =√ = 𝟏. 𝟓𝟖
𝑛−1 5−1 4

𝑠 1.58
𝐶𝑉2 = 𝑥̄2 (100) = ( ) (100) = (0.2257)(100)
2 7
𝑪𝑽𝟐 = 𝟐𝟐. 𝟓𝟕%

Therefore: Set A is less variable than Set B

2
PSY 303 – PSYCHOLOGICAL STATISTICS

Name: ________________________________ Date: _________


Section: _______________________________ Score: ________

EXERCISES 3.5 & 3.6

1. Given the grouped frequency distribution. Find the following


a. Midhinge b. IQR c. QD

AGE f
11 – 20 5
21 – 30 7
31 – 40 12
41 – 50 22
51 – 60 8
61 – 70 4
71 – 80 2
TOTAL

2. In a subdivision with more than a hundred home owners, a distribution of two different
units is given to compare the dispersion of weights with the dispersion of heights. The
mean heights is 5.7ft with s=0.9ft and the mean weight is 72.5kg with s=8.1kg. Compare
the dispersion in the heights and weights.

3
PSY 303 – PSYCHOLOGICAL STATISTICS

3.7 Kurtosis – it is a statistical measure used to describe the distribution of observed data
around the mean. It measures the relative peakedness or flatness of a distribution.
𝑛(𝑛+1) 𝑥−𝑥̄ 4 3(𝑛−1)2
Formula: 𝑘𝑢𝑟𝑡 = {[(𝑛−1)(𝑛−2)(𝑛−3)] [∑ ( ) ]} − (𝑛−2)(𝑛−3)
𝑠

Types of Kurtosis
A. Mesokurtic – are intermediate distribution which are neither too peaked nor too
flat. The values are immediately distributed about the center. Its kurtosis is zero
(kurt=0).
B. Leptokurtic – are distributions where values clustered heavily or pile up in the
center. There is tall distribution with narrow humps and long and high tails. Its
kurtosis is positive (kurt>0) and it denotes a high degree of peakedness.
C. Platykurtic – are flat distributions with values more evenly distributed about the
center with broad humps and short tails. Its kurtosis is negative (kurt<0) and it
denotes a low degree of peakedness.

Figure: Types of Kurtosis

Example1: Ages of Previous Customers


4
𝑥−𝑥 𝑥−𝑥
CI f 𝑥 𝑥−𝑥 ( ) ( )
𝑠 𝑠
18 – 26 2 22 -32.18 -2.5004 39.0875
27 – 35 1 31 -23.18 -1.8011 10.5233
36 – 44 5 40 -14.18 -1.1018 1.4737
45 – 53 8 49 -5.18 -0.4025 0.0262
54 – 62 15 58 3.82 0.2968 0.0078
63 – 71 6 67 12.82 0.9961 0.9845
72 – 80 3 76 21.82 1.6954 8.2621
𝑥−𝑥 4
TOTAL n = 40 ∑( ) =60.3651
𝑠
Solution:

Recall:
∑ 𝑓𝑥 2,167
𝑥̄ = = = 𝟓𝟒. 𝟏𝟖
𝑛 40
∑ 𝑓 (𝑥−𝑥̄ )2 6,461.7760 6,461.7760
𝑠=√ 𝑛−1
=√ 40−1
=√ 39
= 𝟏𝟐. 𝟖𝟕

4
PSY 303 – PSYCHOLOGICAL STATISTICS

𝑛(𝑛 + 1) 𝑥 − 𝑥̄ 4 3(𝑛 − 1)2


𝑘𝑢𝑟𝑡 = {[ ] [∑ ( ) ]} −
(𝑛 − 1)(𝑛 − 2)(𝑛 − 3) 𝑠 (𝑛 − 2)(𝑛 − 3)
40(40 + 1) 3(40 − 1)2
𝑘𝑢𝑟𝑡 = {[ ] [60.3651]} −
(40 − 1)(40 − 2)(40 − 3) (40 − 2)(40 − 3)
40(41) 3(39)2
𝑘𝑢𝑟𝑡 = {[ ] [60.3651]} −
(39)(38)(37) (38)(37)
1,640 3(1,521)
𝑘𝑢𝑟𝑡 = {[ ] [60.3651]} −
54,834 1,406
4,563
𝑘𝑢𝑟𝑡 = {[0.0299][60.3651]} −
1,406
𝑘𝑢𝑟𝑡 = 1.8049 − 3.2454
𝑘𝑢𝑟𝑡 = −1.4405
𝒌𝒖𝒓𝒕 = −𝟏. 𝟒𝟒
𝒌𝒖𝒓𝒕 < 𝟎
PLATYKURTIC

5
PSY 303 – PSYCHOLOGICAL STATISTICS

Name: ________________________________ Date: _________


Section: _______________________________ Score: ________

EXERCISES 3.7

1. Given the grouped frequency distribution. Find the value of kurtosis.


4
𝑥−𝑥 𝑥−𝑥
AGE f 𝑥 𝑥−𝑥 ( ) ( )
𝑠 𝑠
11 – 20 5
21 – 30 7
31 – 40 12
41 – 50 22
51 – 60 8
61 – 70 4
71 – 80 2
TOTAL

6
PSY 303 – PSYCHOLOGICAL STATISTICS

3.8 Skewness - Degree to which a statistical distribution is not in balance around the mean
(is asymmetrical or lopsided), a perfectly symmetrical distribution having a value of 0.
Distributions with extreme values (outliners) above the mean have positive skew, and
the distributions with outliners below the mean have negative skew.
Coefficient of Skewness – measures the general shape of the distribution or the
lack of symmetry of a distribution.

Formulas:
3(𝑥̄−𝑥̃)
𝑠𝑘 = (Pearson’s Coefficient of Skewness)
𝑠
𝑛 𝑥−𝑥̄ 3
𝑠𝑘 = (𝑛−1)(𝑛−2) [∑ ( ) ] (Software Coefficient of Skewness)
𝑠

Types of Distribution

A. Negatively Skewed Distribution – The mass of the data values fall to the right of
the mean and group at the upper end of the distribution with the tail to the left.
The mean is to the left of the median and the mode is to the right of the median.
B. Symmetrical Distribution – The data values are evenly distributed on both sides
of the mean. The distribution is unimodal and the mean, median and mode are
equal and are at the center of the distribution
C. Positively Skewed Distribution – Most of the values in the data fall to the left of
the mean and group at the lower end of the distribution with the tail to the right.
The mean is to right of the median and the mode is to the left of the median.

Figure: Types of Distribution

7
PSY 303 – PSYCHOLOGICAL STATISTICS

Example1: Ages of Previous Customers. Determine the shape of the distribution.


a. use Pearson’s Coefficient of Skewness
b. use Software Coefficient of Skewness

3
𝑥−𝑥 𝑥−𝑥
CI f 𝑥 𝑥−𝑥 ( ) ( )
𝑠 𝑠
18 – 26 2 22 -32.18 -2.5004 -15.6325
27 – 35 1 31 -23.18 -1.8011 -5.8427
36 – 44 5 40 -14.18 -1.1018 -1.3375
45 – 53 8 49 -5.18 -0.4025 -0.0652
54 – 62 15 58 3.82 0.2968 0.0261
63 – 71 6 67 12.82 0.9961 0.9883
72 – 80 3 76 21.82 1.6954 4.8732
𝒙−𝒙 𝟑
TOTAL n = 40 ∑( ) = -16.9903
𝒔

Solution:

Recall:
∑ 𝑓𝑥 2,167
𝑥̄ = = = 𝟓𝟒. 𝟏𝟖
𝑛 40

∑ 𝑓 (𝑥 − 𝑥̄ )2 6,461.7760 6,461.7760
𝑠=√ =√ =√ = 𝟏𝟐. 𝟖𝟕
𝑛−1 40 − 1 39
𝑛
−𝑐𝑓 20−16 4 36
𝑥̃ = 𝐿𝐵 + [ 2 𝑓 ] ⋅ 𝑖 = 53.5 + [ ] (9) = 53.5 + [ ] (9) = 53.5 + = 53.5 + 2.40
15 15 15

̃ = 𝟓𝟓. 𝟗𝟎
𝒙

3(𝑥̄−𝑥̃) 3(54.18−55.90) 3(−1.72) −5.16


a. 𝑠𝑘 = = = =
𝑠 12.87 12.87 12.87

𝒔𝒌 = −𝟎. 𝟒𝟎 Negatively Skewed Distribution

𝑛 𝑥−𝑥̄ 3 40
b. 𝑠𝑘 = [ ∑( ) ] = [(40−1)(40−2)] [−16.9903]
(𝑛−1)(𝑛−2) 𝑠

40 40
=[ ] [−16.9903] = [ ] [−16.9903]
(39)(38) 1,482
= [0.027][−16.9903] = −0.4587
𝒔𝒌 = −𝟎. 𝟒𝟔 Negatively Skewed Distribution

8
PSY 303 – PSYCHOLOGICAL STATISTICS

Name: ________________________________ Date: _________


Section: _______________________________ Score: ________

EXERCISES 3.8

1. Given the grouped frequency distribution. Find the value of Pearson’s Coefficient of
Skewness

AGE f
11 – 20 5
21 – 30 7
31 – 40 12
41 – 50 22
51 – 60 8
61 – 70 4
71 – 80 2
TOTAL

9
PSY 303 – PSYCHOLOGICAL STATISTICS

3.9 Outliers - A value that "lies outside" (is much smaller or larger than) most of the other
values in a set of data.
- it is an observation that lies an abnormal distance from other values
in a random sample from a population.
- Extremely high or extremely low values in the data set.

Formulas: 𝑄1 − 1.5(𝐼𝑄𝑅 ) and 𝑄3 + 1.5(𝐼𝑄𝑅 )

Example1: Using the ungrouped data of the Ages of Previous Customers. Check
the data set for outliers.
59 50 52 38 80 62 77 56
60 61 58 62 51 36 54 18
71 54 44 52 26 63 58 56
41 34 61 50 60 53 62 62
53 43 63 71 65 79 45 66

Solution: Arrangement of data in ascending order


18 41 50 53 58 61 62 71
26 43 51 54 58 61 63 71
34 44 52 54 59 62 63 77
36 45 52 56 60 62 65 79
38 50 53 56 60 62 66 80

n = 40, even
𝑘𝑛 1
𝑄𝑘 = +
4 2
(1)(40) 1 40 1
𝑄1 = + = + = 10 + 0.5 = 10.5
4 2 4 2
50 + 50 100
𝑄1 = = = 𝟓𝟎
2 2
(3)(40) 1 120 1
𝑄3 = + = + = 30 + 0.5 = 30.5
4 2 4 2
62 + 62 124
𝑄3 = = = 𝟔𝟐
2 2

𝐼𝑄𝑅 = 𝑄3 − 𝑄1 = 62 − 50
𝑰𝑸𝑹 = 𝟏𝟐

𝑎. 𝑄1 − 1.5(𝐼𝑄𝑅 ) = 50 − 1.5(12) = 50 − 18 = 𝟑𝟐

𝑏. 𝑄3 + 1.5(𝐼𝑄𝑅 ) = 62 + 1.5(12) = 62 + 18 = 𝟖𝟎

OUTLIERS are 18 and 26

Example2: The Age of 12 patients that entered the hospitals in the morning of a day are
47, 80, 42, 53, 36, 75, 30, 61, 22, 43, 85, and 72. Check the data set for outliers.

Solution: Arrangement of data in ascending order


22, 30, 36, 42, 43, 47, 53, 61, 72, 75, 85 n = 11 odd

10
PSY 303 – PSYCHOLOGICAL STATISTICS

𝑘 (𝑛 + 1)
𝑄𝑘 =
4
(1)(11 + 1) (1)(12)
𝑄1 = =
4 4
12
=
4
𝑄1 = 3𝑟𝑑
𝑸𝟏 = 𝟑𝟔

(3)(11 + 1) (3)(12)
𝑄3 = =
4 4
36
=
4
𝑄3 = 9𝑡ℎ
𝑸𝟑 = 𝟕𝟐

𝐼𝑄𝑅 = 𝑄3 − 𝑄1 = 72 − 36
𝑰𝑸𝑹 = 𝟑𝟔

𝑎. 𝑄1 − 1.5(𝐼𝑄𝑅 ) = 36 − 1.5(36) = 36 − 54 = −𝟏𝟖

𝑏. 𝑄3 + 1.5(𝐼𝑄𝑅 ) = 72 + 1.5(36) = 72 + 54 = 𝟏𝟐𝟔

NO OUTLIERS

11
PSY 303 – PSYCHOLOGICAL STATISTICS

Name: ________________________________ Date: _________


Section: _______________________________ Score: ________

EXERCISES 3.9

1. During the first 11 weeks of world economic crisis, the weekly average hours worked in
manufacturing jobs were 45, 46, 40, 39, 30, 36, 38, 41, 42, 48, and 35 respectively. Check
for outliers of the given set of data.

12
PSY 303 – PSYCHOLOGICAL STATISTICS

3.10 Boxplot (Box-and-Whisker Plot)


- is a graphical interpretation of statistical data based on the minimum, first quartile, median,
third quartile, and maximum. The term "box plot" comes from the fact that the graph looks
like a rectangle with lines extending from the top and bottom (whatis.techtarget.com). A type
of graph used to display patterns of quantitative data. A boxplot splits the data set
into quartiles. The body of the boxplot consists of a "box" (hence, the name), which goes from
the first quartile (Q1) to the third quartile (Q3). Within the box, a vertical line is drawn at the
Q2, the median of the data set. Two horizontal lines, called whiskers, extend from the front
and back of the box. The front whisker goes from Q1 to the smallest non-outlier in the data
set, and the back whisker goes from Q3 to the largest non-outlier (stattrek.com).

Xlowest Xhighest

Q1 Q2 = 𝑥̃ Q3

Boxplot or Box and Whisker chart

Example: Using the ungrouped data of the Ages of Previous Customers. Make a boxplot.
Solution: Given 𝑄1 = 50 𝑄3 = 62 OUTLIERS are 18 and 26
Xlowest = 34 Xhighest = 80

𝑘𝑛 1
𝑄𝑘 = +
4 2

(2)(40) 1 80 1
𝑄2 = + = + = 20 + 0.5 = 20.5
4 2 4 2

56 + 58 114
𝑄2 = = = 𝟓𝟕
2 2

50 57 62

18 26 34 80
Xlowest Xhighest

Q1 Q2 = 𝑥̃ Q3

15 20 25 30 35 40 45 50 55 60 65 70 75 80 85

Box-plot of the Ages of previous customers

13
PSY 303 – PSYCHOLOGICAL STATISTICS

Name: ________________________________ Date: _________


Section: _______________________________ Score: ________

EXERCISES 3.10

1. Consider the data of scores 3, 5, 9, 13, 15, 17, 19, 21, 23, and 25. Construct a boxplot.

2. Twelve randomly selected psychology students were asked to state the number of hours
they slept last Sunday. The resulting data are 6, 7, 8, 10, 5, 4, 11, 12, 11, 10, 8 and 7.
Construct a boxplot.

14
PSY 303 – PSYCHOLOGICAL STATISTICS

CHAPTER TEST (Problem Set 3)

Name: ________________________________ Date: _________


Section: _______________________________ Score: ________

1. the following data give the weight (in pounds) lost by 8 randomly selected new member
of a psychology club at the end of their first 3 months of membership. Complete the table
and find the following:

x 𝑥 − 𝑥̄ |𝑥 − 𝑥̄ |
13
14
17
20
25
18
16
15

a. Range
b. Average Deviation
c. Standard Deviation using the second formula

2. Given the grouped frequency distribution. Find the following

AGE f
18 – 20 3
21 – 23 6
24 – 26 8
27 – 29 5
30 – 32 4
33 – 35 4
TOTAL

a. Range
b. Average Deviation
c. Standard Deviation using the first formula
d. Variance using the shortcut formula
e. 𝑃80 , 𝐷5 and 𝑄3
f. Midhinge, IQR and QD
g. Kurtosis
h. Pearson’s Coefficient of Skewness
i. Construct a Box and Whisker plot

3. ASIAN company manufactures computer terminals. The following data are the number of
computer terminals produced at a company for a sample of fifteen days: 7, 20, 15, 20, 22,
17, 22, 18, 19, 12, 15, 16, 17, 13, and 22. Check for outliers of the given set of data and
Construct a boxplot.

15

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy