0% found this document useful (0 votes)
21 views75 pages

2 Frequency-Distribution

The document discusses the importance of frequency tables in statistical investigations, explaining how they organize raw data into classes with corresponding frequencies for easier analysis. It covers concepts such as class intervals, class boundaries, relative frequency, and cumulative frequency distributions, along with examples and calculations. Additionally, it introduces measures of central tendency and dispersion, emphasizing their role in summarizing data distributions.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
21 views75 pages

2 Frequency-Distribution

The document discusses the importance of frequency tables in statistical investigations, explaining how they organize raw data into classes with corresponding frequencies for easier analysis. It covers concepts such as class intervals, class boundaries, relative frequency, and cumulative frequency distributions, along with examples and calculations. Additionally, it introduces measures of central tendency and dispersion, emphasizing their role in summarizing data distributions.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 75

Prepared by:

CHRISTOPHER N. SATIADA,CST,MAT
FREQUENCY TABLES
In every statistical investigation, it is radically difficult to
examine raw data ( disarranged collected data). Presenting the
data in an array ( an organized set of data in ascending or
descending form) and grouping them in tabular form, lessens
tedious work in statistics.
Presenting the data in a tabular form gives quick information
and a description of the data. These tables are called
statistical tables, which are frequency tables, contingency tables,
and stem-and-leaf design tables.
A frequency distribution or frequency table
is an arrangement of a set of data divided
into classes or categories together with the
corresponding frequencies (number of
observed values).
EXAMPLES:
1. Results in tossing a coin 20 times: HTHHHTTTTHTHTHTTHHHH

Coin Face Tally Frequency


Head (H) IIIII – IIIII – I 11
Tail (T) IIIII – IIII 9
Total 20
2. Ages ( in years) of 30 politicians

28 28 38 48 26 32 45 36 29 29 35 49 56 34 32
24 35 42 35 52 39 37 32 50 27 37 48 37 31 39

Age Tally Frequency


24 – 29 IIIII – II 7
30 – 35 IIIII – III 8
36 – 41 IIIII – II 7
42 – 47 II 2
48 – 53 IIIII 5
54 – 59 I 1
TOTAL 30
3
Grouped data are data arranged in a
frequency distribution.
Class intervals or simply classes are the
ranges of quantitative observations in a
frequency distribution.
3
Class limits are the end numbers in a
class interval.
In the class interval 24 – 29, 24 is the
lower limit and 29 is the upper limit.
3The class boundaries are also called
the absolute class limits. To find class
boundaries for a class limit of whole
numbers, subtract 0.5 from the lower-
class limit and add the same number to
the upper-class limit.
3
Class Size ( c ) or class width is
obtained by subtracting the lower-class
boundary (LB) from the upper-class
boundary (UB).
𝒄 = 𝑼𝑩 − 𝑳𝑩
3 The class widths may be regular or
irregular.
Regular class widths show equal widths
for all classes while irregular class
widths are not equal for all classes.
Class Mark (X) is the midpoint of the
3 class interval and is obtained by
getting the average of the upper limit
(UL) and the lower limit (LL). This can
be determined by using boundaries
instead of limits.
𝑼𝑳 + 𝑳𝑳 𝑼𝑩 + 𝑳𝑩
𝑿= 𝑿=
𝟐 𝟐
3
RELATIVE FREQUENCY
DISTRIBUTION
The relative frequency or percent
frequency of each class is obtained by
multiplying the quotient of the frequency
and the total frequency (n or N) by 100.
𝒇
𝒓𝒇 𝒐𝒓 %𝒇 = 𝟏𝟎𝟎
𝒏
CUMULATIVE FREQUENCY
DISTRIBUTION
There are two cumulative frequency
distributions (cf ), the less then (<cf ) and
the greater than (>cf ) cumulative
distributions.
RELATIVE CUMULATIVE
3
FREQUENCY DISTRIBUTION
The relative cumulative frequency is
obtained by taking each cumulative
frequency in percent form
𝒄𝒇
𝒓𝒄𝒇 𝒐𝒓 %𝒄𝒇 = 𝟏𝟎𝟎
𝒏
3
Step 1. Determine the range ( R ) of the set
of data. The range is equal to the difference
between the highest value (HV) and the
lowest value (LV) of the set of data.
𝑹 = 𝑯𝑽 − 𝑳𝑽
3
Step 2. Divide the range by the desired
number of classes to obtain the class size or
class width.
𝑹
𝒄=
𝒅𝒆𝒔𝒊𝒓𝒆𝒅 𝒏𝒐. 𝒐𝒇 𝒄𝒍𝒂𝒔𝒔𝒆𝒔
3 up the value of c to the decimal places like the observed
Round
values. Rounding up numbers is different from rounding off
numbers.
Rounding up numbers means adding 1 unit to the desired
decimal place whenever there is an excess. To do this, consider
the following:
1.255 rounded up to two decimal places will be 1.26
4.682 rounded up to one decimal place will be 4.7
2.35651 rounded up to three decimal places will be 2.357
3Step 3. List the class intervals starting with the
lowest value ( this will be the lower limit of the
lowest class) or the highest value (this will be the
upper limit of the highest class). Add the class
width to the first lower limit to obtain the next
lower-class limit or subtract the class width from
the upper limit of the highest class to obtain the
next upper-class limit.
3
Step 4. Make the tally.

Step 5. Write the corresponding frequency


of each class interval.
EXAMPLE:
The following are the numbers of recruits of the 25
salespersons in an insurance company in a year.
Construct a frequency distribution with 4 class intervals.
14 15 20 25 18
22 15 18 17 19
12 8 10 14 16
18 22 27 25 24
16 18 22 19 20
S 𝟏𝟗
𝑹 = 𝟐𝟕 – 𝟖 = 𝟏𝟗
O 𝒄=
𝟒
= 𝟒. 𝟕𝟓 ≈ 𝟓
L
U Class Interval
(no. of recruits) Tally frequency
T 8 – 12 IIII 4
I 13 – 17 IIIII – II 7
18 – 22 IIIII – IIIII 10
O 23 – 27 IIII 4
N Total 25
:
SOLVE THE FOLLOWING PROBLEMS.
1. Construct a frequency distribution of the given data with 6 classes.
Determine the class marks, class boundary, less than and greater than
cumulative frequency, relative frequency, and relative cumulative
frequency.
Running Times ( in minutes ) of 35 players
25.6 28.7 30.4 29.2 22.7 24.9 23.5
21.6 29.5 27.1 28.4 29.3 24.8 26.8
28.3 28.4 27.6 24.4 25.6 27.7 29.1
24.0 26.3 28.0 27.5 30.1 31.0 29.5
23.5 23.4 27.3 28.9 27.9 24.6 21.9
HV = _____ LV = ______ R (Range) = __________ c = __________

Class
Interval Tally f X Class Boundary rf <cf >cf

TOTAL
2. Answer the following questions. Refer to the frequency distributions
constructed in #1.
a.What is the class interval with the highest frequency?
b.What is the representative running time of the lowest class interval?
c.How many players have running times less than 26.4 min.?
d.How many percent of the players belong to the lowest class interval?
e.If the minimum time requirement to be considered in the
competition is less than 28 min., how many players will be included?
f. How many percent of the players are included in the competition?
g.How many will not be included in the competition?
Introduction:
* Measures of central tendency are statistical measures that
describe the position of a distribution.
* They are also called statistics of location and are the
complement of statistics of dispersion, which provide information
concerning the variance or distribution of observations.
* In the univariate context, the mean, median, and mode are the
most used measures of central tendency.
* computable values on a distribution that discuss the behavior of
the center of a distribution.
Measures of Central Tendency
The value or the figure which represents the whole series is
neither the lowest value in the series nor the highest it lies
somewhere between these two extremes.
1. The average represents all the measurements made on a
group and gives a concise description of the group.
2. When two are more groups are measured, the central
tendency provides the basis of comparison between them.
Definition
Simpson and Kafka defined it as “ A measure of
central tendency is a typical value around which
other figures congregate”
Waugh has expressed “An average stands for the
whole group of which it forms a part yet
represents the whole”.
Class Interval f
85 – 99 9
70 – 84 23
55 – 69 14
40 – 54 6
25 – 39 4
10 – 24 4
Total 60
Class Interval f X fX d fd ∑𝑓𝑥 3945
𝑋= =
85 – 99 9 92 828 1 9 𝑛 60
70 – 84 23 77 1771 0 0 = 𝟔𝟓. 𝟕𝟓
55 – 69 14 62 868 -1 -14 or
40 – 54 6 47 282 -2 -12 ∑𝑓𝑑
𝑋 =𝐴+ 𝑐
25 – 39 4 32 128 -3 -12 𝑛
−45
10 – 24 4 17 68 -4 -16 = 77 + (15)
60
Total 60 ∑𝒇𝑿 = 𝟑𝟗𝟒𝟓 ∑𝒇𝑿 = −𝟒𝟓
= 𝟔𝟓. 𝟕𝟓
Class Interval f
85 – 99 9
70 – 84 23
55 – 69 14
40 – 54 6
25 – 39 4
10 – 24 4
Total 60
Class
Interval f cf
85 – 99 9 60
70 – 84 2 51 Where half of the data falls.
55 – 69 14 28 The sum of the frequencies is
40 – 54 6 14 lower than the median class.
25 – 39 4 8
10 – 24 4 4
Total 60
𝑵
− ∑𝒇𝒎−𝟏
𝑳 = 𝟔𝟗. 𝟓 𝑿=𝑳+ 𝟐 𝒄
∑𝒇𝒎−𝟏 = 𝟒 + 𝟒 + 𝟔 + 𝟏𝟖 = 𝟐𝟖
𝒇𝒎

𝒇𝒎 = 𝟐𝟑 𝟔𝟎
− 𝟐𝟖
𝑿 = 𝟔𝟗. 𝟓 + 𝟐 𝟏
𝒄 = 𝟖𝟒. 𝟓 − 𝟔𝟗. 𝟓 = 𝟏𝟓 𝟐𝟑
𝑿 = 𝟕𝟎. 𝟖
Class Interval f
85 – 99 9
70 – 84 23
55 – 69 14
40 – 54 6
25 – 39 4
10 – 24 4
Total 60
𝒅𝟏 = 𝟐𝟑 − 𝟏𝟒 = 𝟗
𝒅𝟐 = 𝟐𝟑 − 𝟗 = 𝟏𝟒
Class Interval f 𝑳 = 𝟔𝟗. 𝟓
85 – 99 9 𝒄 = 𝟖𝟒. 𝟓 − 𝟔𝟗. 𝟓 = 𝟏𝟓
70 – 84 23 modal class
55 – 69 14 ∼ 𝒅𝟏
𝑿=𝑳+ 𝒄
40 – 54 6 𝒅𝟏 + 𝒅𝟐
25 – 39 4 ∼ 𝟗
10 – 24 4 𝑿 = 𝟔𝟗. 𝟓 + 𝟏𝟓 = 𝟕𝟓. 𝟑𝟕
𝟗 + 𝟏𝟒
Total 60
Find the mean, median, and mode of the given distribution.

Class f X <cf fX d fd
50 – 58 2 -3
59 – 67 5 -2
68 – 76 14 -1
77 – 85 18 0
86 – 94 7 1
94 – 103 4 2
TOTAL
The measures of dispersion or measures of
variability indicate to what extent the
individual observation is dispersed or spread
out around the mean. Data items that are
clustered closely around the mean, more
homogenous, less variable, and more
consistent possess small variability.
The range is the difference between the highest
value and the lowest value in the set of ungrouped
data. It is the difference between the upper limit of
the largest class and the lower limit of the smallest
class for the grouped data.
EXAMPLES:
1. The average grade of a sample of students in a class are 84.3,
78.4, 85.9, 87.2, 79.4, 90.2, and 87.3. Determine the range.

HV = 90.2 LV = 78.6

R = HV – LV = 90.2 – 78.6 = 11.6


2. Compare the values of the range of the following data

Round 1 2 3 4 5 6 HV LV R
Mikee 84 90 85 87 92 86 92 84 8
Dinky 72 96 96 86 89 90 96 72 24

Answer: The range running time of Dinky is much


longer than the range running time of Mikee.
3. Determine the range of the following frequency distribution.

Class Interval f
85 – 89 9
70 – 84 23 𝑹 = 𝑼𝑳 − 𝑳𝑳
55 – 69 14 = 𝟗𝟗 − 𝟏𝟎
40 – 54 6 = 𝟖𝟗
25 – 39 4
10 – 24 4
Total 60
Determine the variance of each of the following data

1)8,9,2,12,16,10, and 13 (from a sample)


2)-4,-11,7,8,5,0,1 (from a sample)
3)3.4,5.7,8.4,2.5,5.8 and 6.7 ( from a population)

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy