Mat 152 Sas#10
Mat 152 Sas#10
Introduction
Collecting and presenting data is very important, knowing that all scientific facts are supported by true
data. In statistics, one of the most important aspects of the statistical process for the researcher is to gather
correct data and present it in a way that readers can understand easily. Data can be presented in various models
such as textual, tabular, and graphical displays.
In this module, you will learn to describe and analyze ungrouped and grouped data in textual or tabular
form. You will also explore organizing ungrouped data into frequency distribution table format. The knowledge
you will gain here will be necessary for interpreting data in the future.
B. MAIN LESSON
Content Notes
Directions: In this activity, you need to read and underline or highlight keywords for you to keep in mind some
essential terms/ words used in the content notes.
PRESENTATION OF DATA
This refers to the organization of data into tables, graphs, or charts that logical and statistical
conclusions can be derived from the collected measurements.
Example:
Of the 150-sample interviewed, the following complaints were noted: 27 for lack of books in the library,
25 for the dirty playground, 20 for lack of laboratory equipment, and 17 for a not well-maintained university
building.
2. Tabular Presentation
- Method of presenting data using the statistical table.
- A systematic organization of data in columns and rows.
Example:
Table Heading
Box Head
Death Rates
Year Rural Urban Combined
Source Note
3. Graphical Presentation
Before we get to start constructing frequency distribution, we must define some terms that are essential
to understand deeper the nature of data that are displayed in a frequency distribution.
a) Frequency Distribution is the organization of data in a tabular form, using mutually exclusive classes
showing the number of observations in each.
b) Raw Data is the data collected in original form.
c) Range is the difference between the highest value and lowest values in a distribution.
d) Class Limits are the highest or lowest values describing a class.
e) Class Boundaries is the upper and lower values of a class for group frequency distribution whose
values has additional decimal place more than the class limits and end with the digit 5.
f) Interval is the distance between the class lower boundary and upper boundary, denoted by the symbol
i.
g) Frequency (f) is the number of values in a specific class of a frequency distribution.
h) Percentage is obtained by multiplying the relative frequency by 100%
i) Cumulative Frequency (cf) is the sum of the frequencies accumulated up to the upper boundary of a
class in a frequency distribution.
j) Midpoint is the point halfway between the class limits of each class and is representative of the data
within that class.
Example:
In a quiz, the marks obtained by 20 students out of 30 are given as:
12 15 15 29 30 21 30 30 15 17
19 15 20 20 16 21 23 24 23 21
Step 1: Arrange the data in ascending order for easy counting in the next few steps.
12 15 15 15 15 16 17 19 20 20
21 21 21 23 23 24 29 30 30 30
Step 3: Decide the approximate number of classes: Must be between 5 and 20. H.A. Sturges has given a
formula for determining the approximation number of classes (K).
Formula:
𝐊 = (1 + 3.322log𝑁)
Solution:
𝐊 = (1 + 3.322log𝑁)
= 1 + (3.322) (log 20)
= 1 + (3.322) (1.301029…)
= 1 + 4.32202…
= 5.32202…
𝐊=5
Step 5: Create the Class Intervals (CI). Set the lowest number as the starting point. It consists of end numbers
called lower and upper limits.
Add the lowest number to the number of classes (K) to get the upper limit.
12 + 5 = 17
18 + 5 = 23
24 + 5 = 29
30 + 5 = 35
30 – 35
Step 6: Class frequency (f) is the number of data that belong to its class interval. In the example, since the
data is arranged already, counting is very easy.
Distribute the data into respective classes: The total of the frequency column must be equal to
the number of observations or sample size (n).
12 15 15 15 15 16 17 19 20 20
21 21 21 23 23 24 29 30 30 30
Lower Class Boundary (LCB) – is the middle value between the lower-class limit and the upper-
class limit of the preceding class.
Upper Class Boundary (UCB) – is the middle value between the upper-class limit and the lower-
class limit of the next class.
Upper Limit
Lower Limit
Step 8: Determine the classmark. This is the average or midpoint of the upper limit and lower limit.
Classmark (CM) can be found by adding the lower and upper limits and then dividing them by 2.
𝑢𝑝𝑝𝑒𝑟 𝑙𝑖𝑚𝑖𝑡 + 𝑙𝑜𝑤𝑒𝑟 𝑙𝑖𝑚𝑖𝑡
2
Cumulative frequency “less than”(<CF) – is obtained by adding the frequencies successively from
the lowest to the highest interval.
Cumulative frequency “greater than”(>CF) – is obtained by adding the frequencies successively
from the highest to the lowest class interval.
Class
Frequency (f) <CF
Interval (CI)
12 – 17 7 7 Copy the first frequency, which is 7.
Class
Frequency (f) >CF
Interval (CI)
Copy the total frequency to the first >cf, then minus the first
12 – 17 7 20
>cf to the first f, to get the second >cf
18 – 23 8 13 Minus the second >cf to the second f to get the third >cf
24 – 29 2 5 Minus the third >cf to the third f to get the fourth >cf
30 – 35 3 3
Total n = 20
Step 10: Determine the relative frequency of each class interval by dividing the frequency of the interval by
the total number of observations.
Relative frequency percentage (RF%) is the frequency divided by the total frequency and
multiplied by 100 to express it in percentage form.
𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦
𝑅𝑓 = 𝑡𝑜𝑡𝑎𝑙 𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦 𝑥 100
Part A. Table Analysis: One of the simplest methods used to analyze data and display the data is in tabular
form. In a table, you get a systematic arrangement of rows and columns. Evaluate the table below and answer
the few questions that follow.
Table 2. TIMSS Average mathematics scores of 8th-grade students by education system: 2015
Rank Education system Average score
1 Singapore 621
2 Korea, Rep. of 606
3 Chinese Taipei-CHN 599
4 Hong Kong-CHN 594
5 Japan 586
6 Russian Federation 538
7 Kazakhstan 528
8 Canada 527
9 Ireland 523
10 United States 518
10 England-GBR 518
11 Slovenia 516
12 Hungary 514
13 Norway 512
14 Lithuania 511
14 Israel 511
15 Australia 505
16 Sweden 501
17 Italy 494
17 Malta 494
18 New Zealand 493
19 Malaysia 465
19 United Arab Emirates 465
20 Turkey 458
* TIMSS scale center point 500
SOURCE: International Association for the Evaluation of Educational Achievement (IEA), Trends in International
Mathematics and Science Study (TIMSS), 2015. (*cropped to top 25 only)
Questions:
1. How many countries are above the TIMSS average scale? _________________
2. How many Asian countries are in the top 20? _________________
3. How many points does Singapore exceeds from TIMSS average? _________________
15 15 16 17 17 17 19 19 20 21
21 22 23 23 25 27 27 27 28 28
29 29 29 30 32 33 33 34 34 34
34 35 35 36 36 37 38 38 40 41
41 41 41 42 42 42 44 45 46 46
TOTAL
C. LESSON WRAP-UP
Answer Key
Part A.
1. 18 4. 11th
2. 10 5. Ireland
3. 121 points
Part B.
R = 46 – 15 = 31 K = 6.644 ≈ 7 C = 31/7 = 4.429 ≈ 4
Class Tally f LCB UCB CM <CF >CF Relative
Interval Frequency (%)
15 – 18 IIIII-I 6 14.5 18.5 16.5 6 50 12
19 – 22 IIIII-I 6 18.5 22.5 20.5 12 44 12
23 – 26 III 3 22.5 26.5 24.5 15 38 6
27 – 30 IIIII-IIII 9 26.5 30.5 28.5 24 35 18
31 – 34 IIIII-II 7 30.5 34.5 32.5 31 26 14
35 – 38 IIIII-II 7 34.5 38.5 36.5 38 19 14
39 – 42 IIIII-III 8 38.5 42.5 40.5 46 12 16
43 – 46 IIII 4 42.5 46.5 44.5 50 4 8
TOTAL 50 100%