Case Study Dbm30033 Group1 - 98
Case Study Dbm30033 Group1 - 98
LECTURER
SIR SHAHZAN HAFIZ BIN
SAMSUDIN
07DKM21F2009
MUHAMMAD HAFIZ BIN DUHA
07DKM21F2045
MUHAMMAD NIKMAN BIN
MERJUNI
07DKM21F2029
ANDI AHMAD SYAZWAN BIN ATTO
07DKM21F2027
MOHAMMAD KHUZAINI
TABLE OF CONTENT
1. Introduction 1-2
2. Data 3
6. Conclusion 15
- Recommendation
7. Reference 16
1.Introduction
First of all, sustainable technology is a general term that describes innovations that consider
natural resources and foster economic and social development. The goal of this technology is to
drastically reduce environmental and ecological risks and create sustainable products. The
development of electric and hybrid vehicles is one of the sustainable products which is the
manufacturer's effort to reduce environmental risks. In addition, hybrid electric vehicle is a type
of hybrid vehicle that combines an internal combustion engine with an electric propulsion
system. The presence of an electric power system aims to achieve either better fuel economy
compared to conventional vehicles or better performance. There are different types of HEVs,
and the degree to which they function as electric vehicles also varies. The most common form of
HEV is the hybrid electric car, although hybrid electric trucks (pickups and tractors) and buses
also exist. Therefore,we chose the title which is the number of electric and hybrid vehicle
registrations in Malaysia because of the benefits of using it as a sustainable technology for the
environment in Malaysia.
Into the next point, the techniques for assembling, organising, and analysing data are part of the
mathematical discipline of statistics, which can be used to draw insightful conclusions. Its
investigations and analyses often fall into one of two primary categories: descriptive statistics or
inferential statistics. When analysing data for descriptive statistics, no inferences are drawn
from the data. The data is presented via tables and graphs. The qualities of the data are described
in straightforward terms. The topics addressed on a daily basis include accidents, product prices,
corporate profits, earnings, epidemics, sports statistics, and demographic data. Inferential
statistics is a scientific discipline that uses mathematical techniques to generate predictions and
projections by analysing the existing data. It fulfils people's needs.
1.2. Central Tendency
In statistics, a central tendency (or measure of central tendency) is a central or typical value for a
probability distribution. Colloquially, measures of central tendency are often called averages.
The term central tendency dates from the late 1920s. The most common measures of central
tendency are the arithmetic mean, the median, and the mode. A middle tendency can be
calculated for either a finite set of values or for a theoretical distribution, such as the normal
distribution. Occasionally authors use central tendency to denote "the tendency of quantitative
data to cluster around some central value. The central tendency of a distribution is typically
contrasted with its dispersion or variability; dispersion and central tendency are the often
characterized properties of distributions. Analysis may judge whether data has a strong or a
weak central tendency based on its dispersion
2
2.Data
2.1. Data below show the total registration of electric and hybrid vehicle in malaysia
𝑟𝑎𝑛𝑔𝑒
𝐶=
𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑐𝑙𝑎𝑠𝑠, 𝑘
739
=
7
= 105.57
= 106
3
3.4. Frequency table
No. of total
registration Frequency Lower Upper Midpoint Cumulative
boundary boundary frequency
138-243 10 137.5 243.5 190.5 10
244-349 12 243.5 349.5 296.5 22
350-455 1 349.5 455.5 402.5 23
456-561 3 455.5 561.5 508.5 26
562-667 3 561.5 667.5 614.5 29
668-773 1 667.5 773.5 720.5 30
774-879 2 773.5 879.5 826.5 32
4
4.MEASURES OF CENTRAL TENDENCY AND DISPERSION
4.1. Mean, x
No. of total
f x fx
registration
138-243 10 190.5 1905
244-349 12 296.5 3558
350-455 1 402.5 402.5
456-561 3 508.5 1525.5
562-667 3 614.5 1843.5
668-773 1 720.5 720.5
774-879 2 826.5 1653
∑ f =32 ∑ fx =11608
∑ 𝒇𝗑̅
mean, 𝗑̅ =
∑ 𝒇
11 608
=
32
= 362.72
5
4.2. Median, m
No. of total
registration Frequency, f Cumulative Lower boundary
frequency, cf
138-243 10 10 137.5
244-349 12 22 243.5
350-455 1 23 349.5
456-561 3 26 455.5
562-667 3 29 561.5
668-773 1 30 667.5
774-879 2 32 773.5
Lm = 243.5
𝑁 32
= = 16
2 2
F = 10
fm = 12
C = 106
(𝑁 ) − 𝐹
𝑚𝑒𝑑𝑖𝑎𝑛, 𝑚 = 𝐿𝑚 + [( 2 ) 𝐶]
𝑓𝑚
32
( ) − 10
2 ) 106]
𝑚𝑒𝑑𝑖𝑎𝑛, 𝑚 = 243.5 + [(
12
= 243.5 + 53
= 296.5
6
4.3. Mode, m0
No. of total
registration Frequency, f Cumulative Lower
frequency, cf boundary
138-243 10 10 137.5
Highest
244-349 12 22 243.5
frequency
350-455 1 23 349.5
456-561 3 26 455.5
562-667 3 29 561.5
668-773 1 30 667.5
774-879 2 32 773.5
𝐿𝑚0 = 243.5
𝑑1 = 12 − 10 = 2
𝑑2 = 12 − 1 = 11
𝐶 = 106
𝑑1
𝑀𝑜𝑑𝑒, 𝑚0 = 𝐿𝑚0 + [(
𝑑 1+ 𝑑2 ) 𝐶]
= 243.5 + [( 2 ) 106]
2 + 11
= 243.5 + 16.3
= 259.8
7
4.4. Mean deviation, E
No. of total
f x fx 𝗑̅ − 𝗑̅ |𝗑̅ − 𝗑̅| |𝗑̅ − 𝗑̅
|𝒇
registration
138-243 10 190.5 1905 -172.22 172.22 1722.2
244-349 12 296.5 3558 -66.22 66.22 794.64
350-455 1 402.5 402.5 39.78 39.78 39.78
456-561 3 508.5 1525.5 145.78 145.78 437.34
562-667 3 614.5 1843.5 251.78 251.78 755.34
668-773 1 720.5 720.5 357.78 357.78 357.78
774-879 2 826.5 1653 463.78 463.78 927.56
∑ f =32 ∑ fx = = 5034.64
11 608
11 608
𝑚𝑒𝑎𝑛, 𝑥̅ =
32
𝑥̅ = 362.72
∑|𝑥̅ − 𝑥̅ |𝑓
𝐸=
∑𝑓
5034.64
𝐸=
32
𝐸 = 157.33
8
4.5. Variance, 𝑆2
∑|𝑥̅ − 𝑥̅|2𝑓
𝑆2 =
∑𝒇
1162926.09
𝑆2 =
32
𝑆2 = 36341.44
i. Standard deviation, S
𝑆 = √𝑣𝑎𝑟𝑖𝑎𝑛𝑐𝑒
𝑆 = √36341.44
𝑆 = 190.6343
9
ii. First Quartile, Q1
𝐾
𝑄𝑘 = × 𝑡𝑜𝑡𝑎𝑙 𝑑𝑎𝑡𝑎, 𝑁
4
1
𝑄1 = × 32
4
𝑄1 = 8
𝐾𝑁 − 𝐹
𝑄1 = 𝐿𝑄1 + [( 4 ) 𝐶]
𝑓𝑄1
8−0
𝑄1 = 137.5 + [( ) 106]
10
𝑄1 = 137.5 + 84.8
𝑄1 = 222.3
10
iii. Third Quartile, Q3
𝐾
𝑄𝑘 = × 𝑡𝑜𝑡𝑎𝑙 𝑑𝑎𝑡𝑎, 𝑁
4
3
𝑄3 = × 32
4
𝑄3 = 24
𝐾𝑁
𝑄3 = 𝐿𝑄3 + [( 4 − 𝐹) 𝐶]
𝑓𝑄1
24 − 23
𝑄3 = 455.5 + [( ) 106]
3
𝑄3 = 455.5 + 35.3
𝑄3 = 490.8
𝐼𝑄𝑅 = 𝑄3 − 𝑄1
𝐼𝑄𝑅 = 490.8 − 222.3
𝐼𝑄𝑅 = 268.5
11
v. 9th Decile, D9
𝐾
𝐷𝑘 = × 𝑡𝑜𝑡𝑎𝑙 𝑑𝑎𝑡𝑎, 𝑁
10
9
𝐷9 = × 32
10
𝐷9 = 28.8
𝐾𝑁
𝐷9 = 𝐿𝐷9 + [( 10 − 𝐹) 𝐶]
𝑓𝐷9
28.8 − 26
𝐷9 = 561.5 + [( ) 106]
3
𝐷9 = 561.5 + 98.9
𝐷9 = 660.4
12
5. GRAPH HISTOGRAM AND OGIVE
5.1 HISTOGRAM
No. of total
registration Frequency Lower Upper Midpoint
boundary boundary
138-243 10 137.5 243.5 190.5
244-349 12 243.5 349.5 296.5
350-455 1 349.5 455.5 402.5
456-561 3 455.5 561.5 508.5
562-667 3 561.5 667.5 614.5
668-773 1 667.5 773.5 720.5
774-879 2 773.5 879.5 826.5
13
5.2 OGIVE
No. of total
registration Frequency Lower Upper Midpoint Cumulative
boundary boundary frequency
138-243 10 137.5 243.5 190.5 10
244-349 12 243.5 349.5 296.5 22
350-455 1 349.5 455.5 402.5 23
456-561 3 455.5 561.5 508.5 26
562-667 3 561.5 667.5 614.5 29
668-773 1 667.5 773.5 720.5 30
774-879 2 773.5 879.5 826.5 32
14
6. CONCLUSION
In conclusion, there are similarities and slight differences in the number of results found in the
use of both methods, namely the formula method and the graph method. For the histogram
graph method, the central tendency that needs to be measured is the mode where the number of
modes found on the histogram graph is 259.4 while the number of modes for the formula
method is 259.8. therefore, the total difference for the two methods for the mode is 0.4. Also,
for the ogive graph, the central tendency to be measured is the median, first quartile, third
quartile and ninth decile. the median number for the formula method is 296.5 while the median
number on the graph has the same result. Therefore, the median sum for both methods is the
same. Next, the number of the first quartile obtained on the graph is 222.3 which is also the
same number on the formula method. Next, for the number of the third quartile obtained on the
graph is 492.6 compared to the number found in the formula method which is 490.5 which
causes a total difference of 2.1 in both methods. the last is the ninth decile where the number
obtained on the graph is 662.2 while the number obtained on the formula method is 660.4.
therefore, the total difference in results in both methods is as much as 1.8.
6.1 RECOMMENDATION
The recommendation that we can give to reduce the difference or amount from both the graph
method and the formula, the part that needs to be improved is in the graph method where to find
the same amount, the graph needs to be drawn correctly for example in the ogive graph where
each curve needs to be drawn neatly and marked with carefully so that both methods still share
the same amount of data even if there is a slight difference in the number of decimals only.
15
7 . REFERENCE
1) https://en.wikipedia.org/wiki/Central_tendency
2) https://en.m.wikipedia.org/wiki/Hybrid_electric_vehicle
3) https://www.rubicon.com/sustainability-hub/articles/what-is-sustainable-
technology/#:~:text=Sustainable%20technology%20is%20an%20umbrella,to%2
0create%20a%20sustainable%20product
4) https://www.data.gov.my/data/ms_MY/dataset?q=Kenderaan%20elektrik
16