0% found this document useful (0 votes)
41 views7 pages

Data Analytics Ass Group-4 Updated

This document presents definitions and examples of calculating the mean, median, and mode of both ungrouped and grouped data sets. It defines the mean as the average value found by summing all values and dividing by the total count. The median is defined as the middle value of a data set arranged in order. The mode is the most frequently occurring value. Examples are provided to demonstrate calculating these measures for both ungrouped raw data and grouped frequency data. The appropriate use of each measure depending on the data distribution is also discussed.

Uploaded by

ndifon.titian
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
41 views7 pages

Data Analytics Ass Group-4 Updated

This document presents definitions and examples of calculating the mean, median, and mode of both ungrouped and grouped data sets. It defines the mean as the average value found by summing all values and dividing by the total count. The median is defined as the middle value of a data set arranged in order. The mode is the most frequently occurring value. Examples are provided to demonstrate calculating these measures for both ungrouped raw data and grouped frequency data. The appropriate use of each measure depending on the data distribution is also discussed.

Uploaded by

ndifon.titian
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 7

REPUBLIC OF CAMEROON REPUBLIQUE DU CAMEROUN

Peace- Work- Fatherland Paix- Travail- Patrie


………………………… ……………………..

UNIVERSITY OF BAMENDA UNIVERSITE DE BAMENDA


………………………… ……………………
NATIONAL HIGHER POLYTECHNIC ECOLE SUPERIURE NATINALE
INSTITUTE (N.A.H.P.I) POLYTECHNIQUE(E.S.N.P)
…………………………….. ……………………
BAMBILI–BAMENDA BAMBILI–BAMENDA

TOPIC: CALCULATING THE MEAN, MEDIAN, AND MODE OF


GROUPED AND UNGROUPED DATA
COURSE INSTRUCTOR: Dr. NDUKUM PASCALINE

PRESENTED BY : GROUP 4

LEVEL 400

SCHOOL : NAHPI

DEPARTMENT : COME

COURSE TITLE: DATA ANALYSIS

COURSE CODE: COME4108


Name Matricule

RON VAN NTUMSI NGANG UBa21E0436

SAHLA SANDRINE NYUYKIKFE UBa21E0056

SHENYUY GOODNESS-DIVINE TATA UBa21E0057

SIBEN CLETUS WIYSANYUY UBa21E0058

SIGNE JOSUE EMMANUEL UBa21E0059

SUKA NIBA NDE UBa21E0437

TAMABANG REMY GAFORBE UBa21E0438

TCHUIFO AKO ERASTUS UBa21E0439

TEBIT ODILIA ANWEI UBa21E0440

TENENG CEDRIC MULUH UBa21E0060

TIFUHAGOH SHAQUILLE FORD MBA UBa21E0061

TUMASANG MANOEL ACHIBE UBa21E0062

WANDJI JUNIOR FERNANDEZ AWAFOR UBa21E0064

WANGO ADAMU NJINJOH UBa21E0065

WUYIKA FABRICE KIDZENWA UBa21E0067

YVAN HAROLD KWEMO TCHIENGA UBa21E0068

NDIFON TITIANA SIH UBa23E3040

TCHINDA TCHOFFO MEGANE UBa23E3041

MASSODA TATIANA LOIS NGO UBa20E0037


DEFINITION OF MEAN, MEDIAN AND MODE
Mean
The mean, also known as the average, is a measure of central tendency in a data set. To
calculate the mean, you add up all the values in the data set and then divide the sum by the
number of values.

In general, the mean of the set: {𝑥1 , 𝑥2 , 𝑥3 , … , 𝑥𝑛 } is

𝑥1 + 𝑥2 +𝑥3 +⋯+𝑥𝑛
𝑥̅ =
𝑛

For example, consider the data set {2, 4, 4, 4, 5, 5, 7, 9} . The mean would be calculated as follows:

2+4+4+4+5+5+7+9
𝑥̅ = =5
8

So, the mean of this data set is 5. The mean provides a measure of central tendency and is useful for
summarizing a set of values with a single representative value.

Median
The median is another measure of central tendency in a data set. To find the median, you first need to
arrange the values in the data set in ascending order. Then, if the number of values (n) is odd, the
median is the middle value. If the number of values is even, the median is the average of the two middle
values.

For example, consider the data set {4, 9, 4, 5, 2, 5, 4, 7}. After arranging the values in ascending order,
you get {2, 4, 4, 4, 5, 5, 7, 9}. Since there are 8 values (an even number), the median is the average of the
two middle values, which are 4 and 5:

4+5 9
Median = = = 4.5
2 2

So, in this case, the median of the data set is 4.5. The median is less sensitive to extreme values than the
mean and can be a useful measure of central tendency, especially when dealing with skewed
distributions or data sets with outliers.

Mode

The mode of a data set is the value (or values) that appear most frequently. In other words, it is the
value that occurs with the highest frequency in a data set.

A data set may have one mode, more than one mode, or no mode at all.
For example, consider the data set {2, 4, 4, 4, 5, 5, 7, 9}. In this case, the mode is 4 because it appears
more frequently than any other value.

In a situation where there are multiple values with the same highest frequency, the data set is said to be
multimodal, and it has more than one mode. If no value is repeated, the data set is considered to have no
mode.

It's worth noting that unlike the mean and median, the mode does not necessarily provide a measure of
central tendency. It simply identifies the most frequently occurring values in a data set.

Ungrouped Data
Data set from page 551, exercise 3.

Determine for the 10 measurements of lengths


shown below:
(a) the arithmetic mean, (b) the median, (c) the mode, and (d) the standard deviation.
28 m, 20 m, 32 m, 44 m, 28 m, 30 m, 30 m, 26 m, 28 m and 34 m

Mean
∑𝑥 Sum of given data
𝑥̅ = =
𝑛 number of give data

28 + 20 +32 + 44 + 28 + 30 + 30 + 26 + 28 + 34
𝑥̅ =
10

𝑥̅ = 30 m

Median
Arranging data in ascending order.
20 m, 26 m, 28 m, 28 m, 28 m, 30 m, 30 m, 32 m, 34 m, 44 m

since the we have an even data set, the median is the mean of the two meddle values.

Median = (28 + 30) / 2

Medain = 29 m
Mode
The number with the highest occurance is 28 m

Hence, Mode = 28 m

GROUPED DATA
Data from exercise 4

The heights of 100 people are measured correct to


the nearest centimetre with the following results:
150 – 157 5,
158 – 165 18,
166 – 173 42,
174 – 181 27,
182 – 189 8.

Data Organization

Class limits Class Class Frequency (f) f.x Cumulative


(Heights) Boundary midpoints (x) Frequency (cf)

150 - 157 149.5 – 157.5 153.5 5 767.5 5

158 - 165 157.5 – 165.5 161.5 18 2907 23

166 - 173 165.5 – 173.5 169.5 42 7119 65

174 - 181 173.5 – 181.5 177.5 27 4792.5 92

182 - 189 181.5 – 189.5 185.5 8 1484 100

Σf = 100 Σ(f.x)=
17070

Mean

𝑠𝑢𝑚 𝑜𝑓 (𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑖𝑒𝑠 × 𝑐𝑙𝑎𝑠𝑠 𝑚𝑖𝑑𝑝𝑜𝑖𝑛𝑡) Σ(fx)


Mean = =
𝑠𝑢𝑚 𝑜𝑓 𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑖𝑒𝑠 Σf
17070
= 100

Mean = 170.7 cm
Median
Let Lcbm = lower class boundary for the modal class
Cfbmc = Cumulative frequency of class before modal class
W = Class width = 8
fm = frequency of modal class = 42

Σf
− 𝐶𝑓𝑏𝑚𝑐
2
median = 𝐿𝑐𝑏𝑚 + ( )𝑊
𝑓𝑚

100
– 23
2
median = 165.5 + ( )8
42

median = 170.6 cm

Mode
modal class = Class with the highest frequency
Let ∆1 = difference between the frequency of the modal class and the class above it.
∆2 = difference between the frequency of the modal class and the class below it

∆1
mode = 𝐿𝑐𝑏𝑚 + ( )𝑊
∆1 − ∆ 2

42−18
mode = 165.5 + ( )8
(42−18) − (42−27)

mode = 170.4cm

Significants of the Mean, Median and Mode, and how to choose an


appropriate of measure of center for a given data set
The mean, median, and mode are measures of central tendency, and each provides insight into different
aspects of a data set. The choice of which measure to use depends on the nature of the data and the
specific characteristics of the distribution.

Mean:
 Significance: The mean is the average of all the values in a data set.
 Use when: The mean is most appropriate when the data is approximately symmetrically
distributed and does not have extreme outliers. It is sensitive to extreme values and may not be
the best measure if the data set is skewed.

Median:
 Significance: The median is the middle value in a data set when it is ordered. It is not affected
by extreme values (outliers) and is a measure of the central position.
 Use when: The median is useful when the data set is skewed, has outliers, or is not normally
distributed. It provides a better representation of central tendency in such cases.

Mode:
 Significance: The mode is the value(s) that occur most frequently in a data set.
 Use when: The mode is suitable for categorical data or discrete data sets. It is also useful when
identifying the most common response or category is important. In some cases, a data set may
have no mode, or it may be multimodal (having more than one mode).

In summary:
Use the mean when the data is approximately normally distributed and there are no significant outliers.

Use the median when the data is skewed or contains outliers, providing a robust measure of central
tendency.

Use the mode when identifying the most frequently occurring category or value is essential, especially
for categorical or discrete data.

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy