Central of Tendency: Febiola Genesis Lee Khovtsheva Getsemanita Yanita Silitonga
Central of Tendency: Febiola Genesis Lee Khovtsheva Getsemanita Yanita Silitonga
Febiola Genesis
Lee Khovtsheva Getsemanita
Yanita Silitonga
■ This formula is usually written in a slightly different manner using the Greek capitol
letter, , pronounced "sigma", which means "sum of...":
MEDIAN
THE MEDIAN IS THE MIDDLE SCORE FOR A SET OF DATA THAT HAS BEEN ARRANGED IN
ORDER OF MAGNITUDE. THE MEDIAN IS LESS AFFECTED BY OUTLIERS AND SKEWED DATA. IN
ORDER TO CALCULATE THE MEDIAN, SUPPOSE WE HAVE THE DATA BELOW:
65 55 89 56 35 14 56 55 87 45 92
WE FIRST NEED TO REARRANGE THAT DATA INTO ORDER OF MAGNITUDE (SMALLEST FIRST):
14 35 45 55 55 56 56 65 87 89 92
OUR MEDIAN MARK IS THE MIDDLE MARK - IN THIS CASE, 56 (HIGHLIGHTED IN BOLD). IT IS THE MIDDLE
MARK BECAUSE THERE ARE 5 SCORES BEFORE IT AND 5 SCORES AFTER IT. THIS WORKS FINE WHEN YOU
HAVE AN ODD NUMBER OF SCORES, BUT WHAT HAPPENS WHEN YOU HAVE AN EVEN NUMBER OF SCORES?
WHAT IF YOU HAD ONLY 10 SCORES? WELL, YOU SIMPLY HAVE TO TAKE THE MIDDLE TWO SCORES AND
AVERAGE THE RESULT. SO, IF WE LOOK AT THE EXAMPLE BELOW:
65 55 89 56 35 14 56 55 87 45
WE AGAIN REARRANGE THAT DATA INTO ORDER OF MAGNITUDE (SMALLEST FIRST):
14 35 45 55 55 56 56 65 87 89
ONLY NOW WE HAVE TO TAKE THE 5TH AND 6TH SCORE IN OUR DATA SET AND AVERAGE THEM TO GET A
MEDIAN OF 55.5.
MODE
The mode is the most frequent score in our data set. On a histogram it represents the
highest bar in a bar chart or histogram. You can, therefore, sometimes consider the
mode as being the most popular option. An example of a mode is presented below:
Normally, the mode is used for categorical data where we wish to know which is the most
common category, as illustrated below:
WE CAN SEE ABOVE THAT THE MOST COMMON FORM OF TRANSPORT, IN THIS PARTICULAR DATA SET, IS THE
BUS. HOWEVER, ONE OF THE PROBLEMS WITH THE MODE IS THAT IT IS NOT UNIQUE, SO IT LEAVES US WITH
PROBLEMS WHEN WE HAVE TWO OR MORE VALUES THAT SHARE THE HIGHEST FREQUENCY, SUCH AS
BELOW:
SUMMARY OF WHEN TO USE THE MEAN,
MEDIAN AND MODE
Nominal Mode
Ordinal Median
Decile rankings are just another way to categorize data. Which system you use is usually
a judgment call. For example, if you wanted to display class rankings on a pie chart,
using deciles would make more sense that percentiles. That’s because a pie chart with
10-categories would be much easier to read than a pie chart with 99 categories.
IN A SIMILAR WAY, THE DECILES OF A DISTRIBUTION ARE THE NINE VALUES THAT SPLIT THE DATA
SET INTO TEN EQUAL PARTS.
YOU SHOULD NOT TRY TO CALCULATE DECILES FROM SMALL DATA SETS -- A SINGLE CLASS OF
MARKS IS TOO SMALL TO GET USEFUL VALUES SINCE THE EXTREME DECILES ARE VERY
VARIABLE. HOWEVER THE DECILES CAN BE USEFUL DESCRIPTIONS FOR LARGER DATA SETS
SUCH AS NATIONAL DISTRIBUTIONS FOR MARKS FROM STANDARD TESTS.
PERCENTILES
In a similar way, the percentiles of a distribution are the 99 values that split the data set
into a hundred equal parts. These percentiles can be used to categorise the individuals
into percentile 1, ..., percentile 100. A very large data set is required before the extreme
percentiles can be estimated with any accuracy. (The 'random' variability in marks is
especially noticeable in the extremes of a data set.)
RESOURCES
- https://statistics.laerd.com/statistical-
guides/measures-central-tendency-
mean-mode-median.php
- https://www.statisticshowto.datascienc
ecentral.com/what-are-quartiles/