Assessment For Learning Unit 10: Technological Based Quantitative and Qualitative Analysis of Learning Outcomes
Assessment For Learning Unit 10: Technological Based Quantitative and Qualitative Analysis of Learning Outcomes
In Table numbers of schools have been shown according to the management of schools. So
the schools have been classified into 4 categories, namely, Government Schools, Local Body
Schools, Private Aided Schools and Private Unaided Schools. A given school belongs to any one
of the four categories. Such data is shown as Categorical or Qualitative Data. Here the category
or the quality referred to is management. Thus categorical or qualitative data result from
information which has been classified into categories. Such categories are listed alphabetically or
in order of decreasing frequencies or in some other conventional way. Each piece of data clearly
belongs to one classification or category.
We frequently come across categorical or qualitative data in the form of schools categorized
according to Boys, Girls and Co-educational; Students' Enrolment categorized according to SC,
ST, OBC and 'Others'; number of persons employed in various categories of occupations, and so
on.
Let us consider another set of data given in Table
Number of Schools according to Enrolment
Enrolment No. of Schools
Upto 50 6
51 - 100 15
101- 200 12
201-300 8
Above 300 4
Total 45
In Table, numbers of schools have been shown according to the enrolment of students in I
the school. Schools with enrolment varying in a specified range are grouped together, e.g. there are
15 schools where the students enrolled are any number between 51 and 100. As the grouping is
based on numbers, such data are called Numerical or Quantitative Data. Thus, numerical or
quantitative data result from counting or measuring. We frequently come across numerical data in
newspapers, advertisements etc. related to the temperature of the cities, cricket averages, incomes,
expenditures and so on.
Table
From Table, one can easily comprehend the distribution of marks e.g. 10 students have
scores from 25 to 29, while only 7 students have a score lower than 50% etc.
Various terms related to the tabulation of data are being discussed below:
Table 'shows the marks arranged in descending order of magnitude and their corresponding
frequencies. Such a table is known as frequency distribution. A grouped frequency distribution
has a minimum of two columns - the first has the classes arranged in some meaningful order, and
a second has the corresponding frequencies. The classes are also referred to as class intervals. The
range of scores or values in each class interval is the same. In the given example the first class
interval is from 45 to 49 having a range of 5 marks i.e. 45, 46, 47, 48, and 49. Here 45 is the lower
class limit and 49 is the upper class limit. As discussed earlier the score of 45 may be anywhere
from 44.5 to 45.5, so the exact lower class limit is 44.5 Instead of 45. Similarly, the exact upper
class limit is 49.5 instead of 49. The range of the class interval is 49.5 - 44.5 = 5 i.e. the difference
between the upper limit of class interval and the lower limit of class interval.
For the presentation of data in the form of a frequency distribution for grouped data, a
number of steps are required. These steps are:
1. Selection of non-overlapping classes.
2. Enumeration of data values that fall in each class.
3. Construction of the table.
Let us consider the score of 120 students of class X of a school in Mathematics, shown in
Table.
Table, Mathematics score of 120 class X Students
71 85 41 88 98 45 75 66 8138 52 67 92 62 83 49 64 52 90 61 58 63 91 57 48
75 89 73 64 80 67 76 65 76 65 61 68 84 72 57 77 63 52 56 41 60 55 75 53 45
37 91 57 40 73 66 76 52 88 62 78 68 55 67 39 65 44 47 58 68 42 90 89 39 69
48 82 91 39 85 44 71 68 56 48 90 44 62 47 83 80 96 69 88 24 44 38 74 93 39
72 56 46 71 80 46 54 77 58 81 70 58 51 78 64 84 50 95 87 59
The length of class interval preferred is 2, 3, 5, 10 and 20. Here if we take class length of 10
then the number of class intervals will be 62/10 = 6.2 or 7 which is less than the desired number of
classes. If we take class length of 5 then the number of class intervals will be (1215 = 12.4 or 13
which is desirable.
Now, where to start the first class interval? The highest score of 98is included in each of the
three class intervals of length 5 i.e. 94 - 98, 95 - 99 and 96 - 100. We choose the interval 95- 99 as
the score 95 is multiple of 5. So the 13 classes will be 95 - 99,90 - 94, 85 - 89, 80 -84, . . . . . . . , 35
- 39. Here, we have two advantages. One, the mid points of the classes are whole numbers, which
sometimes you will have to use. Second, when we start with the multiple of the length of class
interval, it is easier to mark tallies. When the size of class interval is 5, we start with 0, 5, 10, 15,
20 etc.
To know about these advantages, you may try the other combinations also e.g. 94 -98, 89 -
93, 84 - 88, 79 -83 etc. You will observe that marking tallies in such classes is a bit more difficult.
You may also take the size of the class interval as 4. There you will observe that the mid points are
not whole numbers. So, while selecting the size of the class interval and the limits of the classes,
one has to be careful.
After writing the 13 class intervals in descending order and putting tallies against the
concerned class interval for each of the scores, we present the frequency distribution as shown in
Table 12.8.
Let us take the first score in the first row i.e. 71. The score of 7 1 is in the class interval 70 -
74(70, 71, 72, 73, 74) so a tally (I) is marked against 70 - 74. The second score in the first row is
85, which lies in the class interval 85 - 89 (85, 86, 87, 48, 89), so a tally (1) is marked against 86 -
89. Similarly, by taking, all the 120 scores, tallies are put one by one. While ranking the tallies, put
your finger on the scores, as a mistake can reduce the whole process to naught. The total tallies
should be 120 i.e. total numbers of scores. When against a particular class interval there are four
tallies (I///) and you have to mark the fifth tally, cross the four tallies (MV) to make it 5. So while
marking the tallies we make the cluster of 5 tallies. By counting the number of tallies, the
frequencies are recorded against each of the class intervals. It completes the construction of table.
In Table, the exact limits of class interval 95 - 99 are 94.5 and 99.5, as the score of 95 range
from 94.5 to 99.5 and the score of 99 ranges from 98.5 to 99.5, making the exact range from (94.5
to 99.5. As discussed earlier the data are continuous based on the nature of the variable. The class
interval, though customarily arranged in descending order, can also be arranged in ascending order.
Concept
You as a teacher might be coming across a variety of data pertaining to students' achievement
or other characteristics, both of individuals or groups of individuals. We may often be interested in
having a concise description of the performance of the group as a whole. In case there are more
than one group one may like to compare the groups in terms of their typical performance. Such
descriptions of group performances are known as measures of central tendency. Let us assume that
we have got the scores of students of three sections of class IX with 40 students each in these
sections. We may compute an index of the sets of scores of 40 students in each section which would
represent the average performance of the three sections in a given subject. Such an index would be
a measure of central tendency. It can very well be used to understand the nature of scores in each
section and for making inter-group comparisons. The most commonly used measures of central
tendency are
➢ Mode
➢ Median
➢ Mean.
THE MODE
Data obtained on the nominal scale is of classificatory type and mostly qualitative. We can
count the number of cases in each category and obtain the frequencies. We may then be interested
in noting down the class which is most populous or popular. We frequently deal with 'scores' in
measurement in education. The score obtained by the largest number of individuals is the mode of
that group of scores. For example, if in a section of 40 students of class IX the number of students
obtaining the score of 55 is the highest, 55 would be called the mode of the scores for that section.
Generally such values are seen to be centrally located, with other values in either direction having
relatively lower frequencies. Thus the mode presents a rough estimate of the most typical or the
average score in a group of values. It is not essential to have precise scores of all the individuals of
the group for finding out mode.
For continuous variables mode provides a quick measure which is less precise and less
dependable as compared to other measures of central tendency. If you draw a frequency polygon
or a histogram, you will notice the maximum height of this point or the bar.
Sometimes the scores of a group tend to concentrate on two distinctly separate places on the
scale. In such a situation the distribution is said to be bimodal and the value or score with highest
frequency cannot be said to be the mode. You may examine the following histogram and frequency
polygon.
In the above histogram and frequency polygon you may notice that the distribution has two
peaks, one at score 6 and the other at score 14. Obviously 14 cannot be the only mode here. Hence
it represents a bimodal distribution having two modes at 6 and 14. Some distributions can even be
multimodal i.e. having more than two modes. We may define Mode as the point on the scale of
measurement with largest frequency in relation to other frequency values in the neighborhood.
95 - 99 4
90 - 94 8
85 - 89 5
80 - 84 2
In the given distribution there are maximum (8) frequencies in the class interval 90 - 94. So
the midpoint, i.e. 92, is the mode.
• When the most typical value is wanted as a measure of central tendency. For instance, the
most liked boy in the class, the most popular belief of students about vocational courses etc.,
etc.
• When a quick and approximate measure of central tendency is require.
• When data is incomplete or the distribution is skewed, where most of the values are towards
the extremes.
Limitations of mode
Mode has the limitations associated with the scale of measurement for which it stands. Mode
can obviously not be subjected to further statistical analysis. It remains as only a rough estimate.
Sometimes we may come across bimodal distributions (having two modes) and we do not easily
find one composite measure. You may examine the following two situations and appreciate the
limitations of mode:
Situation I: The scores of students in History for Class VII A are as follows:
22, 37, 45, 66, 32, 64, 65, 67, 66, 67, 65, 67, 38, 66, 66, 65, 32, 66, 67, 65, 64, 64, 67, 52,
47, 67, 68, 67, 70
Situation II: The scores of students in Maths for Class IX A are as follows:
18, 20, 23, 24, 24, 25, 24, 24, 24, 30, 35, 40, 46, 48, 50, 56, 62, 62, 62, 62, 60, 47, 38, 62,
62, 24, 28, 62, 80
An inspection of situation I gives the mode of 67 while the adjacent scorer of 64, 65 and 66
seem to be equally potent to become mode. In situation II you notice a bimodal distribution having
two modes at 24 and 62 as both seem to be equally frequent in their own places. We may thus
conclude that mode is only a crude measure which can be of value when a quick and rough estimate
of central tendency is required.
THE MEDIAN
When data have been arranged in rank order the measure of central tendency may be found
by locating a point that divides the whole distribution into two equal halves. Thus median may be
defined as the point on the scale of measurement below and above which lie exactly 50 percent of
the cases. Median can therefore be found for truncated (incomplete) data provided we know the
total number of cases and their possible placements on the scale. It may be noted that median is
defined as a point and not as a score or any particular measurement.
In the above example there are a total of 40 cases. We have to find a point below and above
which lie 20 cases. There are 13 cases in top 3 class intervals and 19 cases in the bottom four class
intervals. The point segregating the values into two halves may be found in class interval 30 - 34
which has 8 cases in it. It is thus called the Median class. Assuming that these 8 frequencies are
evenly distributed within the class interval 30 - 34 (exact limits 29.5 to 34.3, we may find the
median point which has to be 1 case above 29.5 (or 7 cases below 34.5).
There are 8 cases covering a space of 5 units so one case would take 518 spaces. Hence the
Median = L+ N/2 - fb x i
f
Using this formula for the previous example you can see that
Median = 29.5 + (20 – 19) x 5
8
Now we will calculate Median using this formula: = 29.5 + 5/8
= 29.5 + 0.625
= 30.13
Limitations of median
Median is not dependent on all the observations and ignores their numerical values. It cannot
be used as the centre of gravity of the distribution. Also, it cannot be used for inferential statistical
analyses.
THE MEAN
Mean is calculated when the data are complete and presented on equal interval scale. It is
most popularly known as the 'Arithmetic Mean'. Mean provides an accurate description of the
sample and indirectly, that of the population. It is the sum of measurements* divided by their
number.
Mean = ∑X
N
Where ∑X = Sum of all values
N = Number of cases
• Mean of a distribution of scores may be defined as the point on the scale of measurement
obtained by dividing the sum of all the scores by the number of scores.
Calculating mean for ungrouped data
When raw data are given the Mean is computed by adding all these values and dividing by
the total number.
Example : Compute Mean for the scores given below
25,36,18,29,30,41,49,26,16,27
Mean = ∑X = 25+36+18+29+30+41+49+26+16+27
N 10
297
= 10 = 29.7 (Answer)
Score 18 20 24 35 42 48 50
Frequency 2 4 3 8 6 4 3
Mean = ∑f x
N
Where X = Score
f = frequency
N = ∑f = total number of cases (frequencies)
SCORE
X f fX
18 2 36
20 4 80 Mean = ∑fx
N
24 3 72
35 8 280 = 1062
30
42 6 252
50 3 150
N= 30 ∑fX = 1062
When grouped frequency distribution is given, the Mean is calculated using the above formula i.e.
∑fx
Mean =
N
Here an assumption is made that all frequencies are concentrated at the midpoint of the class
interval. So mid points of class intervals are used for scores.
Limitations of mean
Sometimes mean of a distribution is highly misleading especially when some of the
observations are too large or too small as compared to the others. If you want to study the average
class size and there are 5 classes with 100 - 150 students, 10 classes having 50 to 100 students and
35 classes having 30 to 50 students each. Then the Mean of 55.5 would not represent the typical
case. Even within a class if 5 students' scores are 12, 15,20,25 and 100, the Mean of 34.4 can be
misleading. There are situations where mean may not provide meaningful information.
The Characteristics of Mean, Median and Mode have been discussed in the preceding
sections besides mentioning some situations where they can be appropriately used. Mean, Median
and Made differ from each other on various counts. These should be used as per the nature of the
data indicated, by the scale of measurement used and the purpose in hand. However, the mean is a
more precise, reliable and stable measure. Its use should be avoided when data are skewed or
truncated. If some decision is to be taken on the face value of data, mode is the best measure. But
to have a suitable measure when data are incomplete or skewed, Median may be preferred. If further
statistical analysis is to be carried out we should go for Mean. It will not be desirable to consider
any one of them to be superior or inferior in all situations as it is rather contextual. We should
consult an expert if required.
Concept
The averages are representatives of a frequency distribution. But they fail to give a complete
picture of the distribution. They do not tell anything about the scatterness of observations within
the distribution.
Suppose that we have the distribution of the yields (kg per plot) of two paddy varieties from 5 plots
each. The distribution may be as follows
Variety I 45 42 42 41 40
Variety II 54 48 42 33 30
It can be seen that the mean yield for both varieties is 42 kg but cannot say that the
performances of the two varieties are same. There is greater uniformity of yields in the first variety
whereas there is more variability in the yields of the second variety. The first variety may be
preferred since it is more consistent in yield performance. Form the above example it is obvious
that a measure of central tendency alone is not sufficient to describe a frequency distribution. In
addition to it we should have a measure of scatterness of observations. The scatterness or variation
of observations from their average are called the dispersion. There are different measures of
dispersion like the range, the quartile deviation, the mean deviation and the standard deviation.
RANGE
This is the simplest possible measure of dispersion and is defined as the difference between
the largest and smallest values of the variable.
• In symbols, Range = L – S
Where L = Largest value.
S = Smallest value.
In individual observations and discrete series, L and S are easily identified.
In continuous series, the following two methods are followed.
Method 1
L = Upper boundary of the highest class
S = Lower boundary of the lowest class.
Method 2
Example1
The yields (kg per plot) of a cotton variety from five plots are 8, 9, 8, 10 and 11. Find the range.
Solution
L=11, S = 8.
Range = L – S = 11- 8 = 3
Example 2
Solution
L = Upper boundary of the highest class = 75
S = Lower boundary of the lowest class = 60
Range = L – S = 75 – 60 = 15
Merits of range
1. It is simple to understand.
2. It is easy to calculate.
3. In certain types of problems like quality control, weather forecasts, share price analysis, etc.,
range is most widely used.
Demerits of range
1. It is very much affected by the extreme items.
2. It is based on only two extreme observations.
3. It cannot be calculated from open-end class intervals.
4. It is not suitable for mathematical treatment.
5. It is a very rarely used measure.
STANDARD DEVIATION
It is defined as the positive square-root of the arithmetic mean of the Square of the deviations
of the given observation from their arithmetic mean.
The standard deviation is denoted by s in case of sample and Greek letter σ (sigma) in case
of population.
The formula for calculating standard deviation is as follows
Example
Raw Data
The weights of 5 ear-heads of sorghum are 100, 102,118,124,126 gms. Find the standard
deviation.
Solution
Example
Continuous distribution
The Frequency distributions of seed yield of 50 seasamum plants are given below. Find the standard
deviation
1. It is rigidly defined and its value is always definite and based on all the observations and the
actual signs of deviations are used.
2. As it is based on arithmetic mean, it has all the merits of arithmetic mean.
3. It is the most important and widely used measure of dispersion.
4. It is possible for further algebraic treatment.
5. It is less affected by the fluctuations of sampling and hence stable.
6. It is the basis for measuring the coefficient of correlation and sampling.
VARIANCE
COEFFICIENT OF VARIATION
If we want to compare the variability of two or more series, we can use C.V. The series or
groups of data for which the C.V. is greater indicate that the group is more variable, less stable, less
uniform, less consistent or less homogeneous. If the C.V. is less, it indicates that the group is less
variable or more stable or more uniform or more consistent or more homogeneous.
QUARTILE DEVIATION
Cocept
Quartiles: You know that median is that value of the variate which divides the total
frequency of the distribution into two equal parts. Quartiles may be defined as those values of the
variate which divide the total frequency into four equal parts. Quartiles are denoted by Q1, Q2 and
Q3. Q2 is the same as median. The value of the variate which divides the lower half below the
median, into two equal parts on the basis of frequency, is called Lower Quartile. It is denoted by
Q1. Similarly, when the upper half which is above the median, is divided on the basis of frequency
into two equal parts, the value of the variate is called Upper Quartile, denoted by Q3.
The difference between the two quartiles i.e. Q3-Q1, represents inter-quartile range. Half of
this difference, which is semi-interquartile range is called quartile deviation. Thus quartile
deviation, which is denoted by Q, is given by
Q3 – Q1
Q=
Q3 – Q1
Q=
For computing the value of Q, we have to first find the values of Q1 and Q3, we have to find
the points on the scale of measurement upto which 25% and 75% of the cases lie respectively. The
process of calculation of Q1 and Q3 is similar to the process of calculating the median, the only
difference being that for the median we consider N/2 cases, while for the Q1 and Q3 we have to
take N/4 and 3N/4 cases respectively.
Let us first discuss the use and limitations of quartile deviation as a measure of dispersion.
Quartile deviation is easy to calculate and interpret. It is independent of the extreme values, so it is
more representative and reliable than range. Wherever median is preferred as a measure of central
tendency, quartile deviation is preferred as measure of dispersion. However, like median, quartile
deviation is not amenable to algebraic treatment, as it does not take into consideration all the values
of the distribution.
While interpreting the value of quartile deviation it is better to have the values of Median.
Q1 and Q3 along with Q. If the value of Q is more, then the dispersion will be more, but again the
value depends on the scale of measurement. Two values of Q are to be compared only if the scale
used is the same. Q measured for scores out of 20 cannot be compared directly with Q for scores
out of 50. If median and Q are known, we can say that 50% of the cases lie between ‘Median – Q’
and ‘Median + Q’. These are the middle 50% of the cases. Here, we come to know about the range
of only the middle 50% of the cases. How the lower 25% of the cases and the upper 25% of the
cases are distributed, is not known through this measure. Sometimes, the extreme cases or values
are not known, in which case the only alternative available to us is to compute median and quartile
deviation as the measures of central tendency and dispersion. Through median and quartiles we can
infer about the symmetry or skewness of the distribution.
CONCEPT OF PERCENTILES
In case of median, total frequency is divided into two equal parts: in the case of quartiles,
total frequency is divided into four equal parts: similarly in case of percentiles, total frequency is
divided into 100 equal parts. Percentiles are denoted by P1, P2,P2,….. P100. Thus, percentiles may
be defined as those values of the variate which divide the total frequency into 100 equal parts. So,
there are 1 percent of the cases below the point P1, 2 percent of the cases below the point P2, and
so on. As discussed earlier, Median is represented by P50 and the two quartiles Q1 and Q3 are
represented by P25 and P75 respectively. Similarly, first, second, third,…..ninth deciles are
represented by P10,P20,P30,…..P90 respectively.
Calculation of percentiles
For calculating the values of percentiles, we have to find the points on the scale of
measurement upto which the specified percent of cases lie. The process of calculating the
percentiles wherein we take into consideration the specified percent of cases is similar to that of
calculating the quartiles. Thus,
R = P/100 x (N + 1)
Interpretation of percentiles
Percentiles are more frequently used in testing and interpreting test scores. For any
standardized tests, percentile norms are reported with the test, so that the obtained test results may
be interpreted properly. If the percentile rank of an individual is 60, we come to know that 60% of
the students have scored less than that individual. If only the score of an individual is given, it is
difficult to judge the performance. It can be judged only with reference to a particular group.
However, with the help of the cumulative percentage curve, we can find the percentile rank of that
individual and judge the performance on that basis.
Limitations of percentiles
The mastery of an individual is not judged by the use of percentiles, as the same person in a
poor group will show better rank and in an excellent group will show comparatively poorer rank.
Also, as in case of simple ranks the difference in percentile ranks at different intervals are not equal.
As an example, P100 – P90 is not comparable to P50 – P40. The position of a student on total
achievement cannot be calculated from percentiles given in several tests.
The distance of a score from a central point is called a deviation. The simplest way to take
into consideration the variation of all the values in a distribution is to find the mean of all the
deviations of these values from a selected point of central tendency. Usually, the deviation is taken
from the mean of the distribution. The average of the deviations of all values from the arithmetic
mean is known as mean deviation or average deviation.
The measure of central tendency is such a point on the scale of measurement on both sides
of which there are a number of values. So, the deviations from this point will be in opposite
directions, both positive and negative. If the score is denoted by X. and the mean by M, then X –
M denotes the deviation of scores from the mean. The deviation, where mean is greater than the
score, will be negative. By definition of the mean, as measure of central tendency, the algebraic
sum of all these deviations will come out to be zero, as the deviations on both the sides are equal.
To avoid this problem, the absolute values of these deviations, i.e. /X –M / irrespective of their sign
is taken into consideration. Thus,
∑/X-M/
Mean Deviation =
First of all, we should know when and where to use Mean Deviation as a measure of
dispersion. Mean Deviation is the simplest measure of dispersion that takes into account all the
values in a given distribution. It is easily comprehensible even by a person not well versed in
statistics, but it has some limitations also. First, as it takes into account the absolute values of
deviations, without considering the sign of the deviation, it is unwieldy in mathematical operations.
So, it is used only as a descriptive measure of variability. Second, it is influenced by extreme values.
But this influence is less than the influence on some other measures of dispersion which also take
into consideration all the values.
For interpreting the mean deviation, it is always better to look into it along with the mean
and the number of cases. Mean is required because the mean and the mean deviation are
respectively the point and the distance on the same scale of measurement. Without mean, the mean
deviation cannot be interpreted, as there is no clue for the scale of measurement or the unit of
measurement. The number of cases is important because the measure of dispersion depends on it.
For less number of cases, the measure is likely to be more.
Carefully look at the following hypothetical frequency distribution, which a teacher has
obtained after examining 150 students of class IX on a mathematics achievement test (see Table).
If you compute the values of Mean, Medium and Mode, you will find that these three are
approximately the same (M = Md = MO = 52).
This 'Bell' shaped curve technically known as Normal Probability Curve or Simply Normal
Curve and the corresponding frequency distribution of scores, having equal values of all three
measures of central tendency, is known as Normal Distribution.
This normal curve has great significance in mental and. educational measurement. In
measurement of behavioral aspects, the normal probability curve has been often used as reference
curve.
The reasons why distributions exhibit skewness and kurtosis are numerous and often
complex, but a careful analysis of the data can often throw some light on the asymmetry. Some of
the common causes are:
Normal Curve has great significance in the mental measurement and educational evaluation.
It gives important information about the trait being measured.
❖ Number of evidences are accumulated to show that normal distribution provides a good fit
or describe the frequencies of occurrence of many variable 'and facts in (i) biological
statistics e.g. sex ratio in births in a country over a number of years, (ii) the anthropometrical
data e.g. height, weight, (iii) wages and output of large numbers of workers in the same
occupation under comparable conditions, (iv) psychological measurements e.g. intelligence,
reaction time, adjustment, anxiety and (v) errors of observations in Physics, Chemistry and
other Physical Sciences.
❖ The Normal Distribution is of great value in educational evaluation and educational
research, when we make use of mental measurement. It may be noted that normal
distribution is not an actual distribution of scores on any test of ability or academic
achievement, but is instead, a mathematical model. The distribution of test scores approach
the theoretical normal distribution as a limit, but the fit is rarely ideal and perfect.
There are number of applications of normal curve in the field of educational measurement
and evaluation. These are:
to determine the percentage of cases (in a normal distribution) within given limits or scores
to determine the percentage of cases that are above or below a given score or reference point
to determine the limits of scores which include a given percentage of cases
to determine the percentile rank of a student in his own group
to find out the percentile value of a student's percentile rank
to compare the two distributions in terms of overlapping
to determine the relative difficulty of test items, and
dividing a group into sub-groups according to certain ability and assigning the grades.
To illustrate what we mean by a relationship between two variables, let us use the example
cited in 16.1 i.e. the scores of 5 students in mathematics and physics. What pattern do you find in
the data? You may notice that in general those students who score well in mathematics also get
high scores in physics. Those who are average in mathematics get just average scores in physics
and those who are poor in mathematics get low scores in physics. In short, in this case there is a
tendency for students to score at par on both variables. Performance on the two variables is related;
in other words the two variables are related, hence co-vary.
If the change in one variable appears to be accompanied by a change in the other
variable, the two variables are said to be co-related and this inter-dependence is called
correlation.
CO-EFFICIENT OF CORRELATION
The co-efficient of correlation is always symbolized either by r or p (Rho). The notion 'r' is
known as product moment correlation co-efficient or Karl Pearson's Coefficient of Correlation. The
symbol 'P' (Rho) is known as Rank Difference Correlation Coefficient or Spearman's Rank
Correlation Coefficient.
Types of correlation
In a bivariate distribution, the correlation may be:
1. Positive, Negative or Zero; and
2. Linear or Curvilinear (Non-Linear)
6∑ D²
ρ =1-
N(N² -1)
Where: p = The Spearman's Rank Co-efficient of Correlation
D = Difference between paired Ranks
N = Number of subjects or items ranked
Example
In a speech contest Prof. Mehrotra and Prof. Shukla, judged ten pupils. Their judgments
were in ranks, which are presented below. Determine the extent to which their judgments were in
agreement.
Table
6x∑D²
ρ = 1-
N(N² -1)
6x28
= 1-
10 ( 10² - 1)
168
= 1-
990
= 1- .17
ρ = + .83
The value of co-efficient of correlation is +.83, This shows a high degree of agreement
between the two judges.
The most often used and most precise coefficient of correlation is known as the Pearson's
Product - Moment Coefficient. It is computed when data are expressed in interval or ratio form and
distribution of X and Y have a linear relationship. Here linear relationship means, if we draw a line
graph by taking X variable on X-axis and Y variable on Y axis the obtained graph should be straight
line.
The formula used for computing the Pearson's coefficient of correlation is:-
where
r = Pearson's Coefficient of Correlation
∑X = Sum of the Scores of X Variable
∑Y = Sum of the Scores of Y Variable
∑X2 = Sum of the Squared X Scores
∑Y2 = Sum of the Squared Y Scores
∑XY = Sum of the Product of Paired X and Y Scores
N = Number of Paired Scores
Example
The scores given below were obtained on an Intelligence Test and Algebra Test by 10
students of class VIII. Compute Pearson's Coefficient of Correlation.
Table
The steps in computing 'r' from ungiouped scores may be outlined thus:
Step 1 : Find the sum of the scores of X and Y variable.
Step 2 : Square each score of X variable and find their sum i.e. Cx2 (Col. 4)
Step 3 : Square each score of Y variable and find their sum i.e. Cy2 (Col. 5)
Step 4 : Multiply the X scores and Y scores in the same rows, and enter these
products in the column XY, i.e. Col. 6; and get the sum of XY i.e. (ZXY)
Step 5 : Put all the values of N, CX, CY, Cx2, Cy2 and ZXY in the formula, and simplify.
Learners should so be aware of the following factors which influence the size of the
coefficient of correlation and can lead to misinterpretation:
❖ The size of "r" is very much dependent upon the variability of measured values in the
correlated sample. The greater the variability, the higher will be the correlation, everything
else being equal.
❖ The size of "r" is altered, when an investigator selects an extreme group of subjects in order
to compare these groups with respect to certain behavior. "r" obtained from the combined
data of extreme groups would be larger than the "r" obtained from a random sample of the
same group.
❖ Addition or dropping the extreme cases from the group can lead to change on the size of "r".
Addition of the extreme case may increase the size of correlation, while dropping the
extreme cases will lower the value of "r".
Correlation is one of the most widely used analytic procedures in the field of Educational
Measurement and Evaluation. It not only describes the relationship of paired variables, but it is also
useful in:
• Prediction of one variable - the dependent variable on the basis of the other variable the
independent variable.
• Determining the reliability and validity of the test or the question paper.
• determining the role of various correlates to a certain ability.
• factor analysis technique for determining the factor loadings of the underlying variables in
human abilities.
The data which has been shown in the tabular form may be displayed in pictorial form by
using a graph. A well-constructed graphical presentation is the easiest way to depict a given set of
data.
Types of graphical representation of data
Here only a few of the standard graphic forms of representing the data are being discussed
as listed below:
• Histogram
• Bar Diagram or Bar Graph
• Frequency Polygon
• Cumulative Frequency Curve or Ogive
Histogram
The most common form of graphical presentation of data is histogram. For plotting a
histogram, one has to take a graph paper. The values of the variable are taken on the horizontal
axis/scale known as X-axis and the frequencies are taken on the vertical axis/scale known as Y-
axis. For each class interval a rectangle is drawn with the base equal to the length of the class
interval and height according to the frequency of the C.I. When C.I. are of equal length, which
would generally be the case in the type of data you are likely to handle in school situations, the
heights of rectangles must be proportional to the frequencies of the Class Intervals. When the C.I.
are not of equal length, the areas of rectangles must be proportional to the frequencies indicated
(most likely you will not face this type of situation). As the C.1.s for any variable are in continuity,
the base of the rectangles also extends from one boundary to the other in continuity. These
boundaries of the C.1.s are indicated on the horizontal scale. The frequencies for determining the
heights of the rectangles are indicated on the vertical scale of the graph.
Let us prepare a histogram for the frequency distribution of mathematics score of 120 Class
X students (Table).
For this, on the horizontal axis of the graph one has to mark the boundaries of the class
intervals, starting from the lowest, which is 34.5 to 39.5. So the points on X-axis will be 34.5, 39.5,
44.5, 49.5, and 99.5. Now on the vertical axis of the graph, the frequencies from 1 to 14 are to be
marked. The height of the graphical presentation is usually taken as 60 to 75% of the width. Here,
we take 1 cm on X-axis representing 5 scores and 1 cm on Y-axis representing a frequency of 2.
For plotting the first rectangle, the base to be taken is 34.5 -39.5 and the height is 7, for the second
the base is 39.5 - 44.5 and the height is 8, and so on.
The histogram will be as shown in Figure.
Fig.: Distribution of Mathematics Scores
Let us re-group the data of Table 12.8 by having the length of class intervals as 10, as shown
in Table.
90 - 99 11
80 - 89 18
70 - 79 20
60 – 69 25
50 – 59 21
40 - 49 18
30 - 39 7
Total 120
To plot the histogram, we-mark the boundaries of the class intervals on X-axis. Here the
points will be 29.5,39.5,49.5,. . .-. . . . , 99.5. On they-axis, the frequencies to be marked are from
1 to 25. On X-axis, a distance of 1 cm represents a scare of 10, while on Y-axis; 1 cm represents a
frequency of 5. The histogram will be as shown in Figure
If we observe Figures, we find that second figure is simpler than Figure one. Figure one is
complex because the number of class intervals is more. If we further increase the number of class
intervals, the figure obtained will be still more complex. So for plotting the histogram for a given
data, usually we prefer to have less number of class intervals.
For a discrete variable the unit of measure on the horizontal axis is not important. Neither
are 1 the classes related to each other. So the bars arc equally spaced and are of equal width on the
horizontal axis. However, the height of the bars are proportionate to the respective frequencies. Bar
graphs are frequently used for pictorial presentation of discrete data. If two variables are used
simultaneously, even then bar graphs may be quite effective. For example, if along with the total
number of schools (management-wise) the number of boys' schools, girls' schools and co-ed schools
are also to be indicated then this can be done on the same graph paper by using different colours,
each indicating the sex-wise category. For each management there will be 4 bars having different
colours indicating different categories.
Frequency polygon
For plotting a frequency polygon, as in case of histogram, the values of the variable are taken
on the horizontal axis of the graph and the frequencies are taken on the vertical axis of the graph.
In the case of a frequency polygon, one has to indicate the mid points of the C.I. on the horizontal
axis, instead of indicating the boundaries of the interval, Here the midpoint of the intervals just
before the lowest interval and just after the highest interval are also to be indicated. Now by taking
the mid points one by one, the points above them are to be plotted corresponding to the frequencies
of the intervals. In case of the two additional mid points, the frequency being zero, the points to be
plotted are on the X-axis itself. The adjoining points so plotted are to be joined by straight line
segments.
Let us again consider the frequency distribution of mathematics scores shown in Table 12.9
and prepare the frequency polygon for the same. The mid points of the C.1.s are respectively
34.5,44.5,54.5, . . . . . . 94.5. Two additional mid points required are 24.5 and 104.5. Now on the
horizontal axis of the graph locate the points 24.5,34.5,44.5, . . . . . . .94.5, 104.5 as shown in Figure
Take the points above the plotted points by taking the heights as 0,7, 18,21,25,20, 18, 11 and
0 respectively. Join these points in a sequence. The frequency polygon obtained will be as shown
in Figure .
The mid points of the tops of the rectangle and extend them to one interval on either end of
the figure with zero frequency, the figure so obtained will be the frequency polygon.
The primary purpose of frequency polygon is to show the shape of the distribution. When
two or more frequency distributions are to be compared, the relative frequency polygons are
constructed against the same set of axes. Any difference in the shape of these distributions becomes
visible. Frequency polygon has an advantage over the histogram.
For getting the cumulative frequencies of a C.I. we take the cumulative frequencies upto the
previous interval and add the frequency of that interval into it. Here C.F. indicates that upto 39.5
there are 7 cases, upto 49.5 there are 25 cases, upto 59.5 there are 46 cases, and so on. The difference
between the construction of the frequency polygon and ogive is that for frequency polygon, one
takes the mid points of the C.I. on horizontal axis, while for ogive one takes the upper boundary of
the C.I. on horizontal axis. Again on the vertical axis, in case of Ogive one takes cumulative
frequency/cumulative percentage instead of frequency only. The cumulative frequency curve or
Ogive for the given data in Table 12.10, will be as shown in Fig.
Fig. : Cumulative Frequency Curve or Ogive
In Fig., the curve starts from 29.5 (0 Cumulative Frequency) and moves upto 99.5 (120
C.F.). In this case the points have been joined in a sequence with a smoothened curve, instead of
straight line segments. From ogive we can easily find out a point on horizontal axis upto which the
specified number of cases or the specified percentage of cases will be available. The only difference
between the cumulative frequency curve and ogive is that for cumulative frequency curve, on
vertical axis, we take cumulative frequencies, while in case of ogive we also have to take cumulative
percentages
CCE - The scientific method tries to determine the strengths and weaknesses of students,
improves students acquisition levels, strengthens school teamwork and societal co-operation.
Student’s interaction and behaviour are well taken care of along with academics. At the core of the
new educational vision, the objective of making the learning process joyful for the child is
envisaged. When the child takes greater responsibility for his /her own learning and by giving
freedom to the learner to experiment and explore, the learning process can be made exciting and
meaningful to each learner. Thus learning can be de-stressed by introducing alternatives to
homework in smaller classes and by gradual elimination of pass/fail criterion along with the
introduction of grades.
Inferences
Generally inference means a conclusion made on the basis of evidence or reasoning where
a sin assessment, inference occur when we can see something happening. In contrast, inferences
are what we figure out based on an experience. Helping students understand when information is
implied, or not directly stated, will improve their skill in drawing conclusions and making inference.
These skills will be needed for all sorts of school assignments, including reading, science and social
studies. Inferential thinking is a complex skill that will develop over time with experience.
Diagnosis
No one source of data can be sufficient to assess what a pupil knows about school-related
content. What is called for is a triangulation of several kinds of data drawn from various types of
tests: standardized tests of achievement and aptitude, teacher-made quizzes, observations of
behavior, and the like. Diagnosis does not necessarily mean prescription unless the data collected
have demonstrated high reliability and validity.
Feedback
Good feedback generally focuses on behavior or the outcomes of behavior rather than on the
inherent characteristics of the person concerned. It leaves that person feeling positive and able to
move forward. The timing of the feedback is important. It needs to be given as soon as possible
after the event. The greater the delay, the less likely it is that the student will find it useful or be
able or inclined to act on it.
The test scores will help the students can know their strengths and weakness in respective
subjects. It provides feedback to the students. It also provides a basis for checking the adequacy of
their own progress in a particular subject, as well as their study habits, interest, home influence etc.
Which influence their performance?
Based on the test scores, teachers may infer about the success of instruction process adopted
by them. Also they may provide more appropriate instructional guidance for individual students or
the class as a whole.
Portfolio
E- Portfolio Assessment
Portfolio creation is the responsibility of the learner, with teacher guidance and support and
often with the involvement of peers and parents.
A portfolio provides samples of the student’s work which show growth over time.
The criteria for selecting and assessing the portfolio contents must be clear to the teacher
and the students at the beginning of the process.
The entries in the portfolio can demonstrate learning and growth in all learning
competencies.
Strengths of E- Portfolio Assessment
Embedded in instruction
Portfolios provide teachers with a tool for showing what, how, and how well students learn
both intended and incidental outcomes. They provide students and teachers with creative,
systematic, and visionary ways to learn, assess, and report skills, processes, and knowledge.
Promotes a shift in ownership; students take an active role in examining what they have done
and what they want to accomplish.
Offers the possibility of assessing the more complex and important aspect of a learning area
or subject matter; and
Covers a broad scope of knowledge and information from many different people involved
in the assessment of students’ learning and achievement.
It can be very time consuming for teachers to organize and evaluate the content of portfolios.
Portfolio can be just a miscellaneous collection of artifacts that do not show patterns of
growth and achievement.
Data from portfolio assessments can be difficult to analyze or aggregate to show change.
Cover Letter
Table of Contents
Entries
Dates
Drafts
Reflections
Descriptive Model
Evaluative Model
Meaning
• A Rubric is “a scoring tool that lists the criteria for a piece of work”. (Goodrich H.)
• A Rubric is a scoring guide used to evaluate a student’s performance based on the sum of a
full range of criteria rather than a single numerical score.
• A Rubric is a formative type of assessment because it becomes an ongoing part of the whole
teaching and learning process.
• Rubrics improve student performance by clearly showing the student how their work will be
evaluated and what is expected.
• Rubrics help student become better judges of the quality of their own work.
• Rubrics promote student awareness about the criteria to use in assessing peer performance.
• Rubrics provide useful feedback to the teacher regarding the effectiveness of the instruction.
• Rubrics reduce the amount of time teachers spend evaluating student work.
The example in table lists the criteria and gradations of quality for verbal, written, or graphic
reports on students inventions-for instance, inventions designed to ease the westward journey for
19th century pioneers for instance, or to solve a local environmental problem, or to represent an
imaginary culture and its inhabitants, or anything else students might invent.
The rubric could easily include criteria related to presentation style and effectiveness, the
mechanics of written pieces, and the quality of the invention itself.
The four columns to the right of the criteria describe varying degrees of quality, from
excellent to poor. As concisely as possible, these columns explain what makes a good piece of work
good and a bad one bad.