Sample
Sample
TECHNIQUES
Structure
8.0 Objectives
8.1 Introduction
8.2 Concept of Measurement
8.3 Measurement Issues in Research
8.4 Scales of Measurement
8.5 Criteria for Good Measurement
8.5.1 Reliability
8.5.2 Validity
8.5.3 Practicality
8.6 Errors in Measurements
8.7 Scaling Techniques
8.7.1 Comparative Scaling Techniques
8.7.2 Non-Comparative Scaling Techniques
8.8 Let Us Sum Up
8.9 Key Words
8.10 Some Useful Books
8.11 Answers or Hints to Check Your Progress Exercises
8.0 OBJECTIVES
After going through this Unit, you will be able to:
• state the concept of measurement and its need;
• explain the various scales of measurement;
• discuss the various scaling techniques; and
• identify the criteria for good measurement.
8.1 INTRODUCTION
Measurement is the foundation of scientific enquiry. As we know that research
begins with a ‘problem’ or topic. Thinking about a problem results in
identifying concepts that capture the phenomenon being examined. Concepts
are mental ideas representing the phenomenon. The process of
conceptualization involves defining the concepts abstractly in theoretical terms.
Operationalisation of concepts for research purpose involves moving from the
abstract to the empirical level. This underlines the need to measure the various
attributes of the people, the characteristics of objects or phenomenon. Further
issues like decent work, human wellbeing, happiness, quality of education etc.
which have several qualitative and quantitative dimensions are emerging
important issues of research in economics. Such issues are being probed
53
Research Design and through development of indicators and composite indexes which necessarily
Measurement involve measurement of such indicators. Hence, a student of economics is
expected to be well versed in the measurement scales, criteria of good
measurement, and important scaling techniques. This unit throws light on these
issues. Let us begin to discuss the concept of measurement.
1) Nominal Scale
A qualitative scale without order is called nominal scale. In nominal scale
numbers are used to name identity or classify persons, objects, groups,
gender, industry type. Even if we assign unique numbers to each value, the
numbers do not really mean anything. This scale neither has any specific
order nor it has any value. In case of nominal measurement, statistical
analysis is attempted in terms of counting or frequency, percentage,
proportion, mode or coefficient of contingency. Addition, subtraction,
multiplication and division are not possible under this scale.
2) Ordinal scale
A qualitative scale with order is called ordinal scale. In ordinal scale
numbers denote the rank order of the objects or the individuals. Numbers
are arranged from highest to lowest or lowest to highest. For example,
students may be ranked Ist, 2nd and 3rd in terms of their academic
achievement. The statistical operations which can be applied in ordinal
measurement are median, percentiles and rank correlation coefficient.
Ordinal scales do not provide information about the relative strength of
ranking. This scale does not convey that the distance between the different
rank values is equal. Ordinal scales are not equal interval measurement.
Further this scale does not incorporate absolute zero point.
3) Interval scale
Interval scale includes all the characteristics of the nominal and ordinal
scale of measurement. In interval scale numerically equal distance on the
scale indicate equal distances in the attributes of the object being measured.
For example, a scale represent marks of students using the attributes range
0 to 10, 10 to 20, 20 to 30, 30 to 40 and 40 to 50, and so forth. The mid
point of each range (ie. 5, 15, 25, 35 and 45 etc.) are equidistance from
each other. The data obtained from an interval scale is known as interval
data. The appropriate measures used in this scale are: arithmetic mean,
standard deviation, Karl Pearson’s coefficient of correlation and tests like t-
test and f-test. We cannot apply coefficient of variation in the interval scale.
55
Research Design and 4) Ratio Scale
Measurement
Ratio scale has all the characteristics of nominal, ordinal and interval scale.
It also possesses a conceptually meaningful zero point in which there is a
total absence of the characteristic being measured. Ratio scales are common
among physical sciences rather than among social sciences. The examples
of ratio scales are the measures of height, weight, distance and so on. All
measures of central tendencies that can be used in this scale include
geometric and harmonic means.
You will notice in the above tables that only the ratio scale meets the criteria for
all four properties of scales of measurement. Interval and Ratio scale data are
sometimes referred to as parametric and Nominal and Ordinal data are referred to
as non-parametric.
Examples of Scales of Measurement –
Scale Example Statistics
Nominal Gender Frequency, percentage, proportion Mode,
Yes-no Coefficient of contingency, chi-square (x2)
Students roll number
Objects/Groups
Ordinal Class Rank, Median, percentile, rank, correlation,
Socio Economic status Range
Academic Achievement
Interval Student grade point, Mean, correlation, Range, Standard
Temperature, Calendar deviation, rank order variance, Karl
dates, Rating Scale pearson’s correlation, t-test, f-test etc.
Ratio Weight, Height, Salary, Mean, Median, Mode, Range, Variance,
Frequency of buying a Standard deviation, coefficient of variation,
product rank order variance, Karl pearson’s
correlation, t-test, f-test
56
Measurement and
8.5 CRITERION FOR GOOD MEASUREMENT Scaling Techniques
8.5.1 Reliability
Reliability refers to the degree to which the measurement or scale is consistent
or dependable. If we use same construct again and again for measurement, it
would lead to same conclusion. Reliability is consistency in drawing conclusion.
. . ..
. . ..
Reliable and Valid Valid but not Reliable Reliable but not valid
8.5.2 Validity
Validity refers to the extent an instrument or scale tests or measures what it
intends to measure. This means validity is the extent to which differences
found with a measuring instrument reflect true differences among those being
tested. This can be done by seeking other relevant evidences. There are three
types of validity: content validity; criterion validity and construct validity.
Content validity indicates the extent to which it provides coverage of the issues
under study. Criterion validity examines how well a given measure relates to
one or more external criterion based on empirical observations. And, construct
validity explains the variation observed in tests conducted on several
individuals. Construct validity is closely related with factor analysis.
8.5.3 Practicality
From practical view point, measure should be economical, convenient and
interpretable. The measure should not be lengthy and difficult. The measure
can be done by highly specialized persons.
• Unreliability
57
Research Design and Unreliability refers to inconsistency in what the measure produces under
Measurement repeated uses. If any measure, on average give some score for a case on
variable and gives other score, when it is used again, it is said to be unreliable.
58
Measurement and
8.7 SCALING TECHNIQUES Scaling Techniques
Scaling Techniques
Comparative Non-comparative
Scaling Techniques Scaling Techniques
Example
Rank the following factors in order of your importance in choosing mobile
services (Assign 1 to most important and 5 to least important factor. Please do
not repeat the ranks.)
Factor: Price, connectivity, call drop, Internet, Download speed.
Rank :……… ……… …… ……. ……….
There are various comparative scaling techniques. Three most commonly used
comparative scaling techniques are discussed here.
a) Paired comparison
b) Rank order
c) Constant sum
a) Paired Comparison
The paired comparison is most commonly used scaling technique. This
method is simply a binary choice. In this method respondents choose the
stimulus or items in each pair that has the greater magnitude on the choice
dimension they are instructed to use. This method is used when the study
requires to distinguish between the two objects. The data obtained in this
method is ordinal in nature.
59
Research Design and Example
Measurement
In a study of consumer preferences for the two brands of milk product i.e.
Amul and Mother Dairy, a consumer is asked to indicate which one of the
two brands he would prefer for personal use:
1) Which milk product of the following you prefer on the basis of taste.
Please tick mark (√).
2) Which milk product will you prefer on the basis of packaging? Please
tick mark (√).
3) Which product will you prefer on the basis of price? Please tick mark (√).
n [n - 1]
Number of comparison = ;
2
n= number of objects.
b) Rank Order
This method is very popular among researchers and provides ordinal data.
In this method, respondents are provided various objects and asked to rank
the objects in the list. Rank order method is less time consuming. In this
method, if there are n objects, only (n-1) decisions need to be made.
Respondent can easily understand the instructions for ranking. This
technique produces ordinal data.
Example
Rank the various brands of mobile phone in order of preference. The most
preferred can be ranked 1, the next as 2 and so on. The least preferred will
have the last rank. No two brands should receive the same rank number.
60
Measurement and
Scaling Techniques
c) Constant Sum
In constant sum scaling, respondents are asked to allocate a constant sum
of units to a specific set of objects on the basis of pre-defined criterion. If
an object is not important, the respondent can allocate zeropoint and if an
object is most important may allocate maximum points out of the fixed
points. The total fixed points are 100. The total may be taken as some
other value depending on the study.
Example
Allocate preferences regarding a movie based on various predefined
criteria. Respondents were asked to rate each criteria in such a way that the
total becomes 100.
The data obtained in this method is considered an ordinal scale. The advantage
of this method is that it provides fine discrimination among objects without
consuming too much time. However, allocation over large number of objects
may create confusion for the respondent. This method cannot be used as a
response strategy with illiterate people and children.
61
Research Design and 8.7.2 Non Comparative Scaling Techniques
Measurement
Non-comparative scaling techniques involve scaling of objects independent to
some specific standard. Respondent evaluate only one object at a time. For
example, in a study of consumer preferences for different telecom services, a
consumer may be asked to rate a list of factors that he/she would consider
while choosing a particular telecom company.
Example
Please rate the following factors you think important in choosing mobile
service. Rate 1 to least important and 5 to most important (rating could be on
any scale. In this example, we have used a 5 point scale).
Example
How would you rate a magazine with regard to its quality.
Presentati Most_________________x__________________Least
on Style
A very large number of ratings are possible if the respondents are literate
to understand and accurately differentiate the objects. The data generated
from continuous rating scale can be treated as numeric and interval data.
The researcher can divide the line into as many categories as desired and
assign scores based on the categories under which the ratings fall.
62
ii) Itemized Rating Scale Measurement and
Scaling Techniques
In this scale, respondents are provided with a scale having numbers/
descriptions associated with each category. The respondents are asked to
select the best fitting category with the object.The commonly used
itemised rating scale are:
Some of the important advantages of this scale are: (i) it is relatively easy to
construct as it can be performed without a panel of judges, (ii) it is considered
more reliable because respondents answer each statement included in the
63
Research Design and instrument, (iii) it can easily be used in respondent centred and stimulus
Measurement centred studies, (iv) this takes less time to construct. However, there are several
limitations of this type of scale i.e. (i) with this scale, we can simply examine
whether respondents are more or less favourable to a topic but it is difficult to
tell how much more or less they are, (ii) This does not rise more than to a
ordinal scale structure, (iii) the total score of an individual respondent has little
clear meaning since a given total score can be secured by a variety of answer
patterns.
+3 +2 +1 0 -1 -2 -3
Easy to read ____ : ____ : ____:____:_√__:____:____ Hard to read
Modern ____ : ____ : ____:____:_√__:____:____ Old fashion
Rational ____ : ____ : ____:____:____:____:____ Emotional
Unreliable ____ : ____ : ____:____:____:____:____ Reliable
Worthless ____ : ____ : ____:____:____:____:____ Valuable
Unorganized ____ : ____ : ____:____:____:____:____ Organized
Orthodox ____ : ____ : ____:____:____:____:____ Liberal
Vain ____ : ____ : ____:____:____:____:____ Modest
Like Likert Type Scale, this scale has also several advantages: (i) it is an
efficient and easy way to secure attitude from a large sample, (ii) the total set
of responses (both direction) gives a comprehensive picture of the meaning of
an object as well as a measure of the subject of the rating, (iii) it is a
standarised technique that can be easily repeated.
64
c) Stapel Scale Measurement and
Scaling Techniques
Stapel scales are named after John stapel who developed these scales. Stapel
scale consists of a single criterion in the centre with 10 categories numbered
from -5 to +5 without a neutral point (zero). The scale is usually presented
vertically. Negative rating indicates that the respondent inaccurately describes
the object and positive rating indicates that the respondent describes the object
accurately.
Example
Rate the Departmental store by marking (√) on the following factors. +5
indicate that the factor is most accurate for you and -5 indicate that the factor is
most inaccurate for you.
+5 -5
+4 -4
+3 -3
+2 -2
+1 -1
Services Products
-1 +1
-2 +2
-3 +3
-4 +4
-5 +5
In this scale, data generated is interval data. In this method, data can be
collected through telephonic interview. The data obtained from Stapel
scale can be analysed in the same way as semantic differential scale.
Example
A) Single Category Scale
1) Do you own a house?
Yes No
65
Research Design and 2) Do you own a car?
Measurement
Yes No
Yes No
b) Multiple Category
1) Please indicate your income by marking (√) in the income
group you fall
5000 - 10000
10000 - 15000
15000 - 20000
20000 - 25000
Cumulative scale (scalogram analysis) like any other scaling technique has
several advantages as well as limitations. The advantages are – It assures that
only a single dimension of attitude is measured, (ii) Researcher’s subjective
judgement is not allowed to creep in the development of scale since the scale is
determined by the replies of respondents.
66
Its main limitation is that (i) In practice perfect cumulative or unidimesional Measurement and
scale are very rarely found and approximation is used, (ii) Its developmental Scaling Techniques
procedure is cumbersome in comparison to other scaling methods.
……………………………………………………………………………….
……………………………………………………………………………….
……………………………………………………………………………….
……………………………………………………………………………….
3) What are the sources of measurement errors?
……………………………………………………………………………….
……………………………………………………………………………….
……………………………………………………………………………….
68
Measurement and
8.10 SOME USEFUL BOOKS/PAPERS Scaling Techniques
69