0% found this document useful (0 votes)

92 views24 pages

Chapter - 5 - Measurement in Research

This chapter discusses measurement in research and focuses on validity and reliability. It covers: 1) The four types of validity - conclusion validity addresses whether there is a relationship between variables, internal validity addresses whether the relationship is causal, construct validity addresses whether the measures accurately represent the constructs, and external validity addresses whether findings can be generalized. 2) Reliability refers to consistency of measurement and considers true score theory and reliability estimators. 3) The four levels of measurement are discussed - nominal, ordinal, interval, and ratio scales. Measurement error and its importance for data analysis are also covered. 4) The chapter objectives are to explain validity and reliability, distinguish different types of each, understand their relationship

Uploaded by

Yosef Ketema

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

92 views24 pages

Chapter - 5 - Measurement in Research

Uploaded by

Yosef Ketema

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 24

Chapter – 5 -

Measurement in research
Introduction
Measurement is the process of observing and recording the observations that are collected
as part of a research effort. There are two major issues that will be considered here.
First, you have to understand the fundamental ideas or theory involved in measuring. In
this chapter, we well focus on how we think about and assess quality of measurement. In
the section on the concept of validity, the theory of what constitutes a good measure will
be presented. Reliability of measurement refers to consistency or dependability of
measurement, including consideration of true score theory and a variety of reliability
estimators. In the section on levels of measurement, the meaning of the four major levels
of measurement: nominal, ordinal, interval, and ratio will be discussed.

Learning objectives
After completing this chapter students will be able to:

 Explain the concept of validity and reliability of a measure

 Distinguish the difference between the different types of measurement validity
 Discuss the relationship between validity and reliability of a measure
 Describe the different types of measurement reliability
 Understand the theory of measurement
 Describe the four levels of measurement scales
 Identify and discuss types of measurement errors
 Explain the importance of level of measurement in data analysis
 Define the concept of operationalization of construct
5.1. Theory of measurement

Prior to starting any research project, it is important to determine how you are going to measure
a particular phenomenon. This process of measurement is important because it allows you to
know whether you are on the right track and whether you are measuring what you intend to
measure. Both reliability and validity are essential for good measurement, because they are
your first line of defense against forming inaccurate conclusions (i.e., incorrectly accepting or
rejecting your research hypotheses).

5.1.1. The concept of validity in measurement

When people think about validity in research, they tend to think in terms of research
components. You might say that a measure is a valid one, that a valid sample was drawn,
or that the design had strong validity, but all of those statements are technically incorrect.
Measures, samples, and designs don't have validity—only propositions can be said to be
valid. Technically, you should say that a measure leads to valid conclusions or that a
sample enables valid inferences, and so on. It is a proposition, inference, or conclusion
that can have validity.

Researchers make lots of different inferences or conclusions while conducting research.

Many of these are related to the process of doing research and are not the major
hypotheses of the study. Nevertheless, like the bricks that go into building a wall, these
intermediate processes and methodological propositions provide the foundation for the
substantive conclusions that they wish to address. For instance, virtually all social
research involves measurement or observation, and, no matter what researchers measure
or observe, they are concerned with whether they are measuring what they intend to
measure or with how their observations are influenced by the circumstances in which
they are made. They reach conclusions about the quality of their measures—conclusions
that will play an important role in addressing the broader substantive issues of their study.
When researchers talk about the validity of research, they are often referring to the many
conclusions they reach about the quality of different parts of their research methodology.
Validity is typically subdivided into four types. Each type addresses a specific
methodological question. To understand the types of validity, you have to know
something about how researchers investigate a research question. Because all four
validity types are really only operative when studying causal questions, I will use a
causal study to set the context.

Figure1, you are given below, Shows that two realms are involved in research, the first,
on the top, is the land of theory. It is what goes on inside your head. It is where you keep
your theories about how the world operates. The second, on the bottom, is the land of
observations. It is the real world into which you translate your ideas: your programs,
treatments, measures, and observations. When you conduct research, you are continually
flitting back and forth between these two realms, between what you think about the world
and what is going on in it. When you are investigating a cause-effect relationship, you
have a theory (implicit or otherwise) of what the cause is (the cause construct). For
instance, if you are testing a new educational program, you have an idea of what it would
look like ideally. Similarly, on the effect side, you have an idea of what you are ideally
trying to affect and measure (the effect construct). But each of these—the cause and the
effect—have to be translated into real things, into a program or treatment and a measure
or observational method. The term operationalization is used to describe the act of
translating a construct into its manifestation. In effect, you take your idea and describe it
as a series of operations or procedures. Now, instead of it being only an idea in your
mind, it becomes a public entity that others can look at and examine for themselves. It is
one thing, for instance, for you to say that you would like to measure self-esteem (a
construct). But when you show a ten-item paper-and-pencil self-esteem measure that you
developed for that purpose, others can look at it and understand more clearly what you
intend by the term self-esteem.

Figure 1. The major realms and components of research

Now, back to explaining the four validity types. They build on one another, with two of
them (conclusion and internal) referring to the land of observation on the bottom of
Figure 1., one of them (construct) emphasizing the linkages between the bottom and the
top, and the last (external) being primarily concerned about the range of the theory on the
top.

Imagine that you want to examine whether use of a World Wide Web virtual classroom
improves student understanding of course material. Assume that you took these two constructs,
the cause construct (the Web site) and the effect construct (understanding), and operationalized
them, turned them into realities by constructing the Web site and a measure of knowledge of

the course material. Here are the four validity types and the question each addresses:

 Conclusion Validity: In this study, is there a relationship between the two variables? In
the context of the example, the question might be worded: in this study, is there a
relationship between the Web site and knowledge of course material? There are several
conclusions or inferences you might draw to answer such a question. You could, for
example, conclude that there is a relationship. You might conclude that there is a
positive relationship. You might infer that there is no relationship. You can assess the
conclusion validity of each of these conclusions or inferences.
 Internal Validity: Assuming that there is a relationship in this study, is the relationship a
causal one? Just because you find that use of the Web site and knowledge are
correlated, you can't necessarily assume that Web site use causes the knowledge. Both
could, for example, be caused by the same factor. For instance, it may be that wealthier
students, who have greater resources, would be more likely to have access to a Web site
and would excel on objective tests. When you want to make a claim that your program
or treatment caused the outcomes in your study, you can consider the internal validity
of your causal claim.

 Construct Validity: Assuming that there is a causal relationship in this study, can you
claim that the program reflected your construct of the program well and that your
measure reflected well your idea of the construct of the measure? In simpler terms, did
you implement the program you intended to implement and did you measure the
outcome you wanted to measure? In yet other terms, did you operationalize well the
ideas of the cause and the effect? When your research is over, you would like to be able
to conclude that you did a credible job of operationalizing your constructs—you can
assess the construct validity of this conclusion.

 External Validity: Assuming that there is a causal relationship in this study between the
constructs of the cause and the effect; can you generalize this effect to other persons,
places, or times? You are likely to make some claims that your research findings have
implications for other groups and individuals in other settings and at other times. When
you do, you can examine the external validity of these claims.

Notice how the question that each validity type addresses presupposes an affirmative
answer to the previous one. This is what I mean when I say that the validity types build
on one another. Figure 2 shows the idea of cumulativeness as a staircase, along with the
key question for each validity type.

Figure 2The validity staircase, showing the major question for each type of validity
Activity-1-

Assuming that you would like to conduct a research on the casual relationship of the research
variables in the research question on “how employee motivation will influence their productivity
in an organization?” identify and discuss the four types of validity and the type of question each
addresses in the course of addressing the answer of the research question.

Commentary

Considering the relationship between employee motivation and employee performance you
need to develop a data collection instrument that allow you to show there is a relationship
between the cause and effect, the change in the dependent variable is due to the change in the
independent variable, the other validity question is whether you are able to generalize to the
construct ie. you are able to correctly measure the construct and the external validity refers to
the fact that you can generalize to other people and other organization.

5.1.2. The Reliability of a Measurement

Terms such as consistency, predictability, dependability, stability, and repeatability are the
terms that come to mind when we talk about reliability. Broadly defined, reliability of a
measurement refers to the consistency or repeatability of the measurement of some
phenomena. If a measurement instrument is reliable, that means the instrument can measure
the same thing more than once or using more than one method and yield the same result.
When we speak of reliability, we are not speaking of individuals; we are actually talking about
scores.

If you think about how we use the word reliable in everyday language, you might get a hint. For
instance, we often speak about a machine as reliable: "I have a reliable car." Or, news people
talk about a "usually reliable source." In both cases, the word reliable usually means dependable
or trustworthy. In research, the term reliable also means dependable in a general sense, but
that's not a precise enough definition. What does it mean to have a dependable measure or
observation in a research context? The reason dependable is not a good enough description is
that it can be confused too easily with the idea of a valid measure. Certainly, when researchers
speak of a dependable measure, we mean one that is both reliable and valid. So we have to be a
little more precise when we try to define reliability.

In research, the term reliability means repeatability or consistency. A measure is considered

reliable if it would give you the same result over and over again (assuming that what you are
measuring isn't changing).

The observed score is one of the major components of reliability. The observed score is just
that, the score you would observe in a research setting. The observed score comprised of a true
score and an error score. The true score is a theoretical concept. Why is it theoretical? Because
there is no way to really know what the true score is. The true score reflects the true value of a
variable. The error score is the reason why the observed is different from the true score. The
error score is further broken down into method (or systematic) error and trait (or random)
error. Method error refers to anything that causes a difference between the observed score and
true score due to the testing situation. For example, any type of disruption (loud music, talking,
traffic) that occurs while students are taking a test may cause the students to become distracted
and may affect their scores on the test. On the other hand, trait error is caused by any factors
related to the characteristic of the person taking the test that may randomly affect
measurement. An example of trait error at work is when individuals are tired, hungry, or
unmotivated. These characteristics can affect their performance on a test, making the scores
seem worse than they would be if the individuals were alert, well-fed, or motivated.

Reliability can be viewed as the ratio of the true score over the true score plus the error score,
or:

true score

true score + error score

Okay, now that you know what reliability is and what its components are, you're probably
wondering how to achieve reliability. Simply put, the degree of reliability can be increased by
decreasing the error score. So, if you want a reliable instrument, you must decrease the error.

As previously stated, you can never know the actual true score of a measurement. Therefore, it
is important to note that reliability cannot be calculated; it can only be estimated. The best way
to estimate reliability is to measure the degree of correlation between the different forms of a
measurement. The higher the correlation, the higher the reliability.

The three Aspects of Reliability

Before going on to the types of reliability, I must briefly review the three major aspects of
reliability: equivalence, stability, and homogeneity. Equivalence refers to the degree of
agreement between 2 or more measures administered nearly at the same time. In order for
stability to occur, a distinction must be made between the repeatability of the measurement
and that of the phenomena being measured. This is achieved by employing two raters. Lastly,
homogeneity deals with assessing how well the different items in a measure seem to reflect the
attribute one is trying to measure. The emphasis here is on internal relationships, or internal
consistency.

Types of Reliability
Now back to the different types of reliability. The first type of reliability is parallel forms
reliability. This is a measure of equivalence, and it involves administering two different forms to
the same group of people and obtaining a correlation between the two forms. The higher the
correlation between the two forms, the more equivalent the forms.

The second type of reliability, test-retest reliability, is a measure of stability which examines
reliability over time. The easiest way to measure stability is to administer the same test at two
different points in time (to the same group of people, of course) and obtain a correlation
between the two tests. The problem with test-retest reliability is the amount of time you wait
between testing. The longer you wait, the lower your estimation of reliability.

Finally, the third type of reliability is inter-rater reliability, a measure of homogeneity. With
inter-rater reliability, two people rate a behavior, object, or phenomenon and determine the
amount of agreement between them. To determine inter-rater reliability, you take the number
of agreements and divide them by the number of total observations.

The Relationship between Reliability and Validity

A measurement can be reliable, but not valid. However, a measurement must first be reliable
before it can be valid. Thus reliability is a necessary, but not sufficient, condition of validity. In
other words, a measurement may consistently assess a phenomena (or outcome), but unless
that measurement tests what you want it to, it is not valid.

Remember that when designing a research project, it is important that your measurements are
both reliable and valid. If they aren't, then your instruments are basically useless and you
decrease your chances of accurately measuring what you intended to measure.

Activity-2-

Assuming that you would like to conduct a research on the casual relationship of the research
variables in the research question “how employee motivation will influence their productivity in
an organization?” identify and discuss the three aspect of measurement Reliability and the type
of question each addresses in the course of addressing the answer to the research question.
5.3. Measurement Error

True score theory is a good simple model for measurement, but it may not always be an
accurate reflection of reality. In particular, it assumes that any observation is composed of the
true value plus some random error value; but is that reasonable? What if all error is not
random? Isn't it possible that some errors are systematic, that they hold across most or all of the
members of a group? One way to deal with this notion is to revise the simple true score model
by dividing the error component into two subcomponents, random error and systematic error.
Figure 3 shows these two components of measurement error, what the difference between
them is, and how they affect research.

Figure 3. Random and systematic errors in measurement

What Is Random Error?

Random error is caused by any factors that randomly affect measurement of the variable across
the sample. For instance, people's moods can inflate or deflate their performance on any
occasion. In a particular testing, some children may be in a good mood and others may be
depressed. If mood affects the children's performance on the measure, it might artificially inflate
the observed scores for some children and artificially deflate them for others. The important
thing about random error is that it does not have any consistent effects across the entire
sample. Instead, it pushes observed scores up or down randomly. This means that if you could
see all the random errors in a distribution they would have to sum to 0. There would be as many
negative errors as positive ones. (Of course, you can't see the random errors because all you see
is the observed score X. God can see the random errors, but she's not telling us what they are!)
The important property of random error is that it adds variability to the data but does not affect
average performance for the group (Figure3). Because of this, random error is sometimes
considered noise.
Figure 3. Random error adds variability to a distribution but does not affect central
tendency (the average)

What Is Systematic Error?

Systematic error is caused by any factors that systematically affect measurement of the
variable across the sample. For instance, if there is loud traffic going by just outside of a
classroom where students are taking a test, this noise is liable to affect all of the children's
scores—in this case, systematically lowering them. Unlike random error, systematic
errors tend to be either positive or negative consistently; because of this, systematic error
is sometimes considered to be bias in measurement (Figure 4 ).
Figure - 4 - Systematic error affects the central tendency of a
distribution
Reducing Measurement Error

So, how can you reduce measurement errors, random or systematic? One thing you can do is to
pilot test your instruments to get feedback from your respondents regarding how easy or hard
the measure was and information about how the testing environment affected their
performance. Second, if you are gathering measures using people to collect the data (as
interviewers or observers), you should make sure you train them thoroughly so that they aren't
inadvertently introducing error. Third, when you collect the data for your study you should
double-check the data thoroughly. All data entry for computer analysis should be double-
punched and verified. This means that you enter the data twice, the second time having your
data-entry machine check that you are typing the exact same data you typed the first time.
Fourth, you can use statistical procedures to adjust for measurement error. These range from
rather simple formulas you can apply directly to your data to complex modeling procedures for
modeling the error and its effects. Finally, one of the best things you can do to deal with
measurement errors, especially systematic errors, is to use multiple measures of the same
construct. Especially if the different measures don't share the same systematic errors, you will
be able to triangulate across the multiple measures and get a more accurate sense of what's
happening.
Activity-3-

Considering the question under activity two attempt to explain the difference between a
random error and systematic error in measurement by giving example in the context of
the given research question and suggest how to reduce measurement errors.

Commentary

Your ability to answer your research question is partly determined by the quality of your
data which is again influenced partly by your sampling procedure ,the potential random
and systematic error in measurement therefore you need to exercise at most care in the
data collection process method of data collection and sampling procedure.

5.4. Basic concepts in measurements

Questionnaires are designed to collect information. How is this information collected? It

is gathered via measurement, which is defined as determining the amount or intensity of
some characteristic of interest to the researcher. For instance, a marketing manager may
want to know how a person feels about a certain product, or how much of the product
he/she uses in a certain time period. This information, once compiled, can help solve
specific questions such as brand usage.

But what are we really measuring? We are measuring properties-sometimes called

attributes or qualities of objects. Objects include customers, brands, stores,
advertisements, or whatever construct is of interest to the researcher working with a
particular manager. Properties are the specific features or characteristics of an object that
can be used to distinguish it from another object. For example, assume the object we
want to research is a consumer. The properties of interest to a manager who is trying to
define who buys a specific product are a combination of demographic such as age;
income level; gender; and buyer behaviour, which includes such things as the buyer’s
impressions or perceptions of various brands. Note that each property has the potential to
further differentiate consumers.

On the surface, measurement may appear to be a very simple process. It is simple as long
as we are measuring objective properties, which are physically verifiable characteristics
such as age, income, number of bottles purchased, store last visited, and so on. However,
researchers often desire to measure subjective properties, which can not be directly
observed because they are mental constructs such as a person’s attitude or intentions. In
this case, the researcher must ask a respondent to translate his/her mental constructs onto
a continuum of intensity-easy task. To do this, the researcher must develop question
formats that are very clear and that are used identically by the respondents. This process
is known as scale development.

5.4.1. Scale characteristics

Scale development is designing questions to measure the subjective properties of an

object. There are various types of scales, each of which possesses different
characteristics. The characteristics of a scale determine the scale’s level of measurements.
The level of measurement, as you shall see, is very important. There are four
characteristics of scales: Description, Order, Distance, and Origin.

5.4.2. Description

Description refers to the use of unique descriptor, or label, to stand for each designation
in the scale. For instance, “yes” and “no”,” agree” and “disagree”, and the number of
years of a respondent’s age are descriptors of a simple scale. All scales include
description in the form of characteristics labels that identify what is being measured.

5.4.3. Order

Order refers to the relative sizes of the descriptors. Here, the key word is “relative” and
includes such descriptors as “greater than”, “less than”, or “equal to”. A respondent’s
least preferred brand is “less than” his/her most preferred brand, and respondents who
check the same income category indicate the same (‘equal to”). Not all scales posses
order characteristics. For instance, is a “buyer” greater than or less than a “non-buyer” ?
We have no way of making a relative size distinction.

5.4.4. Distance
A scale has the characteristics of distance when absolute differences between the
descriptors are known and may be expressed in units. The respondent who purchases
three bottles of diet cola buys two or more than the one who purchases only one bottle; a
three-car family owns one more automobile than a two-car family. Note that when the
characteristic of distance exists, we are also given order. We know not only the three-car
family has “more than” the number cars of the two-car family, but we also know the
distance between the two (1 car).

5.4.5. Origin

A scale is said to have the characteristics of origin if there is a unique beginning or true
zero point for the scale. Thus, 0 is the origin for an age scale just as it is for the number of
miles travelled to the store or the number of bottles of soda consumed. Not all scales have
a true zero point for the property they are measuring. In fact, many scales used by
researchers have arbitrary neutral points, but they do not posses origins. For instance,
when the respondent says “no opinion” to the question “do you agree or disagree with the
statement “the Lexus is the best car on the road today””, we can not say that the person
has a true zero level of agreement.

Perhaps you noticed that each scaling characteristic is the most basic and is present in
every scale. If a scale has order, it also possesses description. In other words, if a scale
has a higher-level property, it also has all lower-level properties. But the opposite is not
true.

Activity-4-

Assume that you are given an assignment by your immediate supervisor to develop a
questionnaire in order to measure the level of customer satisfactions for a given company
product and develop their profile, attempt to develop a measurement scale and describe
the property of the four types of measurement scale with reference to your assignment .
5.5. Levels of measurement scales

You may ask “why is it important to know the characteristics of scales?” The answer is
that the characteristics possessed by a scale determine that scale’s level of measurement.
In tern which descriptive statistics are most appropriate for your data will depend on the
measurement scale used in collecting information on each particular item. We have four
levels of measurement: Nominal, Ordinal, Interval, and Ratio scales.

The table below shows you how each scale type differs with respect to the scaling
characteristics we have just discussed

Measurement scales differ by what scale characteristics they possess

Levels of Scale characteristics possessed
measurement
Description Order Distance Origin

Nominal scale yes no no No

Ordinal scale yes yes no No

Interval scale yes yes yes No

Ratio scale yes yes yes yes

5.5.1. Nominal scales

Nominal scales are defined as those that use only labels; that is, they possess only the
characteristic of description. Examples include designation as to race, religion, type of
dwelling, gender, brand last purchased, buyer/non buyer; answers that involve yes-no,
agree-disagree; or any other instance in which the descriptor can not be differentiated
except qualitatively. If you describe respondents in a survey according to their
occupation-banker, doctor, computer programmer- you have used a nominal scale. Note
that these examples of a nominal scale only label the respondent. There is no ordering
among the categories (i.e., Male is not “greater” or “less”) and averaging is not
appropriate for this type of data. The measures used to describe this type of data are the
percentages that fall into each category or the mode (the most commonly selected
category).

Nominal scales are for classification – they are not “measures” in the true sense of the
term as they do not represent “quantities”, “magnitudes”, “frequencies”, or the like.

You use nominal scales when the categories are exhaustive (include all alternatives, even
though one choice may be “other”) and mutually exclusive (none fall into more than one
category).

You can summarize these data as the percentage of respondents who fall into each
category or as the mode, which is the term used to describe the most common category
selected. The mode can be used to express “middle” of the distribution, however, the
frequency distribution (percentages in each category) will generally suffice. There is no
measure of variability for this type of data.

5.5.2. Ordinal scale

These reflect ordered categories (e.g. Small, medium, and large). We can say one
category reflects more of the attribute we are measuring than does another, or that the top
category reflects more than those below it, but we cannot say how much more.
“satisfaction”, for example, can be rated from less to more highly satisfied, but we are not
sure if the difference being “very satisfied” and “satisfied” is really equivalent to the
difference between being “dissatisfied” and being “very dissatisfied”. This limits the
statistics we can use with this type of data. Averages, for example, are not really
appropriate here. As a result, the statistical tests for these types of data tend to use other
approaches, such as looking rankings across the categories in each group.

Most items on surveys have ordinal scale alternatives as selections that the respondents
can choose from. We generally use 5-point scales that include negative eg “very
dissatisfied”, “dissatisfied”, “neutral”, and positive (“satisfied”, “very satisfied”).
These data can also be summarized as the percentage of respondents who fall into each
category. The median can be used to indicate the centre or mid-point of the distribution
and inter quartile range can be used as an indication of variability in the data.

Some users of statistics feel comfortable applying statistics for interval scales to these
data if items are summed to produce a total score (e.g. a satisfaction or loyalty index) or
there are a wide range of ordered categories, but purists are uncomfortable with this
approach. There are however, a variety of procedures (referred to as non-parametric
statistics) that can be used to test for the significance of differences between groups or
subgroups or to assess the significance of changes that occur over time.

5.5.3. Interval scales

Interval scales are those in which the distance between each descriptor is known. The
distance is normally defined as one scale unit. For example, a coffee brand rated “3rd” in
taste is one unit away from one rated “4th”. Some times the researcher must impose a
brief that equal intervals exist between the descriptors. That is, if you were asked to
evaluate a store’s sales people by selling a single designation form, a list of “extremely
friendly”, “very friendly”, “some what friendly”, “some what unfriendly”, “very
unfriendly”, or “extremely unfriendly”, the researcher would probably assume that each
designation was one unit away from the preceding one. In theses cases, we say that the
scale is “assumed interval”. As shown in the table 2, these descriptors are evenly spaced
on a questionnaire; as such, the labels connote a continuum and the check lines are equal
distances apart. By wording or spacing the response opinions on a scale so they appear to
have equal intervals between them, the researcher achieves a higher-level of
measurement than ordinal or nominal.

For this type of data, we can compute mean (average) scores or medians and percentiles.
Typically, the median is preferred if the data are skewed (biased towards lower or higher
scores) or the range can go to very high values (as in housing costs or income level) as
the median is less affected by skew and outliers than is the mean. Measures of variance,
the standard deviation, or the median absolute deviation can be computed for sample
means to express the range within which the population mean is likely to lie.
The statistics for testing for group differences or changes over time that are available for
this type of data (e.g. the t-test, the analysis of variance or ANOVA, etc) tend to be more
powerful.

5.5.4. Ratio scales

Ratio scales are ones in which a true origin exists-such as an actual number of purchases
in a certain time period, dollars spent, miles travelled, number of children, or years of
college education. This characteristic allows us to construct ratios when comparing
results of the measurement. One person may spend twice as much as as another, or travel
one-third as far. Such ratios are inappropriate for interval scales, so we are not allowed to
say that one store has one-half as friendly as another.

Examples on the use of different scaling assumptions in measurement

questions

A. Nominal scaled questions

1. Please indicate your gender Male Female

2. Check all the brands you would consider to purchase

_ Sony _ Panasonic

_ JVC _ Philips

B. Ordinal- scaled questions

1. Please rank each brand in terms of your preference. Place a “1 “ by your first choice, a
“2” by your second choice, and so on

___________ Pepsi

___________ Seven- up

___________ Coca cola

2. For each pair of stores, circle the one you would be more likely to patronize.

Loyal Vs Hadiya

Bambis Vs Tana

3. In your opinion, would you say the prices at Bambis are

___________ Higher than Hadiya

___________ About the same as Shi Solomon Hailu

___________ Lower than loyal

C. Interval-scaled questions

1. Please rate each brand in terms of its overall performance

Rating (circle one)

Brand Very poor Very good

Lacoste 1 2 3 4 5

Parker 1 2 3 4 5

Sony 1 2 3 4 5

2. Indicate your degree of agreement with the following statements by circling the appropriate
number

Rating (circle one)

Statement strongly strongly

Disagree agree

a. I always look for bargains 1 2 3 4 5

a. I enjoy being outdoors 1 2 3 4 5

a. I love to cook 1 2 3 4 5
D. Ratio-scaled questions

1. Please indicate your age

________ years.

2. Approximately how many times in the last week have you purchased anything over Birr 5 in
value at Tana Super market?

0 1 2 3 4 5 More (specify)

3. How much do you think a typical purchaser of a Birr 100,000 term life insurance policy pays
per year for that policy? Birr ________

4. What is the probability that you will use a lawyer’s service when you are ready to make a
will?_______ percent

5.3.1. Likert scale

Description: Statement with which respondent shows the amount of agreement / disagreement to a
specific measurement question. It is the characteristics of an ordinal scale.

Example: Assessment by course-work is easier than assessment by examination

Neither agree nor

Strongly agree Agree Disagree Strongly disagree
disagree

5.3.2. Semantic differential scale

Description: Scale is inscribed between two bipolar words and respondent selects the point that most
represents the direction and intensity of his / her feelings

Example: The degree I am taking is.............

Interesting: ___:_:_:_:_:_:___: Boring

Useful: ___:_:_:_:_:_:___: Useless

Easy: ___:_:_:_:_:_:___: Difficult

5. Rank order
Description: Respondent is asked to rate or rank each options with reference to a specific research
variable. This allows the researcher to obtain information on relative preferences, importance etc. Long
lists should be avoided (respondents generally find it difficult to rank more than 5 items)

Example: Please indicate, in rank order, your preferred Chewing gum, putting 1 next to your favorite
through to 5 for your least favorite.

 Poppotine
 Strawberry

 Special mint

 Wow

 Banana

In General why the measurement level of a scale is important?

There two important reasons:

 The level of measurement determines what information you will have about the object of
study; it determines what you can say and what you can not say about the object. For
example, nominal scales measure the lowest information level, and therefore, they are
sometimes considered as the crudest scales. Nominal scales allow us to do nothing more
than identify our object of study on some property. Ratio scales, however, contain the
greatest amount of information; they allow us to say many things about our object. Yet, it
is not always possible to have a true zero point.

 The level of measurement dictates what type of statistical analysis you may or may not
perform. Low-level scales permit much more sophisticated analysis. In other words, the
amount of information contained in the scale dictates the limits of statistical analysis.

As a general recommendation it is desirable to construct a scale at the highest appropriate level

of measurement possible.

Figure . The hierarchy of measurement levels

It's important to recognize that there is a hierarchy implied in the level of measurement idea. At lower
levels of measurement, assumptions tend to be less restrictive and data analyses tend to be less
sensitive. At each level up the hierarchy, the current level includes all of the qualities of the one below it
and adds something new. In general, it is desirable to have a higher level of measurement (such as
interval or ratio) rather than a lower one (such as nominal or ordinal).

Actitivty-5-

Explain the implication of level of measurement scale in answering your research question by providing
a specific example from the examples you are given above for illustration.

Complete Bundle The Autism Spectrum Scientific Foundations and Treatment Ebook and TestBank Bundle
100% (1)
Complete Bundle The Autism Spectrum Scientific Foundations and Treatment Ebook and TestBank Bundle
404 pages
TOEFL
No ratings yet
TOEFL
19 pages
Live Your Best Life 219 Science Based Reasons To Rethink Your Daily Routine 9781465493293 1465493298 Compress
88% (8)
Live Your Best Life 219 Science Based Reasons To Rethink Your Daily Routine 9781465493293 1465493298 Compress
256 pages
Parkinson Disease: Dementia
100% (1)
Parkinson Disease: Dementia
45 pages
Plate Size and Color Suggestibility: The Delboeuf Illusion's Bias On Serving and Eating Behavior
100% (1)
Plate Size and Color Suggestibility: The Delboeuf Illusion's Bias On Serving and Eating Behavior
14 pages
SPCE 630 Questions and Revised Correct Answers
No ratings yet
SPCE 630 Questions and Revised Correct Answers
29 pages
Foster Weeler Process STD 602 Safety Device Design PDF
No ratings yet
Foster Weeler Process STD 602 Safety Device Design PDF
124 pages
02 - Behaviourist Theory - Boon
No ratings yet
02 - Behaviourist Theory - Boon
54 pages
Catania Chapter Notes Ch. 6 7
No ratings yet
Catania Chapter Notes Ch. 6 7
13 pages
Culturally Unbiased Assessment
No ratings yet
Culturally Unbiased Assessment
64 pages
Assessment Models
100% (1)
Assessment Models
36 pages
Early & Middle Childhood
No ratings yet
Early & Middle Childhood
16 pages
Writing A Comprehensive Report in Sped
No ratings yet
Writing A Comprehensive Report in Sped
21 pages
Single-Subject Research
No ratings yet
Single-Subject Research
24 pages
Research Integrity
100% (2)
Research Integrity
74 pages
TSL 3113 - Topic 1 - An Introduction To Research Methods in Education
100% (2)
TSL 3113 - Topic 1 - An Introduction To Research Methods in Education
63 pages
Chapter 2
No ratings yet
Chapter 2
18 pages
Clinical Dementia Rating Worksheet: Memory Questions For Informant
No ratings yet
Clinical Dementia Rating Worksheet: Memory Questions For Informant
10 pages
Developmental Psychology Notes
100% (1)
Developmental Psychology Notes
78 pages
Assignment 1 Child Development
No ratings yet
Assignment 1 Child Development
10 pages
PSYCHOLOGY
No ratings yet
PSYCHOLOGY
46 pages
Arc The Lad III - Monster Card
No ratings yet
Arc The Lad III - Monster Card
29 pages
Chapter Three 3.1. Transportation and Assignment Models: Example
No ratings yet
Chapter Three 3.1. Transportation and Assignment Models: Example
28 pages
1 Introduction To Educational Psychology
No ratings yet
1 Introduction To Educational Psychology
34 pages
Motivation Assessment Scale
No ratings yet
Motivation Assessment Scale
2 pages
Cognitive Behavioral Therapy Oh
No ratings yet
Cognitive Behavioral Therapy Oh
8 pages
Chapter 1: Risk and Related Concepts
100% (1)
Chapter 1: Risk and Related Concepts
77 pages
Stimulus Equivalence - Chapter2
No ratings yet
Stimulus Equivalence - Chapter2
6 pages
MM Boger Synoptic Complete
No ratings yet
MM Boger Synoptic Complete
1,050 pages
Sme Ar 17629 Moksh 2019 2020 08092020165630
No ratings yet
Sme Ar 17629 Moksh 2019 2020 08092020165630
85 pages
Chapter 4
No ratings yet
Chapter 4
14 pages
Cognitive Development in Infancy
No ratings yet
Cognitive Development in Infancy
32 pages
Marzano
100% (1)
Marzano
6 pages
Assessing Children in Need and Their Families
No ratings yet
Assessing Children in Need and Their Families
141 pages
The Quick Dementia Rating System (QDRS) PDF
No ratings yet
The Quick Dementia Rating System (QDRS) PDF
11 pages
School Counseling Psychology
No ratings yet
School Counseling Psychology
22 pages
12th Chemistry Quarterly
No ratings yet
12th Chemistry Quarterly
3 pages
Rodriguez Lopez, Alejandro-April 2024
No ratings yet
Rodriguez Lopez, Alejandro-April 2024
2 pages
Behavior Problems in The Classroom
No ratings yet
Behavior Problems in The Classroom
17 pages
The Preschool Child
No ratings yet
The Preschool Child
7 pages
Mobil SHC 634 Msds
No ratings yet
Mobil SHC 634 Msds
13 pages
GVT-ODIS-SWRO For PBMC PDF
No ratings yet
GVT-ODIS-SWRO For PBMC PDF
35 pages
Emotional Disturbance Research Paper - SPED 554 - Spring 2023
No ratings yet
Emotional Disturbance Research Paper - SPED 554 - Spring 2023
10 pages
Chemical Injection: Offshore Coshh Essentials
No ratings yet
Chemical Injection: Offshore Coshh Essentials
3 pages
CH 5
No ratings yet
CH 5
21 pages
Stages of Child Development Demo
No ratings yet
Stages of Child Development Demo
18 pages
Rope Handling Informative
No ratings yet
Rope Handling Informative
39 pages
Research Problem Definition and Hypothesis Development (5 HRS.)
No ratings yet
Research Problem Definition and Hypothesis Development (5 HRS.)
22 pages
Inventory Management and Material Handling
No ratings yet
Inventory Management and Material Handling
8 pages
Kerry Case Study - Edited
No ratings yet
Kerry Case Study - Edited
7 pages
Quantitative Research Question and Hypothesis
No ratings yet
Quantitative Research Question and Hypothesis
9 pages
Review of The Related Literature Objectives
No ratings yet
Review of The Related Literature Objectives
15 pages
Chapter Two Forecasting 2.1 Definition A. What Is A Forecast?
No ratings yet
Chapter Two Forecasting 2.1 Definition A. What Is A Forecast?
16 pages
7.1. The Function of A Questionnaire: Unit Vii Questionnaire Design
No ratings yet
7.1. The Function of A Questionnaire: Unit Vii Questionnaire Design
12 pages
Chapter 3
No ratings yet
Chapter 3
30 pages
Chapter 2 Theories of Human Development
No ratings yet
Chapter 2 Theories of Human Development
48 pages
MNGT 481 Brown Forman Paper 1
No ratings yet
MNGT 481 Brown Forman Paper 1
18 pages
Learning Disability Intervention Manual
No ratings yet
Learning Disability Intervention Manual
7 pages
Overview of Health Indicators
No ratings yet
Overview of Health Indicators
44 pages
CH 4
No ratings yet
CH 4
24 pages
The Stages: Object Permanence
No ratings yet
The Stages: Object Permanence
2 pages
Discussion - Lesson Planning
No ratings yet
Discussion - Lesson Planning
2 pages
FS1 Episode 4
No ratings yet
FS1 Episode 4
14 pages
Unity University Operations Research TBH
No ratings yet
Unity University Operations Research TBH
4 pages
Chapter One Introduction To Materials Management
No ratings yet
Chapter One Introduction To Materials Management
21 pages
Unit Viii Designing The Sample Plan and Size
No ratings yet
Unit Viii Designing The Sample Plan and Size
12 pages
8 SLD Considerations Document 2011
No ratings yet
8 SLD Considerations Document 2011
59 pages
Assignment 5
No ratings yet
Assignment 5
5 pages
hdfs421 Lab 6 Tio Kristina
No ratings yet
hdfs421 Lab 6 Tio Kristina
8 pages
Non-Hazing Attestation Form
No ratings yet
Non-Hazing Attestation Form
2 pages
NASP Ethical Problem Solving Worksheet
100% (1)
NASP Ethical Problem Solving Worksheet
7 pages
Children and Development - Ed209 Notes
No ratings yet
Children and Development - Ed209 Notes
3 pages
Edps545 Adams Case Study-2
No ratings yet
Edps545 Adams Case Study-2
13 pages
Freeletics Workout Program Exees - Compress
No ratings yet
Freeletics Workout Program Exees - Compress
1 page
Soal Bahasa Inggris Kelas 3 Semester 1 Animals
No ratings yet
Soal Bahasa Inggris Kelas 3 Semester 1 Animals
3 pages
Counseling Parents of Children With Disabilities: A Review of The Literature and Implications For Practice
No ratings yet
Counseling Parents of Children With Disabilities: A Review of The Literature and Implications For Practice
1 page
Socioemotional Development in Infancy
No ratings yet
Socioemotional Development in Infancy
23 pages
Dev Psych - Module 3
0% (1)
Dev Psych - Module 3
3 pages
Quality Management and Reliability (Mn5554) : DR Qingping Yang
No ratings yet
Quality Management and Reliability (Mn5554) : DR Qingping Yang
12 pages
5532 Membership Form Update 24
No ratings yet
5532 Membership Form Update 24
10 pages
EE3013 Jul2016
No ratings yet
EE3013 Jul2016
4 pages
Vannamei Shrimp Life Cycle JALA
No ratings yet
Vannamei Shrimp Life Cycle JALA
6 pages
Bohannan 1962 Studies in The Alteration of Vestibular Depht II Perioteum Retention
No ratings yet
Bohannan 1962 Studies in The Alteration of Vestibular Depht II Perioteum Retention
6 pages
An Information Assurance Policy
No ratings yet
An Information Assurance Policy
6 pages
Meher Suri - CV (06!10!24)
No ratings yet
Meher Suri - CV (06!10!24)
12 pages
Havighurst Theory
No ratings yet
Havighurst Theory
8 pages
HHS Public Access: Intellectual Disability and Language Disorder
No ratings yet
HHS Public Access: Intellectual Disability and Language Disorder
21 pages
Dbb2101 Legal Regulatory Framework
No ratings yet
Dbb2101 Legal Regulatory Framework
4 pages
Addressing Challenging Behavior
No ratings yet
Addressing Challenging Behavior
9 pages
House Hold Items List
No ratings yet
House Hold Items List
1 page
Typical & Atypical
No ratings yet
Typical & Atypical
2 pages
Paris Packing List PDF
No ratings yet
Paris Packing List PDF
5 pages
Virtual Child Portfolio
No ratings yet
Virtual Child Portfolio
5 pages
1-What Are Sensitive Periods?: When Occur The Sentive Period in The Life of A Child?
No ratings yet
1-What Are Sensitive Periods?: When Occur The Sentive Period in The Life of A Child?
7 pages
Human Growth and Development Theories
100% (1)
Human Growth and Development Theories
10 pages
Program Assessment Tools
No ratings yet
Program Assessment Tools
17 pages
01 Child Psychology - Intrduction
No ratings yet
01 Child Psychology - Intrduction
34 pages
Curriculum-Based Measurement: A Manual For Teachers: Syracuse (NY) City Schools
No ratings yet
Curriculum-Based Measurement: A Manual For Teachers: Syracuse (NY) City Schools
107 pages
Ipp Joint Attention-2
No ratings yet
Ipp Joint Attention-2
3 pages
Resume HBH 3 15
No ratings yet
Resume HBH 3 15
4 pages
Help Birth 3 Dev
No ratings yet
Help Birth 3 Dev
22 pages
Fragile X Syndrome and Premutation Disorders: New Developments and Treatments
From Everand
Fragile X Syndrome and Premutation Disorders: New Developments and Treatments
Randi J Hagerman
No ratings yet
Autism Early Intervention: Fast Facts: A Guide That Explains the Evaluations, Diagnoses, and Treatments for Children with Autism Spectrum Disorders
From Everand
Autism Early Intervention: Fast Facts: A Guide That Explains the Evaluations, Diagnoses, and Treatments for Children with Autism Spectrum Disorders
Raun Melmed
No ratings yet
Cognitive Bias: How Our Minds Trick Us: Micro Book - C2 - Series Human Psychology Explored
From Everand
Cognitive Bias: How Our Minds Trick Us: Micro Book - C2 - Series Human Psychology Explored
Ciro Irmici
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Chapter - 5 - Measurement in Research

Uploaded by

Chapter - 5 - Measurement in Research

Uploaded by

Chapter – 5 -

 Explain the concept of validity and reliability of a measure

5.1.1. The concept of validity in measurement

Researchers make lots of different inferences or conclusions while conducting research.

Figure 1. The major realms and components of research

5.1.2. The Reliability of a Measurement

In research, the term reliability means repeatability or consistency. A measure is considered

true score + error score

The three Aspects of Reliability

The Relationship between Reliability and Validity

Figure 3. Random and systematic errors in measurement

What Is Random Error?

What Is Systematic Error?

5.4. Basic concepts in measurements

Questionnaires are designed to collect information. How is this information collected? It

But what are we really measuring? We are measuring properties-sometimes called

5.4.1. Scale characteristics

Scale development is designing questions to measure the subjective properties of an

Measurement scales differ by what scale characteristics they possess

Nominal scale yes no no No

Ordinal scale yes yes no No

Interval scale yes yes yes No

Ratio scale yes yes yes yes

5.5.1. Nominal scales

5.5.2. Ordinal scale

5.5.3. Interval scales

5.5.4. Ratio scales

Examples on the use of different scaling assumptions in measurement

A. Nominal scaled questions

1. Please indicate your gender Male__ Female__

2. Check all the brands you would consider to purchase

___________ Sony ___________ Panasonic

___________ JVC ___________ Philips

B. Ordinal- scaled questions

___________ Coca cola

3. In your opinion, would you say the prices at Bambis are

___________ Higher than Hadiya

___________ About the same as Shi Solomon Hailu

___________ Lower than loyal

1. Please rate each brand in terms of its overall performance

Rating (circle one)

Brand Very poor Very good

Rating (circle one)

Statement strongly strongly

a. I always look for bargains 1 2 3 4 5

a. I enjoy being outdoors 1 2 3 4 5

1. Please indicate your age

5.3.1. Likert scale

Example: Assessment by course-work is easier than assessment by examination

Neither agree nor

5.3.2. Semantic differential scale

Example: The degree I am taking is.............

Interesting: _____:_____:_____:_____:_____:_____:_____: Boring

Useful: _____:_____:_____:_____:_____:_____:_____: Useless

Easy: _____:_____:_____:_____:_____:_____:_____: Difficult

In General why the measurement level of a scale is important?

There two important reasons:

As a general recommendation it is desirable to construct a scale at the highest appropriate level

Figure . The hierarchy of measurement levels

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

1. Please indicate your gender Male Female

_ Sony _ Panasonic

_ JVC _ Philips

Interesting: ___:_:_:_:_:_:___: Boring

Useful: ___:_:_:_:_:_:___: Useless

Easy: ___:_:_:_:_:_:___: Difficult