HMEF5093 Quantitative Research Methodology
HMEF5093 Quantitative Research Methodology
QUANTITATIVE
RESEARCH
METHODOLOGY
Prof Dr John Arul Phillips
References 178
Appendix 1 183
Appendix 2 184
WELCOME TO HMEF5093
Welcome to HMEF5093 Quantitative Research Methodology, which is one of the
required courses for the Master of Education (MEd) programme, Option 2. The
course assumes no previous knowledge and experience in educational research
but you are encouraged to tap into your experiences as a teacher, instructor,
lecturer or trainer, and relate them to the concepts and principles discussed. This
is a three-credit course conducted over a semester of 14 weeks.
Course Topics
To enable you to achieve the NINE objectives of the course, HMEF 5093 is
divided into 10 topics. Specific learning outcomes are stated at the start of each
topic indicating what you should be able to achieve after completing the topic.
Topic 7: Instrumentation
The topic is devoted towards explaining the design and development
of instruments such as the survey questionnaire, attitude scales and
use of commercially available instruments. The issue of reliability and
validity of instruments is discussed.
To help you read and understand the individual topics, numerous realistic
examples support all definitions, concepts and theories. Diagrams and text are
combined into a visually appealing, easy-to-read module. Throughout the course
content, diagrams, illustrations, tables and charts are used to reinforce important
points and simplify the more complex concepts. The module has adopted the
following features in each topic:
X INTRODUCTION
Lists the headings and subheadings of each topic to provide an overview of the
contents of the topic and prepare you for the major concepts to be studied and
learned.
LEARNING OUTCOMES
This is a listing of what you should be able to do after successful
completion of a topic. In other words, whether you are be able to explain,
compare, evaluate, distinguish, list, describe, relate and so forth. You
should use these indicators to guide your study. When you have finished
a topic, you must go back and check whether you have achieved the
learning outcomes or be able to do what is required of you. If you make a
habit of doing this, you will improve your chances of understanding the
contents of the course.
SELF-CHECK
ACTIVITY
The main ideas of each topic are listed in brief sentences to provide a review of
the content. You should ensure that you understand every statement listed. If
you do not, go back to the topic and find out what you do not know.
Key Terms discussed in the topic are placed at end of each topic to make you
aware of the main ideas. If you are unable to explain these terms, you should go
back to the topic to clarify.
DISCUSSION QUESTIONS:
At the end of each topic a list of questions is presented that are best solved
through group interaction and discussion. You can answer the questions
yourself. But, you are encouraged to work with your course-mates and discuss
online and during the seminar sessions.
At the end of each topic a list of articles and topics of books is provided that is
directly related to the contents of the topic. As far as possible the articles and
books suggested for further reading will be available in OUMÊs Digital Library
which you can access and OUMÊs Library. Also, relevant internet resources are
made available to enhance your understanding of selected curriculum concepts
and principles as applied in real-world situations.
Facilitator
Your facilitator will mark your assignments and provide assistance to you during
the course. Do not hesitate to discuss during the seminar sessions or online if:
Ć You do not understand any part of the course content or the assigned
readings
Ć You have difficulty with the self-tests and activities
Ć You have a question or problem with the assignments.
(e) When have completed the topic, review the learning outcomes to confirm
that you have achieved them and able to do what is required.
(f) If you are confident, you can proceed to the next topic. Proceed topic by
topic through the course and try to pace your study so that you keep
yourself on schedule.
(g) After completing all topics, review the course and prepare yourself for
the final examination. Check that you have achieved all topic leaning
outcomes and the course objectives (listed in this Course Guide)
FINAL REMARKS
One again, welcome to the course. To maximise your gain from this course you
should try at all times relate what you are studying with the real-world. Look at
the environment in your institution and ask yourself whether they provide
opportunities for research. Most of the ideas, concepts and principles you learn in
this course have practical applications. It is important to realise that much we do
in education and training has to be based on sound theoretical foundations. The
contents of this course provide the principles for doing research in education
whether it be in a school, college, university or training organisation.
We wish you success with the course and hope that you will find it interesting,
useful and relevant towards your development as a professional. We hope you
enjoy your experience with OUM and we would like to end with this saying,
„Education is a lifetime of learning, relearning and unlearning‰ by Alvin Toffler
(an adaptation).
INTRODUCTION
This guide explains the basis on which you will be assessed in this course during
the semester. It contains details of the facilitator-marked assignments, final
examination and participation required for the course.
One element in the assessment strategy of the course is that, all students should
have the same information as facilitators about the answers to be assessed.
Therefore, this guide also contains the marking criteria that facilitators will use in
assessing your work.
Please read through the whole guide at the beginning of the course.
ACADEMIC WRITING
(a) Plagiarism
(c) Referencing
All sources that you cite in your paper should be listed in the Reference
section at the end of your paper. HereÊs how you should do your Reference.
ASSESSMENT
Please refer to myVLE.
LEARNING OUTCOMES
By the end of this topic, you should be able to:
1. Define what research is;
2. Explain the characteristics of research;
3. Define what educational research is;
4. Describe the steps involved in the research process; and
5. Differentiate between quantitative and qualitative research.
X INTRODUCTION
The word „research‰ has been used in many different ways, sometimes rather
loosely, giving rise to confusion, and at times with the intention to deceive. This
is illustrated in the following circumstances:
(i) „Research has shown that four out of five dentists interviewed used
Plantoid Herbal Toothpaste.‰ ă The claim that the majority of dentists used
the particular brand of toothpaste is misleading as there is no mention of
the total number of dentists interviewed and whether they were
representative of all dentists in the country.
(ii) „Years of research have shown that SlimTex Capsules reduce oneÊs weight
in a matter of weeks.‰ ă Research on the use of various types of medicine
and supplements by humans tends to be inconclusive. In relation to the
advertisement on slimming pills, it would be difficult to make a claim that
the pills reduce weight as there are many other contributory factors such as
gender, level of health and diet.
ACTIVITY 1.1
Identify the different ways in which the word „research‰ has been used
in the statements above.
• Scientific • Controlled
• Systematic • Theory and hypotheses
• Empirical and critical investigation • Presumed relations
• Discover facts • Natural phenomena
(b) Test the hypothesis (design a study to establish whether the relationship
between the constructs is as hypothesised);
(d) Decide to accept or reject the hypothesis (e.g. correlation between attitude
towards mathematics and mathematics performance).
The purpose of using the scientific method is to enable the researcher to describe
(the relations between factors); to predict (given what is known, we can predict
what might happen); to control (when certain variables are manipulated, does it
lead to a particular condition), and to explain (can a theory be formulated to
explain the phenomena being investigated).
SELF-CHECK 1.1
1. Define research in your own words.
2. Identify the steps in gaining knowledge through the use of scientific
method.
3. What are some characteristics of research?
ACTIVITY 1.2
Where does one find research problems in education? They are all around you!
There are a host of research problems or unresolved issues everywhere. Look at
the 14-year-old who says, „I hate history‰. Do you know why? Do you want to
know why? In fact, whatever that arouses your interest for which there are as yet
no answers or are inconclusive have the potential of being a research problem.
Often, one starts with a rather general, diffuse and even confused notion of the
problem. Do not worry as this is the nature and complexity of the research
process. It is the first step that helps you towards becoming a mature researcher.
The following steps are to help you obtain a research problem:
(a) Step 1: Identify a Broad Problem in Your Area that is of interest to you and
related to your professional goals. You should build sizeable knowledge in
your area of interest for the long term. For example, you may be interested
in how young children learn mathematics, how to get teenagers interested
in science, how to enhance the teaching of moral education, how to improve
training in the corporate sector using e-learning, and the relationship
between headmastersÊ leadership style and teacher morale.
(b) Step 2: Systematic Programme of Reading within your broad area of study.
For example, if you are interested in why children have difficulty learning
mathematics, you can start with textbooks in the area. These textbooks
explain the basic concepts and facts related to the issue and may cite
research in the „references‰ section which you can further explore. Journals
such as the Review of Research in Education and the Review of Educational
Research provide valuable information about a particular field as they
review related literature. Next, read articles in the relevant journals in the
field. For example, if you are interested in reading about research, you may
want to read the Journal of Reading and the Reading Research Quarterly.
You can also look through the catalogue both in the library and online, and
identify the journals in your field.
(c) Students often do not adequately refer to journals in the field but, instead,
tend to cite from popular sources such as newspapers, magazines and
speeches. Journals report empirical evidence on the field you are interested
in, reflect the current thinking about research and indicate the trend of
research efforts in the field. This will give you a grasp of leading edge
research in the United States, Britain, Europe and Australia and how you
can do the same in Malaysia.
(d) Step 3: Relate Your Research Problem to a Theory in the field. A theory is
an explanation of events, phenomena or behaviour. For example, if you are
interested in finding out whether providing children with multimedia
presentations on science concepts will enhance understanding, you may
want to explore the underlying theories of visual learning. Many
phenomena in education are explained by drawing upon theories from
cognitive psychology, sociology, psycholinguistics, management, computer
science and so forth. It should be remembered that theory provides the
direction of the research. We will discuss in more detail the role of theory in
Topic 2.
SELF-CHECK 1.2
relation exists between the variables? The answer to this question will be sought
by conducting the research.
3. The problem should imply possibilities of empirical testing. A problem that does
not contain implications for testing its relationship or relations is not a scientific
problem. So, if you can measure the constructs of self-esteem and academic
performance, then the problem is considered a good problem.
One critical issue to consider when you look for a problem is problem
importance i.e. how important the problem is and why the problem is worth
your attention. As the problem limits the scope of your study by focusing on
some variables and not others, it also provides an opportunity for you to
demonstrate why these variables are important. You can use the following
questions as a guide to assess the importance of your problem:
• Is the problem of current interest?
• Will the problem continue in the future?
• Will gathering more information about the problem have any practical
application?
• Will gathering more information about the problem have any theoretical
importance?
• How big and influential is the population affected by the problem?
• Would research into the problem substantially revise or extend existing
knowledge?
SELF-CHECK 1.3
1. What is a good problem statement?
2. Why do graduate students have difficulty in stating research
problems?
Next, you hypothesise that „the car does not start because the spark plugs are
dirty‰. You check the spark plugs to determine whether they are dirty and accept
or reject the hypothesis accordingly.
Hence, the decision on which methodology to use will depend on the research
problem and the research questions or hypotheses. It is not good practice to decide
on a methodology and then work on the research questions. You have to be able to
state clearly what you intend to study and then decide on an appropriate
methodology. If you are clear about your research problem and research questions,
you will find it easy to get assistance from your supervisor and other students in
designing a study to obtain answers to your research questions.
ACTIVITY 1.3
The following research problems are taken from research literature. Study
them carefully and construct one or two hypotheses or research questions
based on them.
(a) The influence of self-esteem and level of aspiration on academic
achievement.
(b) The effectiveness of graphic organisers in enhancing the
understanding of science text material learners.
(c) The effects of organisational climate in schools on teacher
satisfaction and morale.
ACTIVITY 1.4
You stopped at a junction because the lights had just turned red. But,
on your left, a motorcyclist and a car beat the red light. You were
most annoyed. Being a good social scientist, you asked yourself why
people did that. If you were to convert the problem into a hypothesis,
which of the following would be the best hypothesis? Explain your
answer.
(a) Some people beat the red light more frequently than others.
(b) People who beat the red light are a danger to themselves and
other road users.
(c) Beating the red light is common in crowded cities.
(d) Road users are more likely to beat the red light when traffic is
light than when traffic is heavy.
ACTIVITY 1.5
1. Identify a research problem that you are interested in
investigating.
2. Formulate two research questions or hypotheses based on the
research problem.
(b) Quantitative research adopts the deductive approach. The researcher begins
by formulating research questions and/or hypotheses based on an available
theory or theories. He/she then designs an empirical study to test the
hypotheses and/or answer the research questions. Next, he/she conducts
the study to gather relevant data. Based on the hypotheses tested and/or
answers to the research questions, the researcher confirms or revises the
theory or theories. On the contrary, qualitative research adopts the inductive
approach. Normally, such research is conducted because there is a lack of
theory or existing theories are unable to explain a phenomenon
convincingly. Due to this, no hypotheses are put forward to guide the
research. The qualitative researcher begins by observing the phenomenon
and continues to find patterns in the form of themes, categories, concepts
and typologies that emerge. Tentative hypotheses are introduced and
additional information is collected to explain the phenomenon. Thus,
quantitative research is theory-driven while qualitative research is data-
driven. Figure 1.2 summarises the general research procedures in the two
approaches.
ACTIVITY 1.6
Examine the two research studies on „reasons for discipline problems
in schools‰. Then, state which is quantitative research and which is
qualitative research. Justify your answers.
ACTIVITY 1.7
1. Write down your definition of research.
2. Suggest how you will go about finding a research problem that
you propose to investigate.
3. Discuss some educational research you have read. (Go to OUMÊs
Digital Library and click on „ProQuest‰, which has a good
collection of journals in education.)
4. List the current thinking on research in your area of interest.
• The word „research‰ has been used in many different ways, sometimes rather
loosely, giving rise to confusion, and at times with the intention to deceive.
• The seven steps of the educational research process are useful guides for
beginning researchers.
X INTRODUCTION
Borg and Borg (1988) mentioned that „without some viable theory to serve as a
guide, many studies address trivial questions or contribute nothing to the slow
accumulation of knowledge needed for the advancement of the science of
education.‰ Do you agree with this statement made by Borg and Borg?
Theory has not been properly understood by some graduate students who are of
the opinion that theory is not relevant to practice. Oftentimes, we hear them
remark that a particular course is „too theoretical‰ or that they prefer courses that
are „practical oriented and not too theoretical‰. Some go to the extent of
denouncing theory as useless! This reflects a lack of understanding of what
theory is, and what practice or practical is. According to Borg and Borg (1988),
theory serves as a guide for research to avoid investigating phenomena that are
irrelevant and do not contribute to our understanding.
Note that each of these theories explains learning and the variables or factors that
determine learning. Assuming that each of these theories is true, we can predict
that learning will take place when a student connects new information with old
information, is reinforced through a system of rewards, and reproduces a
modelled behaviour if it brings pleasure. However, many areas of education
have virtually no theoretical foundation and have to rely on other behavioural
sciences, such as psychology, sociology, anthropology and social psychology.
Children are asked whether the amount of water in container A is the same as
that in container C (Is A = C?). Based on these observations, the researcher
concludes whether to confirm or disconfirm the theory. If the theory is
confirmed, it means that children are unable to perform conservation tasks. In
other words, the children would have replied that container C had more water. If
the theory is disconfirmed, then the majority of children would have answered
that the amount of water in container A and C is the same.
SELF-CHECK 2.1
1. What is a theory?
2. Why are there few laws in the field of education?
3. What do you mean by confirming or disconfirming a theory?
4. Identify some theories in your field of interest.
5. What are the implications of PiagetÊs theory of childrenÊs
cognition in the classroom?
Let us examine another well-known theory in psychology which has been used
widely in education. David Clarence McClelland developed a theory of
motivation in 1988, which states that a person is motivated to do something
because of a desire or need for achievement, authority or affiliation, or a
combination of the three characteristics, as provided in Figure 2.3.
ACTIVITY 2.1
1. To what extent does McClellandÊs Motivation Theory describe
your motivation to do something in your daily life?
2. Briefly explain how you would attempt to confirm or disconfirm
McClellandÊs Theory.
[You can find more information about this theory at this site:
http://www.businessballs.com/davidmcclelland.htm
Chapman, A. (1995). David McClellandÊs motivational needs
theory (Electronic version).]
3. Identify some major theories in your area of interest.
The aim of a literature review is to show what has been done in the field and how
your study relates to earlier research. It also indicates the approaches, the
samples used, the variables examined, the statistical procedure used and, most
important of all, the findings obtained. The review gives an overview of the
findings of various previous studies. The review traces the general patterns of the
findings and the conclusions that can be made based on the findings.
It also provides an insight into how your study is similar to or different from
previous studies. For example, is your study an extension of what others have
done? Are you examining variables that have not been attended to in earlier
studies? Are you attempting to replicate earlier studies in a different cultural
context? Are you applying statistical procedures that have not been attempted by
others?
The review of literature requires you to locate, read and evaluate reports of
research as well as reports of opinions and proposals. The review must be
extensive and thorough because you are aiming to obtain a detailed account of
the topic being studied.
(a) The review of literature is an important part of the research process because
it forms the basis of any research and puts your work into perspective. The
literature may inform you about how your research may add to the current
knowledge base. It gives an understanding of previous work (seminal
works in the field), familiarises you with the personalities doing research in
the field and demonstrates that you can access such works.
(b) The review of literature helps the student in delimiting the research
problem by setting parameters. By setting the limits of your study, you
avoid being questioned, "Why didn't you do this or do that?" You can
confidently reply that your study is confined to studying what you had set
out to study. Delimiting the research problem can be achieved if you read
extensively and intensively the problem you plan to investigate, and from
the literature specify clearly what it is you want to study.
(c) The literature also provides insights into the approaches and methodologies
adopted by different researchers. A common mistake made by students is
to pay attention to only the findings of studies. Besides findings, students
should also examine the methodologies used to study the phenomenon
they are interested in. There could be unique approaches adopted which
you might want to replicate in the Malaysian context. For example, in most
studies reviewed, the sample tended to be university students. Hence, if
your study is an attempt to use secondary school students as the sample, it
could be a unique contribution to the field because your approach is
different.
(d) Some research studies include a section on „Recommendations‰ for further
research, which indicates whether you are on the right track in studying
something that has not been explored before. These suggestions are
significant because they express the insights of the researcher after having
studied the phenomenon. You can learn from the mistakes made by the
researcher in his/her research.
SELF-CHECK 2.2
1. What is literature review?
2. What is the purpose of the literature review?
Title: …………………………………………………………………….
Author/s: ………………………………………………………………..
Source: …………………………………………………………………..
Methodology:…………………………………………………………….
Findings…………………………………………………………………
Conclusion:……………………………………………………………….
Comments:……………………………………………………………….
(e) Most important of all, avoid plagiarism. Give due recognition to the works
of other people. It does not cost anything to acknowledge sources. In fact, it
shows the breadth and depth of your review, and the thoroughness of your
work.
SELF-CHECK 2.3
1. „Read, Read, Read!‰. Comment.
2. What are some weaknesses of graduate students when writing a
literature review?
2.7 SOURCES
A good literature review requires knowledge of the use of indexes and abstracts
and the ability to conduct exhaustive bibliographic searches. You should be able
to organise the materials meaningfully, describe, critique and relate each source
to the subject of the inquiry, present the organised review logically and, most
importantly, correctly cite all sources mentioned. Generally, there are two main
sources of materials:
(a) Secondary Sources: These include materials written by an author who was
not a direct observer or participant in the events described. If you read a
textbook on educational psychology, it would be a compilation of the views
and empirical works of other authors rearranged into a textbook. The
textbook is a review of research done by others and is as interpreted by the
author. This interpretation by the author of the textbook would be classified
as a secondary source (note that the interpretation may be biased).
Secondary sources are useful because they provide a quick and relatively
easy method for obtaining an overview of current thinking in the field.
(b) Primary Sources: These include materials that are a direct description of
events by researchers who actually conducted the investigation. Most
primary sources are found in research journals. However, there are also
many reports of research conducted by individuals, groups of individuals
and organisations.
How do you search for research articles, research reports, etc? You can start by
referring to preliminary sources or references such as indexes and abstracts,
which are intended to help you identify and locate research articles and other
primary sources of information. The following are well-known indexes and
abstracts in education:
(e) Current Index to Journals in Education (CIE) ă Indexes over 800 education
journals and includes more than 1,000 articles each month.
Specialised Areas
• Child Development Abstracts and Bibliography
• Exceptional Child Education Resources (ECER)
• Education Administration Abstracts
• Physical Education Index
A further source of information are theses and dissertations that have never been
published. The following are important sources which provide abstracts of
masterÊs theses and doctoral dissertations in education:
(a) Dissertation Abstracts International ă A compilation of abstracts of doctoral
dissertations. Under the education section are sub-topics such as adult
education, art education, preschool and teacher training.
(b) MasterÊs Theses in Education ă This is a listing of masterÊs theses in about
40 major educational topics. It includes name of author, title and institution.
ACTIVITY 2.2
What do you think are some problems graduate students face when
doing literature review for their theses or research projects?
Critique:
(a) Was the reason for answering the research questions or testing the
hypotheses convincing or was it only attempting to appeal to the readerÊs
emotions and merely seeking endorsement from well-known authorities in
the field?
(b) Did the research questions or hypotheses follow logically from theory?
(c) Was there a tendency to oversimplify the theories or studies reviewed?
Critique:
(a) How clear was the selection of subjects?
(b) Was the instrument or treatment administered adequately explained?
(c) Were issues of validity and reliability discussed?
(d) Was the design of the study appropriate? How was it designed to reduce
different types of biases?
(e) What were the independent and dependent variables?
Critique:
(a) Were the results clearly reported (e.g. the use of tables and graphs)?
(b) Did the statistics test the predictions made in the Introduction?
Critique:
(a) Was the author's way the only way to interpret the predicted results?
(b) Can you explain any of the findings the author did not explain or was
unable to explain?
(c) What were the weaknesses or limitations identified by the author or which
you found but were not mentioned?
SELF-CHECK 2.4
1. What major aspects of a study will you examine when
describing a research article?
2. When you critique the Methods section and the Results section,
what are you looking for?
ACTIVITY 2.3
• Literature review shows what has been done in the field and how the
intended study relates to earlier research.
• Literature review delimits a study, relates the methods used by others and
the recommendations of earlier works, and provides the basis for the
intended study.
X INTRODUCTION
Once you have identified your research topic, reviewed the literature and refined
your research questions, you are ready to plan for the data collection stage of the
research process. At this stage, you need to decide what data to collect and how to
collect it. This involves identifying the constructs and variables to be measured.
Without these measurement targets, you may end up collecting irrelevant data and
wasting your time, money and other resources. It is therefore important to
understand the concepts of constructs and variables in educational research.
To be noted is that constructs are not measured directly. Rather, they are inferred
from, and measured by, their empirical referents. The word „empirical‰ refers to
observation. Thus, the empirical referents of a construct are those observable
objects, events and behaviours including beliefs and feelings from which the
construct is inferred. For example, the referents of anxiety are behaviours and
events such as crying, certain facial expressions, certain mannerisms and other
non-verbal behaviours. Note that no single empirical referent ă no one event, object
or behaviour ă fully represents a construct. In other words, no one empirical
referent mentioned above by itself fully represents the construct of anxiety. All
these referents need to be taken together as a whole as the indicator of anxiety.
Figure 3.1 illustrates the process of inference involved in defining a construct.
ACTIVITY 3.1
Given below is a set of items that are supposedly reflective of the
empirical referents associated with the construct of reading attitude. The
referents can be divided into three categories, which make up the three
dimensions of the reading attitude construct defined by MathewsonÊs
well-known tripartite model. The three dimensions are:
• A personÊs prevailing feelings about reading (affective dimension);
• A personÊs belief or knowledge about reading (cognitive dimension);
and
• A personÊs action readiness towards reading (behavioural
dimension).
Dimension Item No
Affective
Cognitive
Behavioural
14 I would spend more time for leisure reading during school days. SD D A SA
20 Reading is useful. SD D A SA
(high, middle, low). Other names for the independent variable are
treatment, factor and predictor variable.
(ii) The dependent variable in this study is academic performance which
cannot be manipulated by the researcher. Academic performance is a
score and other examples of dependent variables are IQ (score from
IQ tests), attitude (score from an attitude scale), and self-esteem (score
from a self-esteem test). Other names for the dependent variable are
outcome variable, results variable and criterion variable.
Put it another way, the dependent variable is the variable predicted to,
whereas the independent variable is predicted from. The dependent
variable is the presumed effect, which varies from changes or variation in
the independent variable.
In research that examines correlation between two variables (e.g. the relationship
between studentsÊ English proficiency and their mathematics performance) as
stipulated in Figure 3.3, both variables are dependent variables.
ACTIVITY 3.2
Can you identify the independent and dependent variables in the
following research study?
You are interested to find out if students who are exposed to project-
based method (PBM) would perform better in the science subject than
those who are taught using the traditional lecture method (TLM).
Besides, you also want to know if the PBM would change their attitude
towards learning science. Lastly, if the PBM has positive effects, you
want to know if it would benefit:
(a) Male students more or female students more.
(b) High achievers more or low achievers more.
Write your answers in the following table.
1 5 Poor
2 7 Poor
3 6 Poor
4 8 Poor
5 13 Good
6 16 Good
7 19 Good
8 20 Good
However, it should be borne in mind that in education not all variables are
directly observable. For example, we cannot really observe learning, memory,
reasoning and so forth. Though these cannot be observed, they can be measured.
With enough indirect evidence, researchers can make a convincing case for the
existence of these invisible variables. For example, though we cannot observe
learning directly, we can see its effect on performance i.e. we can operationally
define learning as an increase in performance. Thus, if we see students improve
their performance after carrying out a task, we can conclude that learning has
occurred.
They are so called in line with the four levels of measurement: nominal, ordinal,
interval and ratio as suggested by Stevens (1951) and cited in Ary, Jacobs and
Razavieh (2002).
Table 3.2: Number and Percentages of Students who Passed or Failed their Science Test
PASSED FAILED
Number of Students 15 5
(75 %) (25 %)
In the above example, passed and failed are categorical variables but the
quantitative data is nominal because it is in the form of frequency count.
Sometimes, you may want to collect more information about the students who
passed or failed the Science test. For instance, you wish to find out how many
who passed or failed were male and female students. You have thus included an
additional categorical variable, which is gender, in the data collection process.
The information may be cross-tabulated as shown in Table 3.3 below:
PASSED FAILED
9 1
Male (60%) (20%)
6 4
Female (40%) (80%)
In this case, the numbers in the rank are ordinal data. Usually, the highest
score is assigned the smallest number while the lowest score is assigned the
biggest number. It must be emphasised that the numbers 1 to 20 themselves
have no particular meaning other than to establish rank. The intervals
between the numbers have no meaning and the numbers themselves have no
absolute quantities.
ACTIVITY 3.3
Refer to the scale in Activity 3.1 and explain the type of data you can
obtain from it.
Good understanding of the four data types is important because it has far-
reaching implications for quantitative data analysis. The appropriateness of the
statistical techniques to be used in the analysis is partly dependent on the types
of data you have collected in your study. You will learn about this in Topics 8
and 9 of this module and in HMEF5113 Statistics for Educational Research.
SELF-CHECK 3.1
1. What is a variable?
2. What is the difference between an independent variable and a
dependent variable? Give specific examples.
3. Why do you need to define variables operationally?
4. Explain the differences and similarities of the four types of data.
ACTIVITY 3.4
• Variables that comprise only two text values or labels are called
dichotomous variables.
• Nominal data takes the form of frequency count. This type of data can also be
expressed in percentages.
• Ordinal data is in the form of rank, either from the lowest to the highest rank, or
vice versa.
• Interval data can have a value on a continuous scale but it does not have an
absolute zero.
• Ratio data is measured on a continuous scale just like interval data but unlike
interval data, it has an absolute zero, which is meaningful.
Categorical variable
Constructs Interval data
Continuous variable Moderator variable
Data types Nominal data
Dependent variable Operational definition of variables
Extraneous variable Ordinal data
Independent variable Ratio data
INTRODUCTION
You may have come across these statements or somewhat similar statements.
Note that each statement is making a claim that their proposed method, product,
technique or procedure is effective in enhancing human performance. Obviously,
you would like to know how they went about proving „effectiveness‰. How does
one prove effectiveness? Of all available research methods, the experimental
method is the best. You may have conducted science experiments in the
laboratory or in the field. The experimental method was originally used in the
field of agriculture where experiments were conducted to test the effectiveness of
various kinds of treatments such as fertilisers, water and sunlight on plant
growth. The method is used in medical sciences especially in testing the
effectiveness of various kinds of drugs, procedures and therapies on patients.
The experimental method is widely used in education whereby researchers
observe the occurrence of a phenomenon as a consequence of a particular action
or intervention.
Hence, it is important that you use the word "effectiveness" carefully, as it only
applies if you are using the experimental method.
SELF-CHECK 4.1
1. What is unique about the experimental method compared to other
methods of research?
2. What is „treatment‰?
3. What is the difference between an experimental group and a
control group? Why do you need these two groups?
Campbell and Stanley state that experiments are internally valid when the
obtained effect can be attributed to the manipulation of the independent variable.
In other words, if the effects (e.g. improved scores in mathematics) obtained in
the experiment are due only to the experimental conditions manipulated by the
researcher and not to any other variables (factors), the experiment has internal
validity. In any experiment, there are always some other factors other than the
independent variable (treatment) that can influence the observed effects
(dependent variable). These variables must be identified and dealt with, or held
constant. Cook and Campbell list a number of factors that can threaten the
validity of experiments. It is important that you know these threats so that you
can take the necessary steps to control the influence of these outside factors to
enhance internal validity.
Figure 4.2: Time interval between pre-test and post-test and threats to internal validity
(a) The first is History, which includes events that have occurred in the
subjectsÊ environment between the pre-test and the post-test that might
affect the scores. For example, the subjects may have experienced events
during the time lapse that affected their attitude, and this is reflected in the
scores of the dependent measure. Generally, the longer the duration
between the pre-test and the post-test, the greater the possibility of history
threatening internal validity. Short time lapses can generate the history
effect too.
(b) The second is Maturation, whereby subjects may change between the pre-
test and post-test, becoming more mature. The change could be both
biological and psychological such as age, learning, fatigue, boredom and
hunger which are not related to specific external events but reside within
the individual.
your pre-test, you may use a multiple-choice test to measure the effect of
the treatment, while in the post-test, an essay test is used.
(d) The fourth is Testing, whereby subjects remember the questions in the pre-
test and if the same test is given as a post-test, the chances are they may
score higher in the post-test i.e. they have become "test-wise". The period
between the pre-test and the post-test should not be too short to the extent
that subjects can recall the questions.
(a) Mortality
Which is sometimes referred to as "attrition" when subjects drop out from the
experiment which can affect the experiment. This is especially serious when
subjects of a particular characteristic (e.g. high ability) systematically drop out.
(b) Selection Bias
When the subjects selected for the experimental group and the control group
are not equivalent before the treatment, leading to a misleading conclusion. For
example, if the experimental group comprises 50% high ability subjects while
the control group comprises only 25% high ability subjects, higher
performance on the post-test may not be attributed to the treatment but to
non-equivalent subjects in terms of ability.
(c) Regression to the Mean
When subjects with extreme scores on a test are selected, there is a likelihood
that when they are retested later on a measure that is correlated with the first
test, their scores will move towards the mean. For example, if students who
performed poorly are selected for training, their average post-test scores will
be higher than their pre-test scores because of statistical regression even if no
training was given.
experiment in different ways which may affect the experiment. This is termed as
„subject-experimenter effects‰.
(a) Subject Effect ă The perception of subjects when they enter the experiment
can affect how they respond to the tasks required of them. Their perception
of the purpose of the experiment, the task required and the rumours they
hear about the experiment may cause them to behave differently. For
example, subjects who realise that an experiment is about speed of learning
and intelligence might learn the material presented as rapidly as possible to
appear intelligent. Similarly, if a task suggests something about emotional
stability, the subject may respond in such a way as to appear most
emotionally stable.
(b) Experimenter Effect ă The experimenter has a motive for conducting the
experiment. He or she is attempting to uncover the laws of human
behaviour through experimentation. Towards this goal, the experimenter
expects subjects to be perfect respondents who will cooperate and follow
instructions carefully. The experimenter may be too keen to obtain findings
that confirm the hypotheses and this desire is communicated unconsciously
to subjects. The subtle cues presented by the experimenter are picked up by
subjects and influence their performance in the direction desired by the
experimenter. Certain attributes of the experimenter have shown to
influence subjects. For example, in some experiments, young children
respond more readily to women experimenters compared to their male
counterparts.
SELF-CHECK 4.2
1. What is meant by internal validity and why is it an important
ingredient in experimental research?
2. Identify the major extraneous variables that need to be controlled
in an experiment.
3. How do these extraneous variables affect the internal validity of
experiments?
4. Explain how subjects and the experimenter can bias the results of
an experiment.
23 34 26 91 73 93 83 59 50 51
76 79 54 45 65 13 11 56 91 27
68 57 37 38 45 45 04 85 66 12
45 25 98 63 52 23 03 36 06 08
89 3 39 34 91 94 12 39 13 31
90 26 83 26 21 34 82 07 34 67
23 61 64 65 37 06 54 26 29 75
87 82 51 02 95 64 62 35 96 49
90 71 25 86 62 39 53 49 48 52
12 38 67 09 67 31 45 40 28 31
SELF-CHECK 4.3
1. Why is it important that subjects are assigned randomly in an
experiment?
2. How do you use the Table of Random Numbers to assign subjects
randomly?
4.4.1 Matching
Determine a particular factor such as academic performance which is measurable
and can be categorised as High and Low. From the sample, select two High
Ability subjects and randomly assign them to the control group and the
experimental group. Next, select two Low Ability subjects and assign them
randomly to the control group and the experimental. Continue doing this until
all subjects have been assigned and your two groups are matched in terms of
academic performance.
Another technique of matching is to give the pre-test and, based on the scores
obtained, assign subjects to the control group and the experimental group.
However, you should ensure that the average score or mean score of the pre-test
is the same for the two groups. E.g. two subjects with mean 23; two subjects with
mean 30; two subjects with mean 34 and so forth.
"You should keep in mind that ANCOVA is an imperfect statistical technique for
equating experimental groups prior to the treatment period. Only the variables
that are measured can be used as covariates. The groups may differ on other
variables, but if these variables have not been measured, they cannot be entered
into the ANCOVA."
SELF-CHECK 4.4
1. Besides randomisation, what are the three techniques of
increasing the probability that subjects in two or more groups are
equivalent?
2. Explain the differences between these three techniques.
H0: M1 = M2 (Equation A)
H0: M1 – M2 = 0 (Equation B)
[H0 = notation for the null hypothesis. You can have H1, H2, H3 and so on;
which simply means Hypothesis 1, Hypothesis 2, Hypothesis 3 and so forth]
So for Equation A, the mean score for Group 1 is more or less EQUAL to the
mean score of Group 2, and if there is no statistical significance difference, then
the null hypothesis is accepted.
Equation B is similar. The mean of Group 1 minus the mean of Group 2 is equal
to 0, indicating no significant differences and so, the null hypothesis is accepted.
If the means of the two groups are different and there is a statistical difference,
then you can reject the null hypothesis (Equation C).
Mean Standard
Deviation
Experimental Group 30.4 3.7
Control Group 28.3 4.1
To the naive person, he or she might conclude that the experimental group
performed better than the control group because the mean score is higher by 2.1
and so the treatment is effective. This is misleading because it is likely that the
differences in the mean between the experimental group and control group could
have occurred by chance. In order for you to accept or reject the null hypothesis,
it is necessary that you analyse the data statistically because you want to be sure
that the treatment administered produced a real effect. How do you determine
that the difference between the two groups is caused by the treatment and not
some other extraneous variables? You could repeat the experiment and see if you
get the same results which will provide evidence on the reliability of the obtained
findings.
However, this is not an economical approach and for this reason statistical tests
are preferred. The test of significance enables one to determine whether the
amount of difference between the two groups is due to chance or treatment. Does
a large difference between the mean score of the experimental and control group
indicate that the difference is real? Even large differences can occur by chance,
although the probability of this happening would be very low. The most
common practice is to state a significance level that must be reached; which is a
statement of the probability that an observed difference is chance difference. The
most common significance levels are .05 and .01; regardless of whether you are
using the t-test, F-test or chi-square.
If you decide from the onset of the experiment that the .05 significance level is to
be used, it means that you will accept as a real difference only one that is so large
that it could have occurred by chance only 5 times in 100 (i.e. 95% not due to
chance). If the .01 significance level is selected, then the difference can be
expected to occur only 1 time in 100 by chance (i.e. 99% not due to chance).
SELF-CHECK 4.5
ACTIVITY 4.1
ACTIVITY 4.2
Random assignment means that each sampling unit has an equal chance of
being selected in the experiment.
X INTRODUCTION
What is meant by research design? According to Christensen (1988), „research
design refers to the outline, plan or strategy specifying the procedure to be used
in seeking an answer to the research question. It specifies such things as how to
collect and analyse the data‰. The design of an experiment will show how
extraneous variables are controlled or included in the study (refer to the control
techniques discussed in Topic 4). The design will determine the types of analysis
that can be done to answer your research questions and the conclusions that can
be drawn. To what extent your design is good or bad will depend on whether
you are able to get the answers to your research questions. If your design is
faulty, the results of the experiment will also be faulty. How do you go about
getting a good research design that will provide answers to the questions asked?
It is not easy and there is no fixed way of telling others how to do it. The best that
can be done is to examine different research designs and to point out their
strengths and weaknesses, and leave it to you to make the decision.
You conclude that praising children increases their mathematics score. This
design is weak for the following reasons:
(a) Selection Bias: It is possible that the pupils you selected as subjects were
already good in mathematics.
(b) History: The school had organised a motivation course on mathematics for
Year 4 pupils. So, it is possible that it might influence their performance.
ACTIVITY 5.1
The three designs described are „weak‰ research designs because they do not
allow for controlling of extraneous factors that might creep into the experiment.
These factors may affect the results of the dependent measure. For example, if the
attitude towards mathematics and outside tuition in mathematics is not
SELF-CHECK 5.1
1. Identify the major differences between one-shot design, one-
group pre-test post-test design and non-equivalent post-test
only design.
ACTIVITY 5.2
In the above example, the experimental and control groups comprise two
different sets of students. This procedure is called a between-subjects design (also
sometimes known as an independent or unrelated design). One advantage of this
design is that the students are less likely to get bored, tired or frustrated with the
study because each set of students is exposed to only one condition. In a similar
Copyright © Open University Malaysia (OUM)
TOPIC 5 EXPERIMENTAL RESEARCH DESIGNS W 73
vein, the research is less susceptible to practice and order effects. On the negative
side, you will need more students to participate in your research. There is also a
need to ensure that both groups of students are homogeneous in any
confounding variables that might affect the outcome of the study. This is because
different students bring different characteristics to the experimental setting. Even
though we randomly assign students to experimental and control conditions, we
might by chance allocate students with one characteristic to one condition, and
this might confuse our results.
So there are four possible combinations of the two independent variables. Each of
these treatment combinations is referred to as cells (i.e. A1B1, A2B1, A2B1 and
A2B2). Subjects are randomly assigned to these four cells within the design. For
the experiment using this factorial design, you are looking for three different
kinds of effect: the main effect of method, the main effect for ability and the
interaction between method and ability.
Figure 5.7: Factorial design showing means for ability and method
Results of the hypothetical experiment are shown in Figure 5.7. The main effects
for methods (variable A) showed that there was a significant difference in history
performance between students taught the inductive (Mean = 50.0) and the
deductive approach (M = 40.0). This means that method had an „effect‰ on
history performance. There was also a main effect for ability (variable B) where a
significant difference was observed between high (M = 55.0) and low ability (M =
35.0) students on performance in history. Similarly, it means that ability had an
„effect‰ on history performance.
(a) Main Effect for Method and Main Effect for Ability and No Interaction (No: 3)
Let us examine what this means with our hypothetical example. The data in
Figure 5.7 indicates that you have main effects for both method and ability.
Look at the first row. You can see that high ability learners treated with the
inductive method (M = 60.0) scored higher than high ability learners treated
with the deductive method (M = 50.0). Looking at the next row, you see
that low ability learners treated with the inductive method (M = 40.0)
scored higher than low ability learners treated with the deductive method
(30.0). You can see this in Figure 5.8.
Looking at the columns tells you about the effect of ability. You see that
high ability learners treated with the inductive method scored 60.0
compared to their low ability counterparts who scored 40.0 (i.e. 20 more) in
the same treatment group. Looking at the second column, you learn that
high ability learners (M = 50.0) treated with the deductive method scored
higher than low ability learners (M = 30.0) treated with the same method.
Thus, it appears that in addition to the method main effect, you have an
ability main effect.
Finally, you also know that there is no interaction because the effect of
method is unaffected by the ability level of students. As Figure 5.7
demonstrates, the effect of method is independent of the ability level and
the effect of ability level is independent of method of instruction. If you
graph the means, your graph should look something like Figure 5.9. The
graph confirms what you saw in Figure 5.7. The high ability line is above
the low ability line. Similarly, ability increases as shown by the fact that
both lines slope upwards as they go from the deductive to the inductive
method. Finally, the graph tells you that there is no interaction between
method and ability on performance in history because the lines are parallel.
(b) No Main Effect for Method and No Main Effect for Ability but an
Interaction (No: 7)
Let us examine what this means with our hypothetical experiment.
According to Figure 5.10, the means obtained on history performance
according to method reveals no significant difference between the inductive
method (M=55.0) and the deductive method (M=55.0). Similarly, for ability,
there was no significant difference between high ability students and low
ability students. However, there was an interaction and the interaction was
significant (see Figure 5.11). In this figure, you noticed that the lines are not
parallel (as in Figure 5.9). Therefore, you have an interaction. What is the
meaning of this interaction since there was no effect for either method or
ability? You would say that method has an effect, but its effect depends on
ability level. Alternatively, you could say that ability has an effect but that
effect depends on the type of method students had been treated with.
Figure 5.10: Factorial design showing means for methods and ability
ACTIVITY 5.3
A lecturer doing an experiment finds that students who are given lecture
notes but do not attend the lecture perform better than those who attend
the lecture. Refine the study by using a 2 X 2 factorial design.
Thus, we can conclude that a true experimental research has three distinct
characteristics (sometimes referred to as three basic principles of
experimentation).
(a) First, it involves the intervention or treatment of independent variables. The
independent variables are manipulated systematically to examine the
effectiveness of the treatment.
(b) Secondly, there is a control of variables. The purpose of this control is to rule
out extraneous variables that might confound the experiment.
(c) Thirdly, a true experiment requires an appropriate comparison. For instance,
comparison is made between two or more groups that are treated
differently.
SELF-CHECK 5.2
1. What is the main strength of „true‰ experiments?
2. What is the major difference between the two types of true
experiments; after-only research design and before-after research
design?
SELF-CHECK 5.3
1. What is the main advantage of using the factorial design?
For example, sometimes it is not possible to assign students to groups which are
a requirement of strong experimental research. Due to logistical reasons, it is
difficult to randomly assign subjects to groups and, so, intact groups such as a
class may have to be used. Is it still possible to do an experiment despite these
limitations? The answer is yes, you can use a quasi-experimental design.
In non-equivalent control-group design, both groups are given first a pre-test and
then a post-test (after the treatment is given to the experimental group). The pre-
test score and the post-test score are compared to determine whether there are
significant differences.
When you cannot randomly assign subjects, you can be sure that extraneous
variables or factors will creep into the experiment and threaten its internal
validity. Do you leave it alone or do something about it?
Cook and Campbell proposed the following steps to enhance the internal validity
of non-equivalent control-group design or quasi-experiments in general:
(a) Selection: Ensure that subjects in experimental and control groups are
matched in terms of important variables that may affect the results of the
experiment. For example, match subjects in terms of academic ability, IQ,
attitudes, interests, gender and socio-economic background.
(b) Testing: Ensure that the time period between the pre-test and post-test is
not too short such that subjects are able to remember the questions given to
them earlier.
(c) History: Ensure that events outside the experiment do not affect the
experiment. The problem is most serious when only subjects from one of
the groups are exposed to such events (e.g. motivation talks and private
tuition).
(d) Instrumentation: Ensure that the pre-test and the post-test are similar. If a
different test is used, you should make sure that the two tests are
equivalent in terms of what they measure (i.e. high reliability and validity).
and then posted a number of times after the treatment. „Baseline‰ refers to the
testing done before the treatment designed to alter behaviour.
A hypothetical example may illustrate how the interrupted time series design is
used. For example, you want to determine whether positive reinforcement
encourages inattentive low ability learners to be more attentive. You identify a
group of 11-year-olds who are low ability learners and get them to attend an
experimental classroom for at least one period each school day as in Figure 5.14.
In this classroom, subjects were taught reading skills in a positive environment
where they were praised and rewarded for attentive behaviour to the given task
activities. Before the students were sent to the positive treatment classroom, their
behaviour was observed over three sessions in their regular classroom with
regards to their attentiveness. This was to obtain baseline data where their
behaviour was recorded in its freely occurring state. The treatment lasted three
weeks and after the treatment, subjects were observed for their attentiveness and
focused behaviour.
SELF-CHECK 5.4
1. What is the meaning of non-equivalent in non-equivalent control
group design?
2. How can you enhance the internal validity of quasi-experimental
research designs?
3. When would you use interrupted time-series design?
Any researcher conducting an experiment must ensure that the dignity and
welfare of the subjects are maintained. The American Psychological Association
published the Ethical Principles in the Conduct of Research with Human
Participants in 1982. The document listed the following principles:
(a) In planning a study, the researcher must take responsibility to ensure that
the study respects human values and protects the rights of human subjects.
(b) The researcher should determine the degree of risk imposed on subjects by
the study (e.g. stress on subjects, subjects required to take drugs).
(c) The principal researcher is responsible for the ethical conduct of the study
and be responsible for assistants or other researchers involved.
(d) The researcher should make it clear to the subjects before they participate in
the study regarding their obligations and responsibilities. The researcher
should inform subjects of all aspects of the research that might influence
their decision to participate.
(e) If the researcher cannot tell everything about the experiment because it is
too technical or it will affect the study, then the researcher must inform
subjects after the experiment.
(f) The researcher should respect the individualÊs freedom to decline to
participate in or withdraw from the experiment at any time.
(g) The researcher should protect subjects from physical and mental
discomfort, and harm and danger that may arise from the experiment. If
there are risks involved, the researcher must inform the subjects of that fact.
(h) Information obtained from the subjects in the experiment is confidential
unless otherwise agreed upon. Data should be reported as group
performance and not individual performance.
SELF-CHECK 5.5
ACTIVITY 5.4
Music
Classical Rock
Lighting Dim 45 11
Level
Bright 12 44
• Examples of weak designs are one shot design, one-group pre-test post-test
design and non-equivalent post-test only design.
• Examples of true designs are after-only research design, factorial design and
before-after research design.
Copyright © Open University Malaysia (OUM)
86 X TOPIC 5 EXPERIMENTAL RESEARCH DESIGNS
• A quasi-experimental design is a design that does not provide for the full
control of potential confounding variables.
LEARNING OUTCOMES
By the end of this topic, you should be able to:
1. Define what a survey is;
2. Compare the different types of survey methods;
3. Explain the process of selecting a sample using different techniques;
4. List seven major steps in conducting survey research;
5. Discuss the advantages and disadvantages of different survey data
collection methods; and
6. Elaborate on the ethics involved in conducting surveys.
X INTRODUCTION
Do you want to know what people are thinking, feeling or doing? If you do, then
a survey would be the best research method. You can survey them orally through
an interview or you can give them a questionnaire. By asking questions, you are
tapping into peopleÊs attitudes, beliefs, perceptions and behaviour. The survey
has proven to be a powerful tool in gathering information about the opinions,
attitudes and intentions of people concerning different social, cultural, economic
and political issues.
It is quite common for newspapers, radio and television to report about the
opinions and perceptions of people obtained through surveys (at times using
survey data to sensationalise issues). Surveys are used by various government
agencies, non-governmental agencies, business and scientific organisations to
probe into peopleÊs reactions to different issues and trends. For example:
• TV stations rely on surveys to profile the people who watch programmes
over their stations.
The survey was designed to allow children to reflect on and rank their
parentÊs ability to show respect and love, have integrity and patience and
be open-minded, among a myriad of other skills and characteristics.
Although the Malaysian dad ranks lower than the Malaysian mum, a
general analysis of the survey results reveals fascinating details of
Malaysian fathers as seen through the eyes of young adults. For instance:
• Dad got a B for telling jokes and C for being fashionable.
• Dad does not really allow them much independence (C-) but
respects their privacy (B) and trusts them (B).
• Dad takes the trouble to teach them right from wrong (B+).
• Dad accepts their own taste in music (B).
• Dad talks about sex (D).
ACTIVITY 6.1
Read the newspaper report on „Malaysian teens grade dad a C-‰.
(a) What are your views about the findings of the survey?
(b) What other information would you need about the survey to
accept the findings?
SELF-CHECK 6.1
6.3 SAMPLING
Unlike a census, where all members of the population are studied, surveys gather
information from only a portion of the population of interest ă the size of the
sample will depend on the purpose of the study. In a good survey, the sample is
not selected haphazardly or only from persons who volunteer to participate. It is
scientifically chosen so that each person in the population will have a measurable
chance of being selected. This procedure is called sampling. This way, the results
can be reliably projected from the sample to the larger population. Two key
words involved in sampling are population and sample. According to Borg and
Borg (1988), the word „population‰ is defined as all people, objects or events
Copyright © Open University Malaysia (OUM)
92 X TOPIC 6 SURVEY RESEARCH METHODOLOGY
If the population is small, e.g. school principals in Kuala Lumpur, the researcher
may decide to study all subjects. When the population is large, the researcher is
not able to study all individuals. It would be too expensive and time-consuming
even for large research organisations. Only in a census is the whole population
studied which involves enormous expenditure, time and many research
assistants. Just imagine having to administer a questionnaire to 100,000 sixteen-
year-olds in Malaysian schools!
Hence, the researcher has to select a certain number of subjects or a sample from
the population to study. Regardless of the method used in selecting the sample,
the most important thing is to ensure that the sample is representative of the
population. Figure 6.2 shows a population of 10,000 individuals who possess
certain characteristics in terms of gender, socio-economic status and ethnicity. If
you intend to select 5% of individuals from the population to form a sample, you
should ensure that the sample has similar characteristics to the population. This
is called population validity whereby the researcher is trying to show that the
sample is representative of the population according to the variables specified.
Using a sample is more economical and if the sample is selected appropriately,
the researcher can make conclusions about the population based on the results
from the sample. This is called generalisation.
ACTIVITY 6.2
the teaching profession. The sample you selected will have to represent the
attitudes of the target population as illustrated in Figure 6.3. Due to
financial and time constraints, you are unable to survey the attitudes of all
graduate teachers across the whole of Malaysia (N = 100,000) and so you
decide to confine your study to graduate teachers in Perak (n = 15,000)
which is called the accessible population. From the accessible population, a
sample of 100 teachers is drawn.
Even though the sample is selected from the accessible population, you
may want to know the degree to which the results can be generalised to the
target population which requires two steps. In the first step, you generalise
from the results of the sample to the accessible population (teachers in
Perak). Second, you generalise from the accessible population to the target
population (graduate teachers in Malaysia). The leap from sample to the
accessible population presents no problem if a random sample of the
accessible population is obtained.
However, in order to make the leap from the accessible population to the
target population, you must gather data to determine the degree of
similarity between graduate teachers in Perak and graduate teachers in the
whole of Malaysia. If you can demonstrate, based on a number of variables
(such as gender, age, experience and ethnicity) that the accessible
population is closely comparable to the target population, you have
established population validity i.e. the accessible population is
representative of the target population.
How do you select a sample randomly? The usual definition of a random sample
is that it is a procedure in which all individuals in the defined population have an
equal and independent chance of being selected as a member of the sample.
Independent means the selection of an individual is not affected by the selection
of another individual. In other words, each individual, event or object has the
same probability of being selected. For example, the number of graduate teachers
in Perak is 10,000 and you intend to draw a sample of 100 teachers as in Figure
6.3. When you select the first teacher, he has a 1:10,000 chance of being selected.
Once this teacher is selected, however, there are only 9,999 cases remaining so
that each teacher has a 1:9999 chance of being selected as the second case. Thus,
as each case is selected, the probability of being selected next changes slightly
because the population from which you are selecting has become one case
smaller.
The Table of Random Numbers is often used in the selection of a random sample
as in Table 6.1. You need to obtain a list of the 10,000 graduate teachers in Perak
and assign numbers 1 to 10,000. Using the table of random numbers, randomly
select a row or column as the starting point. For example, you select Column 5.
Select all the numbers that follow in that column. So, you will select a teacher
with the assigned number 7,332, followed by a teacher with the assigned number
6,516, and then a teacher assigned with the number 4,553 and so forth. If you
need more numbers, you can proceed to the next column until you have enough
numbers to make up the desired sample (i.e. n = 100).
1 2 3 4 5 6 7 8 9 10
1 2345 3445 2678 9156 7332 9332 8345 5950 5023 5189
2 7612 7989 5456 4523 6516 1345 1123 5636 9189 27452
3 6823 5732 3702 3808 4553 4589 0467 8506 6612 12136
4 4598 2564 9860 6360 5245 2347 0391 3623 0620 0850
5 8956 357 3934 3495 9112 9472 1254 3998 1390 3194
6 9059 2691 8395 2634 2189 3465 8223 0745 3487 6709
7 2312 6120 6425 6556 3720 0639 5490 2614 2950 7556
8 8787 8236 5153 0202 9530 6490 6220 3523 9691 4917
9 9063 7190 2590 8696 6267 3923 5360 4937 4854 5223
10 1298 3820 6737 0932 6719 3154 4532 4046 2860 3191
After the required sample size has been calculated, every Nth record is
selected from a list of population members. As long as the list does not
contain any hidden order, this sampling method is as good as the random
sampling method. Its only advantage over the random sampling technique
is simplicity. Systematic sampling is frequently used to select a specified
number of records from a computer file.
Males Females
ACTIVITY 6.3
You have been appointed to lead a research team assigned with the
task of finding the reasons teenagers smoke. The team has decided to
conduct a nationwide survey involving students between 14 and 16
years of age in secondary schools.
(a) Suggest three research questions that will guide data collection.
(b) Explain how you plan to draw the sample of students using
stratified sampling. What sub-groups would you include?
(c) What further information do you need to draw a representative
sample?
SELF-CHECK 6.2
ACTIVITY 6.4
N=118
N=134 N=105 N=113
N=154
N=111 N=129 N=98
N=109
SELF-CHECK 6.3
1. What is the major difference between probability and non-
probability sampling techniques?
2. Why are non-probability sampling techniques used despite their
many weaknesses?
The following are some guidelines you can use in deciding how large your
sample should be:
(a) When the sample selected has to be broken down into smaller groups
involving comparisons of groups, a larger sample is advisable. For
example, you want to compare the self-esteem of males from low socio-
economic backgrounds from single-parent families against males from high
socio-economic backgrounds from single-parent families.
(b) Attrition is expected especially in longitudinal studies. The longer the
duration of a study, the higher the number of subjects who drop out will be.
To reduce attrition, you should keep demands on subjects to the minimum,
fully inform them about the study, obtain a strong commitment from them
before the study and make frequent contact with them to maintain interest
and rapport.
(c) When the population is highly heterogeneous on the variables being
studied, you need to have a larger sample so that the different
characteristics of individuals are satisfactorily represented. If members in
the population are more or less similar, then you may only need a small
sample as most of the characteristics can be captured.
SELF-CHECK 6.4
What are the factors that you will consider when making decisions
about sample size in a survey?
ACTIVITY 6.5
This initial planning requires some idea of the final analysis and it may
be useful at the outset to outline the key tables for the final report
(better still if you are able to create dummy tables), and to consider the
number of cases expected in each major cells or sub-groups. Conduct a
review of literature to define terms and justify theory underlying
research questions.
SELF-CHECK 6.5
1. List some of the qualities of a good questionnaire.
2. Why is pilot-testing necessary?
is low response rate. Also, since the researcher is not present, there is no
way for respondents to seek clarification if questions are unclear.
Respondents should be informed about the purpose of the survey and have the
option not to participate or not to divulge information that he or she does not feel
comfortable with. For example, respondents may be reluctant to disclose income.
To overcome this, you may want to use categories (e.g. RM1,500 ă RM2,000 per
month) which may be less intrusive. You should determine in the pilot-test
which items respondents are uncomfortable with, so that you do not have too
many unanswered questions to the point that some research questions cannot be
answered. The questions asked should not in any way attempt to deceive
respondents. The integrity of a survey is enhanced if respondents are clear about
the purpose of the study.
ACTIVITY 6.6
1. You wish to study moral reasoning among 15-year-olds in a large
district. A total of 4,100 15-year-olds are enrolled in 105
classrooms in the district. You plan to obtain a total of 250
students using the cluster sampling technique. Describe the steps
you would take in selecting the sample.
2. Discuss some of the problems with telephone interviews that may
affect the results of a survey.
3. One of the drawbacks of mail survey is the low response rate.
Suggest how you would increase response rates.
4. When would you use a longitudinal survey rather than a cross-
sectional survey?
5. What are some problems with surveys conducted over the
Internet? How can you overcome these problems?
• Surveys are useful for gathering data about what people are thinking, feeling
or doing.
X INTRODUCTION
The most important and sometimes difficult aspect of educational research is
determining the instruments to be used in data collection. An example of an
instrument is the Self-Esteem Test shown in Figure 7.1. There are numerous tools
that may be used to obtain information. In this topic, we will discuss in detail the
process of instrumentation and look at some examples of widely used types of
instruments. We will also discuss two very important concepts related to
instrumentation: reliability and validity. Fraenkel and Wallen make a distinction
between instrument and instrumentation. An instrument is a device or procedure
for systematically collecting information while instrumentation includes both the
instrument and „the conditions under which it is used, when it is used, and by
whom it is used‰. Examples of instruments are tests, questionnaire, rating scales,
inventories and checklists.
This test is designed to test your level of self-esteem. Read each item and state
whether you strongly agree, agree, disagree or strongly disagree with each
statement by referring to yourself.
4 3 2 1
1. I am an attractive person.
4. I like my hair.
Source: Reprinted with permission: © CoPS Project Malaysia (2005). John Arul Phillips.
ACTIVITY 7.1
Questionnaires are widely used because they are cost effective and can be
administered to large groups of people. There is greater standardisation in
questionnaires as each respondent receives the same set of questions which
allows interpreting from a large sample. The questionnaire, if properly designed,
can address many issues in a relatively efficient way. Also, there is assurance of
anonymity which increases the likelihood that the responses are genuine and
reflective of the opinions, perceptions and beliefs of respondents. However, the
quality of data obtained from a questionnaire will depend on how well it is
designed. Fowler suggests the following important criteria when designing a
questionnaire:
• Is this a question that can be asked exactly the way it is written?
• Is this a question that will mean the same thing to everyone?
• Is this a question that people can answer?
• Is this a question that people will be willing to answer?
Designing a questionnaire is both an art and science; and takes time and
precision. Before setting out to design a questionnaire from scratch, it is a good
idea to find out whether such questionnaires already exist that could be used or
modified to gather the information that you seek. A little bit of research could
save you a lot of work in creating an entirely new questionnaire.
In designing a questionnaire, you have to determine the questions that you want
to ask people; select the question response format for each question, decide on
question sequence and overall layout, pilot-test and implement the instrument.
Deciding what questions to ask will depend on the aims of your survey. For
example, if the aim of your study is to survey the reading habits and interests of
teenagers, one of the questions you will ask is: „What kind of books do you like
to read?‰ You should constantly check to see that there is a close link between the
research aims and the individual questions that you ask. You do not want a
situation where the data collected from your questionnaire does not answer the
research questions.
(b) Check Only One Answer from a List: The respondent is required to check
or tick only ONE answer.
Primary school
Secondary school
Form 6/Matriculation
Degree
(c) Checking More than One Answer from a List: The respondent is required to
place a check or tick next to one or more answer.
Violence on TV
Lack of discipline at home
Insecure with themselves
Peer pressure
(d) Ranked Items: Respondents can also be asked their preferences by ranking
the items from 1 to 2 and so forth. We want the respondent to put a 1, 2, 3, 4
or 5 next to the brand, where 1 is the respondentÊs first choice, 2 the second
choice and so forth.
Rank the following brands of cars in terms of prestige:
(1 ă most prestigious, 2 ă second most prestigious and so forth)
Proton _____
Toyota _____
Honda _____
Nissan _____
Ford _____
(e) Rated Responses: The Likert Scale has become a popular tool in getting
people to respond to questions. The Likert scale is a rating scale which
measures the strength of agreement with a statement.
AGREEMENT
Strongly Disagree Undecided Agree Strongly
Disagree Agree
1 2 3 4 5
FREQUENCY
Very Rarely Rarely Occasionally Frequently Very
Frequent
1 2 3 4 5
IMPORTANCE
Unimportant Of little Moderately Important Very
importance important important
1 2 3 4 5
(f) Filter Questions: This format is used when you want to obtain information
from people who „do‰ or „do not‰ engage in something or believe in
something or have an opinion about something (i.e. you are filtering).
SELF-CHECK 7.1
(c) Instructions:
(i) Instructions on how to answer the questions should be clear and
concise. They should be easy to understand and use words that are
not difficult to understand.
(ii) State the purpose of the survey and who is administering or
sponsoring the survey.
(iii) Indicate how confidentiality is protected. This assurance may increase
the likelihood of honest responses.
(iv) Indicate who the respondents can call or write if they have questions
or concerns, and also if they want a copy of the survey results.
(ii) Group questions that are similar together in terms of format and the
specific covered (e.g. group questions asking respondents about their
background such as occupation, income, age, etc.).
(iii) The transition from one question to another should not be too abrupt.
Avoid questions that jump from one unrelated topic to another.
(e) Scales:
(i) When you want subjects to respond to a scale from lower to higher, it
is usually better to place the lower end of the scale on the left and the
higher end of the scale to the right.
1. Never 2. Seldom 3. Occasionally 4. Always
(ii) Avoid having too many divisions in your scale which strains the
respondent (see example below). Usually a 4-point or 5-point scale
would be adequate.
1. Never 2. Seldom 3. Occasionally 4. Fairly often
5. Often 6. Very Often 7. Always
SELF-CHECK 7.2
answer. Also, take note of the time taken to complete the questionnaire. The aim
of the pilot-test is to detect any mistakes in your questioning and to correct them
before the main survey. The pilot-testing may also allow you to convert an
unstructured or open-ended question to a structured or closed question by
determining the range of possible answers or responses. Based on the pilot-test,
you will be able to make changes that will help maximise response rate and
minimise confusion among respondents.
ACTIVITY 7.2
Design a 15-item Teacher Workload Questionnaire focusing on the following:
(a) Planning for teaching;
(b) Preparing for assessment and marking papers;
(c) Clerical and record keeping;
(d) Attending meetings; and
(e) Administrative duties.
(a) Define the Construct: Define the attitude (or construct) that you wish to
measure. You should review the literature in the field to see how the
attitude has been defined. Examples of attitudes: attitudes towards racial
integration, attitudes towards women bosses, attitudes towards smoking
and attitudes towards water conservation.
(b) Break down the Construct: Usually an attitude (or construct) needs to be
broken down into a number of categories. In other words, you are
providing an operational definition of the attitude. For example, racial
attitudes can be broken down into: marriage, social interaction, cultural
heritage and workplace.
(c) Brainstorm: Discuss and come up with a list of about 8 to 10 statements for
each category of the attitude. Develop an equal number of positive and
negative statements about each category of the attitude object.
(d) Rating the Items: Next is to have a group of judges or panel of experts rate
each statement on a five-point rating scale to ensure content validity (or
face validity). Do the statements cover the breadth of the attitude being
assessed? Measure what a statement is supposed to measure. For example,
is the statement, „I would be or have been in a romantic relationship with a
person of another race‰, describing a racial attitude?
(e) Scale: Decide on an appropriate scale such as the following which has five
possible responses: 1 = strongly disagree, 2 = somewhat disagree, 3 =
undecided, 4 = somewhat agree, and 5 = strongly agree. If you want to
avoid an „undecided‰ category, you can choose to use an even number of
choices i.e. 4-point Likert scale or a 6-point Likert scale.
(f) Pilot-Testing: Administer the attitude scale to a group of subjects and score
the instrument. Monitor to see if respondents had problems in
understanding the items. The final score for the respondent on the scale is
the sum of their ratings for all of the items (this is sometimes called a
"summated" scale). On some scales, you will have items that are reversed in
meaning from the overall direction of the scale. These are called reversal
items. You will need to reverse the response value for each of these items
before summing up the total. That is, if the respondent gave a 1, you make
it a 5; if they gave a 2, you make it a 4; 3 = 3; 4 = 2; and, 5 = 1.
Copyright © Open University Malaysia (OUM)
TOPIC 7 INSTRUMENTATION W 123
(g) Reliability: You have to examine the test for reliability which means
whatever the instrument measures, it measures consistently. CronbachÊs
alpha measures how well a set of items measures a single construct (or
attitude). It measures consistency within the instrument where all items are
compared with each other. Alpha coefficient ranges in value from 0 to 1 and
the higher the score, the more reliable is the attitude scale. Nunnaly has
indicated 0.7 to be an acceptable reliability coefficient but lower thresholds
are sometimes used in the literature.
(h) Reliability Analysis: See Figure 7.2 which is a printout of the reliability
analysis done for a sample of items. The Cronbach alpha for the instrument
is 0.77 (rounded from 0.77102). The second column shows the correlation
between each item with the total or the rest of the items. The third column
shows the correlation coefficient obtained if a particular item is deleted.
Note that if you were to delete Item 4, the Cronbach alpha will increase to
0.82 (rounded from 0.816080).
(i) Final Items: Based on the reliability analysis, the final items to be included
in the instrument can be decided. Generally, the reliability of the scales
tends to increase with the number of items. However, as the number of
items in a scale increases, so the time taken to complete the attitude test will
also increase, and this may demotivate respondents. There is no hard and
fast rule to determine the final number of items in a scale and this will
reflect the nature and complexity of the attitude being assessed. Generally,
fewer than 20 items may reduce reliability acceptably, but more than 30 will
begin to demotivate the respondent. Another point to note is that attitude
scales are summated scales. The scores for individual items are added
together to form the overall score for each respondent. This score is
normally divided by the number of items in the attitude scale to obtain the
Copyright © Open University Malaysia (OUM)
124 X TOPIC 7 INSTRUMENTATION
ACTIVITY 7.3
1. Identify positive and negative statements.
2. Complete the attitude scale by adding three positive statements
and three negative statements.
3. Try out the complete attitude scale with some of your colleagues
and friends.
ACTIVITY 7.4
• Questionnaires are widely used because they are cost effective and can be
administered to large groups of people. If properly designed, questionnaires
can address many issues in a relatively efficient way.
• Attitude scales may be designed to measure attitudes using the Likert scale.
INTRODUCTION
Descriptive statistics is concerned with summarising data in order to describe the
characteristics of a sample. Examples of descriptive statistics are statistics of
students enrolled in an education programme, studentsÊ score in an achievement
test and survey results on certain behaviour. The commonly used methods in
describing data are presentation using charts and graphs, measure of central
tendency and measure of dispersion. These forms of descriptive analysis are also
known as exploratory data analysis (EDA).
This topic will not delve into descriptive statistics in detail. You will learn more
about it in HMEF5113 Statistics for Educational Research.
Variable Number of
Variables Data presented as Type of Chart
Type
Two Raw Form Scatter Diagram
Raw Form Stem and Leaf
Continuous One Frequency Distribution Histogram
Polygon
Summary Table Bar Chart
One
Pie Chart
Categorical Cross Tabulation
Two Side-by-Side Bar Chart
(Contingency Table)
Example:
The data below shows the number of tuition classes attended by 16
secondary school students in a week:
1, 3, 5, 2, 4, 3, 1, 4, 3, 2, 3, 3, 4, 2, 2, 3
Table 8.1 is the frequency distribution table for the data, while Figure 8.1
displays the pie chart.
Example:
The data below describes the average number of hours in a day spent by 35
students doing mathematics revision.
Range
Width of interval ≈ .
Number of desired class groupings
4.7 − 1.2
Width of interval ≈ = 0.5. It is appropriate to start with the value
7
1.0 and end with 4.9. Table 8.2 is the frequency distribution table for the
data while Figure 8.2 displays the bar chart.
Example:
In studying the association between job satisfaction and job performance, a
researcher carried out a retrospective survey based on the annual
performance appraisal of employees of a large organisation. The following
data was obtained as shown in Table 8.3. Figure 8.3 is a side-by-side bar
chart.
Job Performance
Low High
Low 104 20
Job Satisfaction
High 40 25
Example:
The data below shows the average number of hours spent in a day by a
group of teachers preparing teaching aids.
0.7, 2.7, 0.5, 1.2, 0.5, 2.1, 0.9, 1.4, 1.3, 1.6, 3.3, 1.8, 1.9, 1.9, 2.3.
The following is the stem and leaf plot for this data.
Stem Leaf
0 5 5 7 9
1 2 3 4 6 8 9 9
2 1 3 7
3 3
[Note: In plotting the stem and leaf for this data set, the unit digit becomes
the stem and the following digit becomes the leaf.]
• Histograms
Histograms are used to describe numerical data that have been grouped
into frequency, relative frequency or percentage distributions.
Example:
Suppose the average time spent on preparing teaching aids were collected
from larger samples and tabulated in a frequency distribution table, a
histogram will be an appropriate chart to represent the data. Table 8.4
shows the frequency distribution of the average number of hours spent in a
day on preparing teaching aids by 140 teachers, while Figures 8.4 and 8.5
present the histogram and polygon illustrating the data respectively.
Figure 8.4: Histogram showing the average time spent on preparing teaching aids
Figure 8.5: Polygon showing the average time spent on preparing teaching aids
• Scatter Diagram
Scatter diagrams are used to describe the linear relationship between two
quantitative variables. The shape of the scatter points on the two-
dimensional graph will give us an idea on how these two variables are
related.
Example:
Suppose that a group of respondents were asked to rate their satisfaction
with the teaching and learning environment at OUM and their motivation
to learn, their scores on these two variables can be plotted as a scatter
diagram to see whether they are linearly related. Figure 8.6 shows the
scatter diagram relating studentsÊ satisfaction with the teaching (T&L)
environment at OUM and their motivation to learn.
ACTIVITY 8.1
The measure we use depends on the type of measurement and its distribution.
Figure 8.7 summarises the commonly used measure of central tendency.
[Note: Since most of the post-graduate research will use primary data and the SPSS will be used for
analysis, this module does not discuss the methods for determining values for the central tendency
measures for grouped data. Students interested in learning this may refer to any standard statistics
books.]
(a) Mean
To calculate the mean, we add up all the observed values and divide them
by the total number.
x1 + x 2 + ... + x n
Mean is defined as, X=
n
Example:
The following is the data on the number of hours in a day spent by 10
students doing mathematics revision.
0.7, 2.7, 0.2, 1.2, 0.5, 2.1, 0.9, 1.4, 1.3, 1.6
The mean time in hours spent doing mathematics revision per day is:
0.7 + 2.7 + 0.2 + 1.2 + 0.5 + 2.1 + 0.9 + 1.4 + 1.3 + 1.6
X=
10
The mean, X = 1.26 hours.
Since mean uses all the information in the data set, it is a more efficient
summary measure. However, the disadvantage of using mean is that it is
affected by extreme values. This is illustrated as follows.
If the value „1.6‰ in the data set is substituted with „10.3‰ hours, the mean
will become:
0.7 + 2.7 + 0.2 + 1.2 + 0.5 + 2.1 + 0.9 + 1.4 + 1.3 + 10.6
X =
10
X = 2.16 hours
(b) Median
Median is also known as a measure of location; that is, it points to the
central location as the summary of the data set. Since median does not use
all the information in the data set, it is less efficient than the mean. One
advantage of using median over mean is that it is not influenced by extreme
values.
Example:
In the example on time spent (in hours) for doing mathematics revision, the
n +1 15 + 1
median is the values at location , which is
2 2
0.7, 0.9, 1.2, 1.3, 1.4, 1.6, 1.8, 1.9, 1.9, 2.1, 2.3
Median Location
If the value 2.3 is replaced with 10.3, the median value of the data set will be:
0.7, 0.9, 1.2, 1.3, 1.4, 1.6, 1.8, 1.9, 1.9, 2.1, 10.3
Median Location
The median value for the data set is still 1.6 hours.
0.7, 0.9, 1.2, 1.3, 1.4, 1.6, 1.8, 1.9, 1.9, 2.1, 2.3, 2.7 , 3.3
n +1 12 + 1
Then, the median location will become , which is = 6.5:
2 2
0.7, 0.9, 1.2, 1.3, 1.4, 1.6, 1.8, 1.9, 1.9, 2.1, 2.3, 2.7 , 3.3
Median Location
1.6 + 1.8
Thus, the median is , which is 1.7 hours.
2
(c) Mode
Mode of distribution is the value that occurs most frequently. It is not affected
by extreme values and normally used for either nominal or ordinal data. There
could be situation where no mode or several modes are in the data set. Such
distribution of scores would be termed as bimodal, trimodal and so on.
Example:
The following is the data on the number of hours in a day spent by 10
students doing mathematics revision.
0.7, 0.9, 1.2, 1.3, 1.4, 1.6, 1.8, 1.9, 1.9, 2.1, 2.3
ACTIVITY 8.2
For the data given in Activity 8.1, determine the appropriate Measure
of Central Tendency and interpret them in the context of the study.
Justify your selections.
(a) Range
It is the difference between the largest and the smallest observations
in the data set.
Example:
The following is the data on the number of hours in a day spent by 10
students doing mathematics revision.
0.7, 0.9, 1.2, 1.3, 1.4, 1.6, 1.8, 1.9, 1.9, 2.1, 2.3
Data set A, B and C have the same mean but different standard
deviation. The standard for data set B is the lowest followed by data
set A and C. The smaller the standard deviation, the higher the
consistency between the data in the data set. This is illustrated clearly
in Figure 8.9, where data in set B are distributed more closely
compared to data in set A and C.
n (X − X )
Standard deviation is defined by the formula: SD = ∑ ni −1 where
i =1
X i is the ith observation, X is the mean value of the data set and n is the
number of observations in the data set. If the data set comprises the entire
population (census data), the population standard deviation is defined as,
σ=
N (X − X )
∑ i .
i =1 N
0.6, 2.2, 0.3, 1.3, 0.6, 2.2, 1.1, 1.5, 1.6, 1.9
The following is the computation of standard deviation for the above data.
Mean Score
X = 1.33
∑ (X i − X ) 2 = 4.12
n (X − X)
4.12
Standard Deviation = ∑ i
n−1
=
9
= 0.677
i =1
In this example, the mean hour for watching TV is 1.33 and the standard
deviation is:
⎛ SD ⎞
CV = ⎜ ⎟ *100%
⎝ X ⎠
Example:
The table below shows the mean weight and standard deviation of two
groups of children.
The weights of children in group 2 are more dispersed than those in group 1.
ACTIVITY 8.3
For the data given in Activity 8.1, determine the appropriate Measure of
Dispersion and interpret it in the context of the study. Then, justify your
selection.
ACTIVITY 8.4
Use the data provided in the appendix, perform the following tasks:
1. Use appropriate graphs to summarise the data;
2. Determine the appropriate measure of central tendency and measure
of dispersion to describe the relevant variables; and
3. Write a simple report describing what the researcher can conclude
about the sample.
• Data gathered from samples are usually presented using some form of
graphics for easy interpretation.
• Some commonly used graphs are: Bar Chart, Pie Chart, Line Graph,
Histogram, Stem and Leaf Diagram, and Scatter Diagram.
• The types of graphs and charts depend on how the variables are measured.
• The two types of numerical summaries are measure of central tendency and
dispersion.
• The commonly used measures of central tendency are mean, median and mode.
X INTRODUCTION
Inferential statistics are statistical tests used in making inference from sample
observation to the relevant population. Unlike descriptive statistics presented in
the earlier topic, the emphasis of inferential statistics is not the samples but
making generalisation from sample data to the population. Generally, there are
two ways to infer from sample to population, either estimating the population
parameter from sample statistics or testing the hypothesis that what is observed
in the sample is a reflection of the actual phenomena in the population.
95%
p s (1 − p s ) p s (1 − p s )
p s − 1.96 p s + 1.96
n n
Example:
A researcher intends to determine the „mathematics problem-solving ability‰ of
Form Five science stream students in his school. He gave a mathematics
problem-solving test to 65 randomly selected Form Five science stream students
and computed the mean score and standard deviation. He then computed the
95% confidence interval for the test scores and reports it as the „mathematics
problem-solving ability‰ of the Form Five science stream students in his school.
SD
The standard error of mean is , which is 0.57. The following is the
n
computation of the confidence interval.
Lower Limit
SD
x −1.96
n
89 - (1.96 x 0.57) = 87.88
Upper Limit
SD
x +1.96
n
89 + (1.96 x 0.57) = 90.12 ă lower limit
Example:
Table 9.1 presents the mean „mathematics problem-solving score and standard
deviation‰ of two groups of randomly selected Form Five students from the
science and arts streams in the same school. There, researchers would like to see
if there is a significant difference in the ‰mathematics problem-solving ability‰
between Form Five students from the science and arts streams in his school.
Solution:
The confidence interval for science stream students:
89 + (1.96 x 0.57) = 90.12 ă upper limit
89 - (1.96 x 0.57) = 87.88 ă lower limit
95% Confidence Interval is between 87.88 ă 90.12.
The confidence interval for arts stream students:
Arts Stream Form Five students Science Stream Form Five students
Interpretation:
Since the confidence intervals for the science and arts stream Form Five students
do not overlap, it can be concluded that there is a significant difference in the
mathematics problem-solving ability between these two groups of students.
From the mean scores, we can further conclude that science stream students have
higher mathematics problem-solving ability than the arts stream students.
Copyright © Open University Malaysia (OUM)
TOPIC 9 QUANTITATIVE DATA ANALYSIS II: INTRODUCTION TO INFERENTIAL W 151
STATISTICS
ACTIVITY 9.1
A researcher wanted to study the Emotional Intelligence of university
students in the Klang Valley. He defined Emotional Intelligence as a
three-dimensional construct comprising Stress Tolerance, Optimism
and Emotional Self-Awareness. The following table shows the
descriptive statistics of the variables for male and female students.
Compute the confidence interval and interpret your results.
One Sample:
The mean mathematics problem-solving score of Form Five science stream
students in the school is 85.
Two Samples:
There is no difference in the mean mathematics problem-solving score
between the science and arts stream Form Five students in the school.
One Sample:
The mean mathematics problem-solving score of Form Five science stream
students in the school is not 85.
Two Samples:
There is a difference in the mean mathematics problem-solving score
between the science and arts stream Form Five students in the school.
Types of Error
Regardless of how carefully samples are selected for a study, there will always be
some variation in the results when compared with results derived from the entire
population. This variation is due to sampling error. Thus, the inference from
samples to populations are probabilistic; we can be close to 100% certain by
employing strict research design but can never be 100% certain that our inference
is correct. Drawing the wrong conclusion is referred to as error of inference.
There are two types of error: type I and type II errors ă both relate to the rejection
or acceptance of the null hypothesis. Table 9.2 illustrates type I and type II errors
in relation to the decision on the null hypothesis.
Table 9.2: Related Decisions about the Null Hypothesis and Error Types
Null Hypothesis
Decision
True False
Type I error is committed when the researcher rejects the null when the null is
indeed true; in other words, incorrectly rejecting the null. The probability level
where the null is incorrectly rejected is called the significance level, denoted by
the symbol α a value set a priori (before even conducting the research) by the
researcher.
On the other hand, type II error is committed when the researcher fails to reject
the null when the null is indeed false; in other words, wrongly accepting the null.
The type II error is often denoted as β . In any research, the intention of the
researcher is to correctly reject the null. If the design is carefully selected and the
samples represent the population, the chances of achieving this objective are
high. Thus, the power of the study is defined as 1 - β .
Both the α and β have some implications on the sample size; holding the
sample size constant, α and β are inversely related. This is to say that if the
researcher would like to minimise the probability of committing type I error, the
probability of committing type II will increase.
Example:
It is given that the mathematics problem-solving score of 50 randomly selected
Form Five science stream students in the school is 89 with a standard deviation of
10.8. If it is evident from a previous similar study that the mean score for
mathematics problem-solving of Form Five science stream students in the district
is 85, what can be concluded from the results obtained from the sample?
Solution:
The alternate hypothesis shows that the researcher is being optimistic with his
studentsÊ ability.
Step 4: Assume that the data was collected appropriately. The sample mean
( X ) and standard deviations (SD) are 89 and 10.8 respectively while
population mean ( μ ) is 85 (obtained from the null hypothesis).
X−μ 89 − 85
The t value is defined as t = = = 2.61
SD / n 10.8 / 50
Step 6: Conclusion
The mean mathematics problem-solving score of Form Five science
stream students in the school is not equal to 85.
Points to Ponder
The hypothesis can be directional (one tailed) or non-directional (two tailed). The
difference between one-tailed and two-tailed tests is how the hypothesis is
formulated. For the one-tailed test, the Alternate Hypothesis will indicate
direction while no direction is indicated for the two-tailed test. The decision on
whether to use a one-tailed or two-tailed test depends on the amount of
information one has based on literature. Hypothesis for the one-tailed test
(directional hypothesis) should be strongly guided by literature. On the
implication on the statistical significance, it is far more difficult to reject the non-
directional hypothesis since the observed probability needs to be compared to
α
the significance level divided by two ( ). If the researcher is unsure of the
2
direction, the non-directional hypothesis is recommended. The following is an
example of how to write the null and alternate hypotheses for one-tailed and
two-tailed tests.
Two-Tailed Test
Null (Ho) : There is no difference in the mean mathematics problem-
solving score between the science and arts stream Form Five
students in the school.
One-Tailed Test
(a) Right Tail
Null (Ho): The mean mathematics problem-solving score of science
stream Form Five students in the school is less than or
equal to the scores of Form Five students from the arts
stream.
ACTIVITY 9.2
Apart from the three variables related to Emotional Intelligence (Stress
Tolerance, Optimism, and Emotional Self-Awareness) the researcher also
collected some demographic information. Among the demographic
variables are gender, ethnicity and types of institution (either
government of private).
Write the appropriate hypotheses that the researcher can test in this study.
You are required to write the hypotheses in null and alternate form.
Choosing the right test will contribute to the validity of the research
findings. Improper use of statistical tests will not only cause the validity of
the test results to be questioned and do little justification to the research but
at times it can be a serious error especially if the results have major
implications (for example, if these are used in policy formulation and so
on).
Sample size plays a crucial role in deciding the family of statistical test,
parametric or non-parametric. In large sample, the central limit theorem
ensures that parametric tests work well even if the population is not
normal. Parametric tests are robust to deviations from normal distributions,
when sample size is large. The issue here is how large is large enough. A
rule of thumb suggests that a sample size of about 30 or more for each
category of observation is sufficient for one to use the parametric test.
Non-parametric tests also work well with large samples.
Non-parametric tests are only slightly less powerful than parametric tests
with large samples.
On the other hand, if the sample size is small, we cannot rely on the central
limit theorem. Thus, the p value may be inaccurate if parametric tests were
to be used.
Figures 9.2, 9.3 and 9.4 summarise the common statistical tests used in
education research.
Figure 9.3: Commonly used test in Education Research for comparing two populations
ACTIVITY 9.3
For the hypotheses listed for activity 9.2, determine the appropriate
statistical tests. The sample comprises 78 males and 96 females. In terms
of ethnicity, 75 are Malays, while Chinese and Indians are 53 and 46
respectively. As for the types of institution, 108 are from the government
universities while 66 are from private institutions.
SELF-CHECK 9.1
• There are two ways to infer from sample to population, either by estimating
the population parameter from sample statistics (confidence interval) or
testing the hypothesis.
X INTRODUCTION
From Topic 1 to Topic 9, we have discussed the methods and techniques of
conducting research from the quantitative perspective. This topic will focus on
the writing process and will address one important task that most graduate
students will have to do, that is, write the research report. The guidelines
discussed in this topic are more appropriate for quantitative research rather than
qualitative research. Despite this, the format is still relevant to qualitative
research with some adaptations. You may refer to HMEF5103 Qualitative
Research Methodology for specific details.
ACTIVITY 10.1
Take a look at a research report and identify the chapters and major
sections in each chapter.
Read the chapters and the major sections carefully and take note of what
is focused on each chapter and major section.
The title should be concise and should indicate clearly the purpose of the
study. Keep in mind its possible usefulness to another researcher who may
search a database where your dissertation may be listed. The title should not
claim more than the study actually delivers. It should not to be stated broadly
and make it difficult for the reader to determine what the study is about. For
example, the title „Mathematics Achievement of Urban Poor Children‰ is too
general and a more precise title would be „Mathematics Achievement of Poor
Children in an Urban School in Sarawak‰. These items should be written in
Times Roman font/16pt. The title should be in capital letters, single-spaced
and centred between the right and left margins of the page. If the title goes
beyond one line, the words in the title should be divided into lines so that each
successive line is shorter than the one above it and is centred below it in an
inverted pyramid style as shown in Figure 10.2.
(b) Abstract
The abstract is a short, one-paragraph summary of the most important
components of the research study. It normally includes a brief statement of
the problem, the objectives of the study, the main research questions, the
sample involved, the research design adopted, the major findings and their
implications. The abstract should not exceed 200 words written in single
spacing. Open University Malaysia requires that you have the translation of
your abstract in Bahasa Malaysia if you have written your report in English.
Likewise, you need to provide the English translation of the abstract if the
Copyright © Open University Malaysia (OUM)
168 X TOPIC 10 WRITING THE RESEARCH REPORT
report is in Bahasa Malaysia. The translated abstract should include the title
as well.
(c) Acknowledgements
An acknowledgement page is included if you have received unusual
assistance in the conduct of the study. The acknowledgements should be
simple and restrained. Do not indulge in flattery and excessive recognition
for routine participation of family members, lecturers, supervisors,
librarians and clerical helpers.
(d) Declaration
The declaration is meant for the candidate to declare that the research
report is the result of your own work. You should use the format provided
by Open University Malaysia as provided in Figure 10.3.
DECLARATION
Name:
Matric Number:
I hereby declare that this dissertation is the result of my own work, except for
quotations and summaries which have been duly acknowledged.
Signature: Date:
In your problem statement, you not only explain what you are
studying i.e. the independent and dependent variables involved but
also provide research and theoretical support for your study.
(iii) In one or two statements, write the objectives of the study. Ask
yourself what you hope to uncover through your research and then
say it in a way that your colleagues will understand. Use simple and
jargon-free language. In this section, you might want to elaborate on
the conceptual framework to further clarify your objectives.
Otherwise, you can have an additional section on Research Design in
Chapter 3 to explain the conceptual framework of your study.
(v) Briefly tell the reader the significance of the study (justify doing the
study). You can argue the significance of your study based on the
following criteria:
• The problem or gap demands attention because the findings could
influence practice and policy;
• The methodology you are using is unusual;
• You are studying certain variables which have not been given
attention in previous studies;
• Your study will contribute to the body of knowledge in the field;
• The outcome could be the extension of a theoretical model.
(vi) The limitations of the study refer to elements which are beyond the
control of the researcher. Outline the limitations in terms of the
sample, data collection and/or data analysis so that the reader is
aware of the parameters of your study. Do not be intimidated by the
limitations of the study because it is not possible to expect every
research conducted to be perfect. They merely indicate to the reader
that you are aware of the limitations and that the findings should be
interpreted in light of these limitations.
SELF-CHECK 10.1
SELF-CHECK 10.2
(i) Sample
It is here that you explain the size of the sample and how you select
them. Indicate the extent to which the sample is representative of the
population. For example, what is the composition of your sample?
Are you going to use random sampling or purposive sampling and
why?
(ii) Instrumentation
You should include a clear description of the data collection
techniques or instruments you will use. For example, if your study is a
survey, you have to explain how you design and develop the
questionnaire or interview checklist. Explain the number and types of
items included in the questionnaire. If you are going to use attitude
scales, achievement tests and other psychological tests, you have to
give evidence regarding the reliability and validity of the instruments.
You should also describe the scoring procedures adopted for the
instruments used.
SELF-CHECK 10.3
and the American Educational Research Journal to see how tables and
graphs are presented and explained.
If you do a qualitative study, there will be less numerical data. Instead your
data would consist of concepts, categories or themes which may be
presented in table form. You would also be presenting data in the form of
anecdotes or excerpts of interviews, observations and documents to
support your arguments. For qualitative studies, Qualitative Report is a
useful journal which presents reports of qualitative studies in education,
nursing and medicine. As the focus here is on quantitative research
methodology, the format of data analysis and data presentation is beyond
the scope of this module.
For clarity, you may want to present your analyses and findings under each
research question. However, demographic data of the sample that are not
findings to your research questions should not be presented in this chapter.
This information should ideally be presented in Chapter 3 under the
heading „Sample‰.
SELF-CHECK 10.4
You should keep in mind that this chapter is the most used part of the
research report by other readers. Readers who scan research literature to
find significant studies examine this chapter before deciding whether or not
further examination of the report is worthwhile.
SELF-CHECK 10.5
What could be the sections in Chapter 5?
(b) Appendices
Include in this section all the relevant support materials especially the
research instruments that are used in the study. This will indicate to
whoever assesses your research report that it is a complete and quality
product.
NOTE:
For details about the scope and depth of the research report, please refer to
Appendix 2. The appendix presents the scope and depth of the research
report for the PhD (Edu) Thesis , the MasterÊs Dissertation (Option 1) and
the MasterÊs Project (Option 2).
ACTIVITY 10.1
Compare what you have read about the research report in this topic with
what you have noted in Activity 10.1.
How are they similar to each other?
How are they different from each other?
Borg, W., & Borg, M. (1988). Educational research: An introduction. New York:
Longman.
Guba, E., & Lincoln, Y. (1989). Fourth generation evaluation. Newbury Park,
California: Sage Publications.
John Arul Phillips (2007). HMEF 5014 Educational research methodology. Kuala
Lumpur: OUM.
Mitchell, M., & Jolley, J. (1988). Research design explained. New York: Holt,
Rinehart and Winston.
APPENDIX 1
Overview A dissertation is expected to cover a A dissertation is expected to cover a A research report is expected to cover a
particular area of study in depth and to particular area of study in depth and particular area of study and to contribute
contribute to research in the discipline. to contribute to research in the field. to research in the field. A study
Mere description or a study of undertaken for Option 2 may be a small-
perceptions of people in the field is not scale study of a phenomenon or it may
A dissertation should demonstrate encouraged unless it is supported by be a study of perceptions.
evidence of reading seminal work and some qualitative data.
recent contributions to body of research.
The intended study should explore and The emphasis here is more on a
address the existing “gap” in the area of A dissertation should examine student’s understanding of the research
research. contemporary issues in the field by process as well as the awareness of the
presenting evidence of reading seminal limitations of his/her study rather than
The emphasis here is on a student’s
work as well as recent contributions to the in-depth understanding of the
detailed and in-depth understanding of
research in an area. phenomenon investigated or the
the phenomenon investigated.
generalisability of his/her findings.
Quantitative If the study employs a quantitative If the study employs a quantitative If the study employs a quantitative
Research approach, it should rest on a sound approach, it should rest on a sound approach, it should be supported by past
theoretical framework and supported by theoretical framework and supported research.
past research. by past research.
For a research report using experiments
For a dissertation using experiments or For a Dissertation using experiments or surveys, the analysis may just focus
surveys, the analysis should include use or surveys, the analysis should include on detailed explanations of descriptive
of inferential statistics, e.g. ANOVA, t- use of inferential statistics, e.g. statistics (means, standard deviation,
tests or chi-square statistics at the very ANOVA, t-tests or chi-square statistics frequency, percentages, etc).
least. Students must explain the at the very least. Students should
However, the use of inferential statistics
appropriateness of the statistical tests explain the appropriateness of the
such as t-test and chi-square analysis is
used and they must only use these tests statistical tests used and they are
highly encouraged. Students should
after providing sufficient evidence for the encouraged to provide evidence for the
show some awareness of their
assumptions underlying each statistical assumptions underlying each statistical
understanding of the statistical
procedure. procedure.
assumptions when inferential statistics is
In situations where these assumptions used.
are not met, appropriate non-parametric
tests should be used and the limitations
of these tests must be discussed.
Apart from reporting the significance of
Qualitative studies should include “rich Qualitative studies should include “rich
186 X APPENDICES
OR
Thank you.