M&e Edu
M&e Edu
SUBJECT TITLE:
Types of test
.Standardized test
.Survey Test
.Diagnostic Test
.Prognostic Test
Types of Validity:
ü Content Validity
ü Construct Validity
ü Concurrent Validity
ü Predictive Validity
Statistical Measures
Continuous Assessment
• Nominal scale
• Ordinal scale
• Interval scale
• Ratio scale
-Definition of Evaluation
-Types of Evaluation
a. Formative Evaluation
b. Summative Evaluation
c.
INTRODUCTION
In this unit, we shall discuss the basic ideas in testing. We shall discuss testing as an essential
part of teaching and why teachers use test. We shall look at the meaning and qualities or
characteristics of a good test; reliability and validity of a test, Principles involve in the
construction of a test, the steps in constructing a classroom test or teacher made test, discuss the
advantage and disadvantages of both essay type test and objective type test.
LEARNING OUT COME
TEST:
WHAT IS A TEST?
To the lay man in Educational Measurement, a TEST would mean a School Examination,
College entrance Examination or employment test, involving writing or marking answers. This
constitutes a rather narrow view of the world of testing.
“a systematic procedure for observing and describing one or more characteristics or attributes of
a person with the aid of either a numerical scale or a category system” (CRONBACH 1970) OR
A test is merely a series of tasks which is used to measure a sample of a person’s behavior at a
given time. (Gronlund 1965) OR
A test is a device designed to elicit from examinees the behavior the teacher wishes to evaluate
Since a test is a form of measurement, it is therefore accompany with the assigning of numerical
values. The number assigned will enable the teacher to make comparison among different level
or performance of students in class.
The procedures that teachers normally used to find out the characteristics of the learners are:
Questionnaire, Interview, Class product, Assignment, Paper and Pencil test, Examination.
TESTING
Testing is an essential part of teaching, is an important tool used by the teachers to assess the
performance of students in the classroom teaching and learning situation.
That is why testing is important at all stages of Education; Primary, Secondary, tertiary and
especially at the conclusion of every stage.
ACTIVITY;
Every procedure used in an evaluation program should possess certain common characteristics.
The most essential of these characteristics can be classified under the following heading;
1. Validity
2. Reliability
3. Usability/Practicality
4. Objectivity
5. Comprehensiveness
6. Simplicity
7. Scorability
This includes;
a) Planning stage
b) Preparing stage
c) Reproducing stage
d) Administering stage
e) Scoring stage
f) Interpreting stage
g) Evaluation stag
GENERAL PRINCIPLES INVOLVE IN THE CONSTRUCTION OF A TEST.
For a test to be valid and reliable it must follow certain principles in order to achieve its
objectives. Below are some of the principles involving the construction of a good test;
1. Consider the level of the learners for whom you are constructing the test.
Example; A test meant for a JSS1 Pupil is inappropriate for a JSS2 pupil.
2. Determine the purpose of the test, whether it is an aptitude test, an achievement test or a
diagnostic test.
3. Determine the syllabus content from which the test is to be drawn and inform the pupils
4. Construct a table of specification or a test blue – print which is normally called Blooms
Taxonomy.
5. Determine the type of test items to be used whether Essay or Objective type.
TYPES OF TEST
1. Standardized Test
2. Teacher made Test
1. STANDARDIZED TEST
A standardized test is one which has fix set of test items, with specific directions for
administration and has given to represtative groups of individuals for the purpose of establishing
norms.
I. Survey test
II. Diagnostic test
III. Prognostic test
CHARACTERISTIC OF STANDARDIZED TEST
v Direction for administration and scoring are so precisely stated that procedures are
standard for different users of the test.
v A test Manuel and other accessories materials are provided to guide the administrational
scoring of the test and the interpretation and use of the result.
v The test is designed not only for use in measuring individual pupils growth but also to
compare classes, schools, districts, regions and nations
v The sampling of the syllabi programmers and text books is done systematically.
The teacher made test, is the test that the classroom teacher prepares and administered to his/her
pupils to find out the effectiveness of his effort in the arts of teaching. They are therefore called
the classroom test.
are employed to measure the amount and the extent of learning, within a particular course given
by a specific instruction.
ACTIVITY;
2. Standardized test encompasses large Teacher made test may be constructed from
coverage of content and skills. limited topics.
Essay type test are divided into two main types namely;
1. Restricted type
2. Free-Response type.
In this type of items, the student is not allowed to write freely but he is guided in the extent or
length and scope.
Example;
a. In not more than 250 words, write why measurement and Evaluation is included in your
programme of studies.
b. Describe the uses of test result in not more than 60 words.
In this type of test the student organizes and writes the answers depending on the scope and
nature of his knowledge and exposition.
Example;
In objective type test, the questions are asked in a way that the answer will follow a well define
form. The marker in this case, does not have to pass an opinion on whether the answer is right or
wrong.
Marking in objective test items is much easier and can be done by anybody or by a computer if a
‘magnetic’ pencil is used to answer.
OBJECTIVE TYPE TEST IS DIVIDED INTO FOUR (4) MAIN TYPES, THUS;
Example;
This consists of an introductory question or an incomplete statement called the stem and a series
of responses from which the student has to choose the most correct answer.
Example;
This is another variety of multiple choice items but one which conserves space; Students are
instructed to make correct links between items in list A and items in list B.
Example;
Ghana Freetown
Here the examinee is required to complete a statement by writing the answers in the spaces
provided.
Example;
a. Essay test is divided into two main types namely: ------------------ and -----------.
b. One type of standardized test is called --------------------.
c. Give one type of essay type test question.
ADVANTAGE ESSAY TYPE TEST ITEMS
Ø Requires a long time to answer few question; therefore a limited field of knowledge is
tested.
Ø Does not measure purely factual information very well.
Ø Subjective scoring, which is, slow, difficult and incorrsistent.
Ø Freedom to respond in own words enables bluffing and writing skill to influence the
score.
1. Extends over a wide range of knowledge in one test, because they can be answered
quickly; one test can have a variety of questions.
2. Encourages pupils to develop a wide background of facts and abilities.
3. It is simple to score – in fact can be machined scored, permitting greatest accuracy.
4. Can measure a wide range of learning outcomes.
5.
ACTIVITY;
The validity of a test concerns what the test measure and how well it does so. It is the extent to
which the result of the evaluation procedure serves the specific use.
OR.
It also refers to the soundness with which test scores can be interrupted in a particular way. The
use fullness of a test scores for a particular purpose.
Generally a test may become invalid for a test valid, if does not cater for content or outcome
under consideration. In considering the factors that influence validity, we recognize two
categories of factors
The under mentioned factors can prevent the test items from functioning as intended
consequently reducing the validity of the test result.
a. Unclear direction
b. Reading Vocabulary and sentence structure to difficult.
c. Inappropriate level of difficulty of the test items.
d. Poor constructed test items
e. Test items which are inappropriate for the outcome being measured.
f. Too short tests/short tests
g. Identifiable pattern of answers.
In certain cases invalid test are due to personal factors influencing the pupil’s responses to the
test situation rather than to any shortcoming in the test instrument. Some of the factors attributed
to the pupil’s include;
a. Emotional disturbances
b. Some pupils are frighten by the test situation and thereby unable to respond normally.
c. Lack of motivation
d. The relation of teaching to test
e. The nature of the group tested and in the composition of the criterion measures used
TYPES OF VALIDITY
i. Content validity
ii. Construct validity
iii. Concurrent validity
iv. Predictive validity
i. Content validity
is define as the extent to which a test measure a representative sample of the subject matter
content and the behavior changes under consideration.
ii.Construct validity
May be defined as the extent to which test performance can be interpreted in terms of certain
psychological construct.
EXAMPLE:
A test design to measure depression must only measure that particular construct, not
closely related ideas such as anxiety or stress.
iii.Concurrent validity:
This may also be defined as the extent to which test performance is related to some other current
performance. Concurrent validity is a statistical type of validity, determined by correlating the
test result with some external criteria of success.
iv.Predictive validity:
May be defined as the extent to which test performance is accurate in predicting some future
performance. Predictive validity can be most clearly illustrated in the area of aptitude testing
which predict success in some future activities.
ACTIVITY;
Refers to the consistency of test scores. Consistency of scores obtained by the same individual
when re-examined with the same test on different occasion and still obtain the same on similar
score.
OR
When different markers independently rate the same pupil on the same test and obtain the same
or similar score.
Although teachers seldom fine it profitable to calculate reliability coefficient for the test they
construct, they can and should take cognizance of the factors influencing reliability to maximize
the reliability of their own classroom test. The under-mentioned factors have been known to
affect the conventional measure of reliability:
ANSWER;
Reliability of a test refers to “how consistent test scores and other evaluating result
are from one measure to another. It provides the consistency that makes validity
possible and indicates how much confidence can be placed in evaluation result”.
(Gronlund-1931).
ACTIVITY;
1. List two factors each that affect the validity and reliability of a test.
2. When is a test said to be reliable?
3. Share with your colleagues some of the practical experiences you have had
during a given test in class.
STATISTCAL MEASURES
The Measure of Central Tendency are one kind of statistics which are frequently used in
educational measurement. The three (3) most common measures of central tendency are the
Mean, Mode and Median. Others include: Denciles and Percentiles. They are so named
because they are located at the middle of a distribution of scores and are useful in the describing
what is average about a Sample.
A. THE MEAN:
The mean could be defined as the average of a set of values, calculated by summing
them together and dividing by the number of terms or observations.
EXAMPLE:
Find the mean of the following: 2 3 4 5 6 2 1
SOLUTION
Mean = (X) = The sum of a given quantity
Number of items
X= 2+3+4+5+6+2+1
X= 23 / 7
X= 3.2857
1. It is easy to calculate
2. It has determinate exact value
3. It is easy to compute
4. It is best known average
DISADVANTGE OF THE MEAN
B.THE MEADIAN:
The median of a set of numbers arranged in order of magnitude (i.e. in an array) is the middle
value or the arithmetic mean of the two middle values (Spiegel 1972)
1. It is easy to compute
2. It can be obtained graphically
3. The median can be easily determine without calculation
4. It is easy to understand
EXAMLPE: 2 3 4 5 8 9 1 2
Answer = 2 3 4 [5] 8 9 1 2 median =5
C.THE MODE:
EXAMPLE: 8 5 3 7 5 2 3 4 3 5 6 3 9 and 4
Solution: arrange in order of magnitude 2 3 3 3 3 4 4 5 5 6 7 8 9
The mode is 3 because is the most common value.
ADVANTAGE OF THE MODE
NOTE:
b. When again a mode occurs three times in a distribution or row, we called that sets as
polymodal
EXAMPLE: 15, 18, 17, 15, 18, 16, 17, 15, 19, 18, 16, 17
15, 15, 15, 16, 16, 17, 17, 17, 18, 18, 18, 19.
These sets have three (3) modes 15, 17 and 18. We called this set as polymodal
EXAMPLE:
Represent the marks scored by 22 business studies pupils at Baptist junior secondary school.
8 12 12 8 13 15 27 15 13 12 8
13 8 12 12 8 12 12 13 13 15 13
Prepare a frequency distribution for the above scores/data.
a. mean
b. median and
c. model score
SOLUTION
12 IIII II 7 84
13 IIII I 6 78
15 III 3 45
27 I 1 27
∑f
X = raw score
N =
= 274/22
ACTIVITY.
60,80,47,37,58,78,61,59,51,31,24,49,09
60,52,81,77,38,54,67,59,68,48,45,and 92,
§ Range
§ Mean deviation or Average deviation
§ Semi-Interquartile range or Quartile deviation
§ The 10-90 Percentile range
§ The standard deviation
§ The variance
§ Standardized Variable
§ Standard scores
B. Correlation and regression
CONTINUOUS ASSESSMENT
Continuous Assessment has been defined as “A formative evaluation procedure which finds out
in a systematic manner the level of mastery that a pupil has attained in knowledge, skills and
attitude after a given set of learning experiences’’ (Haworth R.E. 1991). It is thus:
Formative- it is carried out as the student learns, and is used to address weakness in the student
and perhaps adjust content
Guidance Oriented- it provides a record that can be used to guide the student and provide
counseling
i. It provides feedback on the learning outcomes that learners have achieved and those that
have not been achieved
ii. It assists with identifying the strengths and weakness of learners
iii. It encourage communication between teachers and learners
iv. It works hand in hand with evaluation and therefore provides important information about
curriculum issues like teaching methods and the relevance of learning outcomes and
resources
v. Result of assessment will motivate students
vi. Help in making wise educational decision, which include selection, placement,
classification, diagnostic, guidance & counseling decisions
vii. Assessment enable pupils to get further objectives to improve their learning
viii. Assessment serve as a process of improving the teacher’s style and methods of teaching
ACTIVITY
• Why is continuous assessment important for the classroom teacher and the student?
• List four characteristics of continuous assessment in school.
1. Cognitive Domain: This aspect deals with the mental, ability and capability of the child
which is based on specific concept or ideas. Mental function of the individual
The component includes:
Ø Knowledge
Ø Comprehension
Ø Application
Ø Analysis
Ø Synthesis
Ø Evaluation
2. Affective Domain: This deals with the attitude changes that are stated in the course
objective. The emphasis is based on feeling and emotion. It also concerns with the
willingness to learn and what value wen give to the learning:
The component includes:
Ø Class contraption
Ø Sociability
Ø Helpfulness
Ø Honest
Ø Effort
Ø Initiative
Ø Self-discipline
Ø Punctuality
3. Psychomotor Domain: This aspect deals with basically manipulative skills and body
movement or control
The component includes:
Ø Painting
Ø Hand writing
Ø Stamina
Ø Dancing
Ø Playing football
Ø Jumping
Ø Running
Ø Physical dexterity
ACTIVITY
List and explain three (3) main taxonomy of education as put forward by Bloom’s et, al.
INTRODUCTION
When we measure something, we find out how “big or small” that thing is, in terms of a certain
unit which we have chosen. The measurement of things or individual helps us to be able to
compare things or individuals. Things or individuals can only be compared after they have been
measured
(STEPHEN 1946)
1. Nominal Scale
2. Ordinal Scale
3. Interval Scale
4. Ratio Scale
NOMINAL SCALE
Here the objects are classified and classes are denoted by numbers. The number for one class
whether they are greater or less than the number of the other class reflects northing about the
properties of the object order than they are different, e.g. sex-male and female: male denoted by
1, female 2, this does not mean that female are better than male, also racial origin e.g. Limba 1,
Themne 2, Krio 3
ORDINAL SCALE
Numbers indicate rank order and nothing more. They do not indicate absolute quantity nor do
they indicate that the intervals between numbers are equal. E.g. military rank, athletic ranking,
letter grading system: A, B, C, D and F. if these are the grades in a history course, we know that
an individual who has received an ‘A’ had a higher level of achievement in history than one who
has received a ‘B’
INTERVAL SCALE
In this scale objectives are not only ordered but may also be assign numbers so that equal
differences between the numbers assign to the object reflect equal difference in the amount of
property possessed. Zero is not absolute; it does not reflect the complete absence of the property.
E.g. achievement test scores, the difference between temperature of 85 and 88 degrees is
regarded as the same as the difference between temperature of 23 and 26 degrees.
RATION SCALE
Possess all the properties possessed by the interval scale but in addition zero is absolute. The
zero implies the complete absence of the property e.g. height, weight
EVALUATION
This is the process of appraising the extent to which the objectives of a programmed or project
are being achieved.
It is the process of making sound, fair and unbiased judgment about educational measurement
DIFFERENCES BETWEEN MEASUREMENT AND EVALUATION
1. To motivate learners
2. To diagnose
3. For selection purpose
4. For the promotion of students or learners
5. To report about the performance of students to parents, agencies etc.
6. To compare student’s performance
7. For placement decision
8. For certification purpose
TYPES OF EVALUATION
There are two main types of evaluation and these are; formative and summative evaluation
Ø Formative Evaluation:
this is the on-going evaluation of programmers under development so that it can be
redesigned or improved
Example.
Questions asked during an explanation, test, quizzes etc.
Ø Summative Evaluation:
This is the evaluation made at the end of a programmed. It concerns about the outcomes
of an already developed programmed and judges it worth. It gives a summary of the
overall strength and weakness of an already completed programmed and describes the
extents to which the programmed has achieved its effective objectives.
ACTIVITY
§ Validity of a test
§ Reliability of a test
§ Usability of a test
i. The Pupils
References;
§ Texts Books.
FROM. ;