0% found this document useful (0 votes)
27 views17 pages

1validity 160310040001

The document discusses different types of validity that are important for tests including content validity, criterion-related validity, construct validity, validity in scoring, and face validity. It emphasizes the importance of carefully designing tests based on language specifications and constructs being measured, and using multiple methods like concurrent validity, predictive validity, and think aloud protocols to establish validity. Tests need to accurately measure the intended content and have valid scoring to ensure they are truly assessing what they are intended to assess.

Uploaded by

Garvin Goei
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
27 views17 pages

1validity 160310040001

The document discusses different types of validity that are important for tests including content validity, criterion-related validity, construct validity, validity in scoring, and face validity. It emphasizes the importance of carefully designing tests based on language specifications and constructs being measured, and using multiple methods like concurrent validity, predictive validity, and think aloud protocols to establish validity. Tests need to accurately measure the intended content and have valid scoring to ensure they are truly assessing what they are intended to assess.

Uploaded by

Garvin Goei
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 17

VALIDITY

BY NATASHA VELÁSQUEZ MANRIQUE


 1. Content validity
 2. Criterion-related validity
 3. Other forms of evidence for construct validity
INTRODUCTION  4. Validity in scoring
 5. Face validity
 6. How to make more valid tests
 It refers to how accurately an assessment or measurement tool
taps into various aspects of the specific construct in question. In
other words, do the questions really assess the construct in
question?
 A test needs to be related to the content of the class (relevant
content).
 HOW TO JUDGE IT? We need a specification of the skills or
Content structures that the test is meant to cover.
validity  Not all the course content needs to appear in the test.
 Language specifications provide the test constructor with a basis
for making a principled selection of elements to include in the test.
 A comparison between test specification and test content is the
basis for judgments related to content validity.

The  The greater a test’s content validity, the more likely is to be an


accurate measure of what it is supposed to measure.
importance of
content
validity
1. According to the CEFR, which specification of the language
skills would you need to take into account to test an A1 student?

PLEASE USE YOUR GADGETS TO HAVE ACCES TO THE CEFR


ACTIVITY 1 LANGUAGE SPECIFICATION.

2. Do you think teachers care about those specifications while


testing a student?
 It is the degree to which test results agree with those provided by
some independent and highly dependable assessment of the
candidate’s ability.
 The independent assessment is the criterion measure against
Criterion- which the test is validated
related validity  TWO TYPES:

CONCURRENT VALIDITY PREDICTICE VALIDITY


 It refers to the extent to which the results of a particular test, or
measurement correspond to those of a previously established
measurement for the same construct.
CONCURRENT
VALIDITY  Is it possible to test everything you need to test in a short time?
 This will always depend on how many functions are tested in the
component, and how representative they are among the complete
set of functions including in the objectives.
 Using the “correlation coefficient”. This is a mathematical measure
of similarity.
How the level  Perfect agreement= 1
of agreement  Total lack of agreement= 0

is measured?
The level of agreement is regarded as satisfactory, depending on
the purpose of the test and the decisions that are made based on it.
 This topic concerns the degree to which a test can predict a
candidate’s future performance.

PREDICITVE  How helpful is it to use final outcomes as the criterion measure


when so many factors other than ability in English (such as
VALIDITY subject knowledge, intelligence, motivation, health and
happiness) will have contributed to every outcome?

 Example: placement tests


 We cannot be sure that the items of the test are measuring what
we expect them to measure.

Other forms  Construct validity: “construct” refers to any underlying ability


that is hypothesized in a theory of language ability.
of evidence  It is important to establish if distinct abilities exist, if they can be
for construct measured and if they are measured in a test.
 Research is needed for evidence.
validity
Another way of obtaining evidence about the construct validity of a
test is to investigate what test takers actually do when they respond
to an answer.
TWO PRINCIPAL METHODS:

THINK ALOUD RETROSPECTION

Test taker voice They try to recollect


their thoughts as what their thinking
they respond to was, as to they
the item. responded.

Problem: The very


voicing of thoughts may Problem: Thoughts may
interfere with what would be forgotten
be the natural response of
the item
 It is worth pointing out that if a test is to have validity, not only the
item must be valid. Also the way in which the responses are scored
must be valid.

VALIDITY IN  If the test is meant to test one specific skill and while grading
another one interferes with the scoring process, then the test is
SCORING not valid.
To discuss:
 Do the PUCE language tests lack scoring validity? Yes or no?

ACTIVITY 2  So, why do teachers take away points when they are grading the
reading and comprehension part?
 What is the reason to do so? Do you think this is fair for the
student?
 A test is said to have face validity if it looks as if it measures what
FACE it is supposed to measure.

VALIDITY  A test which does not have face validity may not be accepted by
candidates, teachers, education authorities or employers.
 Write explicit specifications for the test which take into account all
that is known about the constructs that are to be measured.
 Make sure that you include a representative sample of the content
How to make of these in the test.
tests more  Use direct testing.

valid  Make sure that the scoring responses relates directly to what is
being tested.
 Do everything possible to make the test reliable. If the test is not
reliable, it cannot be valid.
 In pairs answer the following test. Then, check if it has validity
enough to say that the student knows all the content of the A1
level.
 Would you say that the test is measuring the most important
ACTIVITY 3 content of the A1 level?
 Which contents would you test instead of those and why?
 It is important to take into account the language specifications.
Conclusion  It is necessary to do research to say that a test has content validity.

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy