0% found this document useful (0 votes)

35 views41 pages

Lecture 3 - CH 4

The document discusses the basics and assumptions of psychological testing. It covers 7 key assumptions: 1) Psychological traits and states exist and can vary in degree. 2) Traits and states can be quantified and measured through defining constructs and developing test items. 3) Test performance can predict future non-test behavior. 4) All tests have limitations and imperfections. 5) Various sources of error are part of the assessment process. 6) Unfair assessment procedures can be identified and reformed. 7) Testing provides benefits to society by informing important decisions. The document also discusses what makes a good test, focusing on reliability, validity, norms, and practical

Uploaded by

247rxg9qr8

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views41 pages

Lecture 3 - CH 4

Uploaded by

247rxg9qr8

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 41

PSYC 303: MEASUREMENT

AND EVALUATION IN
PSYCHOLOGY
LECTURE 3: Of Tests and Testing: Basics and Assumptions
MEYMUNE N. TOPÇU, PhD
Recap from Lecture 2
Measures of Skewness - Kurtosis
Variability

Standard Scores

Measures of central
tendency
Correlation

Levels of
Meta-Analysis
Measurement
Challenging Concepts from Pre-Quiz 2.2
Challenging Concepts from Pre-Quiz 2.2
 Assumptions about psychological testing and
assessment
 What is a good test?
 Reliability
 Validity

Lecture Plan  Norms

 Sampling to develop norms
 Types of norms
 Fixed reference groups scoring systems
 Norm- vs. criterion-referenced Evaluation
 Culture and Inference
What is a “good test”?
Assumption 1: Psychological traits and states exist
 Components of stability and change in our behavior
 Trait: A long-term characteristic of an individual that shows
Assumptions about through their behavior, actions, and feelings
psychological  Based on a sample of behavior
testing and  Intelligence, cognitive style, interests, personality
assessment  State: A temporary condition that an individual is
experiencing for a short period of time
 Any examples?
 https://oxford-review.com/oxford-review-encyclopaedia-
terms/the-difference-between-an-state-and-a-trait/
Why is it important to distinguish between
traits and states in psychological testing &
assessment?
 How do traits exist? Do they have a physical experience?
 A psychological trait exists as a construct
 Construct: An informed, scientific concept developed or
constructed to describe or explain behavior
 The construct’s existence can be inferred from overt behavior
 Observable action or the product of an observable action

 Trait is not expected to be manifested 100% of the time

 What determines if a trait will be manifested or not?
 The strength of the trait & the nature of the situation
 Situation-dependent, E.g., American football - Playground
 https://www.mindgarden.com/145-state-trait-anxiety-
inventory-for-adults
 Attributions of a trait or state term are relative
 E.g., “Özge is very shy” – an unstated comparison with degree of shyness in average person
 The reference group can greatly influence one’s conclusions or judgments
 Measuring sensation seeking (the need for varied novel, complex sensations)
 Sensation seeking scale vs. performance-based measures
Assumption 2: Psychological traits and states can be
quantified and measured
 If psychological traits and states vary by degree they are
quantifiable
Assumptions about
psychological  Defining the trait/state
testing and  The same phenomenon can be defined in different ways

assessment  E.g., “Aggressive salesperson”, “Aggressive killer”,

“Aggressive waiter”
 How aggressiveness is defined by the test developer
 “The number of self-reported acts of harming others”
 “The number of observed acts of aggression”
 The test developer should provide a clear “operational
definition”
 After defining a trait/state a test-developer considers the
types of item content
 Components of intelligence in US adults
 If knowledge of American history: “Who was the second
president of the US”
 If social judgment: “Why should guns in the home always be
inaccessible to children?”

 Should all items have equal weight?

 The social judgment item could be given more weight
 Developing appropriate ways to score/interpret
 Cumulative scoring: A trait is measured by a series of test
items
Assumption 3: Test-related behavior predicts non-test-related
behavior
 The obtained sample of behavior is typically used to make
predictions about future behavior
 E.g., Predicting success in life from intelligence scores
Assumptions about obtained in childhood
psychological  To postdict behavior: Understanding of behavior that has
testing and already taken place. E.g., Criminal’s state of mind
assessment
Assumptions about
psychological
testing and
assessment
Assumption 4: All tests have limits and imperfections
 Why?
 Test users should understand the limitations of tests and
how those limitations can be compensated for by data from
other resources
Assumption 5: Various sources of error are part of the
assessment process
 Error: Factors other than what a test attempts to measure will
influence performance on test
Assumptions about  Does an intelligence test score truly reflect intelligence or
psychological factors other than intelligence?
testing and
 Error variance: The component of a test score attributable to
assessment scores other than the trait or ability measured
 Assessees, Assessors, Instruments can all be sources of error
variance
 Random errors: Errors that happen as a matter of chance
 E.g., the weather on the day of testing

 Error is an element in the process of measurement

Assumption 6: Unfair and biased assessment
procedures can be identified and reformed
 Sophisticated procedures to identify and correct test
bias and list of ethical guidelines to ensure test
fairness
Assumptions about  Fairness-related questions and problems can still
psychological arise
testing and  E.g., The test is used with a person whose
assessment background/experience is different from the group the
test was intended for

 Tests are tools and they can be used properly or

improperly
Assumption 7: Testing and assessment offer powerful benefits to
society
 Imagine a world without psychological tests/assessments.
How would it be?
Assumptions about
 In a world without tests…
psychological
 People can easily trick others that they are a surgeon
testing and
 Personnel might be hired on the basis of nepotism rather than
assessment documented merit
 It would be very difficult to offer treatments for educational
difficulties
 The military/business sector would not have a tool to screen
applicants

 We need good tests…

 Psychometric soundness of a test
 A good test/measuring tool is reliable
 In theory, the perfectly reliable measuring tool
consistently measures in the same way
What is a “good
test”? 1 Kg

Reliability

1 Kg 1.3 Kg 1.2 Kg
1 Kg 1.3 Kg 0.9 Kg
1 Kg 1.3 Kg 1 Kg
Why is it more difficult to achieve perfect
reliability for psychological tests?
How does calculating reliability differ
when you are measuring a trait vs. a state?
 Psychometric soundness of a test
 A valid test measures what it claims to measure
 E.g., Intelligence

What is a “good  Items that make up a test adequately sample the range of
test”? areas that must be sampled to adequately measure the
construct

Validity  How are the scores interpreted? How do scores on this

test relate to other scores measuring the same/opposite
construct?
 E.g., A valid test of introversion should be negatively
correlated with a valid test of extraversion
 A good test is one that trained examiners can administer,
score, and interpret with a minimum of difficulty
What is a “good  A good test is a useful test, one that yields actionable
test”? results that will ultimately benefit individual test takers
or society at large
Other Considerations  If the purpose of a test is to compare the performance of
the test taker with the performance of other test takers,
then a “good test” is one that contains adequate norms
Why choose one test over the other?
 What is the objective of using a test? how does the test
meet that objective?
 How is the construct defined?
 Who is the test designed for use with? (age, gender,
reading level etc.)
 How appropriate is it for the targeted test takers?
 What type of data will be generated from using this
test?
 Will there be a need for other assessment tools?
 Does the test require an expert test user?
Norms
 Norm-referenced testing and assessment: a method of
evaluation and a way of deriving meaning from test
scores by evaluating an individual test taker’s score and
comparing it to scores of a group of test takers
 Aim: To understand where the test taker stands among
other test takers
 Norm (singular): Behavior that is usual/average/normal
 Norms (Plural): the test performance data of a particular
group of test takers that are designed for use as a
reference
 Normative sample: The group of people whose
Norms performance on a particular test is analyzed
 The data may be in raw or converted scores
 To norm (verb): Refers to the process of deriving norms
 Norming a test is expensive
 User norms: Descriptive statistics based on a group of
test takers in a given period of time
Sampling to develop  Standardization: The process of administering a test to a
norms representative sample
 Population: The complete set of individuals with at least
one common observable characteristic
 Sample of the population: a portion of the universe of
people deemed to be representative of the whole
population
 Sampling: The process of selecting a representative
group of people
 Subgroups in a population may differ in terms of certain
characteristics. It can be essential to have these
differences proportionately represented
 Stratified sampling help prevent sampling bias and aid
in the interpretation of results
Sampling to develop  Stratified-random sampling: When every member of the
norms population had the same chance of being included in the
sample
 Purposive sampling: Arbitrarily select some sample because
we believe it to be representative of the population
 The prob: The sample may no longer be representative
 Decision of sampling: Comparing what is ideal and what is
practical
 Incidental/Convenience sampling: employ a sample that is
Sampling to develop
not necessarily the most appropriate but is simply the most
norms convenient
 Budgetary or other limitations
 E.g., PSYC 101 samples
 Exclusionary criteria
 People with uncorrected vision impairment
 People taking medicine that can affect performance
 People who are not fluent in English etc.
Recall your own experience as a research subject. How
appropriate was it for the researcher to use students as
convenient sample?
 Developing norms for a standardized test
 Administering the test to the sample
 Standard set of instructions
 Recommended settings
 Summarizing the data using descriptive
statistics (?)
 Test developers provide information to support
recommended interpretations of the results: the
nature of the content, norms/comparison
groups, other technical evidence
 Percentile norms: the raw data from a test’s standardization
Types of Norms sample converted to percentile form
 Dividing the distribution into 100 equal parts
Percentile
 Percentile: an expression of the percentage of people whose
score on a test or measure falls below a particular raw score.
 Percentage correct: What proportion of the items did the test
taker got correct
 GRE example
 What might be a problem of using percentiles?
 With normally distributed scores real differences between raw scores may be minimized near the ends of the
distribution and exaggerated in the middle of the distribution
 Highest frequency of raw scores are in the middle –even smallest differences will appear large in percentiles
 For the tails – Differences between raw scores may be great with very small percentile differences
 Age equivalent norms: Average performance of different
Types of Norms
samples of test takers who were at various ages at time of test
 Carefully constructed age norms of physical characteristics is
Age Norms
OK.
 For psychological characteristics it is tricky
 Identifying the mental age according to intelligence test.
 Problem with it: E.g., Young Sheldon
 Technical ground: SD can be different for different ages
Types of Norms  Grade Norms: Designed to indicate the average test
performance of test takers in a given school grade
Grade Norms  The test is administered to a group of representative samples
of children over a range of consecutive grade levels (1 st to
6th)
 The school year is 10 months: a 6th grade student performing
average for the 4th month of the school year receives 6.4
Types of Norms
 Using nationally representative samples to compare tests that
measure the same construct
National Anchor Norms
 Readings tests: BRT & RAT
 The 96th percentile= Raw score of 67 on BRT and 14 on
RAT
 The national anchor norms must be obtained by administering
the two tests on the same sample
Types of Norms
 Subgroup Norms: Segmenting a normative sample by a
criteria used in initial selection of subjects (age, educational
Subgroup and Local Norms
level, ethnicity, handedness etc.)
 The manual can provide normative info for each
 Local Norms: typically developed by test users to provide
normative info on the local populations’ performance
Norm vs. Criterion
Referenced Evaluation

 What is the difference?

 Norm-referenced: Evaluating the test
score in relation to other scores on the
same test
 Criterion-referenced: Evaluating a test
score based on whether some criterion
is met
 Criterion: A standard on which a
judgment or decision may be based
 E.g., Diploma, driver license etc.
Norm vs. Criterion
Referenced Evaluation
 Criticism: May assess mastery of basic
knowledge, skills, or both, but has little
or no meaningful application at the
upper end of the knowledge/skill
continuum
 These two are not mutually exclusive a
test can be both norm and criterion
referenced
 In a sense all testing is normative
 http://www.edpsycinteractive.org/
topics/measeval/crnmref.html
Can you think of norm vs. criterion
referenced tests you took before?
 Test users should not lose sight of culture as a factor in test administration, scoring, and interpretation
 Is the test appropriate for the targeted test taker population
 The do’s and don’t regarding culture and psychological tests

FINAL FS-2-Activity-4
No ratings yet
FINAL FS-2-Activity-4
10 pages
Sem I & II As Per Nep 2020 Syllabus
No ratings yet
Sem I & II As Per Nep 2020 Syllabus
110 pages
PSY 101L: Psychological Testing: Prof. A.K.M. Rezaul Karim, PH.D
No ratings yet
PSY 101L: Psychological Testing: Prof. A.K.M. Rezaul Karim, PH.D
50 pages
Discourseof Holec 1981
No ratings yet
Discourseof Holec 1981
15 pages
Effortless Success - Course 1 Workbook
100% (13)
Effortless Success - Course 1 Workbook
66 pages
Lecture 4 - CH 4
No ratings yet
Lecture 4 - CH 4
42 pages
Accomplishment Report For Research Congress
100% (1)
Accomplishment Report For Research Congress
5 pages
MODULE 4 of Tests and Testing
No ratings yet
MODULE 4 of Tests and Testing
58 pages
Week One Readings For Psychometrics
No ratings yet
Week One Readings For Psychometrics
31 pages
Psychological Testing Assessment
No ratings yet
Psychological Testing Assessment
253 pages
Summary-Part 2 of Lectrure
No ratings yet
Summary-Part 2 of Lectrure
3 pages
Psychology Practical File
No ratings yet
Psychology Practical File
5 pages
Psychological Assessment Midterm Exam Reviewer
No ratings yet
Psychological Assessment Midterm Exam Reviewer
6 pages
Assessment Quiz 2 REV
No ratings yet
Assessment Quiz 2 REV
4 pages
Tests and Testing - Falcutan
No ratings yet
Tests and Testing - Falcutan
2 pages
Assumption 1
No ratings yet
Assumption 1
4 pages
Muet Reading Set 2
No ratings yet
Muet Reading Set 2
9 pages
Bottling Up Emotions
No ratings yet
Bottling Up Emotions
1 page
Chapter 4 Test and Testing
No ratings yet
Chapter 4 Test and Testing
5 pages
Of Tests and Testing
No ratings yet
Of Tests and Testing
48 pages
Vaishnavi Kaushik
No ratings yet
Vaishnavi Kaushik
65 pages
Communication Research
No ratings yet
Communication Research
284 pages
Psy. Testing
No ratings yet
Psy. Testing
43 pages
Psych Assessment Chapter 4
No ratings yet
Psych Assessment Chapter 4
32 pages
Assessment Prelims Rev
No ratings yet
Assessment Prelims Rev
5 pages
Teaching Strategies in Handling Students With Intellectual Disability
No ratings yet
Teaching Strategies in Handling Students With Intellectual Disability
19 pages
Psych 162
No ratings yet
Psych 162
14 pages
PSYC 2101 Assessment & Personality
No ratings yet
PSYC 2101 Assessment & Personality
11 pages
Psych Assessment
No ratings yet
Psych Assessment
22 pages
Ratio Feb 9, 2025
No ratings yet
Ratio Feb 9, 2025
16 pages
Psych Assessment 1
No ratings yet
Psych Assessment 1
13 pages
PSYCHOLOGICAL ASSESSMENT Reviewer PDF
100% (1)
PSYCHOLOGICAL ASSESSMENT Reviewer PDF
33 pages
ASSUMPTIONS
No ratings yet
ASSUMPTIONS
4 pages
Assessment Psychology Testing
No ratings yet
Assessment Psychology Testing
56 pages
Grade 4 Teachers Guide
No ratings yet
Grade 4 Teachers Guide
99 pages
Psych Asses Midterm Notes Rev
No ratings yet
Psych Asses Midterm Notes Rev
43 pages
1st Chapter FY 2nd Sem CPSC
No ratings yet
1st Chapter FY 2nd Sem CPSC
15 pages
CC01 PA Introduction
No ratings yet
CC01 PA Introduction
11 pages
PsychAssessment Reviewer 4
No ratings yet
PsychAssessment Reviewer 4
3 pages
Psych Assessment - Midterms
No ratings yet
Psych Assessment - Midterms
16 pages
PSY 414 Psychological Testing & Construction 10.03.2021
No ratings yet
PSY 414 Psychological Testing & Construction 10.03.2021
73 pages
Introduction To Psychological Testing
No ratings yet
Introduction To Psychological Testing
31 pages
Chapter 1
No ratings yet
Chapter 1
38 pages
MANAGEMENT BY OBJECTIVE Course Era Assignment
100% (1)
MANAGEMENT BY OBJECTIVE Course Era Assignment
4 pages
Psychological Assessment (CHAPTERS 3-5)
No ratings yet
Psychological Assessment (CHAPTERS 3-5)
9 pages
Week 5 of Tests and Testing
No ratings yet
Week 5 of Tests and Testing
7 pages
Psychological Testing - Introduction
No ratings yet
Psychological Testing - Introduction
38 pages
Why Is There A Need To Philosophize
No ratings yet
Why Is There A Need To Philosophize
17 pages
(Michael Grenfell, David James) Bourdieu and Educa (BookFi) PDF
No ratings yet
(Michael Grenfell, David James) Bourdieu and Educa (BookFi) PDF
211 pages
Psychological Assessment Chapter 4 - of Tests and Testing PDF
0% (1)
Psychological Assessment Chapter 4 - of Tests and Testing PDF
7 pages
Journal Week 1
No ratings yet
Journal Week 1
4 pages
Reviewer For Psych Assessment
No ratings yet
Reviewer For Psych Assessment
5 pages
Dr. Most. Aeysha Sultana (MAS1) : PSY 101L: Psychological Experiment and Testing
No ratings yet
Dr. Most. Aeysha Sultana (MAS1) : PSY 101L: Psychological Experiment and Testing
13 pages
Psychological Assessment
No ratings yet
Psychological Assessment
15 pages
Introduction To Psychological Testing
No ratings yet
Introduction To Psychological Testing
8 pages
School Teachers' Attitudes Towards Inclusive Education
No ratings yet
School Teachers' Attitudes Towards Inclusive Education
6 pages
What To Look For in A Psychological Test
No ratings yet
What To Look For in A Psychological Test
32 pages
Of Tests and Testing: Mcgraw-Hill/Irwin © 2013 Mcgraw-Hill Companies. All Rights Reserved
No ratings yet
Of Tests and Testing: Mcgraw-Hill/Irwin © 2013 Mcgraw-Hill Companies. All Rights Reserved
17 pages
Assignment 2 - Psych Assessment
No ratings yet
Assignment 2 - Psych Assessment
3 pages
Detailed Lesson Plan On Types of Communicative Strategies
No ratings yet
Detailed Lesson Plan On Types of Communicative Strategies
6 pages
PsY. T Chapter 02 (Simplify)
No ratings yet
PsY. T Chapter 02 (Simplify)
29 pages
CORE 10 Unit 2
No ratings yet
CORE 10 Unit 2
4 pages
Foundations of Psych Testing 2
No ratings yet
Foundations of Psych Testing 2
14 pages
Chapter 4 Psych Assessment
No ratings yet
Chapter 4 Psych Assessment
5 pages
Introduction To Psychological Testing-1
No ratings yet
Introduction To Psychological Testing-1
5 pages
Adoc - Pub - Abdurrahman Mulyono2003 Pendidikan Bagi Anak Berke
No ratings yet
Adoc - Pub - Abdurrahman Mulyono2003 Pendidikan Bagi Anak Berke
6 pages
Lesson Plan Template For Teachers
No ratings yet
Lesson Plan Template For Teachers
3 pages
Uses and Scope of Testing
No ratings yet
Uses and Scope of Testing
15 pages
Introduction To Psychological Testing
No ratings yet
Introduction To Psychological Testing
5 pages
Classical Theory of Personality 38 2
No ratings yet
Classical Theory of Personality 38 2
29 pages
NGEC444
No ratings yet
NGEC444
21 pages
Activity 1, Nature by Numbers, Delos Santos, Stephanie
No ratings yet
Activity 1, Nature by Numbers, Delos Santos, Stephanie
2 pages
Chapter 4 Tests and Testing
No ratings yet
Chapter 4 Tests and Testing
8 pages
Wa0010.
No ratings yet
Wa0010.
5 pages
Introduction To Psychological Testing
No ratings yet
Introduction To Psychological Testing
5 pages
Psyease 3rd Sem Psychology Module-2
No ratings yet
Psyease 3rd Sem Psychology Module-2
7 pages
Topic No. 1 21 Century Skills in Teaching Economics K12 Spiral Instructional Modeling in The Philippines
No ratings yet
Topic No. 1 21 Century Skills in Teaching Economics K12 Spiral Instructional Modeling in The Philippines
8 pages
ClickOn Starter Unit 3 Day 2
No ratings yet
ClickOn Starter Unit 3 Day 2
6 pages
Name: Marijo D Dalman Bsed-Filipino III Deadline: March 25, 2022 Unit 2 Teaching As A Vocation and Mission
No ratings yet
Name: Marijo D Dalman Bsed-Filipino III Deadline: March 25, 2022 Unit 2 Teaching As A Vocation and Mission
11 pages
FS 1 Activity 6
No ratings yet
FS 1 Activity 6
7 pages
Portfolio Spreadsheet Aitsl Standards
No ratings yet
Portfolio Spreadsheet Aitsl Standards
3 pages
General Introduction To Psychological Assessment
No ratings yet
General Introduction To Psychological Assessment
5 pages
Ad Analysis - Example
No ratings yet
Ad Analysis - Example
3 pages
The Relationship Between Self-Efficacy and Lecturer's Assertive Behavior With Japanese Public Speaking Anxiety
No ratings yet
The Relationship Between Self-Efficacy and Lecturer's Assertive Behavior With Japanese Public Speaking Anxiety
17 pages
Dakota Hanes Resume
No ratings yet
Dakota Hanes Resume
1 page
1st Quarter Cot Lesson Plan
No ratings yet
1st Quarter Cot Lesson Plan
8 pages
PsychAssess 3 Assumptions
No ratings yet
PsychAssess 3 Assumptions
3 pages
More How to Win at Aptitude Tests
From Everand
More How to Win at Aptitude Tests
Liam Healy
4/5 (7)
Personality and Psychometric Testing In Business Resource Manual
From Everand
Personality and Psychometric Testing In Business Resource Manual
Jimmy Petruzzi
No ratings yet
Research in Psychology: An Introductory Series, #8
From Everand
Research in Psychology: An Introductory Series, #8
Connor Whiteley
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Lecture 3 - CH 4

Uploaded by

Lecture 3 - CH 4

Uploaded by

PSYC 303: MEASUREMENT

Lecture Plan  Norms

 Trait is not expected to be manifested 100% of the time

assessment  E.g., “Aggressive salesperson”, “Aggressive killer”,

 Should all items have equal weight?

 Error is an element in the process of measurement

 Tests are tools and they can be used properly or

 We need good tests…

Validity  How are the scores interpreted? How do scores on this

 What is the difference?

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.