1 Assessment Review - JANICE
1 Assessment Review - JANICE
Scoring of responses
Interest tests
Types of Tests
Test Audience
used both with tests of a single used only with tests of more than
domain and tests covering several one domain; characteristics are
different domains assessed relative to each other
Test Content
Knowledge/ Behavior Construct
Skills Domains Domains Domains
Correct Answer:
20 Or 21
STATISTICAL FOUNDATION
Question:
Correct Answer:
6
STATISTICAL FOUNDATION
Question:
Correct Answer:
Correct Answer:
68%
STATISTICAL FOUNDATION
Types of Derived Scores:
A. Based on Mean and Standard Deviation
Linear Standard Scores – preserve the absolute
position of a raw score relative to its mean
• z-score – a standard score derived by calculating the
difference between a particular raw score and the
mean, divided by the standard deviation
Correct Answer:
z-score = 2
T-score = 70
STATISTICAL FOUNDATION
B. Based on Rank within Groups
Percentile - an expression of the percentage of
people whose score on a test falls below a particular
raw score
Normalized Standard Scores – create a score
distribution which approximates the standard
normal curve
• Stanine – score based on dividing the normal
distribution into nine parts, with a mean of 5 and
a standard deviation of approximately 2
• Normal Curve Equivalent (NCE) – has a mean =
50 and SD = 21.06 with a range from 1 to 99
• Sten – has a mean of 5.5 and SD of 2
STATISTICAL FOUNDATION
Question:
Correct Answer:
97
STATISTICAL FOUNDATION
Question:
Correct Answer:
71.06
STATISTICAL FOUNDATION
Types of Score Interpretation:
SEm
- is used to estimate or infer the extent to which an
observed score deviates from a true score.
- also known as the standard error of score, an index of
the extent to which one individual’s scores vary over tests
presumed to be parallel.
STATISTICAL FOUNDATION
Correct Answer:
68% of the time his observed scores will fall
between 89 and 79
STATISTICAL FOUNDATION
RELIABILITY
Theoretical Models
TRUE SCORE GENERALIZABILITY ITEM RESPONSE
THEORY THEORY THEORY
- Also referred to as - Also referred to as domain - Also referred to as
classical test theory, a sampling theory, a system latent-trait theory or the
system of assumptions of assumptions about latent-trait model, a
about measurement that measurement that include system of assumptions
include the notion that a the notion that a test score about measurement
test score (and even a (and even a response to an (including the assumption
response to an individual individual item) consists of that a trait being
item) is composed of a a relatively stable measured by a test is
relatively stable component that actually is unidimensional) and the
component that actually is what the test or individual extent to which each test
what the test or individual item is designed to measure item measures the trait
item is designed to as well as relatively unstable
measure as well as a components that
random component that is collectively can be
error accounted for as error
STATISTICAL FOUNDATION
Validity
- How well a test what it purports to measure
VALIDITY
Utility Analysis
- A family of techniques that entail a cost-benefit analysis designed to yield
information relevant to a decision about the usefulness and/or practical value
of a tool of assessment.
TEST DEVELOPMENT
1 Test conceptualization
2 Test construction
3 Test tryout
4 Item analysis
5 Revision
1 Test conceptualization
p = no of people correct
total no. of people
Note:
• The optimal level for an acceptable p value
depends on the number of options per item.
e.g. 4 alternatives = .63
• Norm-referenced tests: optimal p value = .5
• Criterion-referenced tests: optimal p value =
proportion of test takers expected to pass the test
STATISTICAL FOUNDATION
Item Difficulty Analysis:
d = U p – Lp
U
Where:
Up and Lp indicate the number of test takers in the
upper and lower groups who pass the item, and
U is the total number of test takers in the upper group
STATISTICAL FOUNDATION
Item Discrimination Analysis:
d = PF - PU
Where:
Item-Total Correlation:
- Determine how high or low a D value is good enough
Note:
• Pearson correlation formula = use when item types that
generated a variety of scores (e.g. free response, projective,
Likert)
• Point-biserial correlation = use when item performance is
dichotomous
• The closer the correlation is to 1.0, the more likely it is the
item and the overall test measure the same characteristic.
STATISTICAL FOUNDATION
Question:
Correct Answer:
p = .62
d = .40
4 Item analysis
Qualitative Item Analysis:
Personality Assessment
Career Assessment
Neuropsychological Assessment
COGNITIVE ASSESSMENT
• Cognitive Tests
Cognitive Theories and Models
Binet’s Theory of
Intelligence
Cognitive ability
developed with age
like physical abilities.
Intelligence is
expressed in the
performance of
complex mental acts.
Cognitive Theories and Models
Thorndike’s Multi-factored Approach
• Personality Tests
Personality Theories and Models
Larsen & Buss, 2010
Personality theories are organized into the following domains:
Methods:
• Objective
• Projective
• Career Tests
Career Theories and Models
Trait-Factor Theories
•Self-Directed Search
-Developed by John L. Holland, this is a
self-administered, self-scored, and self-
interpreted interest inventory for
individuals 12 years of age and older.
Career Tests
•The Career Competency Scale for
College Students (CCS–CS)
-Developed by Vicentita M. Cervera
-Consists of 80 positively- and negatively-
worded statements distributed among
the following areas of competencies:
•Knowledge and Skills (KS)
•Attitude (A)
•Behavior (B)
•Concept of Success (CS)
•Attitude towards Change (ATC)
•Overall Competency (OC)
Career Tests
• Filipino Work Values Scale ©1988, 2013
-By Vicentita M. Cervera, Ed.D.,RGC
-The Filipino Work Values Scale (FWVS), Student Edition, is an
instrument designed to determine a student's work values. The
scale consists of eighty (80) items and it takes approximately 15
to 20 minutes to accomplish it.
•Environmental
•Familial
•Intellectual-Achievement Oriented
•Interpersonal
•Managerial
•Material
•Occupational
•Organizational
•Religious
•Variety
NEUROPSYCHOLOGICAL
ASSESSMENT
• Neuropsychology
• Neuropsychological Tests
Neuropsychology
Neuropsychology – is the branch of psychology that focuses on
the relationship between brain functioning and behavior.
Tools:
• Case history and case studies
• Interviews
• Physical examination
Neuropsychological Tests
• Tests of General Intellectual Ability
- Wechsler Tests
• Tests of Abstract Thinking
- Wechsler Similarities subtest
- Object Sorting Test
- Color-Form Sorting Test (Weigl’s Test)
- Wisconsin Card Sorting Test-64 Card Version – sorting of 64
cards that contain different geometric figures printed in
different colors.
• Tests of Executive Function
- Tower of Hanoi – it is set up by stacking the rings on one of the
pegs, beginning with the largest diameter ring, with no
succeeding ring resting on a smaller one.
- Porteus Maze Test
- Clock-Drawing Test
Neuropsychological Tests
• Tests of Perceptual, Motor and Perceptual-Motor Function
-Wepman Auditory Discrimination Test – consists of 40 pairs of
monosyllabic meaning full words
-Bruininks-Oseretsky Test of Motor Proficiency – used with
children aged 4 ½ to 14 1/2 , this includes subtests that assess
running speed and agility, balance, strength, response speed and
dexterity.
- Purdue Pegboard Test – measure manual dexterity
- Bender Visual-Motor Gestalt Test
• Tests of Memory
-California Verbal Learning Test II – the task is to repeat a list of
words that are read by the examiner.
- Wechsler Memory Scale (WMS-IV) – for test takers from ages
16-90
Practitioners…
• should select standardized psychological tests that
meet the purpose for which they are to be used and
that are appropriate for their intended clients.
Practitioners…
• should interpret test scores based on appropriate
norms.
Practitioners…
• should use tests that are culturally-fair in terms of
gender, ethnic background, religion, race and
handicapped conditions.
Practitioners…
• should comply with the test administration, test
scoring and test interpretation procedures as
prescribed in the Test Manual.
Practitioners…
• should use assessment data by taking into account
various factors and characteristics of the client being
assessed that might affect the practitioner’s
judgment or reduce the accuracy of interpretation.
UST_jtocray 08/16/16