0% found this document useful (0 votes)

50 views44 pages

DR Luning Sun Psychometrics Module, Lecture 3 Social Sciences Research Methods Centre - SSRMC

1. Computerized adaptive tests (CATs) individually tailor tests to examinees' abilities by selecting subsequent test items based on responses to previous items. This increases accuracy and efficiency compared to standard tests. 2. A CAT requires an item bank, item parameters, an item selection method, a scoring algorithm, and termination rules. It iteratively selects the optimal next item based on the current ability estimate, scores the response, updates the ability estimate, and repeats until a stopping criterion is met. 3. Concerto is an online platform that can be used to create CATs. It allows users to build questionnaires, CATs, and provide feedback using pre-programmed nodes. CATs created in Concer

Uploaded by

Monuranjan Borgohain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views44 pages

DR Luning Sun Psychometrics Module, Lecture 3 Social Sciences Research Methods Centre - SSRMC

Uploaded by

Monuranjan Borgohain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 44

Dr Luning Sun

Psychometrics Module, Lecture 3

Social Sciences Research Methods Centre | SSRMC
 Introduction to CAT
 CAT in R
 CAT in Concerto
Some materials and examples come from previous workshops run by:
Michal Kosinski (Stanford University)
David Stillwell (University of Cambridge)
Chris Gibbons (Harvard University)
 Standard test is likely to contain questions
that are too easy or too difficult
◦ Classical Test Theory
◦ Item Response Theory
 Adaptively adjusting the level of the test to
individual participant:
◦ Increases the accuracy
◦ Saves time / money
◦ Prevents boredom / frustration
 Item bank and calibration (IRT model)
 Starting point
 Item selection algorithm (CAT algorithm)
 Scoring on-the-fly method
 Termination rules
And
 Item bank protection / overexposure
 Content Balancing
Start the test:
1. Ask first question, e.g. 1.0 Incorrect response Correct response
of medium difficulty
2. Correct! Probability
0.8

3. Score it
0.6 Normal distribution
4. Select next item with a
Difficulty
difficulty around the 0.4
most likely score (or with
the max information)
0.2
5. And so on…. Until the
stopping rule is reached 0.0
-3.0 -2.0 -1.0 0.0 1.0 2.0 3.0
Theta

Most likely score

Standard test to assess Kumamon

= A question from our test

Maths ability

2+2 1134 x 16
Standard test to assess Kumamon

= A question from our test

Maths ability

 Reliability of theta estimate (standard error)

 Other, clever stuff

𝑟𝑒𝑙𝑖𝑎𝑏𝑖𝑙𝑖𝑡𝑦 = 1 − 𝑆𝐸 2
Alpha(0.90) =
SE(0.32)

Alpha(0.80) =
SE(0.45)

Alpha(0.70) =
SE(0.55)
1. The pool of available items is searched for
the optimal item, based on the current
estimate of the examinee's ability
2. The chosen item is presented to the
examinee, who then answers it correctly or
incorrectly
3. The ability estimate is updated, based upon
all prior answers
4. Steps 1–3 are repeated until a termination
criterion is met
• Efficiency – how many items do I need to ask before I get to a
certain level of precision

• Precision – How precise can my measurement be

• What do we need for CAT –

Item information (questions, scoring keys)

Item parameters
Item selection method
Scoring algorithm
Stopping rule
Others ……
catR package
 Women’s Mobility
◦ Item 1Go to any part of the village/town/city.
◦ Item 2Go outside the village/town/city.
◦ Item 3Talk to a man you do not know.
◦ Item 4Go to a cinema/cultural show.
◦ Item 5Go shopping.
◦ Item 6Go to a cooperative/mothers' club/other club.
◦ Item 7Attend a political meeting.
◦ Item 8Go to a health centre/hospital.
library(ltm)
my2pl<-ltm(Mobility~z1)
plot(my2pl,type="IIC")
require(catR)
c<-coef(my2pl)
itemBank <- cbind(c[,2], c[,1], 0, 1)
Choose the item to start with:
 max info around average?
plot(my2pl, type = "IIC")
plot(my2pl, type = "IIC", items=4)
 Random one?
items_administered<-c(4)
responses<-c(1)

it<-itemBank[items_administered, 1:4,drop=F]
theta<-thetaEst(it, responses)
sem<-semTheta(theta,it)

q<-nextItem(itemBank, theta=theta,out=items_administered)
q$item
Introduction

Items

Feedback

Bank

Parameters

Responses Logic

Theta
SEM
 Concerto hosting website
◦ https://hosting.concertoplatform.com/user/registration
 Sign up and log in
 Create your own server
 Start your Concerto experience
 Name
 URL
 Node:
◦ info
◦ questionnaire
◦ CAT
◦ form (save_data)
◦ feedback
 Basic questionnaire

 CES-D scale (The Center for Epidemiologic

Studies Depression Scale; Radloff, 1977)
◦ 20 items
◦ 4 response options
◦ Score above 16 indicates depression
 https://concertotest.com/luning/SSRMC/test/cesd

Radloff, L. S. (1977). The CES-D scale: A

self-report depression scale for
research in the general
population. Applied psychological
measurement, 1(3), 385-401.
 CAT – dichotomous

 Women’s Mobility
◦ 8 items in the item bank
◦ Item selection: MFI
◦ Scoring: BM
◦ Stopping: 3 items
◦ Randomesque: 1
◦ Content balancing: no
◦ Feedback:
 score$score<-round(score$theta*15+100,0)
 faceiq.icar-project.com
◦ Adaptive face detection test
◦ Adaptive emotion recognition test
◦ Adaptive abstract reasoning test
◦ And more ……
 Any questions?

Basic Statistical Treatment in Research
No ratings yet
Basic Statistical Treatment in Research
32 pages
Psychological Testing 24
No ratings yet
Psychological Testing 24
32 pages
Writing A Legal Memorandum
100% (5)
Writing A Legal Memorandum
22 pages
Item Analysis - Santhosh Sir
No ratings yet
Item Analysis - Santhosh Sir
68 pages
Second Version of Same Manuscript
No ratings yet
Second Version of Same Manuscript
37 pages
Ph.d. MGMT
No ratings yet
Ph.d. MGMT
43 pages
Statistical Biology - Reviewer
100% (1)
Statistical Biology - Reviewer
6 pages
Item - Analysis
No ratings yet
Item - Analysis
20 pages
EDUC/PSY 6600: Unit 6 Homework: Categorical Data - Binomial and Chi Squared Tests
No ratings yet
EDUC/PSY 6600: Unit 6 Homework: Categorical Data - Binomial and Chi Squared Tests
34 pages
Lecture 1 - COURSE PREREQUISITES16.20.2019 PDF
No ratings yet
Lecture 1 - COURSE PREREQUISITES16.20.2019 PDF
51 pages
c11 - Quantitative Data Analysis and Interpretation
No ratings yet
c11 - Quantitative Data Analysis and Interpretation
37 pages
Hülya İlhan
No ratings yet
Hülya İlhan
34 pages
Wise JCAT V10 No2
No ratings yet
Wise JCAT V10 No2
11 pages
ICT Tools in Research - 10-11june2020
No ratings yet
ICT Tools in Research - 10-11june2020
40 pages
Computational Aspects of Psychometric Methods With R - 1st Edition Scribd Full Download
100% (12)
Computational Aspects of Psychometric Methods With R - 1st Edition Scribd Full Download
15 pages
20399-Article Text-24412-1-2-20220628
No ratings yet
20399-Article Text-24412-1-2-20220628
9 pages
2021 (Part - 3) Preliminary Tryout and Item Analysis
No ratings yet
2021 (Part - 3) Preliminary Tryout and Item Analysis
40 pages
3 Assmuption-Testing PDF
No ratings yet
3 Assmuption-Testing PDF
17 pages
PsychAssess 13 TableStatistics
No ratings yet
PsychAssess 13 TableStatistics
7 pages
Quantitative Psychology The 85th Annual Meeting of The Psychometric Society, Virtual Readable PDF Download
100% (13)
Quantitative Psychology The 85th Annual Meeting of The Psychometric Society, Virtual Readable PDF Download
15 pages
Hsu, C.-L., W.-C. Wang, Et Al.
No ratings yet
Hsu, C.-L., W.-C. Wang, Et Al.
20 pages
ED442874
No ratings yet
ED442874
12 pages
Reviewer For Statistical Tools
No ratings yet
Reviewer For Statistical Tools
8 pages
Adaptive Item Calibration: A Process For Estimating Item Parameters Within A Computerized Adaptive Test
No ratings yet
Adaptive Item Calibration: A Process For Estimating Item Parameters Within A Computerized Adaptive Test
17 pages
Dataanalysistechniques SHSOctober
No ratings yet
Dataanalysistechniques SHSOctober
58 pages
Item Analysis: A Technique To Check Suitability of Items For Test Dr. Girish Kumar Tiwari
No ratings yet
Item Analysis: A Technique To Check Suitability of Items For Test Dr. Girish Kumar Tiwari
28 pages
Chapter - 16: Categorical Data Field (2005)
No ratings yet
Chapter - 16: Categorical Data Field (2005)
31 pages
An Introduction To Psychometrics
100% (1)
An Introduction To Psychometrics
5 pages
Developing Computerized Adaptive Testing For A National Health Professionals Exam An Attempt From Psychometric Simulations 2023 Ubiquity Press
No ratings yet
Developing Computerized Adaptive Testing For A National Health Professionals Exam An Attempt From Psychometric Simulations 2023 Ubiquity Press
10 pages
Beatrice
No ratings yet
Beatrice
4 pages
Psychometrics: Statistics For Psychology
No ratings yet
Psychometrics: Statistics For Psychology
23 pages
Psychological Assessment Rationalization
No ratings yet
Psychological Assessment Rationalization
7 pages
Item Analysis and Test Construction
No ratings yet
Item Analysis and Test Construction
45 pages
1 OrCom 155 Introduction
No ratings yet
1 OrCom 155 Introduction
20 pages
C-09 - Unit 5 - Item Analysis - Classical Approach by Dr. Md. Kamal Uddin - C-09!18!01 - 2023
No ratings yet
C-09 - Unit 5 - Item Analysis - Classical Approach by Dr. Md. Kamal Uddin - C-09!18!01 - 2023
65 pages
Quantitative Psychology 83rd Annual Meeting of The Psychometric Society, New York, NY 2018 ISBN 303001309X, 9783030013097 Updated Edition Download
No ratings yet
Quantitative Psychology 83rd Annual Meeting of The Psychometric Society, New York, NY 2018 ISBN 303001309X, 9783030013097 Updated Edition Download
16 pages
CHAPTER 8 Clavillas Garma Garcia, J. Layog
No ratings yet
CHAPTER 8 Clavillas Garma Garcia, J. Layog
41 pages
ALICAT A Customized Approach To Item Selection Pro
No ratings yet
ALICAT A Customized Approach To Item Selection Pro
13 pages
Reporting - Test Development
No ratings yet
Reporting - Test Development
5 pages
IRT (Item Response Theory)
No ratings yet
IRT (Item Response Theory)
45 pages
Machine Learning-Driven Language Assessment: Research Conducted at Duolingo
No ratings yet
Machine Learning-Driven Language Assessment: Research Conducted at Duolingo
17 pages
Psychassessment Formula
No ratings yet
Psychassessment Formula
5 pages
Medical Statistics New
No ratings yet
Medical Statistics New
46 pages
6600 Chapter Summaries SKELETON
No ratings yet
6600 Chapter Summaries SKELETON
20 pages
Choosing The Appropriate Statistical Test For Clinical Research
100% (2)
Choosing The Appropriate Statistical Test For Clinical Research
19 pages
Panel Discussion 2 Mixed Methods Approach To Assuring Content Validity
No ratings yet
Panel Discussion 2 Mixed Methods Approach To Assuring Content Validity
48 pages
Computer Adaptive Test AND Item Response Theory
No ratings yet
Computer Adaptive Test AND Item Response Theory
7 pages
Statistic & Machine Learning: Team 2
No ratings yet
Statistic & Machine Learning: Team 2
42 pages
Psychometric
No ratings yet
Psychometric
27 pages
Introducing Decision Theory Analysis (DTA) and Classification and Regression Trees (CART)
No ratings yet
Introducing Decision Theory Analysis (DTA) and Classification and Regression Trees (CART)
30 pages
Chapter 3
No ratings yet
Chapter 3
7 pages
Baker Basics of IRT
No ratings yet
Baker Basics of IRT
180 pages
INSPIRE Fellowship-2024 Applications PDF
No ratings yet
INSPIRE Fellowship-2024 Applications PDF
86 pages
Characteristics, Construction and Evaluation of Psychological Tests
100% (1)
Characteristics, Construction and Evaluation of Psychological Tests
52 pages
Evaluation of FLA Test: A Comparison Between CTT and Irt
No ratings yet
Evaluation of FLA Test: A Comparison Between CTT and Irt
22 pages
Encyclopedia of Research Design, 3 Volumes (2010) by Neil J. Salkind PDF
85% (34)
Encyclopedia of Research Design, 3 Volumes (2010) by Neil J. Salkind PDF
1,644 pages
Learning Area Grade Level Quarter Date I. Lesson Title Ii. Most Essential Learning Competencies (Melcs) Iii. Content/Core Content
No ratings yet
Learning Area Grade Level Quarter Date I. Lesson Title Ii. Most Essential Learning Competencies (Melcs) Iii. Content/Core Content
5 pages
12test Construction
No ratings yet
12test Construction
3 pages
Reading Approaches q2 Week 1
No ratings yet
Reading Approaches q2 Week 1
28 pages
Psychological Assessment Outline Summary
No ratings yet
Psychological Assessment Outline Summary
9 pages
Methods and Stats in I/O: - Science - Research - Data Analysis - Correlation and Regression - Psychometrics
No ratings yet
Methods and Stats in I/O: - Science - Research - Data Analysis - Correlation and Regression - Psychometrics
44 pages
Nature and Scope of Jurisprudence
No ratings yet
Nature and Scope of Jurisprudence
2 pages
L11 ItemAnalysis
No ratings yet
L11 ItemAnalysis
59 pages
Practical 5: Introduction To Weka For Classfication
100% (1)
Practical 5: Introduction To Weka For Classfication
4 pages
Social Constructionism in The Symbolic Interactionist Tradition
100% (1)
Social Constructionism in The Symbolic Interactionist Tradition
13 pages
Statistical Computing by Using R
100% (1)
Statistical Computing by Using R
11 pages
Educational Psychology:: An Integrated Approach To Classroom Decisions
No ratings yet
Educational Psychology:: An Integrated Approach To Classroom Decisions
25 pages
Rural Design and Urban Design
No ratings yet
Rural Design and Urban Design
9 pages
A Postmodern Critique of The Little Red Riding Hood Tale.
No ratings yet
A Postmodern Critique of The Little Red Riding Hood Tale.
219 pages
He Notes 8
No ratings yet
He Notes 8
20 pages
Vac Assignment On Empathy
No ratings yet
Vac Assignment On Empathy
10 pages
Psych 1101 Chapter 1 Handout
No ratings yet
Psych 1101 Chapter 1 Handout
24 pages
Sample Account Statement Data 50rows
No ratings yet
Sample Account Statement Data 50rows
8 pages
Module 3 Handout
No ratings yet
Module 3 Handout
60 pages
Maharshi Dayanand University, Rohtak
No ratings yet
Maharshi Dayanand University, Rohtak
1 page
Dr. Babasaheb Ambedkar Technological University, Lonere, Raigad Regular Summer 2022 Semester Examination Time Table For B. Tech. (6th Semester)
No ratings yet
Dr. Babasaheb Ambedkar Technological University, Lonere, Raigad Regular Summer 2022 Semester Examination Time Table For B. Tech. (6th Semester)
8 pages
Jurnal Sinta 1-6 Musvira Mustafa - Po713201191120 - 2C
No ratings yet
Jurnal Sinta 1-6 Musvira Mustafa - Po713201191120 - 2C
7 pages
Notes GNED 05
No ratings yet
Notes GNED 05
2 pages
ELS 107-Stylistics
No ratings yet
ELS 107-Stylistics
9 pages
Week 7 Mixed-Methods Research
No ratings yet
Week 7 Mixed-Methods Research
5 pages
Birdsong & Vanhove (2016)
No ratings yet
Birdsong & Vanhove (2016)
20 pages
Units For The Whole College
No ratings yet
Units For The Whole College
2 pages
Gantt Data
No ratings yet
Gantt Data
2 pages
The Impact of Management Information Systems Adoption in Managerial Decision Making: A Review
No ratings yet
The Impact of Management Information Systems Adoption in Managerial Decision Making: A Review
9 pages
Week 4 - Performance Task (12 & Abm 2)
No ratings yet
Week 4 - Performance Task (12 & Abm 2)
4 pages
MSC Psychology Modules: Ourse Structure (Full-Time)
No ratings yet
MSC Psychology Modules: Ourse Structure (Full-Time)
12 pages
Dignitihealth
No ratings yet
Dignitihealth
1 page
CE 305: Principles of Steel Design Plate 2 - Final: Hand Written
No ratings yet
CE 305: Principles of Steel Design Plate 2 - Final: Hand Written
9 pages
S.Rengasamy. Madurai Institute of Social Sciences Regional Planning & Development. - Concept & Meaning of Region
No ratings yet
S.Rengasamy. Madurai Institute of Social Sciences Regional Planning & Development. - Concept & Meaning of Region
5 pages
Quiz 103 1
No ratings yet
Quiz 103 1
3 pages
From A Rational Standpoint: Analyzing Nuances in The Utility of Western Psychological Tests As Assessment Tools in The Philippines
No ratings yet
From A Rational Standpoint: Analyzing Nuances in The Utility of Western Psychological Tests As Assessment Tools in The Philippines
6 pages
Blue or Red? Exploring The Effect of Color On Cognitive Task Performances - Science
No ratings yet
Blue or Red? Exploring The Effect of Color On Cognitive Task Performances - Science
2 pages
Fraud Detection
No ratings yet
Fraud Detection
1 page

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

DR Luning Sun Psychometrics Module, Lecture 3 Social Sciences Research Methods Centre - SSRMC

Uploaded by

DR Luning Sun Psychometrics Module, Lecture 3 Social Sciences Research Methods Centre - SSRMC

Uploaded by

Dr Luning Sun

Psychometrics Module, Lecture 3

Most likely score

= A question from our test

= A question from our test

= A question from our test

= A question from our test

= A question from our test

= A question from our test

= A question from our test

= A question from our test

= A question from our test

= A question from our test

= A question from our test

= A question from our test

= A question from our test

 Large item bank

Kingsbury, G. G., and Zara, A. R. (1989). Procedures for

 Test time (5 minutes)

 Reliability of theta estimate (standard error)

 Other, clever stuff

• Precision – How precise can my measurement be

Item information (questions, scoring keys)

 CES-D scale (The Center for Epidemiologic

Radloff, L. S. (1977). The CES-D scale: A

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.