Chapter Fourteen: Data Preparation
Chapter Fourteen: Data Preparation
Data Preparation
14-2
Chapter Outline
1) Overview
2) The Data Preparation Process
3) Questionnaire Checking
4) Editing
i. Treatment of Unsatisfactory Responses
5) Coding
i. Coding Questions
ii. Code-book
iii. Coding Questionnaires
14-3
Chapter Outline
6) Transcribing
7) Data Cleaning
i. Consistency Checks
ii. Treatment of Missing Responses Adjusting
the
8) Statistically Adjusting the Data Data
i. Weighting
ii. Variable Respecification
iii. Scale Transformation
9) Selecting a Data Analysis Strategy
14-4
Chapter Outline
10) A Classification of Statistical Techniques
11) Ethics in Marketing Research
12) Internet & Computer Applications
13) Focus on Burke
14) Summary
15) Key Terms and Concepts
14-5
Check Questionnaire
Edit
Code
Transcribe
Clean Data
Questionnaire Checking
A questionnaire returned from the field may be
unacceptable for several reasons.
Parts of the questionnaire may be incomplete.
Editing
Treatment of Unsatisfactory Results
Returning to the Field – The questionnaires
Coding
Coding means assigning a code, usually a number, to each
possible response to each question. The code includes an
indication of the column position (field) and data record it will
occupy.
Coding Questions
Fixed field codes, which mean that the number of records for
each respondent is the same and the same data appear in the
same column(s) for all respondents, are highly desirable.
If possible, standard codes should be used for missing data.
Coding of structured questions is relatively simple, since the
response options are predetermined.
In questions that permit a large number of responses, each
possible response option should be assigned a separate column.
14-10
Coding
Guidelines for coding unstructured questions:
Category codes should be mutually exclusive and
collectively exhaustive.
Only a few (10% or less) of the responses should fall
possible.
14-11
Coding
14-12
Coded Questionnaire
14-13
Codebook
A codebook contains coding instructions and the
necessary information about variables in the data set.
A codebook generally contains the following
information:
column number
record number
variable number
variable name
question number
instructions for coding
14-14
Codebook
14-15
Coding Questionnaires
The respondent code and the record number appear
on each record in the data.
The first record contains the additional codes: project
code, interviewer code, date and time codes, and
validation code.
It is a good practice to insert blanks between parts.
Data Cleaning 14-16
Consistency Checks
SPSS Data
Statistically Adjusting the Data 14-19
Weighting
In weighting, each case or respondent in
the database is assigned a weight to reflect
its importance relative to other cases or
respondents.
Weighting is most widely used to make the
sample data more representative of a target
population on specific characteristics.
Yet another use of weighting is to adjust the
sample so that greater importance is attached
to respondents with certain characteristics.
14-20
High School
1 to 3 years 6.39 8.65 1.35
4 years 25.39 29.24 1.15
College
1 to 3 years 22.33 29.42 1.32
4 years 15.02 12.01 0.80
5 to 6 years 14.94 7.36 0.49
7 years or more 12.18 6.90 0.57
Totals 100.00 100.00
14-21