Paper 2 Research Methods
Paper 2 Research Methods
RESEARCH METHODS
Psychology is a science
Peoples intuitions about human behaviour also known as folk psychology after
turn out to be wrong. This is one primary reason that psychology relies on science
rather than common sense.
2. The second feature is that it is concerned with empirical questions - these are
questions about the way the world actually is
3. The third feature of science is that it creates public knowledge, that is scientists
publish their work to be reviewed by their peers.
fi
fi
UNIT 1: RESEARCH METHODS
1.1 Experiments
1.2 Self-reports
1.4 Observations
1.5 Correlations
2.Interviews
• Structured
Objective Subjective
factual/not impacted by interpretation description/impacted by interpretation
Idiographic
Nomothetic
Provide explanations
Provide evidence
Observations: Observations:
Structured + semi structured Unstructured
Example of data
1. Physiological measures
MRI scans - changes in di erent regions of the brain in terms of volume and activity
2. Psychometric tests
IQ test
GAD-7
FFMQ: ’I perceive my feelings and emotions without having to react to them’ (rating scale:
never or very rarely true, rarely true, sometimes true, often true, very often or always true)
4. Observations
Structured observations
ff
1.1 EXPERIMENTS
PAPER 2 RESEARCH METHODS
EXPERIMENTs
IV DV
Independent Dependent
variable variable
CAUSE EFFECT
Manipulated Measured
Changed Result
Sleep Concentration
1.1 Experiments
Experiment:
An experimental research is used to identity causal links between variables. That is,
they test whether one variable causes a change in another variable.
Researchers manipulate one variable (IV) and observe/measure the e ect it has on
the other variable (DV) .
the factor in an experiment which is measured and is expected to change under the
in uence of the independent variable.
This concrete de nition allows researchers to measure and compare aggression consistently across
di erent studies, improving the scienti c rigour of psychological research.
ff
fi
fi
fi
fi
fi
fi
Practise questions:
A hypothesis in a study states ‘Recall will be better after a short delay than after a long
delay’.
..............................................................................................................................................[1]
Suggest one way to operationalise ‘short delay and long delay’ in this study.
..................................................................................................................................................
.............................................................................................................................................[1]
Elea is conducting an experiment to test whether people who sleep for more than eight
hours per night are happier than people who sleep less than this.
(a) (i) Elea’s dependent variable is ‘happiness’. Suggest how this could be operationalised.
..................................................................................................................................................................
…………………………………………………………………………………………………………
[1]
(ii) Suggest one disadvantage of operationalising happiness in this way.
..................................................................................................................................................................
…………………………………………………………………………………………………………
..................................................................................................................................................................
…………………………………………………………………………………………………………
[2]
Describe why psychologists operationally define the independent variable and the dependent variable in
experiments, using any examples.
......................................................................................................................................................................
......................................................................................................................................................................
......................................................................................................................................................................
......................................................................................................................................................................
......................................................................................................................................................................
......................................................................................................................................................................
......................................................................................................................................................................
......................................................................................................................................................................
......................................................................................................................................................................
......................................................................................................................................................................
......................................................................................................................................................................
.................................................................................................................................................................[6]
Practise question:
Albert is investigating autism. He plans to obtain a sample of people with autism and a control group.
He will compare how well these participants judge the feelings of dolls which are either dancing,
crying, waving or smiling.
(a) Identify the independent variable in this experiment.
..................................................................................................................................................
..............................................................................................................................................[1]
(b) Suggest how the dependent variable in this experiment could be measured.
..................................................................................................................................................................
…………………………………………………………………………………………..………………
..................................................................................................................................................................
………………………………………………………………………………………………………[2]
Zho is investigating how quickly fish learn to respond to different coloured lights. She trains
fish in two tanks to swim to the end of the tank for food in response to a light. She uses a
red light with one tank and a green light with the other tank. She times how quickly the fish
swim to the end of the tank.
(a) Identify the dependent variable in this experiment.
..................................................................................................................................................
..............................................................................................................................................[1]
(b) Identify the independent variable in this experiment.
..................................................................................................................................................
..............................................................................................................................................[1]
Elea is conducting an experiment to test whether people who sleep for more than eight
hours per night are happier than people who sleep less than this.
(a) (i) Elea’s dependent variable is ‘happiness’. Suggest how this could be operationalised.
..................................................................................................................................................................
…………………………………………………………………………………………………………
[1]
(ii) Suggest one disadvantage of operationalising happiness in this way.
..................................................................................................................................................................
.................................................................................................................................................................
.................................................................................................................................................................
.................................................................................................................................................................
[2]
Olivia is conducting an experiment to investigate whether students concentrate better in
class before or after eating. Her experimental design is a repeated measures design.
(a) Identify the independent variable (IV) in this experiment.
..................................................................................................................................................
..............................................................................................................................................[1]
(b) Identify the dependent variable (DV) in this experiment.
..................................................................................................................................................
..............................................................................................................................................[1]
Establishing a causal
relationship: controlling variables
In order to establish a causal relationship the researcher must be able
to assert with some degree of con dence that it was indeed the IV that
caused the change in the DV. In order to do this the researcher must
control all the other variables.
Instructions or instruments
Health conditions
used
Kinds of variables:
controlled variables: A controlled variable in an experiment, is a factor or condition
that is intentionally kept consistent and unchanged throughout the experiment.
This is done to ensure that any observed e ects or di erences in the dependent
variable(s) are the result of the independent variable(s) being manipulated and
not due to variations in other factors.
To remove its e ects a control variable may be held at a constant level during
the study or managed by statistical means
• These are any variables other than the independent variable (IV) that might a ect the dependent
variable (DV). They can in uence the outcome of the experiment if not controlled.
• Extraneous variables can exist in any experiment but do not necessarily correlate with the IV.
Confounding Variables
• These are a subset of extraneous variables that vary systematically with the IV and therefore
provide an alternative explanation for the results.
• They make it unclear whether the IV or the confounding variable caused changes in the DV.
• Confounding variables threaten the internal validity of an experiment because they introduce a
hidden relationship.
• Example: In the same experiment on sleep and test performance, if participants who got less
sleep also had to wake up earlier (leading to increased fatigue), fatigue would be a confounding
variable. This is because it is tied to both the amount of sleep (IV) and test performance (DV).
fl
ff
fl
ff
Extraneous variable:
room temperature
IV: DV:
Sleep Test performance
Confounding variable:
Fatigue
IV: DV:
Sleep Test performance
Practise question:
Explain why it is important to control extraneous variables in experimental studies.
..................................................................................................................................................................
..................................................................................................................................................................
..................................................................................................................................................................
................................................................................................................./...............................................
[2]
Why is it important to control extraneous variables?
(b) Suggest how Mikko could operationalise the dependent variable in his experiment.
………………………………………………………………………………………………………………………….
…………………………………………………………………………………………………………………………......
………………………………………………………………………………………………………………………….
………………………………………………………………………………………………………………………….[2]
Kinds of groups in an experiment (IV):
The experimental condition/ experimental group:
No Control
IV (3) - music group
IV absent DV
ff
ff
ff
Why do researchers
need a control group?
• To establish a baseline for comparison with the
experimental groups. In this way the control group allows
researchers to measure the e ects of the experimental
treatment by providing a reference point una ected by the
manipulated variable.
Independent
GENDER
variable
Independent Treatment
variable status
Independent
Sleep duration
variable
Manipulated into
0 hrs 4 hrs 7 hrs 9 hrs
four levels
recommend
no sleep short extended
ed
Dependent
Cognitive performance
variable
LEVELS OF THE IV
Five
0-9 10-18 19-29 30-39 +40
levels
reaction time
Dependent
(measured in ms using a computerised
variable task)
fi
Levels of the IV: Example1
Placebo vs. Medication
Hypothesis: Patients who take the true medication pill will experience a greater reduction in depression
levels compared to those who take the placebo pill or those in the control condition (no pill).
Variables:
• Dependent Variable (DV): Overall change in depression (measured by a depression scale such
as the BDI score)
ff
Levels of the IV: Example 2
For example, suppose a teacher wants to know how three di erent studying techniques
a ect exam scores. She randomly assigns 30 students each to use one of the three
studying techniques for a week, then each student takes the exact same exam.
In this example, the independent variable is Studying Technique and it has three
levels:
• Technique 1
• Technique 2
• Technique 3
In other words, there are the three experimental conditions that the students can
potentially be exposed to.
The dependent variable in this example is Exam Score, which is “dependent” on the
studying technique used by the student.
ff
ff
BANDURA ET AL
THE BOBO DOLL
EXPERIMENT
CAN AGGRESSION BE
LEARNT?
Bandura et al: the process
Subjects exposed
to either an Children are
Aggression
aggressive or observed for
arousal
nonaggressive imitative behaviour
model
Understanding the levels of the IV:
Bandura et al: Aggression
Experiments:
How do experimenters make sure extra variables do not impact the validity of their
research?
Two kinds of groups in an experiment: the experimental group and the control
group
• Strength: This design is good because the participants only encounter the
experimental design setting once. They are therefore unlikely to respond
due to demand characteristics. Also there are no order e ects.
• Weakness: More participants are needed. One problem is that there might
be individual di erences between participants (participant variables) that
might in uence the ndings. This risk can be reduced by random
allocation of participants to di erent conditions.
• Strength: The main advantage is that each person acts as their own
baseline. Any di erence between participants that could in uence their
performance and therefore the DV, will a ect both levels of the IV in the
same way. Individual di erences (or participant variables) are therefore
unlikely to bias the ndings
MEAN SCORES
p1 p2 p3 p4
IV 1: Loud noise=
3+4/2
3.5
IV 1
Test score: Test score:
Loud noise 3/10 4/10 IV 2: no noise=
8+9/2
8.5
Participants
scored more
when their
attention was
IV 2 tested in the ‘no
No noise Test score: Test score: noise’ condition
as compared to
8/10 9/10
the loud noise
condition
INDEPENDENT VARIABLE
DEPENDENT VARIABLE
IV 1: loud noise
EXTRANEOUS VARIABLE
INDEPENDENT VARIABLE
DEPENDENT VARIABLE
IV 2: no noise (control)
MEAN SCORES
p1 p2 p3 p4
IV 1: Loud noise=
3+4+2+4/4
IV1 IV2 IV1 IV2
3.25
(loud noise) (no noise) (loud noise) (no noise)
DAY 1
Test score: Test score: Test score: Test score: IV 2: no noise=
7+8+8+9/4
3/10 8/10 2/10 9/10
8.0
Participants
scored more
IV2 IV1 IV2 IV1 when their
(no noise) (loud noise) (no noise) (loud noise) attention was
DAY 2 tested in the ‘no
noise’ condition
Test score: Test score: Test score: Test score:
as compared to
7/10 4/10 8/10 4/10 the loud noise
condition
INDEPENDENT VARIABLE
INDEPENDENT VARIABLE
DEPENDENT VARIABLE
IV 2: no noise (control)
matched
Group A with IV1
group B
matched
Group B with IV 2
group A
Strengths and weaknesses of experimental design:
Practise questions:
Bella is conducting a laboratory experiment to find out about people’s fear of animals. She
asks each participant if they have any phobias. If the participant says yes they are asked to
leave. The remaining participants are shown photographs while their pulse rate is measured.
The independent variable (IV) is whether the photographs are of animals with eight legs or
animals with four legs.
The experiment used a repeated measures design.
Explain one advantage of this design in this study.
......................................................................................................................................................................
......................................................................................................................................................................
......................................................................................................................................................................
.................................................................................................................................................................[2]
Ed and Jim are planning a field experiment about learning. They want to know whether
young children learn to use mobile (cell) phones from older brothers or sisters. They each
offer their phone to a younger brother or sister and also to a cousin who has no older
brothers or sisters. They time how long it takes each child to find a game to play with on the
phone.
Explain the experimental design being used in this experiment.
..................................................................................................................................................................
..................................................................................................................................................................
..................................................................................................................................................................
.................................................................................................................................................................
[2]
Richard is conducting an experiment to investigate whether daydreams are affected by the type of
background noise. He has two groups who are both given a boring task in a room. For one group
there are children laughing outside the open window. For the other group there are adults talking
loudly in the corridor. Different participants are used in each group.
(a)
(i) Identify Richard’s experimental design. Include a reason for your answer.
......................................................................................................................................................................
......................................................................................................................................................................
......................................................................................................................................................................
.................................................................................................................................................................[2]
measures
• tools like pulse meters, MRI scans are
objective and therefore reliable
- how objective were
the tools used to
• self reports: close ended questionnaire +
structured interviews + structured
measure the DV or IV
observations provide objective and
quantitive data and are therefore improve the
reliability of a study
Reliability
consistency
procedure
- how standardised was
the procedure?
• if within each level of the IV the procedure is
consistent the study will have reliability
ecological validity
the extent to which the results of the study can be
addresses whether the results are applied to real world contexts
true/applicable to the real world
INTERNAL ECOLOGICAL
VALIDITY VALIDITY
FIELD EXPERIMENTS
INTERNAL ECOLOGICAL
VALIDITY VALIDITY
Factors that impact Internal Validity:
Participant variables: (independent measures)
• individual di erences between participants (such as age, personality and intelligence) that could a ect
their behaviour in a study. They could hide or exaggerate di erences between levels of the IV.
Participant variables such as IQ can e ect studies on memory studies.
• Solution: random allocation: a way to reduce the e ect of confounding variables such as individual
di erences. Participants are put in each level of the IV such that each person has an equal chance of
being in any conditions
• It is di cult to control participant variables in the eld, in the laboratory the researcher can be more
selective about the participants and therefore set up controls.
Situational variables:
can be things like the time of day, room temperature or how much sleep a participant has had.
Solution: Standardisation: Situational variables are easier to control and standardise in the lab, but
harder to control in the eld.
the e ect of boredom, fatigue or learning (the practise e ect) when participants go through di erent IV
conditions in an order. Can e ect validity in repeated measures design.
Solution: Counterbalancing: participants are divided into two groups, one group does the IV
conditions in one order (IV1 then IV2) and the other does the IV conditions in the reverse of that order
(IV2 then IV1)
ff
ff
ffi
ff
ff
fi
ff
ff
fi
ff
ff
ff
ff
ff
Demand characteristics (higher in repeated measures):
Demand characteristics are features of the experimental situation which give away the aims.
They can cause participants to try to change their behaviour, which reduces the validity of the
study.
Solution: a blind study- participants do not know whether they are in the experimental group or
the control group
Experimenter bias:
When the researchers expectations in uence the experiments results. It might be subtle clues in
the way they phrase things, or body language in the experimenter that e ect the participants
behaviour. For example: An experimenter who wants to see a drug passed
Solution: a double-blind technique- the experimenter and the participant both don’t know how
was in the control group and who was in the experimental group.
when people behave in a certain way because they want to portray themselves in a better light.
Social desirability is likely to be higher in the lab as participants know they are being studied.
Solution:
covert observation: where participants are not aware of being observed or studied
ller questions: in questionnaires help reduce social desirability bias by masking the true
purpose of the survey and diverting attention from sensitive or leading questions
fi
fl
fi
ff
Factors that impact Ecological validity:
• Control over variables: if the variables are controlled such as
in a lab experiment - the condition becomes so arti cial (which
improves internal validity) and unlike real life that it can loose its
applicability to real life (ecological validity)
Bedroom 1:
Lab low mundane Ecological
steel bed, tube lights, realism validity is low
white paint, plain room
Bedroom 2:
mundane
Lab Ecological
realism
cozy bed, rug, lamp, side slightly higher validity is better
table
Bedroom 1:
mundane
Field Ecological
realism
participant sleeps in their high validity is high
own bed
fi
Kinds of experiments:
1. Laboratory experiment: an experiment carried out
in a controlled/arti cial environment
The researcher can assess responses by Practical and ethical constraints- It may be
qualitative or quantitative means. unethical or impossible to assign people to
groups. For example: Unethical to deliberately
deprive children of sleep. Impossible or
expensive to send people on trips to study jet
lag
ff
ff
Describe the differences between a field experiment and a laboratory experiment, using any
examples.
...................................................................................................................................................
...................................................................................................................................................
...................................................................................................................................................
...................................................................................................................................................
...................................................................................................................................................
...................................................................................................................................................
...................................................................................................................................................
...................................................................................................................................................
...................................................................................................................................................
...................................................................................................................................................
...................................................................................................................................................
................................................................................................................................................[6]
Ed and Jim are planning a field experiment about learning. They want to know whether
young children learn to use mobile (cell) phones from older brothers or sisters. They
each offer their phone to a younger brother or sister and also to a cousin who has no
older brothers or sisters. They time how long it takes each child to find a game to play
with on the phone.
(a) Suggest why Ed and Jim chose a field experiment as their research method.
............................................................................................................................................................
............................................................................................................................................................
............................................................................................................................................................
............................................................................................................................................................
[2]
Gavin is testing the hypothesis that ‘People are more likely to drop rubbish on the ground when there is
already rubbish on the ground’. He is conducting a field experiment in a quiet street. His independent
variable is whether the street already has rubbish on the ground or not. Gavin’s dependent variable is
whether people drop a leaflet given to them by a stooge onto the ground or not.
(c) Explain one reason why Gavin chose to conduct a field experiment rather than a laboratory experiment.
......................................................................................................................................................................
.......................................................................................................................................................................
.......................................................................................................................................................................
......................................................................................................................................................................
…………………………………………………………………………………………………………………………[2]
......................................................................................................................................................................
.......................................................................................................................................................................
.......................................................................................................................................................................
......................................................................................................................................................................
…………………………………………………………………………………………………………………………[2]
Hypotheses in
experimental research
a hypothesis is a testable statement based on the
aims of a research
* Make sure that you always state all/both of the levels of the IV (e.g. IV1 & IV2) and
the DV otherwise your null hypothesis will not make sense. For example, the null
hypothesis: ‘There is no di erence between mind maps and revision apps’ is meaningless=
0 marks - because the DV is not mentioned.
ff
ff
ff
ff
ff
ff
ff
Type of
Definition Example
Hypothesis
1. Mindfulness and
Anxiety
There will be a difference in Students who practice There will be no difference in
anxiety levels (DV) between mindfulness (IV1) will have anxiety levels (DV) between
Students who practice students who practice lower anxiety levels (DV) than students who practice
mindfulness (IV1) and mindfulness (IV1) and those students who do not practice mindfulness (IV1) and those
students who do not who do not (IV2). mindfulness (IV2). who do not (IV2).
practice mindfulness (IV2)
with anxiety levels (DV).
3. Introverts/Extroverts
and Social Behavior There will be a difference in There will be no difference in
Extroverts will engage in more
social behaviour between social behaviour during group
social behavior during group
Introverts and extroverts introverts and extroverts tasks between introverts and
tasks than introverts.
with social behavior during during group tasks. extroverts.
group tasks (DV).
Penny is using cats and parrots in her experiment. Penny houses each animal alone
and only gives them their daily food every evening. Her independent variable is the
species. She thinks that parrots will share food because they are social animals
whereas cats live on their own. To test this, two animals of the same species are put
together with a small bowl of food every afternoon.
(a) Penny’s dependent variable is whether the animals share the food. Suggest how
Penny could operationalise this dependent variable.
............................................................................................................................................................
............................................................................................................................................................
............................................................................................................................................................
………………………………………………………………………………………………………
[2]
• food sharing could be operationalised as how much food each animal eats;
• see whether one animal waits for the other/see if they both get something to eat;
(b) 2 marks for a fully operationalised non-directional hypothesis, correctly using cats and parrots, and saying how food
sharing will be measured
• Cats (IV1) and parrots (IV2) will differ in the time they spend sharing food (DV) (2)
• There will be a difference in the number of pieces of food (DV) shared between the cats (IV1) and parrots (IV2) (2)
• There will be a di erence in food sharing between the cats and parrots (1)
• There will be a di erence in helpfulness between the cats and parrots (1)
• Adults are more likely to play internet-based games than younger people = 1 (experimental not
operationalised)
The older you are the more likely you are to play internet games = 1 mark (correlational not
operationalised)
• Older people/adults are more likely to play internet based games for more than 2 hours per day
than younger people / children. = 2 marks
• 12 year olds are more likely to play internet based games than people over 40 = 2 marks
• The older you are in years the more likely you are to play an internet game. = 2 marks
• The older you are the less likely you are to play an internet game for more than 8 hours a day. = 2
marks
An experiment is testing the aim that smiling a ects helpfulness. The procedure is to
investigate whether people leaving a shop will hold a door open for people who are
smiling.
(a) Suggest an operationalised directional (one-tailed) hypothesis that Otto could test.
……………………………………………………………………………………………………………
............................................................................................................................................................
............................................................................................................................................................
……………………………………………………………………………………………………………[2]
(b) Suggest an null hypothesis for this experiment.
………………………………………………………………………………………………………………
…………………………
……………………………………………………………………………………………………………[1]
ff
(a)
People help more when they are smiled at but don’t when they are not = 2 marks.
This has smiling (IV1) and not smiling (IV2) there is the DV (help more) and there is 'helping more when smiled
at' so is one tailed.
People are more helpful so hold doors open more often when they are smiled at = 2 marks [more helpful/hold
doors (DV) when smiled at (IV1) but no IV2 so this can only be 1 mark]
People are more helpful than when they are not smiled at = 1 mark.
More helpful (in what way?) than when (with no detail) not smiled at?]
People are more helpful so hold doors open more often when they are smiled at than when they are not
smiled at = 2
People leaving a shop will hold the door open for people who smile = 0 marks (too vagueWrite a null hypothesis
for this experiment. 1
(b) Has to have two levels of IV but DV does not have to be operationalised
Any di erence between how helpful people when they are smiled at or not is due to chance. = 1 mark
There is no di erence between helpful people when they are smiled at or not. = 1 mark
No marks for statements that are not hypotheses, or are experimental/ alternative hypotheses.
ff
ff
Carol is interested in theory of mind. She is comparing adults in ‘caring’ jobs such as
nursing with adults in ‘non-caring’ jobs that have less contact with the public. Carol
believes that adults in caring jobs will score higher on the Eyes Test than those in non-
caring jobs.
(a) Write a null hypothesis for Carol’s study.
............................................................................................................................................................
............................................................................................................................................................
............................................................................................................................................................
…………………………………………………………………………………………………[1]
1 mark for a null hypothesis (whether operationalised or not)
0 marks for an experimental/ alternative hypothesis / correlational null.
e.g. There will be no difference between eyes test scores for caring job adults and non-
caring job adults = 1
e.g. Any difference between theory of mind in nurses and typists is due to chance = 1
e.g. There will be no difference between theory of mind and caring or non- caring jobs = 0
(nonsense)
e.g. Any difference between eyes test scores and adults job type is due to chance = 0
(nonsense)
e.g. There will be a difference between eyes test scores for caring job adults and non-caring job
adults = 0 (H1)
e.g. Theory of mind in nurses will be better than in typists = 0 (H1)
Gavin is testing the hypothesis that ‘People are more likely to drop rubbish on the
ground when there is already rubbish on the ground’. He is conducting a field
experiment in a quiet street. His independent variable is whether the street already has
rubbish on the ground or not. Gavin’s dependent variable is whether people drop a
leaflet given to them by a stooge onto the ground or not.
(a) State the type of hypothesis Gavin is testing. Include a reason for your answer.
............................................................................................................................................................
............................................................................................................................................................
............................................................................................................................................................
............................................................................................................................................................
[2]
1 mark for type of hypothesis.
1 mark for the reason (even if name is incorrect). This
may or may not be linked.
(a) Describe how Sara can conduct a laboratory experiment to test whether
people with a higher IQ are better at concentrating than people with a lower
IQ. [10 marks]
(b) Identify one practical weakness/limitation with the procedure you have
described in your answer to part (a) and suggest how your study might be
done di erently to overcome the problem.
can be used to ask open or closed questions can be used to ask open or closed questions
but mostly used to ask closed questions but mostly used to ask open questions
mostly used to collect Quantitative data (QL) mostly used to collect Qualitative data (QL)
• May su er from response bias and social • May su er from interviewer bias and
desirability bias. social desirability bias.
Data can be qualitative, but may also be quantitative depending on type of question
Strengths
Asking people directly means that participants are given the opportunity to express their feelings and explain their behaviour
rather than the researcher trying to work out reasons for their behaviour from other methods e.g observation
Weaknesses:
Some participants may provide socially desirable responses/ not give truthful answers/respond to demand characteristics.
Researchers have to be careful about use of leading questions; it could a ect the validity of the data collected.
Quantitative data can be scored (objective and reliable) and compared to all other people (statistically analysed)
completing the questionnaire.
Closed/ xed choice questions may force people into choosing answers that do not re ect their true opinion and
therefore may lower the validity.
Quantitative data o ers depth and detail into an individuals perspective or experiences, the responses of the di erent
participants are di cult to compare
Data take time and e ort to analyse (thematic analysis) and therefore sample sizes are likely to be kept small
Data Collection Unstructured Interviews, open-ended tools (e.g. pulse meters, MRI), closed-ended
Methods questionnaires questionnaires, structured observations
Focus In-depth exploration of individual Identi cation of patterns and trends in groups
experiences
Objectivity An issue with qualitative data (QL) A strength of quantitative (QT) data gathered
gathered from open ended questions and from close ended questions (and structured
unstructured observations, that require observations) that are based on quanti cation.
interpretation.
Examples Interview on coping with stress Number of hours slept correlated with stress
fi
fi
Questionnaires
open questions ask for descriptions/detail closed questions ask for answers from
speci c options
• Ask questions that begin with why, what, • Di erent kind of questions:
how
• Dichotomous questions (YES/NO)
• e.g: How would you describe your • multiple choice questions (MCQs)
relationship with your mother?
• Rating scales
• e.g. Why are you a vegetarian?
• e.g. What motivates you to exercise? • Likert scales
ff
fi
fi
close-ended questions
Rating scale
Qualitative data provides in-depth and detail Quantitative data allows for statistical
data analysis and comparisons
Harder to give to a larger sample and Can be given to a larger sample and therefore
therefore not as generalisable o er generalisable results
Require thematic analysis which is time Statistical analysis - is quicker and easier to
consuming give to a large sample
Are subjective because responses require Provide objective data because the answers
interpretation - reduce validity do not require interpretation - improve validity
ff
fi
ff
fi
Questionnaires
Dif culty in interpreting answers can lead to Reliable as the same questions are answered
a lack of reliability by each participants (standardised) and can
be compared
Solution: Solution:
inter-rater reliability/intra-rater reliability can be tested internal consistency (split half test) and inter-rater
to see if the results are reliable reliability
The content of the questions are important - if not stated properly can be misunderstood and
therefore the study might lack face validity.
Reliability
close ended questions
• internal consistency (across items- measured using the split-half test)
• test-retest reliability (across time- measured using test-retest)
open ended questions
• inter-rater consistency (agreement between different raters)
• intra-rater consistency (consistency within the same rater over time)
fi
fi
fi
fi
fi
fi
Reliability in self reports
test-retest
test-retest reliability e.g if Baron-Cohen et
- giving the same test
al. used the AQ on the
- over time twice, to see if the
scores correlate well same group twice,
they should get the
same AQ each time
Reliability
consistency
refers to the
if one teacher grades consistency of a
intra-rater reliability the same essay twice single rater when
- same researcher at high intra-rater they assess the
di erent times consistency means they same subject, task,
give a similar score both or phenomenon at
di erent points in
time.
construct validity
Construct validity is improved by thoroughly testing the questionnaire
The extent to which a against established theories and ensuring it accurately measures the
Validity questionnaire accurately intended construct. It worsens when questions fail to capture the full
measures the theoretical scope of the concept or measure unrelated factors.
accuracy concept or construct it is
intended to assess.
The BDI improves construct validity by being grounded in cognitive
theories of depression i.e Beck’s work on cognitive distortions and
negative bias.
concurrent validity
QUESTIONNAIRES INTERVIEWS
schedule + new
Open ended schedule no schedule
questions
likely consist of close close + open ended consist mostly of open
ended questions questions ended questions
Quantitative
Quantitative + Qualitative less Qualitative
Standardised free- owing
data data Standardised data
Subjective
Objective thematic Valid
statistical Reliable Objective less reliable analysis have depth
analysis requires and detail
interpretation
STRENGTHS
1.Standardised - the questions, their order, interviewers clothes etc are xed for
every participant hence the measure is reliable and the results of the
interview comparable between participants
2.The questions can be close ended or open ended, therefore both QT and QL
data can be collected.
3.The interviewer can clarify the meaning of questions where needed.
WEAKNESSES
STRENGTHS
1. Some questions are xed which makes sure that there is some similar information from
every participant. This allows for comparisons and averages to be calculated
2. It is also possible to ask questions speci c to the participant. This allows the researcher
to develop ideas and explore issues that are particular to that person.
3. They offer more detail and possibly more validity than structured interviews but are not
as in-depth as unstructured interviews and so the extent of further knowledge gained is
not as much as with unstructured interviews.
4. Encourages two-way communication. Provides an opportunity so that interviewers can
learn answers to questions and the reasons behind the answers.
5. Allows respondents time to open up about sensitive issues.
6. Provides qualitative data to compare to previous and future data
WEAKNESSES
STRENGTHS
1.Produce very valid results because the researcher gets time to build a rapport with the
participant and gain their trust. This can lead to more honest and natural responses
2.Topics that require a detailed, sensitive and in depth exploration of the subjective reality of
respondents and a description of their experiences are very well covered by unstructured
interviews.
3.As there are no restrictions on the interviewer frequently the study yields knowledge that is
novel and insightful.
4.Although the social desirability effect is there because of the mere presence of the
researcher - the researcher has the opportunity to mediate it by building a rapport
WEAKNESSES
1. The study will lack objectivity because the results are not measurable or quantitative and
really on the interpretation of the researcher.
2. Furthermore, the researcher and subject are both deeply involved in the study which can lower
objectivity.
3. The data is dif cult to organise and interpret.
4. Usually two interpreters will be required to analyse/interpret the data/results.
fi
1.3 CASE STUDIES
PAPER 2 RESEARCH METHODS
Case studies
a case study is a detailed investigation of a single instance, usually just
one person although it could be a single family or institution.
useful for
ndings from case studies are highly valid - as individuals are studied in great depth and
detail
Weaknesses
As the sample is small and speci c the ndings can usually not be generalised.
The validity of case studies risk being compromised because of the close relationship that can
form between the researcher and participants which may reduce the objectivity of there
researcher.
The close bond between the researcher and participant may also reduce reliability, as a
di erent researcher might interpret data di erently.
The level of detail can also be an ethical threat as the researcher may compromise the
participants privacy and the details might make it hard to maintain con dentiality (disguise
identity)
fi
ff
fi
fi
ff
fi
Suggest one strength of using a variety
of of techniques in a case study
Can collect qualitative and quantitative data; (advantage)
Valid results: - in-depth data (about the history / symptoms / therapy) can
be collected about a subject's experience.
Application: Case studies are one of the best ways to stimulate new
research. A case study can be completed, and if the ndings are valuable,
they can lead to new and advanced research in the eld. There has been
a great deal of research done that wouldn't have been possible without
case studies. Oftentimes there are theories that may be questioned with
case studies.
fi
fi
Weaknesses
Generalisability: the sample is very small therefore generalisability is limited. If
one person is studied and they may be rare/unique/unusual, so researchers are
less likely to be able to generalise to the wider population / less representative
content of
e.g. depth / detail: e.g. types of doodles, frequency, e.g. associate with emotions /
information
being with friends/being outdoors, examples of rating scales etc etc.
collected
two or more
techniques for observation, interviews, questionnaires (rating scales)
data collection
Describe how Jason could conduct a case study on Karen to nd out about her doodling behaviour.
To conduct a simple and replicable case study, Jason could use a mixed-method approach (triangulation), combining
one quantitative and one qualitative method to collect data.
Quantitative Method: Observation with Behavioral Categories
Jason could use structured observations as the quantitative method to objectively record Karen's doodling behavior.
He can create a set of behavioral categories to note how frequently and under what conditions Karen doodles during
lectures. Some example categories could include:
• Doodling Frequency: Number of times Karen engages in doodling per class.
• Duration of Doodling: Length of time Karen doodles during a specific period.
• Situational Context: Whether Karen is doodling during challenging topics, during lecture breaks, or during more
relaxed discussions.
Jason would observe Karen over several lectures (e.g., five sessions) and record the frequency and duration of her
doodling using a simple tally chart or a time log. These behavioral categories can later be subjected to statistical
analysis, such as calculating averages or frequencies, to quantify her doodling habits.
Qualitative Method: Semi-Structured Interview
In addition to observation, Jason could conduct a semi-structured interview with Karen to gain deeper insights into
her thoughts and motivations behind doodling. This would serve as the qualitative method, allowing Karen to explain
why she engages in doodling. Sample interview questions could include:
1. “Can you describe what you experience while you are doodling?”
2. “What emotions or thoughts do you have when doodling during class?”
3. “Does doodling help you concentrate, or is it more of a relaxing activity?”
These open-ended questions allow Karen to express her subjective experience in her own words, providing rich
qualitative data.
fi
Jason is a university lecturer and is interested in doodling behaviour. One day, Jason sees one of his
students, Karen, doodling. Jason is especially interested because Karen has a visual impairment that
means she cannot see. Jason decides to conduct a case study on her.
Describe how Jason could conduct a case study on Karen to nd out about her doodling behaviour.
Data Analysis
• Quantitative Data: The data from the observations (e.g., doodling
frequency) can be analyzed using basic statistical analysis. For
instance, Jason could calculate the average number of doodles per
class or determine if doodling frequency increases during specific
lecture contexts.
• Qualitative Data: The interview responses can be analyzed using
thematic analysis to identify recurring themes in Karen's motivations
or feelings about doodling. Jason would transcribe the interview, then
code it for key themes, such as "doodling as a focus tool" or "sensory
experience."
number of passengers in a train carriage during a 7 minute period (Piliavin et al) (metric
number of people)
the time it took for the rst helper to respond (metric: seconds)
typically gathered from structured observations
Qualitative data
Controlled observation: took place in a laboratory where all the variables and measurements were
controlled, while the behaviour of participants was observed and recorded.
Participants level of obedience was measured through observation. This was operationalised as the
maximum voltage given in response to the orders. Observers also noted the participants body
language and any verbal comments or protests made throughout the procedure.
Milgram
fi
This was a eld experiment, where the independent variable was gender (male or female),
observations were used as a technique to measure the dependent variable of activities with the
toys.
Hasset et al For each social group, seven trails, each lasting 25 minutes, were observed using two video
cameras. Each trial began with all the moneys in the group indoors while one plush toy and one
wheeled toy were placed 10 meters apart in the outdoor enclosure. The video tape was analysed
by two observers working together to achieve consensus.
They identi ed each animal interacting with a toy and coded speci c behaviours directed towards
the toys using a behavioural checklist. The exact time at which activity occurred was also recorded
so in addition to the frequency of behaviour, duration was also recorded.
fi
fi
fi
BANDURA ET AL (BOBO DOLL)
• Bandura et al conducted an experiment where the dependent variable (DV) was the learning the child
displayed. This was measured through a controlled observation of the children and measures of
aggressive behaviour were recorded.
• Prior to the experimental part of the study, the children were observed in their nursery school by the
experimenter and a teacher who knew them well. They were rated on four ve-point scales
measuring physical aggression, verbal aggression, aggression to inanimate objects and aggression
inhibition (anxiety). They were then assigned to three groups, ensuring that the aggression levels of
the children in each group were matched. Of the 51 children rated by both observers (the rest were
rated by only one observer), similar ratings were generally produced. Their ratings were compared as
a measure of ‘inter-rater reliability’, which showed a high correlation between the observers, of r =
0.89.
• A test of the child’s aggression then followed in which the child was observed for 20 minutes using a
one-way mirror (covert). For the aggressive model group, this was a test of delayed imitation. This
experimental room contained a three foot (92 cm) Bobo doll, a mallet and some other objects. It also
contained some non- aggressive toys, including a tea set, crayons, etc. These toys were always
presented in the same order (controlled)
• The children’s behaviours were observed in ve second intervals (240 response units per child).
There were three ‘response measures’ of the children’s imitation, with a range of possible activities in
each. Partially imitative aggression was scored if the child imitated these behaviours incompletely.
fi
fi
• striking the Bobo doll with the mallet, sitting on the doll and punching it in the nose,
Imitation of physical
kicking the doll, and tossing it in the air.
aggression:
Imitative verbal • repetition of the phrases, ‘Sock him’, ‘Hit him down’, ‘Kick him’, ‘Throw him in the air’
aggression: or ‘Pow’.
Imitative non- • repetition of ‘He keeps coming back for more’ or ‘He sure is a tough fella’.
aggressive verbal
responses:
mallet aggression: • striking objects other than the Bobo doll aggressively with the mallet
sits on Bobo doll: • laying the Bobo doll on its side and sitting on it, without attacking it.
aggressive gun play: • shooting darts or aiming a gun and ring imaginary shots at objects in the room.
non-imitative • physically aggressive acts directed toward objects other than the Bobo doll and any
physical and verbal hostile remarks except for those in the verbal imitation category (e.g. ‘Shoot the
aggression: Bobo’, ‘Cut him’, ‘Stupid ball’, ‘Horses ghting, biting’ ‘Knock over people
fi
fi
OBSERVATIONs can be conducted in many di erent ways, for example
Structured/Unstructured
Controlled/Naturalistic
Overt/Covert
Participant/Non-participant
1. structured - the observers were measuring latency i.e. the time it took for
the rst helper to help and the number, sex and race of passengers in the
critical area etc.
3. participant - the observers were part of the social setting; they pretended
to be passengers and made conversation with other passengers while
recording their observations
4. covert - the role of the two female observers was not made obvious to the
passengers
fi
Strengths of naturalistic observation:
1. High Ecological Validity: Observing behaviors in a natural setting provides a
Naturalistic more realistic view of how participants behave in everyday contexts.
2. Reduced Demand Characteristics: Participants are often unaware they’re
Conducted in the participants being observed, so they’re more likely to act naturally, enhancing the validity of
normal envirnment without the findings.
interference from researchers in
either the social or physical Weaknesses of naturalistic observation:
environment
1. Low Reliability: Lack of control over variables can make it difficult to replicate
the study or achieve consistent findings, reducing reliability.
2. Potential Ethical Issues: Observing people in natural settings without their
knowledge may raise ethical concerns, particularly around informed consent
and privacy.
Unstructured 1. High Validity: Researchers record all behaviours as they occur, capturing the
complexity and natural flow of interactions for richer data.
the observer records the whole 2. Flexible and Adaptive: Allows researchers to explore new behaviours or
range of possible behaviours, patterns that emerge, providing a more holistic view of participants.
which is usually con ned to a
pilot stage at the beginning of a Weaknesses of unstructured observations:
study to re ne the behavioural
categories to be used 1. Low Reliability: Without a structured system, observations can vary widely
between researchers, making replication challenging.
2. Data Overload: The abundance of unfiltered data can complicate analysis, and
important insights may be lost in the process of managing extensive
information.
participants are unaware of the 1. Validity: as participants are not aware of being observed their behaviour is
true nature of the observation/ unlikely to be affected by being observed, so demand characteristics and social
the role of the observer is not desirability are reduced.
obvious
Weaknesses of covert observations:
e.g because they are hidden or 1. Ethical issue: partipcants are being watched without their permission which
disguised or they are are breaks the ethical guidelines of consent, they are also denied the right to
unaware of the role of the withdraw, also if their work out the role of a participant covert observer it may
observer (have not given cause them distress, it is also difficult and sometimes impossible to debrief
informed consent- are being participants.
deceived) 2. Practical issue: the observer must be either hidden, far away or disguise their
role, they may make data collection more difficult, potentially reducing the
validity and reliability of the study.
Evaluation
refers to the degree of agreement or consistency
between di erent raters or judges when assessing
the same subject, task, or phenomenon.
inter observer reliability
- across di erent
researchers
Reliability
consistency
behavioural categories ensure that researchers
interpretation is not required while recording data
behavioural categories therefore making the data collection process
- prede ned/ pre objective and the results reliable i.e. if someone
determined categories else counted the same instances they would get
the same results
Participant/Non-participant: is the researcher interacting with, or part of the social group of the
How participants?
Covert/Overt: are the participants aware of being observed?
Ethical considerations: if required by the question i.e consent, no harm, debrie ng etc.
Evaluation Validity: improves if participants natural behaviour is recorded
Reliability: inter-observer reliability (using two observers)
fi
fi
fi
Jim is planning an observational study about the way people respond to facial expressions. He intends
to go to a train station and either smile or frown at people. He wants to see how people respond to him.
(a) Describe how Jim could conduct an observational study at a train station to nd out how people
respond to smiling and frowning.
To conduct an observational study on responses to facial expressions, Jim could implement a covert,
structured, participant observation in a controlled setting at a train station.
SAMPLING: Using opportunity sampling, Jim would record responses from individuals he encounters
during each interval.
WHEN: Jim will go to a train station every Monday, Wednesday, Saturday, morning (9am-10am) and evening
(5pm-6pm) to collect data on a diverse sample of people.
HOW: He would engage with individuals without disclosing his role as a researcher (covert). While people
would see him smiling or frowning, they would not know they are part of a study, minimising demand
characteristics and promoting authentic reactions. By smiling or frowning, Jim actively manipulates the social
situation to observe immediate and unaltered responses to his facial expressions (controlled).
DATA COLLECTION: Jim would use a structured observation approach by creating a pre-defined
checklist/behavioural categories of potential responses, such as ‘smiling back’, ‘frowning’, ‘looking away’,
or verbal responses. This structured design enhances reliability, ensuring consistent data collection across
all participants and making the study easier to replicate.
The procedure would involve defining specific time intervals (e.g., every 15 minutes) to alternate between
smiling and frowning. He would approach participants from a set distance (e.g., five feet away), maintaining
uniformity in interactions by making brief eye contact and displaying a distinct smile or frown for
approximately three seconds. He will set another control by wearing the same clothes for each observation.
DATA ANALYSIS: in a note book he will record how the participant responds e.g ‘smiling back’ and then
tally/count and make a table of the frequency of each response. He will then make a bar chart, where on the
x axis will be the different behavioural categories, and on the y axis the no of times each category was
observed. He will then be able to statistically compare the participants reaction to smiling and frowning.
fi
Dr Birkin is planning an observational study of children’s
imaginary play behaviour (for example, a child might use a
leaf as a plate, or a big cardboard box as a boat or a house).
smiling 15 5 25 10
frowning 5 15 30 5
30
23
15
0
smiling back frowning looking away verbal response
1.5 CORRELATIONS
PAPER 2 RESEARCH METHODS
De nition: Correlational analysis is a statistical technique used to examine the relationship between
two measured variables (co-variables).
• A negative correlation – as one variable increases, the other decreases (has a coe cient below
0 down to -1)
Purpose: It identi es patterns and trends between variables but does not establish cause and
e ect.
Use Case: Especially useful when variables cannot be manipulated (as in an experiment) due to
ethical or practical reasons.
Example 1: Measuring children's exposure to violent television and correlating it with their
aggression levels in school, since experimentally increasing exposure would be unethical.
Example 2: Studying the link between sleep duration and academic performance, as it might be
impractical to control students' sleep patterns directly.
Key Insight: Helps researchers understand potential links between variables in real-world settings.
ff
fi
fi
ffi
ffi
Measuring variables in correlational
studies:
NUMERICAL DATA - Quantitative Data
each variable must exist over a range, sometimes called ‘continuous data’, and it must be possible measure them
numerically.
numbers from a rating scale (how happy are you? 0-5, 0= not at all, 5= extremely happy)
• Variables:
◦ In correlational research we do not (or at least try not to) in uence any variables but only measure them and
look for relations (correlations) between some set of variables, such as blood pressure and cholesterol
level.
◦ In experimental research, we manipulate some variables and then measure the e ects of this manipulation
on other variables; for example, a researcher might arti cially increase blood pressure and then record
cholesterol level.
◦ A correlational study, however, just investigates whether two variables are interdependent i.e. whether they
are related to each other. There is, however, no expectation of causation in such studies.
◦ If a researcher nds that there is a correlation between two variables, it means that there is a possible
causal relationship etween these variables.
• Data Analysis:
◦ Data analysis in experimental research also comes down to calculating “correlations” between variables,
speci cally, those manipulated and those a ected by the manipulation.
◦ However, only experimental data can conclusively demonstrate causal relations between variables. For
example, if we found that whenever we change variable A then variable B changes, then we can conclude
that “A in uences B.”
◦ Data from correlational research can only be “interpreted” in causal terms based on some theories that we
have, but correlational data cannot conclusively prove causality.
fi
fl
fi
b
fi
ff
ff
fi
ff
fl
ff
ff
Correlational analysis Experiments
one variable is manipulated (IV)
two measured variables (e.g,
and the other variable is
variable 1 and variable 2)
measured (DV)
explanations
describe
(state it’s cause)
Hypotheses in correlational
studies
A hypothesis in a correlational study needs to
operationalise the co-variables as it predicts what
there will be be a relationship between them.
For example:
"There will be a
"Increased social media use "There will be no correlation
Social Media correlation between
will be positively correlated with between social media use
and Anxiety social media use and
higher anxiety levels." and anxiety levels."
anxiety levels."
"There will be a
"Higher exercise frequency will "There will be no correlation
Exercise and correlation between
be negatively correlated with between exercise frequency
Stress exercise frequency and
stress levels." and stress levels."
stress levels."
"There will be a correlation "Higher exercise frequency per week "There will be no correlation
Exercise and between exercise frequency per will be negatively correlated with between exercise frequency per
Stress week and stress levels as stress levels as measured by a week and stress levels as
measured by a stress scale." stress scale." measured by a stress scale."
Income and "There will be a correlation "Higher annual income will be "There will be no correlation
between annual income and job positively correlated with higher job between annual income and job
Job satisfaction as indicated by a satisfaction as indicated by a satisfaction as indicated by a
Satisfaction satisfaction survey score." satisfaction survey score." satisfaction survey score."
Evaluating correlations
Validity - (de nition): a correlational study can only
be valid if the measures of both variables test real
phenomena in e ective ways - to achieve this the
variables must be clearly de ned and relate directly
to the relationship being investigated.
pulse rate -
• What:
• variable 1 (correct operationalisation and quanti cation of rst correlational variable e.g. a repetitive
behaviour)
• variable 2 (correct operationalisation and quanti cation of second correlational variable e.g. understanding
of lesson)
• How: –technique for producing/collecting data i.e. procedure (e.g. tests, observations, questionnaires).
The minors are:
• ethical issues
fi
fi
fi
1.6 LONGITUDINAL STUDIES
PAPER 2 RESEARCH METHODS
Longitudinal studies
a research method that follows a group of participants
(cohort) over time, week to decades, looking at changes
in variables to explore development or changes due to
experiences, such as interventions, drugs or therapies
• Cross-sectional studies are valuable for assessing the state of a population or phenomena
at a given moment but cannot establish causal relationships or investigate changes over
time, as they do not involve follow-up measurements. This design is often used to examine
the prevalence of certain traits, conditions, or behaviours within a population and to identify
potential associations or di erences between various variables at that particular time.
• Design: Similar to a repeated measures design, longitudinal studies follow the same participants over a period of time,
collecting data at multiple points.
• Strengths: This approach allows for examining changes in individuals over time, providing insights into development, progression,
and causal relationships. There are fewer participant variables as the same participants are studied throughout the process.
• Weaknesses: They can be time-consuming, expensive, and subject to participant dropouts (sample attrition), which may bias
results. There can be order e ects, if for example participants are lling the same questionnaire repeatedly over time.
• Example: Tracking a group of children’s cognitive development from early childhood through adolescence.
Cross-Sectional Studies:
• Design: Resembling an independent measures design, cross-sectional studies compare di erent groups of participants at one
point in time, rather than following the same individuals.
• Strengths: These studies are quicker and more cost-e ective than longitudinal studies, as they gather data in a single instance,
and are useful for identifying correlations between variables across age groups or populations.
• Weaknesses: They do not track changes over time and may be a ected by cohort e ects, where di erences between groups are
due to generational or cultural factors rather than age or development i.e. the results may be a ected by participant variables.
• Example: Comparing cognitive abilities in 5-year-olds, 10-year-olds, and 15-year-olds in a single study.
ff
ff
ff
fi
ff
ff
ff
ff
EXAMPLES
6 weeks
baseline/pre intervention Intervention Post-intervention
0 1 2 3 4 5 6 7 8 9 10 11 12
month month month month month month month month month month month month month
Enable researchers to track changes. Longitudinal studies are the most valid test of developmental changes because
in cross sectional studies there may be confounding variables.
Allow researchers to determine/ provide evidence for the e cacy of interventions by comparing data pre and post an
intervention
As the group of participants are the same (cohort), participant variables do not confound the results improving the
validity of the study.
Longitudinal studies like repeated measure retest the same individuals. this means the researcher can be con dent that
any di erences found (over time) are not because of participant variables.
Weaknesses:
It can be di cult to standardise the measures used at di erent time points (e.g. interviewers might change if a
researcher leaves, new and improved measures may be introduced, etc) which impacts the reliability of the study.
Over time the researcher might form a relationship with the participants which may bias the results.
To improve reliability the researcher might test participants repeatedly over time using the same measures (e.g. BDI)
this may lead to demand characteristics being revealed or order e ects (practise e ects or fatigue e ects)
Participants can drop out of the study over time for many di erent reasons this sample attrition reduces the
representativeness of the sample there by reducing the generalisability of the study. Longitudinal studies may not be
generalizable to other populations or contexts, as they are typically focused on a speci c group of participants over
time.
They may also be in uenced by historical or cultural factors that change over time, making it di cult to interpret
changes in behaviour or development as due to the variables of interest.
If children were initially recruited then consent was likely taken from parents or guardians, as the individual grows they
might withdraw their consent.
It can be di cult to maintain con dentiality as researchers need to store personal information in order to be able to re-
contact participants.
ff
ffi
ffi
fl
fi
ff
ffi
ff
ff
ff
fi
ffi
ff
fi
Jack is conducting a longitudinal study on the effectiveness of CBT to treat Generalised Anxiety
Disorder. He conducts an experiment where he compares a group of subjects who were given
CBT for 12 weeks with another group that were given befriending sessions. He collects data
about them at baseline, then immediately after treatment, and then 9-months post treatment.
(a) Describe what is meant by a ‘longitudinal study’, using an example from Jack’s study. .
.......................................................................................................................................................
………………………………………………………………………………………………………………
...................................................................................................................................................[2]
(b) Explain the advantages of using the longitudinal method in Jack’s study.
………………………………………………………………………………………………………………
………………………………………………………………………………………………………………
………………………………………………………………………………………………………………
.......................................................................................................................................................
.......................................................................................................................................................
……………………………………………………………………………………………………………[4]
(c) Suggest one practical difficulty Jack might face in conducting his study.
.......................................................................................................................................................
...................................................................................................................................................[2]
(d) Define "baseline measurement" and explain its importance in Jack's study.
………………………………………………………………………………………………………………
………………………………………………………………………………………………………………
...................................................................................................................................................[2]
(e) Identify and describe one potential extraneous variable that could affect Jack’s study.
.......................................................................................................................................................
.......................................................................................................................................................
……………………………………………………………………………………………………………[2]
(f) Jack collects data at three different time points: baseline, post-treatment, and 9 months
after treatment. Explain why it is important for Jack to include the 9-month follow-up in his
study design.
………………………………………………………………………………………………………………
.......................................................................................................................................................
...................................................................................................................................................[2]
(g) If Jack finds that the CBT group shows more improvement at the 9-month follow-up
compared to the befriending group, what conclusions might he draw about the long-term
effectiveness of CBT?.
………………………………………………………………………………………………………………
………………………………………………………………………………………………………………
...................................................................................................................................................[2]
(h) Evaluate the use of self-report questionnaires as Jack’s method of data collection for
measuring anxiety levels.
.......................................................................................................................................................
.......................................................................................................................................................
……………………………………………………………………………………………………………[2]
(i) Critically assess one ethical issue that Jack must consider in his study and explain how he
could address it.
………………………………………………………………………………………………………………
.......................................................................................................................................................
...................................................................................................................................................[2]
(j) In Jack's study, the group receiving befriending sessions serves as the control group. Explain the role of a
control group and why it is important in this experiment
.………………………………………………………………………………………………………………………..……………
.……………………………………………………………………………………………………………………………………...
.…………………………………………………………………………………………………………………………………[2]
(k) Discuss one way in which Jack can improve the reliability of his measurements.
.………………………………………………………………………………………………………………………..……………
.……………………………………………………………………………………………………………………………………...
.…………………………………………………………………………………………………………………………………[2]
(l) If Jack's results show that both the CBT and befriending groups improved significantly from baseline to the post-
treatment measurement, what type of statistical analysis could he use to determine if the improvement in the CBT
group is significantly greater than in the befriending group?
.………………………………………………………………………………………………………………………..……………
.……………………………………………………………………………………………………………………………………...
.…………………………………………………………………………………………………………………………………[2]
(l) Jack's data show that some participants dropped out before the 9-month follow-up. How could attrition impact
the validity of his results?
.………………………………………………………………………………………………………………………..……………
.……………………………………………………………………………………………………………………………………...
.…………………………………………………………………………………………………………………………………[2]
Dr Felix believes that adults between the ages of 55 and 65 cope less well with their job regardless of whether they stay in the same job or
change jobs. Dr Felix plans to use a sample of adults aged 55 at the beginning of the study. She will re-contact the participants by telephone
as she has a record of each individual’s telephone number and will be following appropriate ethical guidelines.
(a) Describe how Dr Felix could conduct a longitudinal study to investigate how well adults between the ages of 55 and 65 cope with their jobs.
Do not describe how Dr Felix would re-contact her participants, the sample/sampling technique or ethical issues/guidelines in your answer.
Dr. Felix will conduct a longitudinal study that spans ten years, collecting data at multiple points to observe changes in coping mechanisms of adults
between the ages of 55 and 65 cope with their jobs over time.
Tests/Tasks: Measures Used
At the beginning of the study (cohort: age 55), Dr. Felix would administer both self-report: questionnaires and interviews. A suitable tool for
measuring coping could be the close ended questionnaire, which quantifies stress levels, coping strategies, job satisfaction, and emotional well-
being. The questionnaire would include questions like:
"On a scale of 1-10, how well do you think you handle stress in your current role?"
"How frequently do you experience job-related anxiety?" (never, rarely, sometimes, often, always)
Additionally, Dr. Felix could conduct semi-structured interviews to gather qualitative data on how participants feel about their work and whether
they perceive any age-related challenges, providing richer insight into their personal experiences.
Scoring: How Data Will Be Scored or Analysed
For the quantitative data collected from the questionnaire, Dr. Felix could total the scores for each participant. This would enable her to track changes
in the scores over time. She could calculate averages for each time point to determine general trends in coping among the sample. For the qualitative
data gathered from interviews, Dr. Felix would use thematic analysis to identify recurring themes, such as "increased job stress with age" or
"improved work-life balance."
Frequency/Interval: Pattern of Testing Over Time
Dr. Felix would re-test participants at regular intervals—every two years—over a ten-year period (at ages 57, 59, 61, 63, and 65). By using the same
tests and interview questions during each data collection point, she can ensure consistency in the data collection process and accurately measure
how participants' coping mechanisms evolve. The two-year intervals will allow sufficient time for any changes in job circumstances (e.g., job change or
job retention) to influence coping strategies.
Controls/Standardisation:
To ensure consistency, Dr. Felix should standardize the testing conditions across all time points. This could include using the same questions and
scoring system at each data collection point to maintain reliability. Additionally, interviews should be conducted in similar settings (e.g., over the phone
or in a private office) to reduce environmental variations that could influence participant responses. Dr. Felix would also provide the same instructions
to each participant at every stage to avoid introducing variability in how the tasks are approached.
Describe one practical/methodological strength of the procedure you have
described in your answer to part (a). Do not refer to re-contacting the
participants, sampling or ethics in your answer.
One practical weakness of Dr. Felix's study is the potential for social desirability bias
in self-report questionnaires, where participants may give responses they think are
more socially acceptable rather than their true feelings. This can result in inaccurate
data regarding coping mechanisms and job satisfaction which would reduce validity.
To overcome this issue, Dr. Felix could ensure that the questionnaire allows
participants to be anonymous which may encourage participants to provide more
honest responses thus enhancing the validity of the data collected.
Explain why the feature of the procedure you have identi ed in (i) is a strength.
Do not refer to re-contacting the participants, sampling or ethics in your answer.
(2 marks)
The use of anonymous data collection methods in Dr. Felix's study is a strength
because it encourages participants to provide more honest and accurate responses
about their coping mechanisms and job satisfaction. By minimising the fear of
judgment, participants are likely to share their true feelings, leading to more reliable
data.
This, in turn, enhances the validity of the results, as the collected data will better
re ect the participants' genuine experiences over time.
fl
fi
1.7 THE DEFINITION,
MANIPULATION, MEASUREMENT
AND CONTROL OF VARIABLES
PAPER 2 RESEARCH METHODS
Aim of a study
• The intention of the study, the idea being tested or the purpose of
the research, such as to investigate a question or solve a problem.
in experiments
independent variable
dependent variable
controlled/uncontrolled variables
in correlations
two co-variables
Operationalising variables
Variables must be operationalised
When writing hypotheses, you should ideally operationalise the variables you are referring to.
In an experiment for example testing the e ect of age on susceptibility to false memories.
The IV would be age, with, for example, ‘young’, ‘middle-aged’ and ‘old’ groups. It is
important to know how old the people in the groups are; this is operationalisation. You
might operationalise
• a variable which either acts randomly (extraneous), a ecting the DV in all levels of the IV or
systematically, i.e. on one level of the IV (called a confounding variable) so can obscure the e ect
Uncontrolled of the IV, making the results di cult to interpret.
variable • Good control of extraneous variables raises validity
• Extraneous variables are those uncontrolled variables that have a consistent e ect on the DV, and
therefore confound/confuse the results.
• Confounding variables act on the DV selectively in one level of the IV so can interfere with the
results one of two ways.
Confounding • Confounding variables can work against the e ect of the IV, counteracting its e ect on the
DV, thus preventing the real e ect from being identi ed.
variables
• Alternatively, they can increase the apparent e ect of the IV and so suggest that there is an
e ect that doesn't really exist.
• These variables are the most important to control.
ff
ff
ff
ffi
ff
ff
ff
ff
fi
ff
ff
ff
ff
ff
• an experimental procedure where neither the participant nor the researcher
is aware of which condition the participant is in (1 mark)
double-blind:
• so a participant could be in the nalmefene (or naltrexone) group or the
control (placebo) group and not know which group they are in (2 marks)
• Other extraneous variables, which have a random e ect on the DV across all
levels of the IV, are not so problematic.
• The di culty is to identify which variables it will be important to control
Pilot study
before the experiment starts.
• This is one function of a pilot study, a preliminary test of the procedures of a
study.
ffi
ff
ff
Situational variable
Experiments
- it is important to control extraneous -a factor of the environment e.g the lighting in
variables a class room
- - extraneous variables that have a
consistent e ect are called
Controlling confounding variables Participant variable
variables - -Other extraneous variables, which
have a random e ect on the DV -features of the individual e.g a participant
ability to do math
across all levels of the IV, are not so
problematic.
-
e.g. participant variable in Andrade could have been how much the individual normally
doodled;
e.g. participant variable in Milgram was experience with electricity (made them disobey
earlier)
e.g. situational variable in Piliavin et al. was the behaviour / comments of passengers;
Situational variables are reduced through controls on the environment where possible
ff
ff
ff
fl
ff
1.8 SAMPLING OF
PARTICIPANTS
PAPER 2 RESEARCH METHODS
The group, sharing one or more characteristics from which a sample is
drawn.
A sample • Is the group of people who participate in a study that are taken from
the population and should ideally be representative of that group so
that the ndings of the study are generalisable.
Sampling The method used to obtain the (sample= participants for a study from a
technique population)
fi
Generalisability How widely ndings apply, e.g. to other settings and populations
the larger the sample size the more representative the sample, the
Representativeness more generalisable the results
the more diverse the sample the more representative the sample,
the more generalisable the results
fi
Opportunity (convenience) sampling
Description:
• participants are chosen because they are available, for example university students are selected because
they are present at the university where the research is taking place.
Strengths:
• Quicker and easier than other methods as the participants are available, therefore a larger sample can be
readily obtained.
Weaknesses:
• Likely to be non-represented as the variety of people available is likely to be limited, so they will tend to
be similar because they have been recruited from the same context (time and place), and the sample
could therefore be biassed.
• For example, many studies are conducted using university students as they are convenient for the
researchers. However this means that the sample will be predominantly young, often Westernised,
Educated, from Industrialised, Rich Democracies (WEIRD) and this may restrict the applicability of a study
worldwide.
ff
Volunteer (self select) sampling
Description:
• participants are invited to participate, for example through advertisements via email or notices. Those who reply
become the sample.
Strengths:
• Participants are likely to be committed e.g. willing to return for repeated testing
• ethical because participants approach the researchers themselves (no pressure which might be present in opportunity
sampling) which is especially important for drug trails
• It is a useful technique when looking for participants who are unusual/unique in some way for example Baron-Cohen
et al's study where people with autism spectrum disorder were needed.
Weaknesses:
• Likely to be non-representative as people who respond to requests may be similar (volunteers) e.g have free time, are
motivated, interested to participate- bias.
• The nature of the invitation for volunteer sampling might exclude some groups of people.
• Imagine a researcher is looking for a sample of students at a school and puts an advert for volunteers on the library
notice board. Students who never go to the library cannot be included so the sample might be biassed towards
those who work the hardest.
Random sampling
Description:
• sampling all members of the population (i.e. possible participants) are allocated numbers and a xed amount
of these are selected in an unbiased way, for example by taking numbers from a hat, or use a random number
generator to choose the participants.
Strengths:
• Likely to be representative as all types of people in the population are equally likely to be chosen.
• unbiased
Weaknesses:
• recruitment can be di cult as the researchers need access to names and details of all members in your
possible population
• sometimes random generation can also exclude certain categories of people e.g certain minority ethnicities
within the population
• In reality everyone may not be equally likely to be chosen e.g. if they cannot be accessed (if the original list is
incomplete) or if mainly one type of participant e.g. girls, happen to be selected. This is particularly important if
the sample is small.
ffi
fi
Description Description Generalisability Practical concerns Why use it? Excludes
Advantage of Random sampling is more likely to produce a representative sample of the target
random sampling population compared to opportunity sampling. This is because every member of the
over opportunity population has an equal chance of being selected, reducing sampling bias and increasing
sampling the generalizability of the results.
Opportunity sampling is typically quicker and easier to conduct than volunteer sampling because
participants are readily available and do not need to self-select or respond to an advertisement. This saves
Advantage of time and e ort for the researcher.
opportunity
Opportunity sampling can sometimes be more representative than volunteer sampling because it includes
sampling over participants who are available at the time, rather than only those who are motivated to volunteer. Volunteer
volunteer sampling sampling often attracts a speci c type of person (e.g., highly motivated or interested individuals), which can
lead to bias. Opportunity sampling, while still not fully representative, may capture a wider range of
individuals.
ff
fi
Possible questions:
a title
Student group
Quantitative data indicates the quantity of a psychological measure, such as the amount or
strength of a response and tends to be measured on scales, such as time, or as numerical
scores on tests such as IQ or personality. Quantitative data is associated with experiments
and correlations which use numerical scales but it is also possible to obtain quantitative data
from observations, questionnaires or interviews. For example, a record of the number of times
a behaviour is seen or the total of responses to a closed question in an interview would be
quantitative data. The sources of quantitative data are typically highly objective, as the scales
or questions used need little if any interpretation, making them high in validity. In addition, the
measures used are generally highly reliable, as the measures are xed quantities.
Qualitative data indicates the quality of a psychological characteristic. Such data is more in-
depth than quantitative data and includes detailed observer accounts and responses to open
questions in questionnaires, interviews or case studies. Although there is a risk of subjectivity
in the interpretation of such data by the researcher, qualitative data may be more
representative as the participant can express themselves fully, so in some senses qualitative
data can also be valid.
fi
Quantitative Qualitative
uses numbers
Qualitative Data
• Qualitative data (QL) where words or descriptors are taken and converted to quantitative data (QT).
• Labels that are given numerical value
• Note that nominal data that has no order. • Note that for ordinal data order matters.
Nominal Data: Ordinal Data:
Counted Measured
Cannot be divided into smaller parts Can be subdivided into smaller pieces
For example, eye colour or gender can be An example of a continuous variable is weight or height - a
considered a discrete variable because individuals person doesn't have to be either 150 pounds or 151
are either part of a category or they aren't - there is pounds. They could be 150.6 or 150.99999 pounds
no range of answers in between.
Examples: Examples:
1. Number of students in a class (cannot have 4. Amount of time required to complete a task (1.5
‘half’ a person) hours)
12
Frequency of
participants
Bar Chart 6
0
0 1 2 3
Histogram
Observations
12
Frequency of
participants
Bar Chart 6
0
0 1 2 3
Number of complaints
Histogram
Biological measures
12
Frequency of
participants
Bar Chart 6
0
60-70 71-80 81-90 91-100
Histogram
Thought box:
Scenario: A psychologist observes a volleyball game using a structured
observation method with prede ned behavioral categories. They collect
data by:
Question
Reason:
◦ Discrete data consists of speci c, separate values, and in this case, you
cannot have fractional counts like 3.5 females or 2.7 disagreements.
Question
Reason:
Question
Continuous Data
Reason:
DV
IV DV
ti
te
te
12
9
Number of people
0
CBT Psychoanalysis Art therapy Group therapy
12
9
Number of people
0
0-10 11-20 21-10 31-40
Grade Intervals
In Experiments:
100
75
50
25
0
April May June July
Central tendency
A measure of central tendency (also referred to as
measures of centre or central location) is a summary measure
that attempts to describe a whole set of data with a single
value that represents the middle or centre of its distribution.
Median Ordinal scale Preferable for when data ordinal Bar charts
(requires an order) (has an order) OR is not
symmetrical
Middle
value -is not impacted by outliers
-does not represent outliers
Advantages
Disadvantages
Disadvantages
• As it does not take into account outlying values it is less representative
than the mean.
• Unlike the mode, the median cannot be identi ed for categorical nominal data,
it is only used with numerical data on a linear scale (i.e. points in a sequence).
• Unlike mean, median is not amenable to further mathematical calculation and
hence is not used in many statistical tests.
fi
fi
(2) Median: the middle score of a data set
Advantages
Participant Rating
1 5
2 4
3 3
4 4
5 2
6 3
7 5
8 1
9 4
10 3
fi
fi
fi
fi
Dataset Example: Likert Scale Ratings with Two Outliers
Participant Rating
1 5
2 4
3 3
4 4
5 2
6 3
7 5
8 1
9 4
10 1
11 (Outlier) 20
12 (Outlier) 25
fi
fi
fi
fi
Median is preferred to mean when:
• Animals should be protected from pain, e.g. relating to surgery using appropriate anaesthesia and analgesia, and
killed (euthanised) if su ering lasting pain.
harm
• This does not relate to the Fagen et al. study as the elephants were not at risk of catching a life threatening
disease in fact, the study was designed to prevent elephants from developing tuberculosis.
• In the Fagen et al study chopped bananas were used as part of the SPR training which is based on the principles of
positive reinforcement aka rewards and purposely sought to prove the e cacy of SPR over punishments therefore
Reward,
promoting and maintaining the welfare of the elephants.
deprivation
and • The one aversive stimuli used was the syringe and saline, but it was justi ed as it was necessary for the diagnosis of
aversive tuberculosis and therefore in the long run bene cial. Furthermore the elephants were gradually desensitised to the
stimuli
syringe and saline through gradual exposure, starting with a drop and building up in small increments (ranging from
1-15ml) to reach a tolerance of 60ml of saline for sample collection.
fi
fi
ff
ff
fi
fi
fi
fi
fi
ff
fi
fi
ffi
ff
ff
fi
• Only the minimum number of animals needed to produce valid and reliable results should be used.
To minimise the number, pilot studies, reliable measures of the dependent variable, good
number of experimental design and appropriate data analysis should all be used.
animals
• Only a minimum number of animals should be used to produce valid results. In Fagen et al. only
ve animals were used, four baby elephants and one older elephant. The data might not be
generalisable but it makes the study ethically sound as least amount of animals faced the suffering.
• Fagen et al.'s research is based on elephants, a species highly sentient and socially complex,
which could experience signi cant distress in captivity.
species and • However, their study aims to use positive reinforcement (without the use of punishment) to teach
strain elephants to perform a trunk wash procedure that could medically offer great bene t as it would
allow their caretakers to timely diagnose and therefore treat the elephants with tuberculosis
therefore minimising their suffering and even death.
• The elephants’ diet consisted mainly of fresh grasses and dhana (packets of grain, nutritional
supplements, and grasses).
• The elephants had access to the river for water during their grazing time but were otherwise not
food offered water outside of the training protocol.
• These husbandry conditions are the standard practice at the stable and no alterations were made
for the purposes of the study.
• Researchers should consider replacing animal experiments with alternatives, such as videos from
previous studies or computer simulations.
replacement
• The elephants could not have been replaced as it was their illness being treated through the
experiment.
fi
fi
fi
• Research causing death, disease, injury, physiological or psychological distress or discomfort should be
avoided. Where possible, designs which improve rather than worsen the animals’ experience should be used to
normal rather than early deprivation). Alternatively, naturally occurring instances may be used (e.g. where stress
arises naturally in the animal’s environment or lifetime).
• Given the limited freedom permitted by their chains, the elephants could clearly indicate a preference to not
participate in training sessions by walking to the other side of their stalls or simply turning away from the
trainer.
• Mahouts were present at all sessions and stood on the periphery for the safety of the trainer, but they were
procedures: clearly instructed not to speak to or signal the elephants in any way during the sessions to maintain the integrity
pain and of the training. The mahouts complied with this request. Their presence also ensured safety due to the fact that
distress elephant were familiar with the mahouts
• During research, attention should be paid to the animals’ daily care and veterinary needs and any costs to the
animals should be justi ed by the scienti c bene t of the work
• Saline was slowly introduced to the elephants through the process of desensitisation to reduce the distress
caused by saline washing; over a series of repetitions the syringe was brought closer to the elephant, made
contact with its trunk and then with little increments of ml the saline was given to the elephants. This process
• Between testing , animals should be housed with enough space to move freely.
• Elephants went into the jungle to graze under the control of their mahouts from 5 a.m. to 7 a.m. and 10:30 a.m.
to 4 p.m. each day, and they were leg-chained to posts in open stalls for the remainder of the day and night.
• Leg chains were normally placed with both front legs chained together or on a single front leg, with a chain
approximately 6 ft to 8 ft long (1.8 m to 2.4 m) between them and the post. This setup allowed enough laxity in
housing the chains for the elephant to shu e in a diameter that was 6 ft to 8 ft around her.
• The arti cial environment needed to recreate aspects of the natural environment important for the welfare and
survival.
• The elephants spent most of the day grazing in the jungle miming their natural environment.
• They spent the rest of the day leg-chained in a stable
fi
fi
ffl
fi
fi
Why is it important to get consent
from children and parents/
guardians?
Consent is agreeing to participate
Parents
Children
G
GENERALISABILITY
- can the ndings of the
• how large was the sample size?
• how representative (age, gender, ethnicity) was the
study be generalised to
sample?
the general population? • how realistic was the setting/measure?
R
- consistency • are there any extraneous or confounding variables?
- if the study was • is it replicable?
repeated how likely are • how many controls were there?
you to get the same • how objective were the measures?
results?
• does it provide evidence for a theory?
VALIDITY
V
accuracy • how likely is it that the IV caused the change in the
DV (internal validity) - depends on controls
- did the study accurately
• to what extent can the results be applied to real life
measure what it set out to situations (ecological validity)? - depends on
measure? mundane realism
E
ETHICS
were participants deceived during the study?
- did the researchers
could participants be at risk of psychological harm?
follow the ethical
were participants given the right to withdraw and
guidelines?
debriefed after the experiment
fi
Objectivity: the impact of an unbiased external viewpoint on, for
example, how data is interpreted.
• A strength of quantitative (QT) data gathered from close ended
questions and structured observations that are based on
quanti cation.
Representativeness
Population validity
External validity - whether results of one sample - how large was the
i.e. generalisability of sample?
can be generalised to other
ndings - how representative was
populations the sample?
Reliability
consistency
procedure
ecological validity
the extent to which the results of
the study can be applied to real
addresses whether the
world contexts
results are true/applicable to
the real world
internal validity
the extent to which the researcher can be sure that the
changes measured in the DV were caused by the
addresses whether causality
manipulation of the IV.
was established
ecological validity
the extent to which the results of the study can be
addresses whether the applied to real world contexts
results are true/applicable to