Prelims 2024-06-29 13 - 28 - 55
Prelims 2024-06-29 13 - 28 - 55
OF STATISTICS
3
General Purposes
• Statistics are used to organise and
summarise the information so that the
researcher can see what happened in
the research study and can
communicate the results to others.
6
Descriptives
» Disorganized Data
» C - ollection
» O - rganization
» P - resentation
» A - nalysis
» I - nterpretation
» G - eneralization of data
Whoa!
Conducting Basic and Advanced Statistical Data Analysis using SPSS and JAMOVI
19
20
21
Populations and Samples
» A researcher may want to know what
factors are associated with academic
dishonesty among college students?
23
24
Parameters and Statistics
• Parameter • Statistic
– A value, usually a – A value, usually a
numerical value, that numerical value, that
describes a population describes a sample
– Derived from – Derived from
measurements of measurements of
the individuals in the individuals in
the population the sample
26
Estimating Sample Size
» Raosoft
» Survey System
» Power Analysis using G Power
» Inverse Squareroot method
» Gamma Exponential Method
Variables and Data
• ex. Bakhshi, Kanuparthy, and Gilbert (2014)
wanted to determine if the weather is related to
online ratings of restaurants. As the weather
changes, do people’s moods also change?
29
Variables and Data
• A datum (singular) is a single
measurement or observation and is
commonly called a score or raw score
30
Discrete and Continuous Variables
• Discrete variable
– Has separate, indivisible categories
– No values can exist between two neighboring
categories
• Continuous variable
– There are an infinite number of possible
values between any two observed values.
– Is divisible into an infinite number of parts
Types of Data
32
Descriptive & Inferential Statistics
• Descriptive statistics • Inferential statistics
– Summarize data – Study samples to make
– Organize data generalizations about
– Simplify data the population
– Interpret experimental
• Familiar examples
data
– Tables
• Common terminology
– Graphs
– “Margin of error”
– Averages
– “Statistically significant”
Descriptive Statistics
• Descriptive statistics are methods for
organizing and summarizing data.
• For example, tables or graphs are used
to organize data, and descriptive values
such as the average score are used to
summarize data.
• A descriptive value for a population is
called a parameter and a descriptive
value for a sample is called a statistic.
34
Inferential Statistics
• Inferential statistics are methods for using
sample data to make general conclusions
(inferences) about populations.
• Because a sample is typically only a part of
the whole population, sample data provide
only limited information about the
population. As a result, sample statistics
are generally imperfect representatives of
the corresponding population parameters.
35
Figure 1.3
Role of statistics in experimental research.
Sampling Error
• The discrepancy between a sample
statistic and its population parameter is
called sampling error.
• Defining and measuring sampling error is
a large part of inferential statistics.
37
39
40
41
Seat work 001
1. A researcher is interested in the effect of
amount of sleep on high school students’ exam
scores. A group of 75 high school boys agree
to participate in the study. The boys are…
Learning Check
• Decide if each of the following statements
is True or False. (2 and 3)
Seat work 002
• Participant Variables
– these are characteristics such as age, gender,
and intelligence that vary from one individual
to another
• Environmental Variables
– these are characteristics of the environment
such as lighting, time of day, and weather
conditions.
Experimental Method
• In the experiment, a group of college students was
told that they were participating in a manual dexterity
study. The researchers then created two treatment
conditions by manipulating the material that each
participant would be handling. Half of the students
were given a stack of money to count and the other
half got stack of blank pieces of paper. After the
counting task, the participants were asked to dip
their hands into bowls of painfully hot water (122 F)
and rate how uncomfortable it was. Participants who
had counted money rated the pain significantly lower
than those who had counted paper.
Experimental Method
• Participant Variables
– In the money-counting experiment, for
example, suppose that the participants in the
money condition were primarily females and
those in the paper condition are primarily
males.
• Environmental Variables
– Using the money-counting experiment, suppose
that the individuals in the money condition are all
tested in the morning and the individuals in the
paper condition were all tested in the evening.
Control in the Experimental Method
• Methods of control
– Random assignment of subjects
– each participant has an equal chance of being assigned
to each of the treatment condition
– goal: to distribute the participant characteristics evenly
between the two groups so that either group is noticeably
smarter than the other.
– Matching of subjects
– to ensure equivalent groups or equivalent environments.
ex. ensuring that every group has the same participants;
both groups have 60% females and 40% males
– Holding level of some potentially influential variables
constant
– ex. if an experiment uses only 10-year-old children as
participants, then the researcher can be certain that one
group is not noticeably older than another.
Independent and Dependent Variables
• A scientist studies how many days people can eat soup until
they get sick. The independent variable is the number of days
of consuming soup. The dependent variable is the onset of
illness.
Example
• In the experiment, a group of college students was
told that they were participating in a manual dexterity
study. The researchers then created two treatment
conditions by manipulating the material that each
participant would be handling. Half of the students
were given a stack of money to count and the other
half got stack of blank pieces of paper. After the
counting task, the participants were asked to dip
their hands into bowls of painfully hot water (122 F)
and rate how uncomfortable it was. Participants who
had counted money rated the pain significantly lower
than those who had counted paper.
Example
• Control condition
– Individuals do not receive the experimental
treatment.
– They either receive no treatment or they receive
a neutral, placebo treatment.
– Purpose: to provide a baseline for comparison
with the experimental condition
• Experimental condition
– Individuals do receive the experimental
treatment
Nonexperimental Methods
• Nonequivalent Groups
– Researcher compares groups
– Researcher cannot control who goes into which
group
• Pre-test / Post-test
– Individuals measured at two points in time
– Researcher cannot control influence of the
passage of time
• Independent variable is quasi-independent it
is not manipulated but used in creating groups.
Figure 1.7
Two examples of nonexperimental studies
Scales of Measurement
• Measurement assigns individuals or events to
categories
– The categories can simply be names such as
male/female or employed/unemployed
– They can be numerical values such as 68 inches
or 175 pounds
• The complete set of categories makes up a scale
of measurement
• Relationships between the categories determine
different types of scales
Levels/Scales of Measurement
⦿ Measurement – application of rules for
assigning numbers to objects. The rules are
the specific procedures to transform qualities
of attributes into numbers.
⦿ Types of scales/levels:
● Nominal
● Ordinal
● Interval
● Ratio
Levels/Scales of Measurement
A. Nominal Scale
● It simply represents the qualitative or categorical
differences of the variables measured. It is just for
labelling purposes, nothing more.
● Examples:
71
Levels/Scales of Measurement
B. Ordinal Scale
● The categories have different names and are
organized sequentially. As the name implies, it
give order or rank to subjects or items.
● Examples:
○ Socioeconomic status – upper, middle and lower
○ Contest winners – 1st place, 2nd place, etc.
○ T-shirt sizes – small, medium, large, extra large
Ordinal data
data arranged in
some order, but the
differences between Position within an
data values cannot academic organization
1 for President
be determined or 2 for Vice President
are meaningless. 3 for Dean
4 for Department Chair
5 for Faculty member
73
Levels/Scales of Measurement
C. Interval Scale
● The categories have different names, are organized
sequentially and show the degree of difference.
● Examples:
○ IQ (There is no such thing as zero IQ.)
○ Test scores (A zero score does not mean zero
knowledge or the student did not learn anything.)
○ ºC and ºF scales (They have zero value but it does not
mean that there is no temperature at all.)
Interval Data
– Distances between
consecutive integers
are equal Examples:
– Relative magnitude of ❍Temperature on the
numbers is
meaningful Fahrenheit scale.
– Differences between ❍Intelligence Quotient
numbers are
comparable
– Location of origin,
zero, is arbitrary
75
Levels/Scales of Measurement
D. Ratio Scale
● The categories have different names, are organized
sequentially, show the degree of difference and can
hold an absolute zero value.
● Examples:
○ Weight
○ Height
○ Income
○ Age
Ratio Scale
– Highest level of
measurement Examples:
– Relative magnitude •Height, Weight,
of numbers is and Volume
meaningful •Monetary
– Differences between Variables, such as
numbers are Profit and Loss,
comparable Revenues, and
– Location of origin, Expenses
zero, is absolute
(natural) •Financial ratios
77
Scales of Measurement
79
1.5 Statistical Notation
– 10 6 7 4
– ΣX = 27 and N = 4
Order of Mathematical Operations
• Any calculation contained within parentheses is
done first.
• Squaring (raising to other exponents) is done
second.
• Multiplying and/or dividing is done third. A
series of multiplication and/or division
operations should be done in order from left to
right.
• Summation using the Σ is done next.
• Finally, any other addition and/or subtraction is
done
Summation Notation
• (ΣX)2
• ΣX = 15
• (ΣX)2 (15)2 = 225
Summation Notation
– 10 6 7 4
– ΣX = 27 and N = 4
Order of Mathematical Operations
• Any calculation contained within parentheses is
done first.
• Squaring (raising to other exponents) is done
second.
• Multiplying and/or dividing is done third. A
series of multiplication and/or division
operations should be done in order from left to
right.
• Summation using the Σ is done next.
• Finally, any other addition and/or subtraction is
done
Chapter 2 Learning Outcomes
2.1 Introduction to Frequency
Distributions
• A frequency distribution is
– An organized tabulation
– Showing the number of individuals located in
each category on the scale of measurement
• Can be either a table or a graph
• Always shows
– The categories that make up the scale
– The frequency, or number of individuals, in
each category
2.1 Introduction to Frequency
Distributions
• Uses of Frequency Distribution
– Allows the researcher to see “at a glance” the
entire set of scores
– It shows whether the scores are generally high
or low; whether they are concentrated in one
area or spread out across the entire scale
– Allows you to see the location of any individual
score relative to all of the other scores in the
set.
2.2 Frequency Distribution Tables
Frequency Central
Variability
Distribution Tendency
Regular and Mean, Median, Range, Standard
Grouped Mode Deviation
Research Questions
Research Question 1
• Click Analyses
• Exploration
• Descriptives
What is the demographic profile of the respondents in terms of:
1.1 Sex
• Summation notation
• Frequency distributions
Learning Outcomes
3.1 Overview of central tendency
• Central tendency
– A statistical measure
– A single score to define the center of a distribution
• Purpose: find the single score that is most typical or
best represents the entire group
ƩX 45 + 51 + 43 + 57 + 49 + 50 + 46 + 46 + 55
M = =
n 9
M = 49.11
Example 2: Mean: Dividing the total
equally
• Mean as the amount of each individual
receives when the total (ΣX) is divided equally
among all of the individuals.
Md = 26
Median (Md)
⦿ Example:
● If the pieces of data is even, then the median is the mean
of the two middle data values in the ordered list.
Md = (17 + 26) / 2
Md = 21. 5
3.4 The Mode
1. Extreme Scores of
skewed distributions
when a distribution has a
few extreme scores, scores
that are very different in
value from most of the
others, then the mean may
not be a good
representative of the
majority of the distribution.
3.5 Selecting a Measure of
Central Tendency
When to use the Median
2. Undetermined values
the scores are unknown due to some errors.
No. of Pizzas f
5 or more 3
4 2
3 2
2 3
1 6
0 4
3.5 Selecting a Measure of
Central Tendency
When to use the Mode
1. Nominal scales
categories that make up a nominal scale are
differentiated only by name. Since nominal
scales do not measure quantity (distance or
direction), it is impossible to compute the mean
or a median.
3.5 Selecting a Measure of
Central Tendency
When to use the Mode
2. Discrete variables
since it is composed of whole, indivisible
categories, we can easily identify the most
typical case.
3.5 Selecting a Measure of
Central Tendency