0% found this document useful (0 votes)

33 views52 pages

Statistical Evaluation of Data-New

This document provides an overview of key statistical concepts used in data analysis and research. It discusses descriptive statistics which summarize and organize sample data, and inferential statistics which use samples to make generalizations about populations. Key statistical measures covered include frequency distributions, measures of central tendency, variability, correlations, regression, hypothesis testing, and types of errors in hypothesis testing. The goal of statistics is to use samples to draw meaningful conclusions about populations.

Uploaded by

cute bouy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views52 pages

Statistical Evaluation of Data-New

Uploaded by

cute bouy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 52

Statistical Evaluation of Data

Chapter 15

1 /52
Descriptive / inferential
• Descriptive statistics are methods that help
researchers organize, summarize, and simplify
the results obtained from research studies.

• Inferential statistics are methods that use the

results obtained from samples to help make
generalizations about populations.

2 /52
Statistic / parameter
• A summary value that describes a sample is
called a statistic. M=25 s=2

• A summary value that describes a population

is called a parameter. µ =25 σ=2

3 /52
Frequency Distributions
One method of simplifying and organizing a set
of scores is to group them into an organized
display that shows the entire set.

4 /52
Example

5 /52
Histogram & Polygon

6 /52
Bar Graphs

7 /52
Central tendency
The goal of central tendency is to identify the
value that is most typical or most representative
of the entire group.

8 /52
Central tendency
• The mean is the arithmetic average.
• The median measures central tendency by
identifying the score that divides the
distribution in half.
• The mode is the most frequently occurring
score in the distribution.

9 /52
Variability
Variability is a measure of the spread of scores
in a distribution.

1. Range (the difference between min and max)

2. Standard deviation describes the average
distance from the mean.
3. Variance measures variability by computing
the average squared distance from the mean.
10 /52
Variance = the index of variability.
SD = SQRT (Variance)
Variance = (Sum of Squares) / N
X X-M (X-M)2
10 4 16
7 1 1
9 3 9
8 2 4 Variance = 70/10= 7
7 1 1
6 0 0 SD = SQRT(7) =2.64
5 -1 1
4 -2 4
3 -3 9
1 -5 25

Total=60 SS =70
Mean=6
11 /52
Non-numerical Data
Proportion or percentage in each category.
For example,
• 43% prefer Democrat candidate,
• 28% prefer Republican candidate,
• 29% are undecided

12 /52
Correlations
A correlation is a statistical value that measures
and describes the direction and degree of
relationship between two variables.

13 /52
Types of correlation

Variable Y\X Quantitiative X Ordinal X Nominal X

Quantitative Y Pearson r Biserial rb Point Biserial rpb

Ordinal Y Biserial rb Spearman rho/Tetrachoric rtet Rank Biserial rrb

Nominal Y Point Biserial rpb Rank Bisereal rrb Phi, C, λ Lambda

Phi for dichotomous data only

Pearson's contingency coefficient known as C
Cramer's V coefficient
Goodman and Kruskal lambda coefficient
http://www.andrews.edu/~calkins/math/edrm611/edrm13.htm

14 /52
Regression

15 /52
Regression
• Whenever a linear relationship exists, it is
possible to compute the equation for the
straight line that provides the best fit for the
data points.
• The process of finding the linear equation is
called regression, and the resulting equation is
called the regression equation.

16 /52
Where is the regression line?
120

110

100

80
STRENGTH

70
140 150 160 170 180 190 200 210 220

WEIGHT
17 /52
Which one is the regression line?
120

110

100

80
STRENGTH

70
140 150 160 170 180 190 200 210 220

WEIGHT
18 /52
regression equation
All linear equations have the same general
structure and can be expressed as
• Y = bX+a Y= 2X + 1

19 /52
standardized form
• Often the regression equation is reported in
standardized form, which means that the
original X and Y scores were standardized, or
transformed into z- scores, before the
equation was computed.
ȥy=βȥx

20 /52
Multiple Regression

21 /52
22 /52
INFERENTIAL STATISTICS
• INFERENTIAL STATISTICS

23 /52
Sampling Error

Random samples
No treatment

24 /52
Is the difference due to a sampling
error?
Random samples
Violent /Nonviolent TV

25 /52
Is the difference due to a sampling
error?
• Sampling error is the naturally occurring
difference between a sample statistic and the
corresponding population parameter.

• The problem for the researcher is to decide

whether the 4- point difference was caused by
the treatments ( the different television
programs) or is just a case of sampling error

26 /52
Hypothesis testing
• A hypothesis test is a statistical procedure that
uses sample data to evaluate the credibility of
a hypothesis about a population.

27 /52
5 elements of a hypothesis test
1. The Null Hypothesis
The null hypothesis is a statement about the
population, or populations, being examined, and
always says that there is no effect, no change, or no
relationship.

2. The Sample Statistic

The data from the research study are used to
compute the sample statistic.

28 /52
5 elements of a hypothesis test
3. The Standard Error
Standard error is a measure of the average, or standard distance
between sample statistic and the corresponding population
parameter.
"standard error of the mean , sm" refers to the standard deviation of the distribution of sample means taken from a population.

4. The Test Statistic

A test statistic is a mathematical technique for comparing the
sample statistic with the null hypothesis, using the standard
error as a baseline.

M 1 M 2
t
sm
29 /52
5 elements of a hypothesis test
5. The Alpha Level ( Level of Significance)
The alpha level, or level of significance, for a
hypothesis test is the maximum probability that the
research result was obtained simply by chance.

A hypothesis test with an alpha level of .05, for

example, means that the test demands that there is
less than a 5% (. 05) probability that the results are
caused only by chance.
30 /52
Reporting Results from a Hypothesis
Test
• In the literature, significance levels are
reported as p values.

For example, a research paper may report a

significant difference between two treatments
with p <.05. The expression p <.05 simply means
that there is less than a .05 probability that the
result is caused by chance.

31 /52
Errors in Hypothesis Testing
If a researcher is misled by the results from the
sample, it is likely that the researcher will reach
an incorrect conclusion.
Two kinds of errors can be made in hypothesis
testing.

32 /52
Type I Errors
• A Type I error occurs when a researcher finds evidence
for a significant result when, in fact, there is no effect (
no relationship) in the population.
• The error occurs because the researcher has, by chance, selected an extreme sample that appears to show
the existence of an effect when there is none.

• The consequence of a Type I error is a false report. This

is a serious mistake.
• Fortunately, the likelihood of a Type I error is very
small, and the exact probability of this kind of mistake
is known to everyone who sees the research report.

33 /52
Type II error
• A Type II error occurs when sample data do
not show evidence of a significant effect
when, in fact, a real effect does exist in the
population.
• This often occurs when the effect is so small that it does not show up in the sample.

34 /52
Factors that Influence the Outcome of
a Hypothesis Test
1. The sample size.
The difference found with a large sample is
more likely to be significant than the same result
found with a small sample.
2. The Size of the Variance
When the variance is small, the data show a
clear mean difference between the two
treatments.

35 /52
Effect Size
• Knowing the significance of difference is not
enough. We need to know the size of the
effect.

36 /52
Measuring Effect Size with Cohen’s d

37 /52
Measuring Effect Size as a Percentage
of Variance ( r2)
The effect size can also be measured by calculating the
percentage of variance in the treatment condition that
could be predicted by the variance in the control group.

df = (n1-1)+(n2-1)

38 /52
Examples of hypothesis tests
report
• Two- Group Between- Subjects Test
• df = (n1-1)+(n2-1)

• Two- Treatment Within- Subjects Test

df = (n-1)

39 /52
ANOVA reports
• Comparing More Than Two Levels of a Single
Factor (ANOVA)
• df= k-1
• df(within)=(k-1) * (n-1)
• df(between)=(n1-1)+(n2-1)+(n3-1)+…

40 /52
Post Hoc Tests

Are necessary because the original ANOVA

simply establishes that mean differences exist,
but does not identify exactly which means are
significantly different and which are not.

41 /52
Factorial Tests Report
The simplest case, a two- factor design, requires
a two- factor analysis of variance, or two- way
ANOVA. The two- factor ANOVA consists of three
separate hypothesis tests. page 550

P<.01 4.26-7.82
P<.01 3.40-5.61
P<.01 3.40-5.61

42 /52
Comparing Proportions
chi- square

43 /52
Reporting X2

df = ( C1 – 1)( C2 – 1)

• The report indicates that the researcher obtained

a chi- square statistic with a value of 8.70, which
is very unlikely to occur by chance ( probability is
equal to .02). The numbers in parentheses
indicate that the chi- square statistic has degrees
of freedom ( df) equal to 3 and that there were
40 participants ( n= 40) in the study.
44 /52
Evaluating Correlations

• r= 0.65, n =40, p= .01

• The report indicates that the sample
correlation is r= 0.65 for a group of n= 40
participants, which is very unlikely to have
occurred if the population correlation is zero (
probability less than .01).

45 /52
Reliability & Validity
• Reliability refers to the relationship between
two sets of measurements.

46 /52
split-half
evaluate the internal consistency of the test by
computing a measure of split- half reliability.

However, the two split- half scores obtained for each participant are based on
only half of the test items. So we can fix this by using ;

47 /52
Kuder-Richardson
• The Kuder-Richardson Formula 20
estimates the average of all the possible split-
half correlations that can be obtained from all of
the possible ways to split a test in half.

48 /52
Cronbach’s Alpha
• One limitation of the K- R 20 is that it can only
be used for tests in which each item has only
multiple choice/true-false/yes-no alternatives.

• Cronbach’s Alpha can be used for scaled-scores

49 /52
Inter-rater reliability
• Inter-rater reliability is the degree of
agreement between two observers who have
independently observed and recorded
behaviors at the same time.

• The simplest technique for determining inter- rater reliability

is to compute the percentage of agreement as follows:

50 /52
Cohen’s kappa
• The problem with a simple measure of
percent agreement is that the value obtained
can be inflated by chance.
• To correct the chance factor

PA is the observed percent agreement and PC

is the percent agreement expected from
chance.

51 /52
Group Discussion
• Identify the two basic concerns with using a
correlation to measure split-half reliability and
explain how these concerns are addressed by
Spearman-Brown, K-R 20, and Cronbach’s
alpha.
• Identify the basic concern with using the
percentage of agreement as a measure of
inter-rater reliability and explain how this
concern is addressed by Cohen’s kappa.
52 /52

Inferential Statistics
100% (4)
Inferential Statistics
28 pages
Inferential Statistics
100% (1)
Inferential Statistics
57 pages
Statistics: a QuickStudy Laminated Reference Guide
From Everand
Statistics: a QuickStudy Laminated Reference Guide
BarCharts Publishing, Inc.
No ratings yet
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
From Everand
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
Joseph George Caldwell
No ratings yet
Unit - 1 L2
No ratings yet
Unit - 1 L2
80 pages
Lecture 5: Chapter 5 Statistical Analysis of Data Yes The "S" Word
No ratings yet
Lecture 5: Chapter 5 Statistical Analysis of Data Yes The "S" Word
42 pages
250 Lec 5 Fall 13
No ratings yet
250 Lec 5 Fall 13
42 pages
Inferential Statistics
No ratings yet
Inferential Statistics
35 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
42 pages
Hns 2321 Biostatistics Lecture Notes On Inferential Statistics
No ratings yet
Hns 2321 Biostatistics Lecture Notes On Inferential Statistics
25 pages
Business Research Methods: MBA - FALL 2014
No ratings yet
Business Research Methods: MBA - FALL 2014
32 pages
Inferential Statistics
No ratings yet
Inferential Statistics
48 pages
L7-Hypothesis Testing
No ratings yet
L7-Hypothesis Testing
44 pages
Inferential Stat
No ratings yet
Inferential Stat
40 pages
Lecture 2 - MAT361 (21 JAN 2025)
No ratings yet
Lecture 2 - MAT361 (21 JAN 2025)
40 pages
Inferentialstatistics 210411214248
No ratings yet
Inferentialstatistics 210411214248
102 pages
IB372 FA10 Lab01 Intro Statistics Presentation
100% (1)
IB372 FA10 Lab01 Intro Statistics Presentation
75 pages
2statistical Analysis of Data 2
No ratings yet
2statistical Analysis of Data 2
43 pages
Chapter 5 T Test & ANOVA
No ratings yet
Chapter 5 T Test & ANOVA
26 pages
Chapter 5 Hypothesis Testing
100% (1)
Chapter 5 Hypothesis Testing
27 pages
Lecture 7.descriptive and Inferential Statistics
100% (1)
Lecture 7.descriptive and Inferential Statistics
44 pages
Chapter 5 Hypothesis Testing
No ratings yet
Chapter 5 Hypothesis Testing
27 pages
The Statistical Tools
No ratings yet
The Statistical Tools
66 pages
Inferential Statistics
100% (2)
Inferential Statistics
16 pages
Expe Finals
No ratings yet
Expe Finals
8 pages
Inferential Statistics
No ratings yet
Inferential Statistics
26 pages
PG Descriptive and Inferential Statistic 2024
No ratings yet
PG Descriptive and Inferential Statistic 2024
51 pages
Research
No ratings yet
Research
21 pages
7.hypothesis Testing and Sample Size Determination
No ratings yet
7.hypothesis Testing and Sample Size Determination
60 pages
12 Stats Review
No ratings yet
12 Stats Review
51 pages
AEE 302 Note - Unit 3
No ratings yet
AEE 302 Note - Unit 3
4 pages
PSM 201 Sampling Distributions and Hypothesis Testing
No ratings yet
PSM 201 Sampling Distributions and Hypothesis Testing
31 pages
Inferenatial Assign, of Iqra Sajid
No ratings yet
Inferenatial Assign, of Iqra Sajid
8 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
18 pages
T Test
No ratings yet
T Test
29 pages
Inferential Statistics
No ratings yet
Inferential Statistics
40 pages
Statistics Reviewer
100% (1)
Statistics Reviewer
3 pages
Statistics For College Students-Part 2
100% (1)
Statistics For College Students-Part 2
43 pages
Chapter 5 Data Analysis Ab
No ratings yet
Chapter 5 Data Analysis Ab
56 pages
Statistics: An Introduction and Overview
No ratings yet
Statistics: An Introduction and Overview
51 pages
Inferential Statistics PART 1 Presentation
No ratings yet
Inferential Statistics PART 1 Presentation
28 pages
Biostats 2
No ratings yet
Biostats 2
7 pages
Sanet ST 0199751765 PDF
100% (1)
Sanet ST 0199751765 PDF
481 pages
Biostatistics Notes
100% (1)
Biostatistics Notes
8 pages
Biostatistics Notes: Descriptive Statistics
No ratings yet
Biostatistics Notes: Descriptive Statistics
16 pages
5 & 6 - BIOSTATISTICS V & VI Inferential Statistics I & II
No ratings yet
5 & 6 - BIOSTATISTICS V & VI Inferential Statistics I & II
68 pages
Stat 4th Q Week 1 PPT 1
No ratings yet
Stat 4th Q Week 1 PPT 1
47 pages
Final Exam
No ratings yet
Final Exam
5 pages
Straightforward Statistics Understanding The Tools of Research Dropbox Download
100% (8)
Straightforward Statistics Understanding The Tools of Research Dropbox Download
16 pages
Inferential Statistics
No ratings yet
Inferential Statistics
101 pages
Intro of Hypothesis Testing
100% (1)
Intro of Hypothesis Testing
66 pages
Statistical Inferences
No ratings yet
Statistical Inferences
46 pages
AEB03 - Inferential Statitsitics (FE)
No ratings yet
AEB03 - Inferential Statitsitics (FE)
54 pages
Learner'S Packet (Leap) : Student Name: Section: Subject Teacher: Adviser
No ratings yet
Learner'S Packet (Leap) : Student Name: Section: Subject Teacher: Adviser
7 pages
Data Visualization Notes Ou
No ratings yet
Data Visualization Notes Ou
125 pages
DV Unit 1&2 Notes
No ratings yet
DV Unit 1&2 Notes
50 pages
Glossary Statistics
No ratings yet
Glossary Statistics
6 pages
Inferential Statistics: DR Abrar Umar
No ratings yet
Inferential Statistics: DR Abrar Umar
28 pages
Parametric Vs Non Parametric Statistics
No ratings yet
Parametric Vs Non Parametric Statistics
12 pages
Week 13
No ratings yet
Week 13
33 pages
LDA Two Classes - Example: Compute The Linear Discriminant Projection For The Following Two-Dimensional Dataset
No ratings yet
LDA Two Classes - Example: Compute The Linear Discriminant Projection For The Following Two-Dimensional Dataset
14 pages
Lampiran SPSS 20 CC
No ratings yet
Lampiran SPSS 20 CC
6 pages
Wa0001.
No ratings yet
Wa0001.
20 pages
Cqe Equation
No ratings yet
Cqe Equation
57 pages
Quiz Lesson 6 Hypothesis Testing (Quantitative Methods)
No ratings yet
Quiz Lesson 6 Hypothesis Testing (Quantitative Methods)
24 pages
Test of Significance
No ratings yet
Test of Significance
22 pages
Stochastic Hydrology: Indian Institute of Science
No ratings yet
Stochastic Hydrology: Indian Institute of Science
56 pages
GITAM School of International Business GITAM University
No ratings yet
GITAM School of International Business GITAM University
3 pages
Data Science - Part II (Cra 4061)
No ratings yet
Data Science - Part II (Cra 4061)
2 pages
Reasoning With Uncertainty - Probabilistic Reasoning: Version 2 CSE IIT, Kharagpur
No ratings yet
Reasoning With Uncertainty - Probabilistic Reasoning: Version 2 CSE IIT, Kharagpur
10 pages
Guideshorttestu 01
No ratings yet
Guideshorttestu 01
219 pages
Employee Welfare and Employees Engagement
No ratings yet
Employee Welfare and Employees Engagement
14 pages
Question - Bank - Biostatistics (2017!02!03 02-43-30 UTC)
100% (5)
Question - Bank - Biostatistics (2017!02!03 02-43-30 UTC)
53 pages
Leacture 6
No ratings yet
Leacture 6
10 pages
Open Screenshot 2023-12-13 at 8.06.13 PM 26
No ratings yet
Open Screenshot 2023-12-13 at 8.06.13 PM 26
56 pages
Inde 2156 HW 2
No ratings yet
Inde 2156 HW 2
6 pages
Chapter 3 Measures of Variability
No ratings yet
Chapter 3 Measures of Variability
69 pages
Partial Correlation
No ratings yet
Partial Correlation
2 pages
Assignment 06
No ratings yet
Assignment 06
11 pages
Exercise Risk Management
No ratings yet
Exercise Risk Management
2 pages
DAT Manual PDF
100% (1)
DAT Manual PDF
60 pages
Lecture 5 Correlation
No ratings yet
Lecture 5 Correlation
2 pages
Skittles Report
No ratings yet
Skittles Report
9 pages
Chapter 4 - Forecasting Production
No ratings yet
Chapter 4 - Forecasting Production
58 pages
Assignment 3
No ratings yet
Assignment 3
3 pages
Econometric S
No ratings yet
Econometric S
59 pages
Frontier Functions: Stochastic Frontier Analysis (SFA) & Data Envelopment Analysis (DEA)
100% (1)
Frontier Functions: Stochastic Frontier Analysis (SFA) & Data Envelopment Analysis (DEA)
45 pages
Rose Sparkling Wine
100% (1)
Rose Sparkling Wine
32 pages
Pseudorandom Numbers in Modeling and Simulation
No ratings yet
Pseudorandom Numbers in Modeling and Simulation
7 pages
Regression Models Assignment 1
No ratings yet
Regression Models Assignment 1
5 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Statistical Evaluation of Data-New

Uploaded by

Statistical Evaluation of Data-New

Uploaded by

Statistical Evaluation of Data

• Inferential statistics are methods that use the

• A summary value that describes a population

1. Range (the difference between min and max)

Variable Y\X Quantitiative X Ordinal X Nominal X

Quantitative Y Pearson r Biserial rb Point Biserial rpb

Ordinal Y Biserial rb Spearman rho/Tetrachoric rtet Rank Biserial rrb

Nominal Y Point Biserial rpb Rank Bisereal rrb Phi, C, λ Lambda

Phi for dichotomous data only

• The problem for the researcher is to decide

2. The Sample Statistic

4. The Test Statistic

A hypothesis test with an alpha level of .05, for

For example, a research paper may report a

• The consequence of a Type I error is a false report. This

• Two- Treatment Within- Subjects Test

Are necessary because the original ANOVA

• The report indicates that the researcher obtained

• r= 0.65, n =40, p= .01

• Cronbach’s Alpha can be used for scaled-scores

• The simplest technique for determining inter- rater reliability

PA is the observed percent agreement and PC

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.