1 - BIOL2163 Intro To Statistics
1 - BIOL2163 Intro To Statistics
Biostatistics
Email : Luke.Rostant@sta.uwi.edu
Lecture Times : Wed, 12-1pm, BBC
Thu, 12-1pm, BBC
Office Hours : online by appointment
1
For those without access to a
computer off campus:
• Please note the following guidelines concerning the Loaning of Laptops:
• Students in need of a laptop are asked to email
deputy.principal@sta.uwi.edu.
• Students who request a laptop will be asked to sign a loan agreement.
• There is no cost involved.
• Only registered and financially cleared students will be allowed to borrow
these devices.
• Loans will be for up to 28 days at a time and can be renewed.
• Loans will be arranged by The Alma Jordan Library (AJL), via the main
office at Sir Arthur Lewis (SAL) Hall.
• Look out for further updates and for more information about the Office of
the Deputy Principal, please visit https://sta.uwi.edu/deputyprincipal/
2
Class representatives
• Alexei Sanguinette
• alexei.sanguinette@my.uwi.edu
• Karine Khan
• karine.khan@my.uwi.edu
3
BIOL 2163
Biostatistics
Myelearning will be the only medium for distributing
material.
For students experiencing problems with myelearning or
their my.uwi email:
Please check MySecureArea in the Student Portal for any
existing holds on your account that may prevent access to
Myelearning. Please check the Registration Page for the list
of holds and whom to contact for resolution.
1-1 Overview
1-1 Supplemental - Statistical Thinking
1-2 Types of Data
1-2 Supplemental - Critical Thinking
1-3 Design of Experiments
5
Section 1-1
Overview
6
Preview
Polls, studies, surveys and other data collecting
tools collect data from a small part of a larger
group so that we can learn something about the
larger group. This is a common and important
goal of statistics:
Learn about a large group by examining data
from some of its members.
7
Preview
8
Data
Data
collections of observations (such as
measurements, genders, survey responses)
9
Statistics
Statistics
is the science of planning studies and
experiments, obtaining data, and then
organizing, summarizing, presenting,
analyzing, interpreting, and drawing
conclusions based on the data
10
Population
Population
the complete collection of all individuals
(scores, people, measurements, and so on)
to be studied; the collection is complete in
the sense that it includes all of the
individuals to be studied
11
Census versus Sample
Census
Collection of data from every member of
a population
Sample
Subcollection of members selected from
a population
12
Chapter Key Concepts
13
Section 1-1 Supplemental
Statistical Thinking
14
Key Concept
15
Key Concept (continued)
16
Context
17
Source of data
20
Practical Implications
21
Statistical Significance
22
Section 1-2
Types of Data
23
Key Concept
24
Parameter
Parameter
a numerical measurement describing some
characteristic of a population.
population
parameter
25
Statistic
Statistic
a numerical measurement describing some
characteristic of a sample.
sample
statistic
26
Parameters vs. Statistics
A freshwater pond is stocked with 500
tilapia. All 500 fish are weighed.
Parameter: Average weight of the 500 fish
28
Categorical Data
30
Quantitative Data - Discrete Data
Discrete data
result when the number of possible values is either
a finite number or a ‘countable’ number
(i.e. the number of possible values is
0, 1, 2, 3, . . .)
32
Levels of Measurement
33
Nominal Level
Can be used for counts, but should not be used for calculations such
as averages, totals etc., even if coded as numbers
34
Ordinal Level
Ordinal level of measurement
involves data that can be arranged in some order, but
differences between data values either cannot be
determined or are meaningless
Can be used for counts, but not be used for averages, totals etc.
35
Interval Level
Interval level of measurement
like the ordinal level, with the additional property that
the difference between any two data values is
meaningful, however, there is no natural zero starting
point (where none of the quantity is present)
Years 1000, 2000, 1776, and 1492 (Year zero does not mean
time did not exist before it).
38
Recap
39
Section 1-2 Supplemental
Critical Thinking
40
Key Concepts
Success in the introductory statistics course
typically requires more common sense than
mathematical expertise.
Improve skills in interpreting information based
on data.
This section is designed to illustrate how common
sense is used when we think critically about data
and statistics.
Think carefully about the context, source,
method, conclusions and practical implications.
41
Misuses of Statistics
To correctly interpret a graph, you must analyze the numerical information given in the
graph, so as not to be misled by the graph’s shape. READ labels and units on the axes!
43
Pictographs
44
Bad Samples
Voluntary response sample
(or self-selected sample)
45
Correlation and Causality
46
Small Samples
47
Percentages
48
Loaded Questions
51
Missing Data
53
Precise Numbers vs. Accurate Numbers
Percentage of voters who support Donald
Trump.
Precise figure: 43.15%
Imprecise figures: Approximately 40%,
Between 40 to 45%
Because a figure is precise, many people
incorrectly assume that it is also accurate.
A precise number can be an estimate, and it
should be referred to that way. 54
Deliberate Distortion
55
Recap
56
Section 1-3
Design of Experiments
57
Key Concept
If sample data are not collected in an
appropriate way, the data may be so
completely useless that no amount of
statistical torturing can salvage them.
Method used to collect sample data
influences the quality of the statistical
analysis.
Of particular importance is simple random
sample.
58
Bad Samples
Voluntary response sample
(or self-selected sample)
59
Basics of Collecting Data
60
Observational Study
Observational study
observing and measuring specific
characteristics without attempting to modify
the subjects being studied
61
Types of Observational Studies
Cross sectional study
data are observed, measured, and collected at one
point in time
Experiment
apply some treatment and then observe its effects
on the subjects; (subjects in experiments are called
experimental units) – i.e. we attempt to modify the
subjects being studied
Confounding
occurs in an experiment when the experimenter is
not able to distinguish between the effects of
different factors.
66
Controlling Effects of Variables in Experiments
To Prevent Confounding: Use a Randomized Block Design
67
Controlling Effects of Variables in Experiments
Randomized Block Design
68
Controlling Effects of Variables in
Experiments
Blinding
is a technique in which the subject doesn’t know
whether he or she is receiving a treatment or a
placebo. Blinding allows us to determine whether
the treatment effect is significantly different from a
placebo effect, which occurs when an untreated
subject reports improvement in symptoms.
69
Controlling Effects of Variables in
Experiments
Double-Blind
Blinding occurs at two levels:
(1) The subject doesn’t know whether he or she is
receiving the treatment or a placebo
70
Controlling Effects of Variables in Experiments
Completely Randomized Experimental Design
assign subjects to different treatment groups through a
process of random selection
Randomized Block Design
a block is a group of subjects that are similar, but blocks
differ in ways that might affect the outcome of the
experiment
Rigorously Controlled Design
carefully assign subjects to different treatment groups, so
that those given each treatment are similar in ways that are
important to the experiment
Matched Pairs Design
compare exactly two treatment groups using subjects
matched in pairs that are somehow related or have similar
characteristics 71
Replication
Replication
is the repetition of an experiment on more than
one subject. Samples should be large enough so
that the erratic behavior that is characteristic of
very small samples will not disguise the true effects
of different treatments. It is used effectively when
there are enough subjects to recognize the
differences from different treatments.
Randomization
is used when subjects are assigned to different
groups through a process of random selection. The
logic is to use chance as a way to create two groups
that are similar.
73
Summary
Why Sample?
75
Sampling Strategies
Types of Samples
76
Sampling Strategies
Probability Sample
selecting members from a population in such a way
that each member of the population has a known
(but not necessarily the same) chance of being
selected
77
Sampling Strategies
Random Sample
members from the population are selected in such
a way that each individual member in the
population has an equal chance of being selected
79
Systematic Sampling
Select some starting point and then
select every kth element in the population
80
Stratified Sampling
subdivide the population into at
least two different subgroups that share the same
characteristics, then draw a sample from each subgroup (or
stratum)
81
Cluster Sampling
divide the population area into sections
(or clusters); randomly select some of those clusters; choose all
members from selected clusters
82
Multistage Sampling
83
Sampling Strategies
Non-Probability Samples
84
Sampling Strategies
Non-Probability Samples
85
Convenience Sampling
use results that are easy to get
86
Methods of Sampling - Summary
Nonsampling error
sample data incorrectly collected, recorded, or
analyzed (such as by selecting a biased sample, using
a defective instrument, or copying the data
88
incorrectly)
Recap
89