Biostatistics3 2
Biostatistics3 2
3
Percentages and Proportions
Report relative size.
Compare the number of cases in a
specific category to the number of cases
in all categories.
Compare a part (specific category) to a
whole (all categories).
The part is the numerator (f ).
The whole is the denominator (N).
Percentages and Proportions: Example
• What % of social science majors is male?
• of (whole) = all social science majors
• 97 + 132 = 229
• is (part) = male social science majors
• 97
• (97/229) * 100 = (.4236) * 100 = 42.36%
• 42.36% of social science majors are male
Ratios
• Ratio = f1 / f2
• If a town of 2300 had 17 births last year, the birth rate is:
• (17/2300) * 1000 = (.00739) * 1000 = 7.39
• The town had 7.39 births for every 1000 residents.
Percentage Change
• Measures the relative increase or decrease in a variable
over time.
11
Bimodal Distributions
• When a distribution has two “modes,” it
is called bimodal
For example, in the following set the numbers both the numbers
5 and 7 appear twice.
2, 9, 5, 7, 8, 6, 4, 7, 5
5 and 7 are both the mode and this set is said to be bimodal.
12
Multimodal Distributions
• If a distribution has more than 2 “modes,” it is called
multimodal
3, 8, 7, 6, 12, 11, 2, 1
All the numbers in this set occur only once therefore there is
no mode in this set. 13
When To Use the Mode
There is no mode!
The Median
• It is the score in the middle; half of the scores are larger than the
median and half of the scores are smaller than the median
• Conceptually, it is easy to calculate the median
17
Median Example - 2
• What is the median of the following scores:
24 18 19 42 16 12
• Sort the scores:
42 24 19 18 16 12
• Determine the middle score:
middle = (N + 1) / 2 = (6 + 1) / 2 = 3.5
• Median = average of 3rd and 4th scores:
(19 + 18) / 2 = 18.5
18
When To Use the Median
• The median is often used when the distribution of scores is
either positively or negatively skewed
• The few really large scores (positively skewed) or really
small scores (negatively skewed) will not overly influence
the median
19
Mean Calculating the Mean
20
When To Use the Mean
• You should use the mean when
• the data are interval or ratio scaled
• Many people will use the mean with ordinally scaled data too
• and the data are not skewed
$2225
$2225 / 7 = $317.86
23
Quartiles, Deciles and Percentiles
(Measure of Position)
• The median splits the data into equal sized halves
Deciles 1/10
10% 10% 10% 10% 10% 10% 10% 10% 10% 10%
Percentile Computation
• To formalize the computational procedure, let Lp refer to the location of a
desired percentile. So if we wanted to find the 33rd percentile we would
use L33 and if we wanted the median, the 50th percentile, then L50.
Percentiles - Example
Locate the median, the first quartile, and the third quartile for the
below data:
Step 2: Compute the first and third quartiles. Locate L25 and L75 using:
25 75
L25 (15 1) 4 L75 (15 1) 12
100 100
Therefore, the first and third quartiles are located at the 4th and 12th
positions, respective ly
L25 $1,721
L75 $2,205
Quartile Measures
Calculating The Quartiles: Example
Sample Data in Ordered Array: 11 12 13 16 16 17 18 21 22
(n = 9)
Q1 is in the (9+1)*25/100 = 2.5 position of the ranked data,
so Q1 = (12+13)/2 = 12.5
Boxplot - Example
Boxplot Example
Step1: Create an appropriate scale along the horizontal axis.
Step 2: Draw a box that starts at Q1 (15 minutes) and ends at Q3 (22
minutes). Inside the box we place a vertical line to represent the median (18
minutes).
Step 3: Extend horizontal lines from the box out to the minimum value (13
minutes) and the maximum value (30 minutes).
Distribution Shape and
The Boxplot
Q1 Q2 Q3 Q1 Q2 Q3 Q1 Q2 Q3
Quartile Measures:
The Interquartile Range (IQR)
― The IQR is Q3 – Q1 and measures the spread in the middle 50% of the data
― Measures like Q1, Q3, and IQR that are not influenced by outliers are called
resistant measures
The Interquartile Range
Example:
Median X
X Q1 Q3
(Q2) maximum
minimum
25% 25% 25% 25%
11 12.5 16 19.5 22
Interquartile range
= 19.5 – 12.5 = 7