03 Statistics in Analytical Chemistry
03 Statistics in Analytical Chemistry
ANALYTICAL
CHEMISTRY
Statistics in Analytical Chemistry
Mean
Average of a set of values
Median
Mid-point of a set of values
Population
A collection of an infinite number of measurements
carrying the property being studied
N infinity
Sample
A finite set of measurements which represent the
population
Some useful statistics terms
Collective Outliers
A subset of data objects collectively deviate
significantly from the whole data set, even if the
individual data objects may not be outliers
Outliers
Collective Outliers
Detection of collective outliers
Consider not only behavior of individual objects, but also
that of groups of objects
Need to have the background knowledge on the
relationship among data objects, such as a distance or
similarity measure on objects
Outliers
Simple example
Global outlier
A fist-size meteorite impacting a house in your
neighbourhood
It’s a truly rare event that meteorites hit buildings
Contextual outlier
Your neighbourhood getting buried in two feet of snow
If snowfall happened in middle of summer and you
normally don’t get any snow outside of winter
Outliers
Simple example
Collective outlier
Every one of your neighbours is moving out of the
neighbourhood on the same day
Although it’s definitely not rare that people move from
one residence to next, it is very unusual that an entire
neighbourhood relocates at same time
Rejecting Outliers
=10.10
Precise, biased
= 10.01
Imprecise, unbiased
= 9.90
Imprecise, biased
= 10.01
Precise, unbiased
Experimental Errors
Gross error
They are so serious that there is no alternative to
abandoning the experiment and making a completely
fresh start
E.g.
A complete instrument breakdown
Accidentally dropping or discarding a crucial sample
Discovering during the course of the experiment that a
supposedly pure reagent was in fact badly contaminated
Sources of Errors
E.g.,
Consider a titration in which same 25.00 mL pipette is
used to dispense portions of sample for replicate
determinations
If the volume of pure water delivered by the pipette at
a specified temperature is ±0.03 mL
Itmeans, volume might be 24.97 mL (a systematic error
of -0.03 mL) or 25.03 mL (a systematic error of +0.03
mL)
In theory error can be determined by calibrating it
through weighing replicate volumes dispensed by the
pipette, and converting the mass of water to volume
Error and Uncertainty
Significant Figures
Significant figures reflect the accuracy and precision of
a given result
A result should always be rounded to the number of
figures that are consistent with the confidence that can
be placed on it
Thus, the number of significant figures is the number
of digits remaining after the data is rounded
Reported values should contain only significant figures
Reporting Analytical Data
Significant Figures
A value is made up of significant figures when it
contains all digits known to be true and one last digit
in doubt
E.g.,if a value is reported as 18.86 mg/l, the 18.8 must be
firm while the 0.06 is somewhat uncertain
Final zeros after a decimal point are always meant to
be significant figures
E.g. 9.800 g would be considered as having 4 significant
figures
Reporting Analytical Data
Significant Figures
Zeros before a decimal point with non-zero digits
preceding them are assumed to be significant
With no preceding non-zero digit, a zero before the
decimal point is not considered significant
With no non-zero digits preceding a decimal point, the
zeros after the decimal point but preceding other non-
zero digits are not considered to be significant. These
zeros only indicate the position of the decimal point
Reporting Analytical Data
Significant Figures
Final zeros in a whole number may or may not be
significant (e.g., in 1,000 g, zeros may not be
significant and only indicate magnitude of the number)
A good way to determine if zeros interspersed in a
number are significant or not is by determining
whether they can be dropped by expressing the number
in exponential form
E.g., no zeros can be dropped when expressing 100.08 g
in exponential form; therefore, the zeros are significant
However, 0.0008 g can be expressed in exponential form
as 8 x 10−4 g; therefore, the zeros are not significant
Reporting Analytical Data
Significant Figures
Once the number of significant figures obtainable from
a given type of analysis is established, data resulting
from such analyses are reduced according to set rules
for rounding
Reporting Analytical Data
xi = individual measurement
= mean of infinite number of measurements (true
value)
n = number of measurements
Standard Deviation ()
Standard deviation
Estimated Standard Deviation, s (n < 30)
s=
For finite sets the precision is represented by ‘s’
Calculating the Standard Deviation
100%
Relative Standard Deviation
100%
Standard Error of mean
SEM (Smean) =
Test of Significance