Basic Mathematics For Biochemists
Basic Mathematics For Biochemists
for Biochemists
To Joy and Jenny
Basic
Mathematics
for Biochemists
ATHEL CORNISH-BOWDEN
Lecturer in Biochemistry,
University of Birmingham
ISBN-13: 978-0-412-23010-3
This title is available in both hardbound and paperback editions. The paperback
edition is sold subject to the condition that it shall not, by way of trade or
otherwise, be lent, re-sold, hired out, or otherwise circulated without the
publisher's prior consent in any form ofbinding or cover other than that in which
it is published and without a similar condition including this condition being
imposed on the subsequent purchaser.
All rights reserved. No part of this book may be reprinted, or reproduced or
utilized in any form or by any electronic, mechanical or other means, now
known or hereafter invented, including phytocopying and recording, or in any
information storage and retrieval system, without permission in writing from
the publisher.
Cornish-Bowden, Athel
Basic mathematics for biochemists.
1. Biological chemistry 2. Mathematics
I. Title
510'.2454 QD42
ISBN-\3: 978-0-412-23010-3 e-ISBN-\3: 978-94-011-6523-5
DOl: 10.1007/978-94-011-6523-5
Contents
Page
Preface ix
3 Differential Calculus 48
3.1 Co-ordinate geometry 48
3.2 Slope of a curve 50
3.3 Rapid differentiation 52
3.4 Derivatives of sums and products 54
3.5 Derivative of a 'function of a function' 57
3.6 Derivative of a ratio 58
3.7 Higher derivatives 59
3.8 Notation 59
3.9 Maxima and minima 61
3.10 A note on terminology 64
3.11 Points of inflexion 65
3.12 Sketching curves 68
3.13 Problems 70
4 Integral Calculus 72
4.1 Increases in area 72
4.2 Definite and indefinite integrals 74
4.3 Simple integrals 78
4.4 Integral of Ilx 80
4.5 Differential equations 82
4.6 Numerical integration: evaluating the area under a curve 85
4.7 Problems 88
5 Solving Equations 91
5.1 Linear equations in one unknown 91
5.2 Rearranging equations 92
5.3 Simultaneous linear equations 93
5.4 Determinants 96
5.5 Quadratic equations 99
5.6 Graphical solution of equations 102
5.7 Newton's method 107
5.8 Approximate methods 109
5.9 Problems 113
Contents / vii
6 Partial Differentiation 117
6.1 Meaning of a partial derivative 117
6.2 Exact and inexact differentials 118
6.3 Least-squares fitting of the Michaelis-Menten equation 120
6.4 Problems 124
Index 132
Preface
1.1 Introduction
Lord Kelvin once remarked that all of science could be divided into
physics and stamp collecting. Although this rather patronizing
comment of a physicist is not one that will appeal to all biochemists, it
has an element of truth in it. A science can hardly claim to be a science
as long as it remains no more than a catalogue of unrelated
observations. Only when general laws can be proposed and tested by
experiment can it be said to have passed from mere description into
science. In chemistry the transformation from stamp collecting into
science corresponded with the development of thermodynamics and,
later, the atomic theory and theories of chemical bonding; in
biochemistry, the gradual realization that understanding of life
processes requires a foundation of physical chemistry and not just a
list of metabolic reactions has played a corresponding role. It is no
coincidence that mathematics has been central in all of these
developments, and it is now almost impossible to comprehend even
elementary biochemistry without a grasp of elementary mathematics.
Fortunately for non-mathematically minded biochemists, however,
the mathematics necessary for an undergraduate course in bio-
chemistry is nearly all elementary, and nearly all of it has been touched
on in every science student's previous education. Little more is
required, therefore, than to identify the parts of elementary math-
ematics that are important in biochemistry and to reinforce them with
appropriate examples.
In mathematics itself, the transformation from description into
science is paralleled by the development from arithmetic, which is
concerned with numbers and their manipulation, into algebra.
Arithmetic is very useful, but it is much too limited to satisfy all of the
needs of science. In arithmetic, every problem is a new and separate
2 / The Language of Mathematics
problem, and it is difficult to make useful generalizations and hence to
express scientific laws. Let us consider the simple biochemical
example in Table 1.1, which shows a set of rates of a reaction
measured at the substrate concentrations given. As it stands the table
is no more than a description of the results of a particular set of
measurements and as long as we treat the numbers just as numbers we
cannot make a general or useful statement about the enzyme to which
they refer. The table tells us the rates observed at substrate
concentrations of 5 mM and 10 mM but offers no guidance about
what rate to expect at 8 mM; it does not tell us whether the system
studied was behaving in accordance with some general law; it offers no
clue as to what general law there might be. To remedy these omissions
we must move beyond arithmetic into algebra, because only then will
we be able to recognize a pattern or regularity in the numbers, and
express it so that it can be recognized again if it occurs with another
system. If the rates in Table 1.1 are represented as v and the substrate
concentrations as s, then the following equation expresses a law that
defines all of the numbers in the table:
lOs
v=--
4+s
This equation has two advantages over the table: first, it allows a
summary of all of the information while occupying much less space;
secondly, it predicts what v values we might expect to observe at s
values that are not included in the table. For example, it answers the
question above by predicting that v = 6.67 jiM s - 1 when s = 8 mM.
This is clearly more useful than just listing a set of numbers recorded
on a particular occasion.
One can proceed one stage further with this example by noting that
the equation is typical of what is reported for many enzymes, and so if
Substrate concentration
(mM)
1 2.00
2 3.33
5 5.56
10 7.14
20 8.33
Priority rules for operators / 3
we replace the numbers 10 (JlM s -1) and 4 (mM) by V and K m ,
respectively, we have an equation that expresses a generalization
about enzymes:
Vs
v=---
Km+ s
Again, replacing numbers with symbols has increased the generality of
what we want to say. In addition to the symbols v, V, sand K m , which
represent numbers, either particular ones or generalized ones, the
equation contains three operators: one is represented by the addition
sign +; one is shown by the horizontal line between Vs and Km + s;
and the third is implied by the juxtaposition of Vand s, but could have
been made explicit by writing V· s instead of Vs. Each operator
specifies something to be done to the numbers or symbols operated
on: the + sign requires Km and s to be added together; the horizontal
line requires Vs to be divided by Km + s; the juxtaposition of Vand s
(or a dot between them) requires them to be multiplied together.
As long as no ambiguity is possible mere juxtaposition is sufficient
to indicate multiplication, but if several numbers are to be multiplied
together, or if we allow algebraic symbols consisting of more than one
letter each (as in Fortran and many other computer languages), or if
ambiguity is possible for some other reason, multiplication can be
indicated by a dot or a cross. The dot is more common for pairs of
symbols (where no confusion with the decimal point is possible) and
the cross is more common for pairs of numbers, but the two symbols
have the same meaning in most contexts, i.e. V x s means the same as
V . s. (In some specialized applications, such as in vector algebra, it is
convenient to assign distinct meanings to . and x, but these need not
concern us in elementary biochemistry). In current usage the dot
should be written above the line and the decimal point on the line, e.g.
5.1· 8.7 = 44.37 not 5·5.8·7 = 44·37 but this is rather a recent
convention so far as British books are concerned: in older British (but
not American) work one is likely to encounter exactly the opposite
convention.
A * A/B/C/D + AlB * C
would be unambiguous in a Fortran program and would have the
meaning
IX i =X I +X 2 +X 3 + ... +X.
i= I
The limits i = 1 and n written above and below the sign mean 'start
adding at i = 1 and continue until i = n'. If the limits are obvious, as
for example in statistical calculations where one often has to sum over
all of a set of observations, they can be omitted.
Although the summation sign is a considerable convenience when
the underlying calculation is understood, it can sometimes obscure
the meaning when it is not. Indeed, one of the main reasons why more
advanced mathematics than one is familiar with can appear much
more difficult than it actually is, is that it often uses special notation to
express results more compactly. For example, the whole of matrix
algebra is a way of expressing very complicated relationships in an
extremely compact way: very convenient when one is familiar with the
symbolism but baffling when one is not. Whenever obscurity
threatens for this sort of reason it is often helpful to translate the
compact expressions into a more long-winded form, and then their
meanings are likely to become clearer.
A corresponding sign for products also exists, although it is much
less often encountered than the summation sign. It is written as the
Greek capital pi, Il, and is used in an exactly analogous way, i.e .
•
Il
i= I
Xi=X I X2 X 3'" X.
1.4 Functions
A mathematical function can be regarded as a set of instructions to
carry out a series of operations on a variable or set of variables. For
Functions f 7
example, if we define v in terms of s as
Vs
v=---
Km+ s
where V and Km are constants, then we are defining v as afunction of s,
by defining what operations have to be carried out on s to obtain v. We
can also have functions of more than one variable. For example, v may
be determined not solely by a single concentraion s but may depend
both on s and on another concentration i:
Vs
v=------
Km(1 + if K i ) + s
and now we say that v is a function of both sand i.
Sometimes we may wish to symbolize the existence of a depen-
dence of one variable on another without specifying what the
dependence is. We then often use the symbol I( ) or something
similar, e.g.
v = f(s, i)
which states that v depends on sand i but does not indicate whether
the dependence follows the equation given above or some other
equation.
There are a number of functions that are so often required in
mathematics that they are given special symbols. For example, if y is
always given by multiplying x by itself we say that y is the square of x
and symbolize the relationship as
y = x2
where the superscript 2 indicates that 2 x's need to be multiplied
together. Conversely, x in this example is the square root of y, which
we may write as
x=Jy
or, more commonly and for reasons that I shall discuss in Chapter 2,
we may express the same relationship as
X = Y'l
v = Vs/(K m + s)
K = exp(AS O _ AHo )
R RT
where exp( ) is the exponential function (Chapter 2), and ASo, AHo
and R are constants. Thus although K may be constant at constant T
it varies with T and so if we are concerned with changes in
temperature we must treat K as a variable.
Dimensional analysis / 9
It will be evident that the distinction between a constant and a
variable is not absolute, because any variable becomes a constant if
the conditions that determine its variation are kept constant. In some
contexts the distinction is so difficult or inconvenient to make that we
introduce a third term, parameter, to denote a quantity that is treated
as variable for some purposes and constant for others. This occurs
especially when we use statistical methods to estimate unknown
constants: although we may believe it to represent a true physical
constant we must treat it as a variable during the process of
estimation. Suppose, for example, that we have a measured quantity y
that depends on a controlled variable x according to the following
linear equation:
y = a+bx+e
in which a and b are the physical constants whose values we require
and the third term e on the right-hand side represents experimental
error. This error term prevents us from calculating a and b exactly
from measurements of y at two values of x. Instead, we would
measure y at several values of x and try to find values of a and b that
predicted the observed y values as closely as possible (I shall discuss a
similar problem in more detail in Chapter 6). During the analysis we
are free to try any values of a and b that we like: in other words, we
treat them as variables, even though we may believe that some
physical reality exists in which they have constant, but unknown,
values. To express this duality we use the word parameter for a
quantity such as a or b that controls the dependence between our
observable variables.
x
Intercept has dimensions of x
(al
"
~ Intercept has
dimensions of !:I
Intercept has
dimenSions of X
x
(b)
Fig. 1.1. Dimensional analysis as applied to graphs: (a) straight line; (b) arbitrary
curve.
10 20 30 40 x
50
O~--~--~---L--~--~~
(b)
o 10 20 30 40
x
50
!f
(e)
Fig. 1.2. Common faults in plotting graphs: (a) wasted space caused by poorly
chosen scales; (b) crowding caused by poor choice of abscissa scale; (c) satisfactory plot
of the same data;
(d)
(e)
x
!J3 !J3
•
•
2 •
•
•
•
o~--~--~----~. 00
o 2 3
x
If) (9)
Fig. 1.2. (d) spurious inflexion caused by failing to note a sudden change in scale;
(e) misleading axes suggest a much larger slope than actually exists; (f) straight line
drawn through points that demand a curve; (g) same data without the line;
16 / The Language of Mathematics
3
x
(h)
!I 0-8
0-6
0-4
0-2
0
4 6 8
(i) pH
Fig. 1.2. (Con/d.) (h) systematic lack of fit evident in all three plots; (i) points
connected by meaningless straight lines.
way, the labelling must make it obvious that the break exists, to avoid
giving a misleading impression. To connect the points with a line
across the break, as in (d), is grossly misleading.
In (e), the graph may be quite legitimate, but often graphs that
resemble this one are intentionally misleading and therefore dis-
honest. To the casual eye the plot suggests that changing x brings
about a substantial change in y. Only by checking the labelling does
one see that the change in y is very small. Very much worse than what is
shown here is to present the same sort of graph but to omit some or all
of the labelling. Politicians and advertisers are fond of this kind of
graph but it has no place in science.
Plotting graphs / 17
The graph in (f) shows a very common fault: a straight line is drawn
through points that demand a curve. The same points are shown in
(g) without the line, and the curvature is somewhat more obvious: a
line tends to bias the eye and it is prudent, therefore, to examine a set
of points carefully before drawing a line through them. It is
illuminating to take any issue of any research journal of biochemistry
at random and search for examples of straight lines drawn through
points that do not fit straight lines. They are not hard to find. A useful
habit to cultivate is to examine all experimental graphs - your own
and other people's - with your eye close to the paper and looking
along the plotted line (Fig. 1.3). This greatly emphasizes any sys-
tematic failure of the points to lie on the line. In real experiments, of
course, there is always some experimental error, so we should not
expect an exact fit. But experimental error should be random, and so
experimental points should be randomly scattered about the line, not
systematically.
When there are very few points on each line, as in (h), it is very
difficult to distinguish between random scatter and systematic lack of
fit. One answer is of course to obtain more observations, but if several
lines show the same sort oflack of fit, as in (h), there is good reason to
believe it is systematic.
Finally, (i) shows a fault that seems to be particularly common in
plots of pH dependences. Although any reasonable model would
place the points on a smooth curve, they have been connected by
~ l~
L-'
(~
Fig. 1.3. How to inspect a graph for lack of fit.
18 / The Language of Mathematics
straight-line segments. This is not only unattractive, it also mis-
represents the form of the dependence, especially in regions of high
curvature: in (i) the maximum ought to be at pH 5.85, not at pH 6.1,
and it should be at a height of 0.82, not 0.79. In experiments where
there has been no attempt to account for the observations mathemati-
cally it may sometimes be justifiable to connect points by straight-line
segments simply to display them more prominently, but this should
never be done in work with any mathematical pretensions.
1.8 Precision
One of the commonest faults in scientific writing is to report results
with an inappropriate degree of precision. For example, if a sample of
100 ml of blood is found to contain 93 mg of sugar, the concentration
in mM (expressed in terms of glucose, of relative molecular mass 180)
is 1000 x 93/(100 x 180), which can be variously written as 5 mM (to
one significant figure), or 5.17 mM (to three significant figures), or
even 5.1666667 mM (to 8 significant figures), etc. Which of these is
appropriate for reporting the result? Common sense tells us that
5 mM is rather more vague than the data would permit, whereas
5.1666667 mM is ridiculously precise, and 5.17 mM seems a reason-
able compromise; but how can we justify such a compromise
logically? If we examine the four numbers used in the calculation, we
see that 1000, the factor for converting from M to mM, is exact and we
could write it as 1000.000 ... if we wished, 93 mg and 100 ml are of
unknown accuracy, and 180 is correct to 3 significant figures, i.e.
the true value is closer to 180 than to 179 or 181. Without knowing the
conditions of the experiment we cannot know how accurately the
mass and volume were measured, but it seems reasonable to guess that
the volume was chosen to be 100 ml by the experimenter and that it is
correct to at least 3 significant figures. This leaves the 93 g as the
probable main source of inaccuracy and the main one we have to
consider: just as a chain is no stronger than its weakest link, the result
of a calculation (in simple cases at least) is no more accurate than the
most inaccurate value used in it. Presumably the true mass was closer
to 93 g than to 92 g or 94 g, but there is no reason to suppose it was
93.00 g rather than, for example, 92.68 g, but this latter value would
give a cakulated concentration of 5.1488889 mM (to 8 significant
figures). Comparing this with the original value of 5.1666667 mM we
see that only the first two digits are reliable, though the third is not
Precision / 19
worthless: consequently 5.17 mM seems a reasonable way of express-
ing the concentration, with the understanding that the last (or least
significant) digit is unreliable. To include any more than three
significant figures in this result would be to claim that it is more
accurate than it is. As a general rule one should express a result with
no more (or only a little more) precision than the accuracy of the data
from which it was calculated.
Including too many significant figures may seem to be a harmless
fault - after all, the reader can always ignore the unwanted digits. It is
not harmless, however, because it can mislead the reader into thinking
you have measured more than you have, and it can make you look
a fool. Expressing the concentration calculated above as
5.1666667 mM is just as silly as claiming that the fact that grass is
green proves that it contains chlorophyll. The observation and the
conclusion in this case are both true, and they are relevant to ont!
another; nonetheless, the conclusion does not follow from the
observation. Similarly, the true concentration of glucose could
conceivably be 5.1666667 mM, but the available information does
not justify saying so.
Inaccuracy can result not only from inaccuracy in the data; it can
also arise from imprecise arithmetic during the calculation, especially
if several steps are involved. It is advisable therefore to carry out the
intermediate stages of a calculation with more precision (more
significant figures) than one expects to retain at the end. However, it is
not only very tedious to carry out every step with, for example, fifteen
significant figures if only three are to be retained at the end; it is also
unnecessary and it encourages mistakes. A fairly reliable rule is to use
one more significant figure in the calculations than one expects to
retain in the final result. This rule needs to be broken, however, in
calculations that involve small differences between nearly equal
numbers: for example, if a beaker weighs 10.3814 g empty but
10.493 1 g after addition of a sample of a chemical it would be unwise
to calculate the weight of chemical by substracting 10.4 g from 10.5 g
because this would give only one significant figure in the answer
whereas the data would support four.
At the end of a calculation the unwanted digits have to be discarded.
This can be done either by truncation or by rounding; thus 9.358 1 can
be truncated to 9.35 or it can be rounded to 9.36. In truncation we
simply omit the unwanted digits; in rounding we increase the last
retained digit by 1 if the portion of the number to be discarded begins
20 / The Language of Mathematics
with 5,6,7,8 or 9. Rounding is slightly more accurate than truncation,
and is the usual practice in science.
In our own measurements we know (or should know) how precise
they are, but in other people's there may be ambiguity. We had an
example of this at the beginning of this section: the value of 180 given
for the relative molecular mass of glucose is actually correct to three
significant figures, but there is no way of knowing this from the way
the number is written; it would still be written as 180 if the true value
were 177 but for some reason was given to only two significant figures.
If it is important to avoid this ambiguity we can state the precision
explicitly, for example by giving the value as '180 (correct to three
significant figures)', but this is cumbersome for ordinary use and we
normally assume that we can gauge the precision by counting the
number of significant figures. This is somewhat more subtle than it
sounds, because although all non-zero digits are counted, zeroes may
or may not be counted depending where in the number they occur.
The simplest way of counting significant figures (which is easier to do
than to describe) is to proceed as follows: (1) identify the first
significant digit, which is the first non-zero digit encountered on
reading the number from left to right; (2) identify the last significant
digit, which is the last non-zero digit if the decimal point is not
explicitly included, but the last digit of any kind if the decimal point is
included; and (3) count all the digits (including zeroes), starting from
the first and finishing with the last significant digit. This procedure
may be illustrated by the following examples, in which significant
zeroes are shown in italics and the number of significant figures is
shown in parenthesis after each number: 107 (3); 13.064 (5); 0.120 (3);
0.12 (2); 100 (1); 100.0 (4); 003 (1). (Zeroes are not usually written in
front of integers, as in 003, which would normally be written simply as
3, but if they are written they are not significant.)
We do not usually italicize significant zeroes and it is cumbersome
to add a parenthesis specifying the precision. There is, however, a
third way of indicating that a value such as 100 ml is intended to have
three significant figures: we can choose different units that make the
precision clear, e.g. we can write it as 0.100 litre, or we can achieve the
same result by relating the value to a power of 10 (see Section 2.1, p.
25), e.g. by writing it as 1.00 x 102 ml. These representations are often
used in scientific writing.
Statistical methods are not discussed in this book, but a mention of
them is appropriate here because they can provide estimates of the
Problems / 21
precision of calculated values with less of the guesswork and
plausibility arguments that have characterized much of this section.
When such a precision estimate exists it can be written after the value
prefixed by a ± sign ('plus-or-minus') thus: 5.17 ± 0.08 mM. There is
rarely any point in reporting a precision estimate with more than one
or two significant figures, and the value it qualifies should be reported
with the same number of decimal places, i.e. the same number of
significant digits after the decimal point. If there is no decimal point
the two numbers should have the same number of insignificant zeroes.
Thus we might write 81.031 ±0.006 or 135200 ± 1400, but there
would be little point in writing 135217 ± 1400 because the right-hand
17 would clearly be meaningless and so better expressed as 00.
Finally, I should say a word about the difference between accuracy,
which is concerned with the truth of what we say, and precision, which
is concerned with how we say it. For example, the constant 1! has a
value of about 3.14159, and so the expression 1! = 3.58937 may be
very precise but it is not at all accurate, whereas the expression 1!
= 3.14 is much less precise but much more accurate. Precision is
always within our control, but accuracy often is not. The main theme
of this section has been that we should aim to express results so that
their precision is only a little greater than their accuracy: less precision
than this discards good information; more makes us look foolish.
1.9 Problems
(1.1) If Xl = 1, x 2 = 3, X3 = 2, x 4 = 7, Xs = 6, X6 = 9, what are the
values of the following expressions?
4 6
(c) L ix i ; (d) L xf
i = 1 i =1
(1.2) The sample variance S2 of a set of n numbers, Xl' X 2 . . . X n ' can
be defined as
(b) The intercept on the [S]/v axis of a plot of [S]/v against [S] is
Km/ V
(c) The slope of a plot of v against viES] is -1/ Km.
(1.5) Evaluate the following expressions:
(a) 8 +6/3 +4
(b) 8+6/(3+4)
(c) (8 + 6)/(3 + 4)
(d) 2 3 + 32
(e) 23 x 32
(f) 232
(g) (2 3)2
(1.6) Different methods of determining the average relative mole-
cular mass Mr of macromolecules in solution yield different kinds of
average. For example, osmotic-pressure measurements yield a number
average M n' light-scattering measurements yield a weight average M w'
and equilibrium sedimentation yields a Z-average M z. These are
defined as follows:
"n.M.
M =_L._'_'.
n L.nj
" '
Problems / 23
where ni is the number of mol of a species with Mr = M j and the
summations are carried out over all solute species. Evaluate all three
averages for a solution of protein existing as 27 nM monomer, 17 nM
dimer and 1.3 nM tetramer, assuming that Mr = 72000 for the
monomer.
(1.7) Give the results of the following calculations with appropriate
precision:
(a) 17.3 x 1.38221
(b) 2.571/(6.331- 2.111)
(c) 1.3359 x (35.6579 - 35.6112)
(d) 2.1 x 4.361
(1.8) How many significant figures does each of the following values
contain?
(a) 23.007050
(b) 0.007050
(c) 135000
(d) 1.350 x 105
(e) 10.37
2 Exponents and Logarithms
We get similar results for any number, not just 2, raised to the ith
power, and in general we can define:
a3 = aaa
a2 = aa
a1 = a
aO = 1
a- 1 = 1/a
a- 2 =1/a 2
a- i = lid
The relationship aO = 1 must be noted particularly. Although it may
seem surprising at first sight it arises quite naturally from the series
that was discussed above.
The main use of integer powers greater than the second or third in
biochemistry is for convenient representation of very large or very
small numbers. For example, the number of molecules in 1 mol of a
substance is about 600 000 000 000 000 000 000 000, whereas the num-
ber of gram-ions of H+ in a litre of 1M-NaOH is about
0.00000000000001. Numbers such as these are very inconvenient to
manipulate and even to read if they are written in the ordinary way.
(This is perhaps borne out by the fact that in typing the manuscript of
this book I needed three attempts at typing this last number before it
was correct.) In practice, therefore, it is usual to relate very large or
very small numbers to the nearest power of 10: thus, the first of the
two numbers mentioned is 6 times greater than 1023 , and is therefore
usually written as 6.0 x 10 23 , whereas the second is equal to 10- 14 . In
practice in experimental science we usually simplify even further, at
least with dimensioned quantities, by using multiples or submultiples
26 / Exponents and Logarithms
of units: for example, the biochemist often has to deal with
concentrations around 0.000 01 M. This could be written as 10- 5 M,
but it leads to fewer mistakes in arithmetic if we use a smaller unit of
concentration and write it as 10 IlM, where the Il (spoken aloud as
'micro' in this use) is a standard prefix meaning 10- 6.
In principle we could relate large and small numbers in this way to
powers of any number, but it is normally convenient to use powers of
10 because 10 happens to be the basis of our ordinary number system.
To avoid confusion we always use powers of 10 even for numbers that
have simple expressions as powers of other numbers. Thus we would
express 2401 as 2.401 x 103 even though it happens to be precisely 74 .
t = 0, n = no
t = r, n = 2no
t = 2r, n =4no
t = 3r, n = 8n o
and, in general, at
t= Zt, n = 2zno
60
40
20
(a) Generations
6
In (nino)
3
2
2
2 3 4 5
(b) Generations
Fig. 2.1. Logarithms to base 2 provide a convenient way of relating population size
to number of generations, for populations that double in size for each generation.
one can obtain the equilibrium constant for the complete process by
multiplying all of the individual equilibrium constants together:
where R = 8.31 J mol- 1 is the gas constant and Tis the temperature in
kelvins. Nonetheless although ~GO has a fundamental thermo-
dynamic meaning the factor - 2.3RT is irrelevant to many of the
biochemical contexts in which a logarithmic scale is useful. Living
systems hardly ever exploit temperature effects - indeed they usually
try to avoid them altogether - and so in metabolism - 2.3RT is just a
constant, with a value of about - 5700 J mol- 1 (at 25°C). Thus each
- 5.7 kJ mol- 1 in a ~Go value corresponds to a factor of lOin the
corresponding equilibrium constant.
RT [Fe 3 +] [Fe 3 +]
E = 0.77 + fi' In [Fe2+] = 0.77 + 0.060 log [Fe2+]
where the coefficient 0.060 V is the value of 2.303 RTj fi' at 303 K
(30 0 C). This tells us that the actual potential E is determined by two
separate quantities, not only by the standard potential EO but also by
the ratio of concentrations of the two components: not surprisingly, a
couple becomes more powerfully oxidizing as the proportion in the
oxidized state increases. Rather confusingly, the Nernst equation is
Thus at any [CH 3CHO]/[C 2 H s OH] ratio the value of E is 0.42 V less
at pH 7 than at pH o.
Of the couples of biochemical importance, some, for example
cytochrome c (Fe3+) \cytochrome c (Fe 2+), resemble Fe3+ \Fe2+ in
having redox potentials that are independent of pH; a few, such as
acetaldehyde\ethanol, show a decrease of 0.06 V for each unit increase
in pH; but many show intermediate behaviour, because one or both
components ionize in the pH-range of interest (as I shall discuss in the
next section). It follows that EO values if applied naively can be a very
misleading guide to redox behaviour at neutral pH values. For
example, cytochrome c(Fe3+)\cytochrome c(Fe 2+) and acetaldehy-
de\ethanol have nearly equal EO values, 0.250 V and 0.258 V,
respectively, but under metabolic conditions the former is a much
more powerfully oxidizing couple than the latter, because of the large
decrease in potential shown by the latter on raising the pH. To avoid
this confusion biochemists commonly redefine the standard as the
observed potential at a specified pH (usually pH 7) when the two
components are at equal concentrations. This potential, often called
the mid-point potential, is given the symbol E 01 , and the Nernst
equation becomes
E = E 01 + RT In [ ox]
n $' [red]
Proton concentrations are not included in this equation and so it is
42 / Exponents and Logarithms
valid only at the pH at which E 01 is defined. At pH 7 we have E 01 =
- 0.163 V for acetaldehydelethanol, but still EOI = EO = + 0.250 V
for cytochrome c(Fe3+)lcytochrome c(Fe2+).
If two redox couples are allowed to interact, whether by being
mixed together in the presence of a suitable catalyst or by connecting
them electrically, they can react stoichiometrically until their poten-
tials are identical. For example, in the reaction catalysed by alcohol
dehydrogenase the acetaldehydelethanol couple reacts with the
NAD+INADH couple:
E 01 = - 0.320 V (pH 7, 30°C)
When the two E values are equal, i.e. at equilibrium, we can write
[CH 3 CHO] [NAD+]
- 0.163 + 0.03 log [CzHsOH] = - 0.320 + 0.03 log [NADH]
for which EO = - 0.11 V and the pKa ( = -log Ka) of the oxidized
form acetic acid is 4.73. Omitting water (i.e. following the convention
that the standard state of water is the state that exists in the solution of
interest), the Nernst equation can be written as follows:
and so
Finally, one can form an impression of the behaviour between the two
limiting regions by putting [H +] = Ka:
E0 = -O.lI-0.06pKa-0.0310g2= -0.40V
1
([H+]=Ka)
From these last three expressions it is clear that the dependence of the
mid-point potential on pH is as shown in Fig. 2.2: at low pH, EOI
decreases linearly by 0.06 V for each unit increase in pH; at high pH it
decreases linearly by 0.09 V for each unit increase in pH; the two linear
-004
-0·5
-0·6L----L--~----~--~----~---L--~L--
o
Fig. 2.2. Dependence of mid-point potential on pH for the acetic acid Iacetaldehyde
couple.
Problems / 45
regions are joined by a curve, but the extrapolated straight lines
intersect at pH = pK a•
I have discussed redox potentials and their dependence on pH in
rather more detail than one would usually expect in an elementary
mathematics book because it is a topic that causes a great deal of
misunderstanding and muddle, not only among students but also,
regrettably, among the authors of biochemistry textbooks. Most of
this misunderstanding can be avoided by realizing the following: (i)
the Nernst equation at pH 0 is normally expressed in terms of a single
oxidizing form and a single reducing form; (ii) in many couples of
03
02
(pyruvic aCid)
0-1
,
\~
\\
,
0-0 t\ \~
pka
(lactic acid)
01
0-2~--~--~--~--~--~--~--~
o 2 3 4 5 6 7
pH
Fig. 2.3. Dependence of mid-point potential on pH for the pyruvic acid Ilactic acid
couple.
46 / Exponents and Logarithms
interest ionization of one or both forms occurs between pH 0 and pH
7; (iii) in ordinary experiments one controls the total concentrations of
oxidized and reduced forms not the concentrations of any particular
ionic states; (iv) one therefore needs to re-express the Nernst equation
in terms of total concentrations; (v) this can be done by means of the
usual equations for ionic equilibria. By following these rules it is quite
straightforward to deal with couples in which more than a single
ionization must be considered. For example, the couple pyruvic
acidllactic acid,
2.15 Problems
(2.1) Evaluate the following expressions:
(a) 33 (b) 4- z
(c) 271 (d) 4- t
(e) 16° (f) 10- 3
(g) 3.29 6 X 3.29- 4 X 3.29- 3 x 3.29 (h) 2.37
(i) 4! (j) antilog 3 3
(k) log 100 (I) logz 8
(m) antilog 3
30
!I
o~--~--~----~--~----~
o 3 4 5
x=2 x
equation lie on a straight line, as shown in Fig. 3.1, and indeed any
equation of the form
y = a+bx
where a and b are constants, defines a straight line if y is plotted
against x. This more general straight line is shown in Fig. 3.2.
It is obvious that y = a if x = 0, because then the term bx is zero.
Consequently the straight line cuts the y-axis a distance a from the
point (0, 0), the point where the axes intersect, which is known as
the origin. Because of this relationship a is known as the intercept
on the y-axis. The intercept on the x-axis is less obvious but may
readily be found by putting y = 0: this gives a + bx = 0, and so x =
- alb, which is therefore the intercept on the x-axis.
!I=a+bx
!l2
!l1
t o X2 x
-alb
Thus
!I 60
!I = 2 + 3x + X 2
40
30
o 2 3 4 5 6
x
Fig. 3.3. Plot of the parabola y = 2 + 3x + x'.
Then
y+Ay = 2+3(x+Ax)+(x+Ax)2
= 2 + 3x + 3Ax + x 2 + 2xAx + (AX)2
Subtracting, we have
Ay = 3Ax + 2xAx + (AX)2
So
Ay
-= 3+2x+Ax
Ax
As we have said nothing about how large a change in x we are
discussing, we have no grounds for simplifying this expression, but
suppose we specify that Ax is a small change in x, i.e. small compared
with (3 + 2x), and let us emphasize this by replacing Ax with Jx (a
lower-case delta instead of a capital). Then in the similar expression
Jy
- = 3+2x+Jx
Jx
we can justifiably make the approximation of ignoring Jx in
comparison with (3 + 2x):
52 / Differential Calculus
by
- ~ 3+2x
bx
If we make bx so small that it is indistinguishable from zero the
approximation becomes exact. We can express this by saying that the
limit of by/bx as !5x approaches zero is 3 + 2x, or
Lim ~y = 3 +2x
{)X~O uX
dy == Lim !5y
dx {)x~o!5x
This is called the derivative of y with respect to x; it can also be
described as the result of differentiating y with respect to x. Quantities
that are arbitrarily close to zero while remaining distinguishable from
zero are called infinitesimal.
d
-(Ax 3 ) = 3Ax 2
dx
d d
- (A/x) = -(Ax- 1 ) = - Ax- 2 = - A/X2
dx dx
d L
-(AX2) = rAX-2
1 L
= A/2x't
dx
and so on. With the aid of the rules for differentiating combinations of
terms that I shall discuss in the next three sections, this general rule
permits the differentiation of almost all the expressions encountered
in elementary biochemistry. Only two other simple cases are worth
committing to memory, the derivatives of logarithms and
exponentials:
d
dx (Alnx) = A/x
d:(Ae X ) = A~
54 / Differential Calculus
3.4 Derivatives of sums and products
Suppose we wish to differentiate a sum, such as y = u + v, where u and
v are both functions of x that can readily be differentiated with respect
to x. By the arguments we have used before, a small change bx in x
brings about changes in u and v as follows:
du
bu ~ -bx
dx
dv
bv ~ -bx
dx
Clearly if y = u + v the change in y must be the sum of those in u and v:
by = bu +bv
So
by du dv
-~-+
bx dx dx
In the limit the approximation becomes exact, i.e.
dy . by du dv
-= Lim - =-+--
dx bx-Obx dx dx
This result can be extended to the sum of three or more functions of x,
and in general the derivative of a sum is equal to the sum of the
derivatives. The derivative of a difference follows directly:
d du dv
dx (u - v) = dx - dx
dy = Lim by = u dv +vdu
dx c!x ~0 bx dx dx
This derivation is illustrated graphically in Fig. 3.4: the change in uv is
represented by the sum of the areas in the three rectangles labelled
v+ ~v ~----------------------~~~~
',t:11:i -.. -. :~~. :~.>
1.I 0V :~ ~I.I ~v "'.
v~----------------------f~:~'~'-
' -"_':~l~'
uv v8u
oL-----------------------I.IL------u-+~o·u----
(a)
uv v v~ u
O~-------------------d'~~J-+-o-u-
(b)
Fig. 3.4. Explanation of the formula for the derivative of a product uv. In (a) the
increments.5v and .5u are appreciable and the shaded area is by no means negligible, but
as the increments become smaller, as in (b), the shaded area decreases very rapidly, not
only absolutely, but also in relation to the areas labelled u.5v and v.5u, which become
smaller much more slowly.
56 j Differential Calculus
u(jv, v(ju and (ju(jv. It is clear from inspection that as (ju and (jv become
small, (ju(jv becomes smaller very much faster, and in the limit can be
ignored.
If we divide the left-hand side of the above equation by y and the
right-hand side by uv (= y) we have
1 dy 1 du 1 dv
--=--+--
ydx udx vdx
but if we accept (without proof) that :x (In x) = Ijx then we can write
1 1
d In x as -dx, din y as -dy, etc., so
x Y
1 dy 1 du 1 dv
--=--+--
ydx udx vdx
dy = -Au- 2
du
But how can we combine the results to get the derivative of y with
respect to x? This is an example of the problem of how to differentiate
a function of a function - y is a function of u, which is itself a function
of x. I shall now consider this as a general problem before returning to
the specific example of the Michaelis function.
For y = F(u), u = f(x), we have
dy du
by ~ -bu, bu ~ -bx
du dx
and so
dydu
by ~ --bx
dudx
58 / Differential Calculus
and
by :!:: dydu
bx dudx
and in the limit
dy ,by dydu
- = Llm-=--
dx 6x .... obx dudx
dy dy du -2 -2 ABx- 2 -AC
-d =-d-d =-Au (-Bx +C)=(1 B 1 C)2
X ux +x+x
dy= u d- v
- (-1)
+v _ldu
-
dx dx dx
in which V-I is a function of a function, with derivative
Higher derivatives J 59
So
du dv
v--u-
dy _2 dv _ 1 du dx dx
-uv -+ v - = ---=---
dx dx dx v2
dy = 3-4x+3x 2
dx
~ (d Y )
dx dx
= -4+6x
dy = B
dx
d2y
dx 2 = 0
3.8 Notation
The notation that I have used hitherto in this chapter is based on that
of the great German mathematician Leibniz. Although it is the most
60 / Differential Calculus
generally useful and the most widely used there are circumstances in
which it becomes cumbersome, and one should have some familiarity
with three other systems that are sometimes used.
(1) In mathematical tables, which one may wish to consult to
obtain unfamiliar results, it is usually obvious what variable one is
differentiating with respect to, and compactness is achieved by writing
d/dx as D. Many people also find this a useful form for memorizing
the standard results for the derivatives of products and ratios:
D(uv) = uDv + vDu
D(u/v) = (vDu - uDv)/v 2
(2) Another system for achieving compactness when it is obvious
what one is differentiating with respect to is to write the first derivative
of y with respect to x as y', and the second derivative as y". This is
especially convenient when one does not want to define a particular
variable (y in this example) as the function, e.g. one can write the first
derivative of f(x) as f'(x), and the second as f"(x).
(3) Calculus was developed independently by Newton in England
and Leibniz in Germany. Newton's dot notation is in general much
less convenient than that of Leibniz, partly because it does not allow
for the possibility that one may wish to differentiate with respect to
more than one variable. Newton was most concerned with functions
of time, or t, and when dealing with time (e.g. in enzyme kinetics) it is
still occasionally convenient to write dy/dt as yand d 2 y/dt 2 asy. Even
in kinetics, however, this notation is much less common than that of
Leibniz.
The various alternative systems are listed in Table 3.1. As indicated
in the table, one can continue differentiating beyond the second
derivative, but the biochemist rarely wants to do this and so I have not
Table 3.1 Notations for expressing derivatives
60 slope =0 at maximum
v
°0~-----10~----~2~0-----3~0------4~0~
s
Fig .3.5. Plot of v against s for a system showing substrate inhibition. The plot
illustrates the point that we can define a maximum as a point at which the slope is zero,
but this definition would be incomplete because it would also include minima (see
Fig. 3.6).
62 / Differential Calculus
substrate inhibition. A striking feature of the curve is that instead of
increasing indefinitely with s (as in simpler kinds of kinetic behaviour)
v increases to a maximum and then decreases as s is increased further.
A major use of the differential calculus is that it enables one to
calculate where a maximum will occur. From inspection of the graph
it can be seen that at the maximum the slope is zero. This is true in
general, i.e. at any maximum the slope is zero (although the converse is
not, because a point of zero slope may also represent a minimum).
Accordingly, we can find the maximum in Fig. 3.5 by differentiating v
with respect to s and finding a value of s at which dv/ds is zero. This
can be done by applying the rule for differentiating a ratio:
and substitution of this result into the original equation shows the
maximum value of v to be
Maxima and minima j 63
The Michaelis function that we considered earlier in this chapter is of
exactly the same form as this equation for substrate inhibition,
because if
A
y = 1 + (Bjx)+Cx
~ = dy ~ dlnx =2.303xdy
d log x dx d In x d log x dx
(because dlnxjdx = 1jx and logx = 2.3031nx approximately).
Thus, provided x is finite,a zero value of dyjdx must occur at the same
value of x as a zero value of dyjd logx, and so the maximum in the
plot of y against log x occurs at the same values of x and y as in the
plot of y against x.
Exactly the same criteria can be used to find a minimum of a
function, because this also corresponds to a zero value of the first
derivative (Fig. 3.6). Clearly, therefore, the existence of such a zero
value does not guarantee that a maximum has been found. It shows
only that one has found a stationary point, a more general term that
embraces both maxima and minima. In most elementary biochemical
circumstances this is not a problem because it is usually obvious
whether one is dealing with a maximum or a minimum. In the more
general case there may be ambiguity, but this can be resolved by
examining the sign of slope on either side of the stationary point. At a
64 J Differential Calculus
!I
.~--""-------
slope =0 at minimum
Fig. 3.6. Minimum of an arbitrary curve. Again, the slope is zero at the minimum
(see Fig. 3.5), but it is increasing through zero as x increases.
v = Vsj(Km + s)
dy = 9 -12x + 3x 2
dx
d2 y
dx 2 = -12+6x
!I 60
40
Point of inflexion
20 Maximum
t
dy 40
dx
20
0r-~~~~~~-----------
O~------~~-------------------
I I ~
o 2 3 4 5
x
6
Fig. 3.7. Point of inflexion. At a maximum or minimum the first derivative is zero;
at a point of inflexion the first derivative is a maximum or a minimum and the second
derivative is zero.
[NaOAc]
= pKa +0.4343ln [BOAc]
x
= pKa + 0.4343ln - A
-x
because as long as x is less than A essentially all of the added OB-
ions convert BOAc molecules stoichiometrically into OAc- ions.
68 / Differential Calculus
(0.4343 is the reciprocal of 2.303, or I/ln 10). Differentiating with
respect to x, we have
dpH -_ 0.4343
d x
. (A -x) [(A -x)-x(2
x (A -x)
-1)J
0.4343 A
x(A -x)
Thus a buffer is most effective at the pH equal to the pKa of the acid
involved in the titration.
,
, /
, /
o
t
F.ig. 3.8. Sketch of a complex curve (defined in the text) obtained by considering the
value of q and its first and second derivatives with respect to t at various values of t. The
broken line is a straight line and represents the limit approached by the curve as t
increases.
70 / Differential Calculus
behaviour of the slope:
dq k+2k+3eO k+ 2k+ 3eOexp[ - (k+2 +k+3)t]
dt k+2+k+3 k+2+k+3
k+ 2k+ 3eo {1-exp[ - (k+2 + k+3)t]}
k+2+k+3
d 2q
dt 2 = k+2k+3eOexp[ - (k+2 + k+3)t]
As exp[ - (k+ 2 + k + 3)t] must have a value between 1 (at t = 0) and 0
(for t -> CX)) for any positive value of t, it follows that dq /dt cannot be
negative and is zero only at the origin. Thus the plot of q against t
cannot contain a maximum or a minimum except at the origin.
Moreover d 2q/dt 2 is positive at all values of t, although its value
becomes negligible at large values of t. When all of this information is
combined it becomes clear that the form of the curve must be as
shown in Fig. 3.8.
3.13 Problems
(3.1) Differentiate each of the following expressions with respect
to x:
(a) y = x3 (b) y = X 1 / 2
(c) Y = 5e (d) y = e 5x
(e) y = 31nx (f) y = X 1 / 2 + X- 1 / 2
(g) y = In x - 2X2 (h) y = x 2 ex
(i) y = In (x 3 ) (j) y = (x 2 + 3)1/2
(k) y=(x+l)/(x-l)
(3.2) Two of the derivatives obtained in problem (3.1) should be
identical. Why?
(3.3) Each of the following functions displays one minimum and one
maximum. Identify both:
(a) y = 7 + x 2 - 8x + In (XO)
(b) y = 15-x-l/(x-5)
(3.4) Michaelis and Menten analysed their data in terms of the
equation that bears their names, v = Vs/(K m +- 8), by plotting v against
log s. They used the facts that this plot has a point of inflexion at
Problems / 71
s = Km and that the maximum slope is 0.576 V. Prove that these facts
are correct.
(3.5) Differentiate the equation v = Vs/(K m + s) with respect to s.
What are the values of dv/ds when (a) s = 0; (b) s = Km; (c) s
approaches infinity?
(3.6) Consider a pH profile defined by the following equation, in
which h represents the hydrogen-ion concentration [H+] and y is a
constant:
y
Y= 1 + (h/Kd + (K2/h)
(a) At what value of h is y a maximum?
(b) Show that this solution is equivalent to pH = (pK 1 + pK 2 )/2.
(3.7) Under certain limiting conditions the binding of a small ligand
to a protein can be approximated by an equation of the following
form:
KXh
Y=------,-
1 +KXh
in which Y is a measure of the extent of binding, x is the concentration
of ligand, and K and h are constants. Putting K = 1 (arbitrary unit),
h = 2, sketch the form of curve given by this equation for a plot of Y
against x (for positive x).
(3.8) For the equation given in problem (3.7), what would be the slope
of a plot of log [Y/ (1 - Y)] against log x?
(3.9) Differentiate the following equation with respect to t, treating p
and t as variables and V, Km and So as constants:
Vt = p + Kmln [so/(so - p)]
4 Integral Calculus
!f
~ ----------------------
Xo
x
Fig. 4.1. Area under a curve. Ignoring the shaded area, the increase in A as x
increases from XI to XI +"x is YI"X.
calculate the area under any curve whose algebraic expression can be
recognized as the derivative of some other function. First, however,
we must see how a large area can be treated as the sum of many small
areas.
As long as bx is small enough for the relationship bA ~ ybx to be a
good approximation we can write
AA = Lim
n- 00
I ybx = fYdX
then
Sydx = x 3 + Xl + 5x + IX
where IX is a constant, because differentiation of this second function
with respect to x gives the first function.
r
a definite integral:
(3x 2 + 2x + 5)dx
J(3x 2 + 2x + 5)dx = x 3 + x 2 + 5x + IX
= 7+1X at x=l, 22+1X at x = 2
da
-= -ka
dt
J =tt da
- = -k
1=0 a
ftt dt
0
[ Jt=tt
Ina,=o=-kt o
[Jtt
If we define a = ao at t = 0 and a = al at t = t l , this becomes
Ina l -lnao = -ktl
which can be rearranged to
a l = aoexp( -ktd
f~
da
= -kSdt
and so
In a = - kt + In ao
and rearranging gives us the same result as before,
a = ao exp ( -kt)
fAdX = Ax+a.
fAXdX = tAx2 + a.
fAX2dX = tAx 3 + a.
f.
or in general,
Axi+ 1
Ax'dx = - - - + a.
i+ 1
The rule for terms of polynomials is as follows: increase the index by 1,
divide by the new index, and add a constant.
The integral ofa sum (or difference) is also straightforward: ifu and
v are functions of x then
J(u+v)dx = Judx+ Jvdx
Finally, two other simple functions occurred as derivatives in Chapter
3 and consequently can readily be integrated:
f exp(x)dx = exp(x) + a.
f _dX---,--
A+Bx
= ~ In (A + Bx) + a.
B
Simple integrals / 79
This can readily be confirmed by differentiating the right-hand side as
a function of a function. Products of such functions also occur in
kinetics and can be integrated by realizing that they can be expressed
as sums of simple fractions. Thus:
AI(a2+b2x)+A2(al +b1x) =1
The identity sign ( = ) emphasizes that the relationship must apply
regardless of the value of x, and we can use this requirement to assign
values to the two constants. For example, because the relationship
must apply when x = 0 and also when x = 1 it follows that
and
AI(a2+b2)+A2(al +b l ) =1
Thus we have two simultaneous equations that can be solved for the
unknowns Al and A 2 • The two fractions produced in this way, which
are known as partial!ractions, can be integrated separately as above.
Another integral that is often needed in kinetic problems is the
following:
fAX:~X
The easiest way to deal with this is to define u = A + Bx, so that
du = Bdx and x = (u-A)/B
and then
xdx = _1 f(U - A)du
f A+Bx B2 u
= ;2 [
U - A In uJ + 0:
80 / Integral Calculus
f.
Table 4.1 Common integrals
AX;+l
Ax'dx=--+a (for any value of i except -1)
i+ 1
f dX
~ = Inx+a
f xdx 1
- - =2[A +Bx-A In(A+Bx)]+a
A+Bx B
In all cases x is a variable. A. Band i are constants. u and v are functions of x. and cr. is a constant of
integration. All of the integrals in this table apart from the last are worth memorizing.
f a
1
dx
- =
X
F(a)
Integral of l/x / 81
!I
°O~----~--------~a----~------------x~
o h
Fig. 4.2. Plot of the hyperbola y = l/x. The u-axis is an alternative abscissa axis
related to the x-axis such that u = x/a. The area under the curve between x = a and
x = ab is unaffected by calling it the area between u = 1 and u = b.
without specifying anything about the nature of F(a). Then the area
bounded by the curve, the x-axis and the vertical lines x = 1 and x = a
is given by
J-a
1
dx
x
= F(a)
Jab dx = F(ab)
1 X
J
when x = a and so
ab dx b du
J a -X = 1 -=F(b)
U
82 / Integral Calculus
So
F(ab) = F(a)+F(b)
fdX
~ = Inx +IX
(although it does not follow from the argument used here that the
result is In x rather than, for example, loglox; this is true nonetheless,
for reasons that I shall not discuss here).
The logarithm of a constant is another constant. It is legitimate,
therefore, and often convenient, to write the last result in the
following way (with IX replaced by In A):
f dX
~ = In x + In A = In Ax
dp = kab
dt
where a, band p are the concentrations of A, Band P, respectively, at
time t and k is a constant. Although this relationship is easy to arrive at
and easy to understand it does not tell us very much as it stands; in
Differential equations / 83
particular, it does not tell us the extent of reaction as a function of
time. To convert it into a more informative form we must first
decrease the number of variables from 4 to 2. This can be done by
means of the stoichiometric requirement that (a + p) and (b + p) must
both be constants. If we define these as ao and bo respectively (the
values of a and b when p = 0), the original equation can be written
dp
dt = k(a o - p)(b o - p)
f (a o - p)(b
dp = fkdt
o - p)
Solving the differential equation thus resolves itself into a problem of
solving two integrals. The right-hand side is trivial and the left-hand
side may be solved by partial fractions, so that
In (a o - p) In (b o - p) k
- a o -b o
+ ao -b o
= t+1X
L I k-2
eo-x so-x-p x p
dp
dt = k+2x-L2(e O -x)p
!J
Fig. 4.3. Trapezium method for estimating the area under a curve. The shaded area
h
between x = Oand x = his 2(yo + Yl)' but this is the area of the complete trapezium and
thus significantly overestimates the area under the curve, which ought to exclude the
diagonally shaded segment. Similar inaccuracies result from treating the other strips as
trapezia, and the inset shows an example of significant underestimation. The alternative
method discussed in the text, 'Simpson's rule', is much more accurate and the parabolic
arcs that it assumes are hard to distinguish from the true curve, and for that reason are
not shown in the figure.
and
bh = Yl-Yo-ch 2 = 2Yl -~YO-ty2
Integrating the original quadratic, and substituting these values of a,
bh and ch 2, we have
88 / Integral Calculus
and so on. Notice that every odd-numbered Y appears only once,
whereas every even-numbered Y apart from Yo and Yn appears twice, so
the total area A is given by
h
A =3 [Yo + 4(Yl + Y3 + ... + Yn- 1)+ 2(Y2 + Y4 + ... + Yn - 2) + Yn]
This formula is known as Simpson's Rule. It is hardly more difficult to
apply than the trapezium method I considered first, and it is much
more accurate. For most curves that are not highly complex the
formula is adequately accurate if the number of strips is about ten,
and in most experimental examples inaccuracies in measuring the Y
values is a more important source of error than the formula.
4.7 Problems
(4.1) Integrate the following:
(a) f: 3xdx
(b) f2 dx
til
I X
(c) 3x 2 dx
(d) t22 x 3 dx
f (A + Bx)dx
C+Dx
(Note: this can most easily be done by separating it into two
integrals.)
(4.5) The Michaelis-Menten equation can be written in the following
form:
dp Vs
dt Km+s
in which p and s are the concentrations of product and substrate,
respectively, at time t, and V and Km are constants. Defining s = So
and p = 0 at t = 0, use the one-to-one stoichiometry of the reaction to
eliminate s and solve the resulting differential equation to get an
expression for Vt in terms of p.
(4.6) The equation considered in problem (4.5) takes no account of
the possibility of inhibition by the accumulating product. Show that
the following equation, in which Kp is a constant and the other
symbols are as in problem (4.5), can be integrated to provide an
equation of the same form as the solution to problem (4.5):
dp Vs
dt Km(l + pIKp) + s
What would be the form of a plot of a plot of tlln [so/(so - p)] against
pJln [so/(so - p)]?
(4.7) The relationship between the intensity oflight 1 and the distance
x through which it has passed in an absorbing solution can be
expressed as follows:
dl
- = -kcI
dx
where k is a constant and c is the molar concentration of absorbing
substance. Putting 1 = 10 at x = 0, integrate this expression to obtain
an expression for 1 in terms of x. If the molar absorbance A is defined
such that
log (loll) = Acx
what is the relationship between A and k?
90 I Integral Calculus
(4.8) In a spinning ultracentrifuge cell the outward flow rate of the
mass of solute m is given, ignoring diffusion, by
dm w 2 x(1- Vp)M ADe
-= '
dt RT
where t represents time, x the distance from the axis of rotation, M,
the relative molecular mass of the solute, A the cross-sectional area of
the cell, D the diffusion coefficient, e the concentration of solute in gil,
and the other symbols represent constants that are either known or
easily measurable. The effect of diffusion can be represented by
dm de
- = -AD-
dt dx
After spinning for a long period the system achieves equilibrium and
becomes time independent. Then the net flow rate, given by the sum of
the two expressions, is zero at all points. Obtain an expression relating
e and x at equilibrium and hence suggest a plot that would allow M, to
be conveniently measured.
(4.9) The following values of y at various values of x show
measurements of the height of a recorder trace in a chromatographic
separation. Estimate the areas under the two peaks by using
J J
Simpson's rule to evaluate ~ ydx and ~ 7 ydx. In each case compare
the results with four strips with those obtained with eight.
x y x y x y
4+x-4+2x = 8-2x-4+2x
i.e.
3x = 4
The operation of subtracting, for example, 4 from both sides is clearly
equivalent to 'bringing it from one side to the other of the equation
and changing its sign'. Although this is a convenient and proper way
of carrying out the operation one should realize that it is no more than
an example of applying the same operation to both sides of an
equation.
Continuing with the above example we may divide both sides of the
91
92 / Solving Equations
last equation by 3 to obtain the solution:
x = 4/3 = 1.33 approx.
Equations that do not seem to be linear at first sight can often be
rearranged into linear form by application of the same methods. For
example, the equation
~=3
5+2x
becomes linear after both sides of the equation are multiplied by
5+2x:
8x = 15 +6x
which can be solved as before to give x = 7.5.
The only difficulty that may arise in solving this sort of equation is
that it may turn out to be singular, which means that it contains less
information than it seems to contain. Consider, for example, the
following equation, which appears to be an ordinary linear equation:
5x+2-3(x+8) = 2x-22
If, however, we collect terms, we find
(5-3-2)x = -22-2+24
i.e. Ox = 0, which is true regardless of the value of x and consequently
tells us nothing about the value of x. Examples as simple as this seem
rather artificial and trivial, but singular equations can be a serious
problem because they can appear in the last stages of solving a more
complex equation or set of equations. This often happens in the
analysis of experiments that have been badly planned so that they do
not in fact provide the information they were intended to provide, and
I shall consider some more realistic examples later in this chapter.
5.4 Determinants
The methods I have described for solving simultaneous equations
expressed in numerical terms can also be applied to equations in
which the coefficients are not assigned numerical values. Consider the
following general pair of linear simultaneous equations:
A1x+B1y+C 1 = 0
A 2x+B 2y+C 2 = 0
Multiplying the first by A2 and the second by Al we have
AIA2X+A2BIy+A2CI =0
AIA2X+AIB2y+AIC2 =0
and after subtraction of the first equation from the second,
(A 1B 2 - A 2B 1)y + (A 1C 2 - A 2Cd =0
Hence
Determinants / 97
and, similarly,
er IA~~' I~ lA,
expressed in a particularly elegant way, as follows:
IB,X I B, I
B2 C2 A2 C2 A2 B2
The denominators of these expressions are called determinants. Any
determinant is written as a square array of numbers or algebraic
symbols between vertical lines and it has a numerical or algebraic
value that can be found by applying a general rule. Although one can
have any size of square I shall only consider 2 x 2 determinants, which
are not only particularly simple but are also the most important for
our purposes: the value of such a determinant is the product of the
top-left and bottom-right elements minus the product of the bottom-
left
or
=- =- = =
x/1O = - y/30 = -1/14
so x 10/14 5/7, y 30/14 15/7.
It often seems more natural to write each of the original set of
equations with the constant on the right-hand side rather than as the
last term on the left-hand side, i.e. (for the same example):
2x+3y = 5
4~- y=-5
If this is done the solution can be written in the same way as before
except that the sign of the constant is the opposite of what it would be
if the equations were written in standard form. For two simultaneous
equations this means that the last numerator is - 1 rather than 1:
x -y -1
Apart from the third term this is the same as the left-hand side of the
equation to be solved, so, subtracting the latter (which has a value of
zero) we have
(x + b12a)2 = b 2 /4a 2 - cia = (b 2 - 4ac)/4a 2
and hence
± J(b 2 -4ac)
x+ b12a =-~-2-a--
fix) 3
Or---4---~----+----+~~r----r--~r---4-----~
-I
Fig. 5.1. Graphical solution of the equation x 2 + In x = 7. The line is a plot of the
functionf(x) = x 2 + In x -7, which must be zero when the equation is satisfied. Thus x
= 2.47 is an approximate solution.
i=-Kj 0 i
Fig. 5.3. Dixon plot for determining a competitive inhibition constant: this is an
application of the method illustrated in Fig. 5.2.
106 / Solving Equations
Another and somewhat simpler application of the graphical
solution of simultaneous equations in enzyme kinetics is found in the
direct linear plot for determining the parameters Km and V in the
Michaelis-Menten equation. If the rates VI and V 2 at substrate
concentrations s I and S2' respectively, are given by the following pair
of equations:
VS I
VI = ---''--
Km+sl
VS 2
V2 =
K m+s 2
then Km and V can be evaluated graphically by rearranging the
equations to show the dependence of Von Km:
V = + (VI/sl)K m
VI
V = V 2 + (V 2 /s 2 )K m
Both of these define straight lines when V is plotted against Km. The
first may most easily be drawn as a straight line with intercepts - S Ion
the Km axis and VI on the V axis, and the second similarly (Fig. 5.4).
-5,
flXl
4 tangent drawn
at x= 3
o~-----------+------------~----~----~'-~~
2
-2
-4
tangent drawn
at x = I
-6
-8
'- flxl=x'+tnx-7
Fig. 5.5. Newton's method for solving the equation x 2 + In x -7 (see Fig. 5.1). The
starting guess x = Xo = 1 provides the tangent that crosses the x-axis at x = XI = 3.
This improved guess for the solution provides a second tangent that crosses the axis at
x = x 2 = 2.51. Further tangents calculated in the same way give, successively,
x = 2.469, 2.46904, 2.4690423 .... This method permits much greater accuracy than
one can hope to achieve with a graphical method.
Although the last two of these are indeed negligible, the value for G l -
certainly is not. This alone should indicate that it was wrong to ignore
it, but in addition we can calculate a total glutathione concentration of
0.0934M, which exceeds the correct value of 0.08M by a sufficient
margin to indicate that the initial approximation was not acceptable.
5.9 Problems
(5.1) Rearrange the following equations so that they express x in
terms of y:
(a) y = x + 3 (b) y = 2X2 + 5
(c) y = In5x (d) y = 8 exp (3x)
114 / Solving Equations
x+4
(e) Y = x-I (f) xy+4 = 3x
(b) 14.71
6.43
1.281;
5.11
(c) 1-1
3
-21;
-7
(d) I; ~I
Problems j 115
(e) 0 0 0;
136
(f) I-2.22
2.31 1.1 8 1;
0.47
(g) 1.7 6.4 3.1
2.3 8.1 1.2
4 2 1.7 6.4 3.1
=-[g_lu_c_o--=se---:6_-_ph_o---=s=-=ph_a_te-=]~[_A_D_P-=-] = 230
[glucose] [ATP]
dV R
dT p
dV = 0
dT
R V
(~)p p T
V
(~;)T = - ~ = P
R p
(~)v V T
There are two points of symbolism to note here. First, as these are not
ordinary derivatives, we cannot use the ordinary symbols d VldT etc.
for them. Instead of the ordinary d we use a so-called 'curly d', written
as a, for partial differentiation. (This is not a Greek delta, incidentally,
which would be (j and is used in mathematics to represent a small but
finite increment, as in Chapter 3.) Second, we can indicate what
variables are being held constant during the partial differentiation by
showing them as subscripts, as in the expressions above. These are
often obvious, however, and may be omitted when no doubt is likely.
In thermodynamics they are frequently included as an aid to clarity.
In speech we call an expression such as aVI aT 'partial d V by partial
dT', although the second 'partial' is often omitted.
v. =
VS.I (1 + e.)I
I Km+s j
where V and Km are (unknown) constants and (1 + e;) is a factor
representing the effect of experimental error. (We could also have
included experimental error by adding an error term to the basic
expression rather than by multiplying by an error factor. As my
Least-squares fitting of the Michaelis-Menten equation j 121
purpose in using this illustration is mathematical rather than
biochemical it is perhaps unnecessary to discuss why I think the factor
is preferable other than that it leads to much simpler mathematics.
However, it happens that it is also a more realistic way of representing
the kind of error that commonly occurs in enzyme kinetic
experiments.) In practice we would not know V and Km and would
therefore wish to estimate them from the observed values of Si and Vi;
we might prefer to measure them instead, but the unknown magni-
tudes of the error terms prevent this.
By rearranging the above equation we can readily express each e i as
follows:
= (Km/V)(vds;) + (vdV)-1
= ax i +bv i -l
Two points should be noted about these expressions: first, as all of the
summations are from i = 1 to n we can omit the limits from the
summation signs without risk of ambiguity; second, in the first and
second summations on the right-hand side a and b (respectively) are
factors of every term and so can be multiplied once after summing
rather than n times before. We can also partially differentiate with
respect to b in exactly the same way, treating a now as a constant:
as
ab = "[2v.
L.. I
(ax. + bv. -1)]
I I
= 2a~x.v. + 2b~V2 - 2~v.
I I I I
b ~X?~Vi-~XiVi~Xi
- ~V? ~X? - (~XiVY
m LV?/S;LV;-(LV?/SY
IIV = LV?/S; LVj - LV?/SjLV)Sj
LV?/S; LV; - (LV?/SJ2
and it is a simple matter to rearrange these into expressions for Vand
Rm:
v= LV?/S; LV; - (LV?/SY
LV?/S; LVj - LV?/Sj LV)Sj
R LV; LV)Sj - LV?/SjLV j
m - LV?/S;LVj-LV?/SjLV)Sj
These are now the 'best' values of V and Km in the sense that they
make the function S that we defined as our criterion of closeness of fit
as small as possible. Before leaving this result it is advisable to check it
for algebraic errors by the dimensional considerations we saw in
Chapter 1. As Vis a rate, with dimensions of (concentration) (time)-I,
its expression must also be a rate: the first sum in the numerator
contains terms of the form v2/s2, with dimensions of (concentration)2
(time) - 2 (concentration) - 2, or (time)- 2; the second contains terms of
the form v2 , with dimensions of (concentration)2 (time)-2; thus the
product of the first two sums has dimensions of (concentration)2
(time)-4. In the same way the squared sum in the second half of the
numerator also has the dimensions of (concentration)2 (time)-4 and
can therefore be legitimately subtracted from the first product. By the
same sort of analysis we find that the denominator also contains a
dimensionally acceptable subtraction and has dimensions of
(concentration) (time)- 3. Finally, the expression as a whole has the
dimensions of the numerator divided by those of the denominator, i.e.
(concentration) (time) - 1, as it must if it is to be a rate. Thus our
expression for V is dimensionally correct (though this does not
guarantee that it does not contain other kinds of error), and in the
same way we can show that the expression for Rm is a concentration
and is also dimensionally correct.
In this discussion I have treated (concentration) as a fundamental
dimension: this is not strictly correct, as a concentration is a derived
quantity with dimensions of (amount of substance) (length)-3.
124 / Partial Differentiation
Properly therefore I ought to have written (amount of
substance) (length)- 3 wherever I wrote (concentration), but this
would not have made the discussion any easier to follow and indeed
might have encouraged the kinds of mistakes we were setting out to
avoid and recognize. For many purposes there is no harm and a
definite advantage in discussing dimensions in terms of composite
quantities such as concentrations and even rates.
6.4 Problems
(6.1) For the function z = (x 2 + y2)1/2;
(a) determine (:: )y;
(b) determine G: )z;
(c) determine (~~ )x;
(d) show that (::)y (~:)z (~~)x = -1;
a az a az
(e) show that ayax = ax ay"
(6.2) For the function pV = RT (where R is a constant), show that
(Km+So-P)d P = fVdt.
f
So.-P
The left-hand side can readily be integrated by substituting
A = Km + so' B = - 1,x = p, C = so, D = - 1 into the solution
to problem (4.4). It is, however, much simpler to note that So - P
appears in both numerator and denominator, so that we can
write
f:~d: + fd P = f Vdt
Briggs, H., 29
Absorbance, molar, 89, 115
Buffer capacity, 66-68
Accuracy (contrasted with
precision), 21 Calculators, disregard of priority
Acetaldehydelethanol couple, rules, 5
40-42 Calculus
Acetic acidlacetaldehyde couple, differential, 48-71, 117-125
43-45 integral, 72-89
Adenylate kinase equilibrium, Chain rule, 58
101-102 Characteristic, 29
Algebra, scope of, 1-2 Common logarithms, 28-30
'Algebraic notation' (as used in Competitive inhibition, 103-105
electronic calculators), 5 Complex number, 101
Amount-of-substance Computer languages, 3, 5
concentration, 9-10 Concentration, 9-10
Analytic geometry, see co-ordinate Constant, 8-9
geometry, 48-50 Constant of integration, 74, 77, 83
Antilogarithms, 28 Co-ordinate geometry, 48-50
Approximate methods, 109-113 Cubic equation, 110
Area under a curve, relationship to Curvature, measured by second
integration, 72-74, 85-88, 90 derivative, 59
Arithmetic, scope of, 1-2 Curve
Average, different kinds of, 22-23 area under, 72-74, 85-88, 90
Avogadro's number, 32 slope of, 50
Curve sketching, 68-70
Bacterial growth and logarithms to
Cytochrome c couple, 41-42
base 2, 33-34
Bar notation, 29-30 Decadic logarithms, see common
Bell-shaped curve, 57-58 logarithms, 28-30
Boltzmann principle, 32, 47 Decay, exponential, 34-35
Boyle's law, 117-120 Decimal places, 21
Brackets, 4-5 Decimal point, 3
133
134 / Basic Mathematics for Biochemists