Almeida2002 PDF
Almeida2002 PDF
Received 26 January 2002; received in revised form 11 April 2002; accepted 11 April 2002
Abstract
When the assumption of homoscedasticity is not met for analytical data, a simple and effective way to counteract the
greater influence of the greater concentrations on the fitted regression line is to use weighted least squares linear regression
(WLSLR). The purpose of the present paper is to stress the relevance of weighting schemes for linear regression analysis and
to show how this approach can be useful in the bioanalytical field. The steps to be taken in the study of the linear calibration
approach are described. The application of weighting schemes was shown by using a high-performance liquid chromatog-
raphy method for the determination of lamotrigine in biological fluids as a practical example. By using the WLSLR, the
accuracy of the analytical method was improved at the lower end of the calibration curve. Bioanalytical methods data
analysis was improved by using the WLSLR procedure. 2002 Published by Elsevier Science B.V.
practising bioanalysts. The steps to be taken are of variance as a function of concentration [1,2,4–
described and illustrated with a data set obtained 6,10].
during the validation process of a high-performance In order to counteract the greater influence of the
liquid chromatographic (HPLC) method [8]. greater concentrations on the fitted regression line,
the weighted least squares linear regression is used.
The expression to be minimised now takes the
following form [1]:
2. Simple and weighted linear regression
models—background ( y observed,i 2 y predicted,i )2
SS 5 O ]]]]]]]
s 2i
(2)
The objective of a regression analysis is to find a
deterministic model which allows a prediction of the
where s 2i is the variance of the standard point.
values assumed by the dependent variable ( y) when
Taking the objective of the WLSLR into consid-
independent variables (x n ) are known or fixed. That
eration, the most appropriate weighting factor, w i , is
model defines the kind of relationship between
the inverse of the variance of the standard point:
variables. Since the experimental values hardly fit
the mathematical model, the methodology of mini- 1
w i 5 ]2 (3)
mising the sum of squares (SS) of the deviations si
between the data and the assumed model should
yield the best estimate of the model parameters. This However, this weight is usually impractical, and
is called the ‘‘method of least squares’’ and the other empirical weights based on x-variable (con-
expression to be minimised is: centration) or y-variable (response) may provide a
simple approximation of variance [1,2].
SS 5 O( y observed,i 2 y predicted,i )2 (1)
3.2. Choice of the weighting factor estimated values for the a and b parameters of the
weighted regression equation can be obtained by the
In the light of the evidence of the heteroscedastic following modified formulas:
situation, the following step should be the choice of
the weighting factor, w i . As it is not suitable to O O O O
wi ? wi xi yi 2 wi xi ? wi yi
calculate the inverse of variance in laboratory
routine, taking into account the fact that it requires
O O O
b 5 ]]]]]]]]]
w i ? w i x i2 2s w i x id
2 (8)
such as 1 /x 1 / 2 , 1 /x, 1 /x 2 , 1 /y 1 / 2 , 1 /y and 1 /y 2 where (x i , y i ) is the ith data pair of n total data pairs
should be studied. and w i is the weighting factor chosen.
The best weighting factor is chosen according to a The degree of dependence established between the
percentage relative error (%RE), which compares the two variables, expressed by the correlation coeffi-
regressed concentration (Cfound ) computed from the cient (r-value), can be obtained by the following
regression equation obtained for each w i , with the modified formula:
nominal standard concentration (Cnom ):
Cfound 2 Cnom
O O O O
wi ? wi xi yi 2 wi xi ? wi yi
r 5 ]]]]]]]]]]]]]
%RE 5 ]]]] 3 100
Cnom
(5) O O O
]]]]]] 2
O O
]]]]]]2
œ wi ? wi x i2 2s wi xid ?œ wi ? wi y i2 2s wi yid O
The %RE, evaluated by plotting %RE versus (10)
concentration as well as by calculating %RE sum,
defined as the sum of absolute %RE values, is a A generic diagram of the process described is
useful and sensitive indicator of goodness of fit in represented in Fig. 1.
the evaluation of the effectiveness of a weighting
factor for WLSLR [1].
The best w i will be that which gives rise to a 4. Practice
narrow horizontal band of randomly distributed %RE
around the concentration axis and presents the least To exemplify the procedure described, we selected
sum of the %RE across the whole concentration the intra-day assay data set generated during the
range. validation process of a recently developed HPLC
method for the determination of lamotrigine in
3.3. Weighted straight line equation plasma [8]. Regression parameters were obtained by
introducing the respective formulas on a Microsoft
The model parameters (a and b) of the weighted Excel worksheet.
straight line equation can now be estimated. Conver-
sion of the relations for unweighted least squares into 4.1. Test of homoscedasticity
their weighted counterparts can easily be done by
adding a term w i to any sum and changing any term The plot of residuals versus concentration obtained
n into ow i [1,15]. Knowing that estimated values for for the intra-day assay chromatographic data is
the a and b parameters for unweighted least squares shown in Fig. 2. The residual plot clearly showed
may be obtained by the following formulas: that error was not randomly distributed around the
O O O
n ? xi yi 2 xi ? yi
concentration axis. The F-test (Table 1) also re-
vealed a significant difference between the variances,
O O
b 5 ]]]]]]]
n ? x i 2 s x id
2 2 (6)
when the experimental F-value (Fexp 51.45310 5 )
was compared to the tabled one (Ftab 515.98).
Ox ? O y 2 Ox ? Ox y
2
i i i i i There was evidence that variances were signifi-
n ? Ox 2sOx d
a 5 ]]]]]]]] 2 2 (7) cantly different, thus homoscedasticity was not met.
i i
218 A.M. Almeida et al. / J. Chromatogr. B 774 (2002) 215–222
Fig. 2. Residuals plotted against lamotrigine concentrations for the validation intra-day assay.
A.M. Almeida et al. / J. Chromatogr. B 774 (2002) 215–222 219
Table 1
Test of homogeneity of variances (s 2 ): F-test
Standard A A I.S. Response s2
(mg / l) (A /A I.S. )
0.1 75381 3373583 0.022 5.25310 27
62834 2744443 0.023
68050 2863618 0.024
73212 3190782 0.023
75930 3144582 0.024
15.0 12209191 3244845 3.763 7.60310 22
12953689 2911392 4.449
11682576 2903904 4.023
10940196 2852877 3.835
11147470 2672128 4.172
2 22
s 15.0 7.60 3 10
Fexp 5 ] 5 ]]] 5 1.45 3 10 15
s 20.1 5.25 3 10 27
Ftab (4, 4, 0.99) 5 15.98
A, lamotrigine peak area; A I.S. , internal standard peak area.
The %RE plots for unweighted (model 1) and Unlike pharmaceutical analysis, the concentration
weighted (models 2–7) regressions of the lamot- range in bioanalytical methods is usually dynamic
rigine intra-day assay data across the whole con- and broad, presenting three or more orders of
centration range are shown in Fig. 3. Model 1 clearly magnitude, in order to monitor concentrations effec-
underestimated the concentrations in the lower range tively [16]. When the range in data values is large, it
of the calibration curve, near the limit of quantifica- might be expected that the variance of each data
tion (LOQ). Models 4 and 7 presented the best %RE point might be quite different. In the present case,
distribution scatter at the lower end of the calibration the concentration data ranged between 0.1 and 15.0
curve. mg / l. Therefore, the test of homoscedasticity was
The regression parameters of the calibration curve carried out.
generated for each weighting factor and the respec- Residual plots can be used to evaluate the need for
tive sums of the relative errors are summarised in weighting when unweighted LSLR is applied. If the
Table 2. The weighting factor 1 /x 2 (model 7) data adequately fit the linear model, then the re-
produced the least sum for this data set providing the siduals should be randomly distributed in a horizon-
most adequate approximation of variance. Thus, the tal band centred on the concentration axis [2]. In this
1 /x 2 weighting factor was chosen. study, the residual plots for unweighted LSLR
clearly showed that the residuals were not randomly
distributed around the concentration axis. Instead, an
4.3. Weighted straight line equation increase in variance as a function of concentration
was observed. This need of weighting LSLR was
For the intra-day assay data the calibration curve confirmed with the results of the F-test.
obtained with the weighting factor 1 /x 2 was y 5 The best weighting factor was chosen taking into
0.2804x 2 0.0049, with r50.999. The accuracy of account either the plots or the sums of the %RE
the data, expressed by bias value, was evaluated calculated for each weighting factor. In the present
across the whole concentration range using weighted study, the 1 /x 2 weighting factor produced the small-
(model 7) and unweighted (model 1) linear regres- est %RE sum, and the most random distribution
sion. The results are shown in Table 3. around the x-axis at the lower end of the calibration
220 A.M. Almeida et al. / J. Chromatogr. B 774 (2002) 215–222
Fig. 3. Percentage of relative error (%RE) versus concentration obtained for model 1 (w i 5 1), model 2 (w i 5 1 /y 1 / 2 ), model 3 (w i 5 1 /y),
model 4 (w i 5 1 /y 2 ), model 5 (w i 5 1 /x 1 / 2 ), model 6 (w i 5 1 /x) and model 7 (w i 5 1 /x 2 ).
A.M. Almeida et al. / J. Chromatogr. B 774 (2002) 215–222 221
Table 3
Comparison of the accuracy obtained by using unweighted (w i 5 1) and weighted (w i 5 1 /x 2 ) linear regression in lamotrigine assay
validation a
Nominal % Bias
concentration
Model 1 Model 7
(mg / l)
Unweighted (w i 5 1) Weighted (w i 5 1 /x 2 )
y 5 0.2713x 2 0.0281 y 5 0.2804x 2 0.0049
0.1 2117.99 10.28
0.5 223.38 22.33
2.5 11.31 12.73
5.0 14.50 13.46
10.0 11.64 20.48
15.0 21.21 23.63
a
Values represent %bias obtained from five replicates; %bias 5 (Cmean 2 Cnominal ) /Cnominal 3 100.
222 A.M. Almeida et al. / J. Chromatogr. B 774 (2002) 215–222
the justification required by the regulators, as it [10] J. Pateman, Standard curve, in: H.H. Blume, K.K. Midha
(Eds.), Bioavailability, Bioequivalence and Pharmacokinetic
shows when, why and how to use weighting schemes
Studies, Medpharm Scientific, Stuttgart, 1995, p. 399.
for practising bioanalysts. [11] Analytical Methods Committee, Analyst 119 (1994) 2363.
[12] M.J. Cardone, S.A. Willavize, M.E. Lacy, Pharm. Res. 7
(1990) 154.
Acknowledgements [13] C. Hartmann, W. Penninckx, Y. Vander Heyden, P. Van-
keerberghen, D.L. Massart, R.D. Mcdowall, Experience with
chromatographic methods—Europe, in: H.H. Blume, K.K.
A.M. Almeida was supported by PRAXIS XXI Midha (Eds.), Bioavailability, Bioequivalence and Phar-
BD/ 16288 / 98 and M.M. Castel-Branco by PRAXIS macokinetic Studies, Medpharm Scientific, Stuttgart, 1995,
XXI BD/ 18351 / 98. p. 331.
[14] Water Quality—Calibration and Evaluation of Analytical
Methods and Estimation of Performance Characteristics—
Part 1: Statistical Evaluation of the Linear Calibration
References Function, International Standard—ISO 8466-1:1990 (E),
ISO, Geneve, 1990, p. 1.
[1] E.L. Johnson, D.L. Reynolds, D.S. Wright, L.A. Pachla, J. [15] R. Levie, J. Chem. Educ. 63 (1986) 10.
Chromatogr. Sci. 26 (1988) 372. [16] A.R. Buick, M.V. Doig, S.C. Jeal, G.S. Land, R.D. Mcdowall,
[2] H.T. Karnes, G. Shiu, V.P. Shah, Pharm. Res. 8 (1991) 421. J. Pharm. Biomed. Anal. 8 (1990) 629.
[3] G.K. Szabo, H.K. Browne, A. Ajami, E.G. Josephs, J. Clin. [17] J.R. Lang, S. Bolton, J. Pharm. Biomed. Anal. 9 (1991) 357.
Pharmacol. 34 (1994) 242. [18] J. Wieling, G. Hendriks, W.J. Tamminga, J. Hempenius, C.K.
[4] J.S. Garden, D.G. Mitchell, W.N. Mills, Anal. Chem. 52 Mensink, B. Oosterhuis, J.H.G. Jonkman, J. Chromatogr. A
(1980) 2310. 730 (1996) 381.
[5] M. Thompson, Analyst 113 (1988) 1579. [19] N.V. Nagaraja, J.K. Paliwal, R.C. Gupta, J. Pharm. Biomed.
[6] J.N. Miller, Analyst 116 (1991) 3. Anal. 20 (1999) 433.
[7] M. Mulholland, D.B. Hibbert, J. Chromatogr. A 762 (1997) [20] V.P. Shah, K.K. Midha, S. Dighe, I.Y. Mcgilveray, J.P.
73. Skelly, A. Yacobi, T. Layloff, C.T. Viswanathan, C.E. Cook,
[8] M.M. Castel-Branco, A.M. Almeida, A.C. Falcao, ˜ T.A. R.D. Mcdowall, K.A. Pittman, S. Spector, Eur. J. Drug
Macedo, M.M. Caramona, F.G. Lopez, J. Chromatogr. B 755 Metab. Pharmacokinet. 16 (1991) 249.
(2001) 119. [21] Guidance for Industry. Bioanalytical Methods Validation,
[9] W.W. Daniel (Ed.), Biostatistics: a Foundation for Analysis in United States Food and Drug Administration, Washington,
the Health Sciences, John Wiley, 1987, p. 367. DC, 2001, p. 6.