A Simple Method For HPLC Retention Time Prediction: Linear Calibration Using Two Reference Substances
A Simple Method For HPLC Retention Time Prediction: Linear Calibration Using Two Reference Substances
Abstract
Background: Analysis of related substances in pharmaceutical chemicals and multi-components in traditional Chi‑
nese medicines needs bulk of reference substances to identify the chromatographic peaks accurately. But the refer‑
ence substances are costly. Thus, the relative retention (RR) method has been widely adopted in pharmacopoeias and
literatures for characterizing HPLC behaviors of those reference substances unavailable. The problem is it is difficult
to reproduce the RR on different columns due to the error between measured retention time ( tR) and predicted tR in
some cases. Therefore, it is useful to develop an alternative and simple method for prediction of t R accurately.
Methods: In the present study, based on the thermodynamic theory of HPLC, a method named linear calibration
using two reference substances (LCTRS) was proposed. The method includes three steps, procedure of two points
prediction, procedure of validation by multiple points regression and sequential matching. The t R of compounds on a
HPLC column can be calculated by standard retention time and linear relationship.
Results: The method was validated in two medicines on 30 columns.
Conclusion: It was demonstrated that, LCTRS method is simple, but more accurate and more robust on different
HPLC columns than RR method. Hence quality standards using LCTRS method are easy to reproduce in different labo‑
ratories with lower cost of reference substances.
Keywords: RP-HPLC, Retention time, Relative retention, Linear calibration using two reference substances, Multi-
component analysis, Traditional Chinese medicines
© The Author(s) 2017. This article is distributed under the terms of the Creative Commons Attribution 4.0 International License
(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium,
provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license,
and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/
publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
Sun et al. Chin Med (2017) 12:16 Page 2 of 12
same type of columns was used to repeat the analyti- HPLC (UV–Vis detector) were used. Matlab software
cal methods, the differences in the performance and the was provided by Math Works Inc. USA. 30 C 18 columns
separation effects were still large. And then the methods (shown in Table 1), from 13 manufacturers, included A,
for selecting columns with equivalent selectivity, such as B, and EP types were used. And most columns belong to
the USP approach [16], the PQRI approach [17, 18] and type B according to the previous study [11–15]. Accord-
Katholieke Universiteit Leuven column classification sys- ing to the PQRI approach [17, 18] and using the data from
tem [19–21] were proposed. Take PQRI approach [17, 18] the USP website (http://www.usp.org/USPNF/columns.
as an example, hydrophobicity (H), steric interaction (S), html), the similarity of columns were calculated using
hydrogen-bond acidity (A), hydrogen-bond basicity (B) col1 as the reference column. The similarity (0–13.18)
and ion-exchange capacity (C), were used to describe the showed that the differences among the columns were
performance of the column. And the similarity between a large, which indicated that the selected columns are in a
column and the reference column was calculated by these wide range and have good representative trait.
five parameters. When the similarity was less than three, Reference substances of psoralen, isopsoralen, Chon-
the two columns were regarded to be equivalent. Using glou saponin I, Chonglou saponin II, Chonglou saponin
the equivalent column, the reproducibility of separation VI, Chonglou saponin VII, ethinylestradiol, and herbal
and RR could be improved to some extent. However, in reference substances including Psoraleae Fructus (Pso-
addition to column, many other factors also have great raleae) and Paridis Rhizome (Paridis), were supplied
influences on the chromatograms, such as the dead vol- by the National Institutes for Food and Drug Control,
ume of chromatographic system, the different structure China. Methanol, acetonitrile, and phosphoric acid were
of analytes, the complexity of the chromatographic con- HPLC graded and supplied by the Fischer Company,
ditions, and so on. Therefore, it is necessary to develop USA. Ammonium nitrate (analytical grade) was sup-
a method that takes all aforementioned factors into plied by Beijing Chemical Works. Water was prepared by
account to reduce the prediction error of the tR. Milli-Q system, Millipore Company, USA.
According to the thermodynamic theory of liquid chro-
matography, there is a linear relationship between the t R of Preparation of sample solution
the compounds on two different HPLC systems (including Psoraleae [26]: weigh 0.5 g of the powder and place it in a
chromatographs and columns) [22]. For better understand- 50-mL stopper conical flask, then add 25 mL of ethanol.
ing, the pdf of reference 22 (in Chinese, Additional file 1) Sonicate the mixture for 30 min and centrifuge for 5 min.
and the English version of reference 22 (only the section Filter the supernatant through a 0.45-μm PTFE filter.
of theory was translated, Additional file 2) are provided. Paridis [27]: weigh 0.5 g of the powder to a stopper
Combined with the above principle and previous studies conical flask, add 25 mL of ethanol, heat under reflux on
[23–25], a novel method using two reference substances for a water bath for 30 min, cool and filter the supernatant
predicting HPLC tR has been proposed (linear calibration through a 0.45-μm PTFE filter.
by two reference substances, LCTRS). The StR (arithmetic
average of tR for the same compound on different HPLC Chromatographic conditions
system under the same chromatographic conditions) is Psoraleae [26]: mobile phase A was water and mobile phase
used as the reference value, and the linear regression is used B was methanol. The elution procedure was shown below:
as the basic algorithm for tR prediction. In this study, the 0–20 min, 50%B→70%B; 20–45 min, 70%B→85%B;
method was validated in two medicines on 30 C18 columns. 45–50 min, 85%B→90%B; 50–60 min, 90%B. Detection
Compared with the RR method, LCTRS method is proved wavelength was 308 nm. Paridis [27]: mobile phase A was
to be more accurate, and more robust on different HPLC water and mobile phase B was acetonitrile. The elution
columns. Hence, it provides a good prospective application procedure was shown below: 0–40 min, 30%B→60%B;
in quantification of multi-components in TCMs as well as 40–50 min, 60%B→30%B; 50–60 min, 30%B. Detection
related substances in pharmaceutical chemicals. wavelength was 203 nm. Column temperatures were both
set at 30 °C and flow rate was 1.0 mL/min.
Methods
The Minimum Standards of Reporting Checklist contains Results
details of the experimental design, and statistics, and HPLC chromatogram of samples
resources used in this study (Additional file 3). The typical chromatograms of Psoraleae and Paridis were
shown in Fig. 1. The peaks were mainly identified by the
Instruments and reagents reference substances. For those peaks without reference
Waters e2695 HPLC (2998PDA detector), Agilent 1260 substances, UV–Vis spectrum and mass spectrum were
HPLC (DAD detector), and Shimadzu LC-2010A HT used for identification.
Sun et al. Chin Med (2017) 12:16 Page 3 of 12
Standard retention time (StR) that the advantages of using StR was better than tR of any
Under the same chromatographic conditions, measured single column. In this paper, the deviation (ΔtR) of tRmea
retention time (tRmea) of the four saponins in Paridis on and tRpre (formula 2) was used to evaluate the merits and
different chromatographic systems (which includes HPLC defects of RR method and LCTRS method.
instruments and columns, hereinafter referred to as col-
umns due to the differences of tR mainly caused by col-
n
StR = tRi /n (n ≥ 1) (1)
umns) were shown in Table 1. The arithmetic average of
i=1
tR for the same compound on different columns is called
StR, formula (1). Just like RR, S
tR is the reference value for
calculating the predicted retention time (tRpre) of analyte
tR = |tR mea − tR pre| (2)
in the samples. Theoretically, under the same chromato- Linear principle of LCTRS
graphic condition, the RR calculated by different columns According to the chromatographic thermodynamic the-
is constants, but StR is not. It will be discussed in Sec- ory, Wang et al. proved that there was a linear relation-
tion "Minimum number of columns for StR calculation" ship between the tR of the same compounds on different
Sun et al. Chin Med (2017) 12:16 Page 4 of 12
HPLC system (mainly considered as columns) under the on 30 columns was used to fit multiple point linear equa-
same chromatographic conditions [22], as expressed in tion. The averages of ΔtR (average ± standard deviation)
formula (3) and Fig. 2a, b. were calculated, as shown in Fig. 3. For both medicines,
the prediction deviation was reduced with increasing
tR coli = a × tR colj + b (3) number of columns. However, the prediction accuracy
will not be significantly improved when the number of
Since formulas (1) and (3) are both linear, thus there columns reaches five, which is considered as a low-cost
is a linear relationship between tR and StR for each and reasonable limit. It is recommended to choose five–
compound, as shown in formula (4) and Fig. 2c, d. It is fifteen columns for StR calculation.
noteworthy that the correlation coefficient of the linear Even for the columns with same type of packing mate-
regression is higher than that shown in Fig. 2a, b. rial, there are still some differences among the column
stationary phase, packing techniques and errors in the
tR coli = a × StR + b (4) process of chromatographic analysis. Those differences
Minimum number of columns for StR calculation will cause deviation of tR. The physical explanation of
Theoretically, tR on any column can be used as reference StR calculation was to evenly mix and refill the station-
value for linear fitting. But the ΔtR calculated with ran- ary phase of the columns selected. Because of reducing
dom column were instable. Thus, the reasonable number the random and system errors, the prediction result was
of columns for S tR calculation was thoroughly investi- accurate and robust.
gated by random sampling. StR was calculated based on
1, 5, 10, 15, 20, 25, and 30 columns combined with non- Procedure of two points prediction
replicate random sampling times of 30, 100, 100, 100, For RR method, only one compound was chosen as refer-
100, 100, and 1, respectively. The value of S
tR with tRmea ence compound (reference substance required), and RR
Sun et al. Chin Med (2017) 12:16 Page 5 of 12
Fig. 2 Linear fitting results of Psoraleae and Paridis, code No. is the same as that in Fig. 1
of all other compounds were used as reference value for saponin VII and Chonglou saponin I) and sample solu-
calculating tRpre. For LCTRS method, two compounds tion were performed on a C 18 column (col4: BDS Hypersil
were chosen as reference compounds (reference sub- C18). The tRmea (21.014 and 35.170 min) of two reference
stances required), and StR of all other compounds were compounds in the sample solution were obtained by the
used as reference value for calculating tRpre. The refer- reference substances solution (Fig. 4a). Then two points,
ence compounds, the value of RR and StR were shown in Chonglou saponin VII (19.803, 21.014) and Chonglou
Tables 2 and 3. saponin I (33.035, 35.170), could be determined in the
Take Paridis as an example. First of all, reference sub- coordinate using StR as abscissa and tRmea as ordinate.
stances solution of two reference compounds (Chonglou Based on the two points, the following linear equation
Sun et al. Chin Med (2017) 12:16 Page 6 of 12
Psoraleae (n = 23) 0.226 0.247 0.615 0.657 0.794 0.808 0.878 1.000a 1.061 1.103 1.152
a
Paridis (n = 30) 0.652 0.729 1.000 1.090
a
Reference compound
Psoraleae (n = 23) 9.271 10.122a 25.143 26.854 32.437 32.951 35.843 40.793a 43.254 44.950 46.937
a a
Paridis (n = 30) 19.803 22.156 30.319 33.035
a
Reference compound
was given: y = 1.0698x − 0.1719 (Fig. 4b). Taking S tR Procedure of multiple points regression
of analytes (Chonglou saponin VI and Chonglou sapo- After assignment of the peaks of analytes in the sample
nin II) into equation, the tRpre of Chonglou saponin VI solution by prediction of two points regression, the tRmea
(23.481 min) and Chonglou saponin II (32.263 min) were of those peaks should be validated by multiple points
attained, respectively. Finally, in the chromatogram of the regression. In this procedure, tRmea and StR of refer-
sample solution, the corresponding peaks of Chonglou ence compounds and analytes were used to fit a multiple
saponin VI and Chonglou saponin II can be found within points linear regression: Y = 1.1038x − 1.1075 (Fig. 4d).
the range of t Rpre ± tRW (tRW is abbreviation of t R win- Taking StR of analytes (Chonglou saponin VI and Chon-
dow, in this case is 0.6 min), as shown in Fig. 4c. It can be glou saponin II) into new equation, the new tRpre of
seen that ΔtR of analytes calculated by prediction of two Chonglou saponin VI (23.297 min) and Chonglou sapo-
points were 0.583 min and 0.416 min (The tRmea of ana- nin II (32.358 min) were calculated. If ΔtR of all analytes
lytes were 22.898 min and 32.679 min). were less than the given tRL (tRL: tR limit, in this case
Sun et al. Chin Med (2017) 12:16 Page 7 of 12
Fig. 4 Flow chart of LCTRS (Paridis, code No. is the same as that in Fig. 1)
Sun et al. Chin Med (2017) 12:16 Page 8 of 12
is 0.5 min), the prediction was success, otherwise fail- treated as peak series for sequential matching. That is, the
ure (Fig. 4e). In this case, ΔtR were 0.399 and 0.321 min, earlier tRpre will be matched to the peak with the earlier
respectively. The step of validation by multiple points tRmea. Although ΔtR of peak #6 increased to 1.036 min,
was based on the principle of stepwise linear regression, the match results were correct, as shown in Fig. 5b. This
which can further improve the prediction accuracy. rule can be further applied to multiple-peak series, which
The purpose of setting that the tRW is larger than has a close tR.
tRL is to increase the amount of suitable columns and
to improve the accuracy of prediction. Generally, the Comparison between LCTRS method and RR method
recommended ranges of tRW and tRL are 0.8–2.0 and The comparison among unadjusted RR method, adjusted
0.5–1.5 min, respectively. If necessary, the values can be RR method (dead time was measured by ammonium
adjusted in accordance with different samples under dif- nitrate as probe compound), prediction by two points,
ferent chromatographic conditions. If the ΔtR of some and validation by multiple points was summarized in
compounds are large, their tRW and tRL can be set Tables 4 and 5. The results showed that the unadjusted
individually. RR method and adjusted RR method were similar, their
prediction accuracy were bigger and suitable for less pos-
Sequential matching rule itive columns. But the prediction deviation was reduced
If the tR of two peaks are too close, e.g. less than 2 min, and the number of positive columns was increased by
there would be a mistake for peak matching by the least LCTRS method. The best was validation by multiple
ΔtR rule. Take Psoraleae for example, as shown in Fig. 5a, points which was based on the prediction by two points.
peak #6 was assigned to peak A in the sample solution on
col6 (Kromasil C18) with a small ΔtR of 0.515 min. How- Exclusion of column and compound by linear fitting
ever, peak #5 was not found and peak B in the sample Nonlinear shift of tR for a compound on different col-
solution was not matched. When t RW was set as 1.2 min, umns could be caused either by different column pack-
tRmea of peak A was within the window of t Rpre of peak ing materials and use of other packing techniques, or by
#5. tRmea of the peaks A and B would both fall into the the different compound structure. In order to exclude the
window of tRpre of peak #6. Because of the existence of columns and compounds with relatively large nonlinear
one common peak (peak A), peaks #5 and 6 should be shift, linear fitting of tRmea and StR were performed. The
Fig. 5 Advantage of sequential matching (Psoraleae, code No. is the same as that in Fig. 1)
Sun et al. Chin Med (2017) 12:16 Page 9 of 12
following rules were used to identify the outlier column requirements (the average of correlation coefficient was
and compound. (1) In a regression scatter plot, the com- 0.9989). The outlier columns were col2, 8, 12 (Fig. 6a), 16,
pounds obviously deviated from a regression line (the 19, 20 and 30. For Paridis: no obvious nonlinear devia-
correlation coefficient is usually less than 0.99). (2) ΔtR tion was observed of four saponins. All 30 columns met
was usually larger than 1–2 min. The excluded columns the requirement with average correlation coefficient of
and compounds would not be used for StR calculating. 0.9993.
For Psoraleae: no obvious nonlinear deviation was In order to simulate tR of compound with large
observed of all 11 compounds. 23 columns met the structural difference, reference substances solution of
Fig. 6 Outlier column (a) and Outlier compounds (b), code No. is the same as that in Fig. 1
Sun et al. Chin Med (2017) 12:16 Page 10 of 12
ethinylestradiol mixing with four Chonglou saponins ends, otherwise they are in the middle or near the same
were used to measure tR of those five compounds on 30 end]. The coverage corresponding to the smallest ΔtR
columns. Nonlinear shift of ethinylestradiol was observed was 80–100%.The results of Psoraleae (Fig. 7a) showed
on col1 (Fig. 6b), 2–6, 8, 11, 15–18, 26–28 and 30. It the advantage of choosing reference compounds with
appears that the HPLC retention behaviors of ethinyle- smaller linear deviation, when the coverages of tR were
stradiol and four Chonglou saponins were significantly similar. Therefore, the optimized reference compounds
different on this chromatographic condition. It further for Psoraleae were peak #2 and peak #8, rather than
indicated that the classification and similarity evaluation peak #1 and peak #11 which had a maximum coverage
of columns should be based on the characteristics of col- of tR but with more deviation from linearity. The selec-
umns as well as analytes. tion rule decreases the randomness of choosing reference
If the outlier compounds cannot be excluded. The fol- compounds and the amount of calculation (or the ΔtR of
lowing approaches could be used: (1) specify one or more all possible reference compound pairs will be calculated
suitable columns; (2) provide reference substances for every time). The accurate and simple selection proce-
those compounds; (3) use UV–Vis spectrum and/or mass dures were as follows. Firstly, Select two reference com-
spectrum for assistant peak identification. pound pairs with large tR coverage (80–100%). Secondly,
exclude compounds with large linear deviation based on
Selection of two reference compounds the linear fitting results. Thirdly, calculate the ΔtR of the
Ideally there should be no difference in selecting any of rest reference compound pairs and select reference com-
the two compounds as reference compounds. However, pound pairs with the smallest ΔtR.
because of the difference of HPLC instruments, columns, tR2 − tR1
compounds structure, complexity of elution condition, Coverage of tR = (5)
tRlast − tRfirst
and accidental error of analysis, different selection of ref-
erence compound pairs will make differences. In order to tR2 is tR (or StR) of second reference compound; tR1 is tR
find out the rule for reference compounds selection, each (or StR) of first reference compound; t Rlast is t R (or S tR) of
combination of possible reference compound pairs for last compound; t Rfirst is tR (or StR) of first compound.
the two medicines was studied. The average of ΔtR cor- In summary, the establishment procedures of LCTRS
responding to each reference compound pair were cal- were as follows. Firstly, select five–fifteen different
culated and shown in Fig. 7. It can be seen that, for the brands of C18 columns, and record the HPLC chroma-
two medicines, the ΔtR of prediction by two points step tograms of reference substances and sample on all col-
would be decreased with increasing coverage of tR [as umns. Secondly, calculate initial S tR by using all columns,
shown in formula (5). The coverage of t R is a reflex of the and perform linear fitting of tR on each column with StR.
relative position of the two reference compounds. The Exclude outlier columns and compounds, and recalculate
first compound is at one end (with smaller tR), the last final StR using remaining columns. Finally, select two ref-
compound is at the other end (with bigger tR). If the cov- erence compounds with large tR coverage and low linear
erage is high, the two reference compounds are near both deviation.