Accepted
Accepted
the cancer glands in prostate cancer. Sharma et al. logical images. Farjam et al. (2007) used texture fea-
(2015) segmented nuclei and classified with AdaBoost tures to identify the glandular elements within images
based on intensity and morphological features of nu- of prostate tissue. They applied k-means to cluster
clei. the image components. Spanhol et al. (2016) applied
Kong et al. (2009) classified neuroblastoma a completed modeling of the Local Binary Pattern
into three categories: undifferentiated, poorly- (LBP), based on three components extracted from
differentiated, and differentiating. They segmented the 8-neighborhood: center pixel, sign, and magni-
the images at each resolution level into cellular, tude. The center pixel is coded into a binary bit after
neuropil, and background elements and classified global thresholding. The difference signs and mag-
neuroblastoma histological images by integrating nitudes are coded in binary format so that they can
classifiers such as Linear Discriminative Analysis be combined to form the final Completed Local Bi-
(LDA) (Lehmann 1998), Support Vector Machine nary Pattern histograms. Spanhol et al. (2016) used
(SVM) (Cortes & Vapnik 1995), and k-Nearest Neigh- the combination of Completed Local Binary Patterns
bor (k-NN) (Cover & Hart 1967). Tafavogh et al. with k-NN and SVM to classify a large dataset of
(2014) proposed a four stage algorithm to classify breast cancer histological images into benign and ma-
neuroblastoma tumour images into undifferentiated lignant classes. Classification of neuroblastoma histo-
and poorly-differentiated using the Otsu segmenta- logical images remains challenging due to the intra-
tion technique (Otsu 1979). The whole slide image class variation.
was partitioned. Intensity variation was reduced us-
ing an image enhancement technique, regions of inter-
est were segmented by thresholding techniques and 3 Dataset
histological images were classified using rules based There is a lack of large and publicly available im-
on the Shimada scheme. age datasets for analysis of neuroblastic tumours,
In some applications, feature extraction has been which significantly hinders development and valida-
used to classify histological images. All of the existing tion of methods. Therefore, we gathered a dataset
methods extracted global features which are sensitive of images from neuroblastic tumours. Tumour ac-
to intra-class variation. Tabesh et al. (2007) classified cess is compliant with local policy, national legisla-
prostate tissue into tumour and non-tumour based tion, and ethical mandates to use the human tissue
on color, texture, and morphologic features. Qureshi in research. All patient specific details were removed
et al. (2008) classified meningioma using wavelet and a de-identified dataset was used for this research.
package transform and Local Binary Patterns (Ojala The initial dataset consisted of images of tissue mi-
et al. 2002). Zhang et al. (2013) used a combination of croarrays (TMA) of neuroblastic tumours, scanned by
curvelet transform, gray level co-occurrence matrix, the Aperio ScanScope system. Each slide was com-
and the Completed Local Binary Patterns (CLBP) as posed of 20 to 40 1.2mm cores of neuroblastic tumour,
features to stratify breast cancer tumours from histo- stained with haematoxylin and eosin (H&E) and cut
Figure 2: Quantitative actual size of tissue spots and cropped images
poorly-differentiated 571 77
differentiating 187 12
undifferentiated 155 10
ganglioneuroma 84 18
ganglioneuroblastoma 46 8
at 3µm. In this method, the contrast between dif- 4 Patch Completed Local Binary Pattern
ferent cells which have different colors is increased. (PCLBP)
Staining with H&E allows observation of histological
structures. TMA images were in svs format with res- Before describing our PCLBP algorithm, we first de-
olution 0.2µm, images were viewed and extracted us- scribe the algorithm it is based on Completed Local
ing ImageScope software (ImageScope 2016). Tissue Binary Pattern (CLBP) is one of the latest variants
cores were classified by experts into five different cate- of Local Binary Pattern (LBP) (Ojala et al. 2002).
gories: poorly-differentiated, differentiating, undiffer- The LBP operator computes the distribution of bi-
entiated, ganglioneuroma, and ganglioneuroblastoma, nary patterns in the circular neighborhood character-
according to the Shimada classification system. Rep- ized by a radius R and a number of neighbors P . The
resentative images in the categories are shown in Fig- idea is to threshold neighboring pixels, compared to
ure 1. the central pixel to the P neighbors. If the intensity
Areas best representative of each category, and de- of a neighbor pixel is greater than or equal to that
void of artefacts, were selected from each tissue core of the central pixel the value 1 is assigned, other-
by an expert histopathologist. At 40× magnification, wise 0. Therefore, a binary pattern is obtained from
cropped image size was 300×300 pixels with real size the neighborhood. The LBP function at pixel p is
80 × 80µm which is approximately one third of the (from (Ojala et al. 2002))
area of an optical microscope high power field of view.
Figure 2 shows the quantitative actual size of tissue P −1
spots and cropped images.
X
LBP (f (X, Y )) = 2i · u(f (Xi , Yi )−f (X, Y ))
This size was chosen as a compromise between be-
i=0
ing large enough to capture diagnostic features of each (1)
category and small enough for computational cost.
Numbers of images in our dataset are given in Table 1.
It is much larger in terms of patients and images than where f (Xi , Yi ) and f (X, Y ) are grey levels of pixels
the datasets used by Tafavogh et al. (2014) and Kong (Xi , Yi ) and (X, Y ) and u(·) is the unit step function.
et al. (2009). Moreover, the intra-class variation of The CLBP is a completed modeling of LBP (Guo
neuroblastoma cells in the gathered dataset is very et al. 2010) which is based on three components ex-
high which means different cells in neuroblastoma his- tracted from the local region: center pixel, sign, and
tological images within the same patients in the same magnitude. The center pixel is coded by a binary
class have different shapes. An example of the high code after thresholding, with the threshold set as the
intra-class variation of differentiating neuroblastoma average grey level of the whole image. For computing
is shown in Figure 3. As can be seen, both of them the sign and magnitude, a neighborhood of radius R
are differentiating type but their cells have different and number of neighbors P is considered. Signs and
shapes. magnitudes are computed and coded by a specific op-
erator into the binary format so that they can be
combined to form the final CLBP histograms (Guo
Figure 3: An example of high intra-class variation of differentiating neuroblastoma
Figure 9: Accuracy of k-NN classifier versus patch width (W ) and k in parameter tuning of k-NN classifier
The database is divided randomly into two sub- accuracies using different kernels. As can be seen,
sets: parameter-tuning (211 images) and validation the best result is achieved using the RBF kernel, so
datasets (832 images). We select the optimum values we selected it for the next experiments. RBF param-
for free parameters using the training dataset and fix eter γ was empirically defined through experiments
them for the validation. Then, we evaluate the system with best value taking 1/256 (256 is the number of
using the validation dataset and selected parameters. different intensities in the images).
Table 3: Weighted average precision, recall, and F-measure obtained by our system and Spanhol’s system.
images) sets. We train the algorithm using the train- ages to five different categories.
ing set (with the parameter values selected in Sec-
tion 5.1) and test using the validation set. We repeat 2. Neuroblastoma has a complex texture with a
this procedure multiple (10) times and report the av- great deal of complicated features compared to
erage accuracy. Algorithm performance for k-NN and other types of cancer such as breast cancer. It
SVM is reported with the average F-measure, recall, is the first time that neuroblastoma histological
and precision (Powers 2011) weighted by number of images are classified into five different categories
examples in each of the five classes. Distribution of using a feature extraction method.
the computed F-measures for the two classifiers and 3. The proposed method extract features within
feature extraction approaches over the ten trials is small patches which are not easily detected by
presented in Figure 10 and shows that SVM works human eyes.
better than k-NN. The t-test with P value = 0.03
and α=0.05 (significance level) shows that combina-
tion of our algorithm with SVM classifier significantly 7 Conclusion
improves the accuracy of classification in comparison
with k-NN classifier. Table 3 indicates that our al- We proposed a new Patched Completed Local Binary
gorithm obtains approximately 5% higher accuracy Pattern (PCLBP) to classify neuroblastic tumours
when it is combined with SVM classifier compared into five different categories using extracted feature
to the k-NN classifier. We also test the CLBP algo- vectors from histological images. The algorithm built
rithm (Spanhol et al. 2016) on the test images as a the feature vector by extraction of SBP and MBP
benchmark, again reporting the weighted average of within local patches. The advantage of the proposed
precision, recall, and F-measure. Table 3 reports the method is extraction of local features which are more
weighted average precision, recall, and F-measure of robust to intra-class variation compared to global fea-
our system and Spanhol’s system. The weighted av- ture extraction. The evaluation was conducted on a
erage precision, recall, and F-measure of our system gathered dataset with 1043 cropped images from sam-
are better than Spanhol’s system. ples of five different categories. We compare the re-
sults obtained by our system with the state-of-the-art.
Results indicate that the proposed method has im-
6 Discussion proved the average weighted F-measure for k-NN and
SVM by 1.89% and 0.81%, respectively, compared to
The proposed algorithm is a new feature extraction the benchmark.
method to classify neuroblastoma histological images
into five different groups. Although a large number
of methods have been proposed in the literature, our References
system has multiple advantages over these systems:
Boser, B., Guyon, I. & Vapnik, V. (1992), A Training
1. There is no feature based method to classify Algorithm for Optimal Margin Classifiers, in ‘Pro-
histological images into more than two cate- ceedings of the Fifth Annual Workshop on Compu-
gories. They were classified more straightforward tational Learning Theory’, pp. 144–152.
to binary classification. However, the proposed
method can classify neuroblastoma tumour im- Boucheron, L. (2008), Object-and Spatial-Level Quan-
titative Analysis of Multispectral Histopathology
Images for Detection and Characterization of Can- Ojala, T., Pietikainen, M. & Maenpaa, T. (2002),
cer, Doctoral Dissertation in University of Califor- ‘Multiresolution Gray-Scale and Rotation Invariant
nia at Santa Barbara. Texture Classification with Local Binary Patterns’,
IEEE Transactions on Pattern Analysis and Ma-
Chang, C. & Lin, C. (2011), ‘LIBSVM: A Library for chine Intelligence 24(7), 971–987.
Support Vector Machines’, ACM Transactions on
Intelligent Systems and Technology (TIST) 2(3), 1– Otsu, N. (1979), ‘A Threshold Selection Method
27. Software available at http://www.csie.ntu. from Gray-Level Histograms’, IEEE Transactions
edu.tw/~cjlin/libsvm. on Systems, Man, and Cybernetics 9(1), 62–66.
Cortes, C. & Vapnik, V. (1995), ‘Support-Vector Net- Park, J., Caron, H. & Eggert, A. (2008), ‘Neurob-
works’, Machine Learning 20(3), 273–297. lastoma: Biology, Prognosis, and Treatment’, Pe-
diatric Clinics of North America 55(1), 97–120.
Cosatto, E., Miller, M., Graf, H. & Meyer, J. (2008),
Grading Nuclear Pleomorphism on Histological Mi- Powers, D. (2011), ‘Evaluation: from precision, re-
crographs, in ‘International Conference on Pattern call and f-measure to roc, informedness, marked-
Recognition’, pp. 1–4. ness, and correlation’, Journal of Machine Learning
Technologies 2(1), 37–63.
Cover, T. & Hart, P. (1967), ‘Nearest Neighbor Pat-
tern Classification’, IEEE Transactions on Infor- Qureshi, H., Sertel, O., Rajpoot, N., Wilson, R. &
mation Theory 13(1), 21–27. Gurcan, M. (2008), Adaptive Discriminant Wavelet
Packet Transform and Local Binary Patterns for
Farjam, R., Soltanian-Zadeh, H., Jafari-Khouzani, Meningioma Subtype Classification, in ‘Proceed-
K. & Zoroofi, R. (2007), ‘An Image Analysis Ap- ings of the 11th International Conference on Med-
proach for Automatic Malignancy Determination of ical Image Computing and Computer-Assisted In-
Prostate Pathological Images’, Cytometry Part B: tervention, Part II’, pp. 196–204.
Clinical Cytometry 72(4), 227–240.
Sharma, H., Zerbe, N., Heim, D., Wienert, S.,
Guo, Z., Zhang, L. & Zhang, D. (2010), ‘A Com- Behrens, H., Hellwich, O. & Hufnagl, P. (2015), A
pleted Modeling of Local Binary Pattern Operator Multi-Resolution Approach for Combining Visual
for Texture Classification’, IEEE Transactions on Information using Nuclei Segmentation and Clas-
Image Processing 19(6), 1657–1663. sification in Histopathological Images, in ‘10th In-
Hipp, J., Flotte, T., Monaco, J., Cheng, J., Mad- ternational Conference on Computer Vision Theory
abhushi, A., Yagi, Y., Rodriguez-Canales, J., and Applications’, pp. 37–46.
Emmert-Buck, M., Dugan, M., Hewitt, S., Toner, Shimada, H., Ambros, I., Dehner, L., Hata, J., Joshi,
M., Tompkins, R., Lucas, D., Gilbertson, J. V., Roald, B., Stram, D., Gerbing, R., Lukens,
& Balis, U. (2011), ‘Computer-Aided Diagnos- J., Matthay, K. & Castleberry, R. (1999), ‘The
tic Tools Aim to Empower Rather than Replace International Neuroblastoma Pathology Classifica-
Pathologists: Lessons Learned from Computational tion (the Shimada System)’, Cancer 86(2), 364–
Chess’, Journal of Pathology Informatics 2(1), 25– 372.
25.
Spanhol, F., Oliveira, L., Caroline, P. & Laurent, H.
ImageScope (2016). (2016), ‘A Dataset for Breast Cancer Histopatho-
URL: http://www.leicabiosystems.com/digital- logical Image Classification’, IEEE Transactions on
pathology/digital-pathology- Biomedical Engineering 63(7), 1455–1462.
management/imagescope/
Tabesh, A., Teverovskiy, M., Pang, H., Kumar, V.,
Kong, J., Sertel, O., Shimada, H., Boyer, K., Saltz, Verbel, D., Kotsianti, A. & Saidi, O. (2007), ‘Mul-
J. & Gurcan, M. (2009), ‘Computer-Aided Evalu- tifeature Prostate Cancer Diagnosis and Gleason
ation of Neuroblastoma on Whole-Slide Histology Grading of Histological Images’, IEEE Transac-
Images: Classifying Grade of Neuroblastic Differ- tions on Medical Imaging 26(10), 1366–1378.
entiation’, Pattern Recognition 42(6), 1080–1092.
Tafavogh, S., Meng, Q., Catchpoole, D. & Kennedy,
Lee, K. & Street, W. (2003), ‘An Adaptive Resource- P. (2014), Automated quantitative and qualitative
Allocating Network for Automated Detection, Seg- analysis of the whole slide images of neuroblastoma
mentation, and Classification of Breast Cancer Nu- tumour for making a prognosis decision, in ‘Pro-
clei Topic Area: Image Processing and Recog- ceedings of the IASTED 11th International Con-
nition’, IEEE Transactions on Neural Networks ference on Biomedical Engineering’, pp. 244–251.
14(3), 680–687.
Veta, M., Pluim, J., van Diest, P. & Viergever, M.
Lehmann, E. (1998), Theory of Point Estimation, (2014), ‘Breast Cancer Histopathology Image Anal-
Springer. ysis: A Review’, IEEE Transactions Biomed Eng.
Mohammed, E., Mohamed, M., Naugler, C. & Far, B. 61(5), 1400–1411.
(2013), Chronic Lymphocytic Leukemia Cell Seg- Yu, K., Zhang, C., Berry, G., Altman, R., Re, C.,
mentation from Microscopic Blood Images using Rubin, D. & Snyder, M. (2016), ‘Predicting Non-
Watershed Algorithm and Optimal Thresholding, Small Cell Lung Cancer Prognosis by Fully Auto-
in ‘26th IEEE Canadian Conference on Electrical mated Microscopic Pathology Image Features’, Na-
and Computer Engineering’, pp. 1–5. ture Communications 7, 1–10.
Nguyen, K., Jain, A. & Sabata, B. (2011), ‘Prostate Zhang, Y., Zhang, B. & Lu, W. (2013), ‘Breast
Cancer Detection: Fusion of Cytological and Tex- Cancer Histological Image Classification with Mul-
tural Features’, Journal of Pathology Informatics tiple Features and Random Subspace Classifier
2:S3. Ensemble’, Studies in Computational Intelligence
450, 27–42.