Multi Class
Multi Class
1 Introduction
Segmentation of healthy and pathologic brain tissues from magnetic resonance
images (MRI), including their subregions, is important in cancer treatment plan-
ning as well as for cancer research. In current clinical practice, the analysis of
brain tumor images is mostly done manually. Apart from being time-consuming,
this has the additional drawback of significant intra- and interrater variability,
which was reported to be around 20%±15% and 28%±12% respectively accord-
ing to Mazzara et al. [8].
Nowadays, in clinical practice, usually four different MRI modalities are used
to delineate the tumor and its subregions from MRI images of the head. These
?
Funding by the European Union within the framework of the ContraCancrum project
(FP7 - IST-223979) is gratefully acknowledged.
2
2 Methods
2.1 Feature Extraction
Features are extracted from the multispectral imaging data. The most prominent
features for distinguishing pathologic and healthy tissues, as well as all their
subregions, are the image intensities in the different modalities. Additionally, we
use the first order texture features (mean, variance, skewness, kurtosis, energy,
3
entropy) according to [10]. First-order textures can be computed fast and easily
from small patches around each voxel in all four modalities. This yields a 28-
dimensional feature vector x, which consists of the voxel-wise concatenation of
the multimodal intensities I and multimodal textures T at each voxel i as shown
in equation (1).
x(i) = [IT1 (i), IT1c (i), IT2 (i), IT2f (i), TT1 (i), TT1c (i), TT2 (i), TT2f (i)] (1)
2.2 Classification
Classification is done using a soft-margin SVM classifier [9]. SVMs are discrimi-
native classifiers, originating from machine learning. They require a training step
to find a separating hyperplane for the data in the feature space. SVMs solve
the optimization problem
l
1 T X
min w w+C ξi (2)
w,b,ξ 2 i=1
subject to yi (wT φ(xi ) + b) ≥ 1 − ξi , with ξi ≥ 0 (3)
where (xi , yi ) are the instance-label pairs of the dataset and w is the normal
vector of the separating hyperplane. C is a penalty parameter for the error term
and b is the offset of the hyperplane. The appealing property of SVMs is that
they offer the possibility to use a kernel function K(xi , xj ) = φ(xi )T φ(xj ) for
transforming the data into a higher-dimensional feature space, where the data
can be linearly separated efficiently with a maximum margin. Slack variables ξi
are used for soft-margin classification. Parameter selection for the SVM classi-
fier with a radial basis function (RBF) kernel is done using grid-based cross-
validation on the training data. In order to extend the inherently binary SVM
classifier to a multiclass problem, we use a one-against-one voting strategy. Our
SVM classification is based on the LibSVM implementation described in [2].
2.3 Regularization
Regularization is done in two different stages using a CRF method. Conditional
Random Fields are an extension of Markov Random Fields (MRF). Both offer the
possibility to formulate the regularization as an energy minimization problem.
To this end we use a second-order CRF with two energy terms.
X X
E= V (yi ) + w(yi , yj , xi , xj )Dpq (yi , yj ) (4)
i ij
The first term in equation (4) denotes the data energy, which is only depen-
dent on the data at the current point, whereas the second term constitutes the
smoothness energy, which takes the neighborhood information into account. For
4
the case at hand, V is the unary potential function, w describes the neighborhood
relationships and Dpq is a label distance function for the pairwise potentials. The
unary potentials can be calculated directly from the voxel-wise output produced
by the SVM classifier. The smoothness energy is computed depending on the
neighboring voxels.
We compute the unary potentials V (yi ) from the label output yi of the SVM
classifier. In equation (5), c1 is a constant which allows us to adjust the weights of
unary and pairwise potentials. ỹi is the newly assigned label after regularization
and δ is the Kronecker δ function.
CRF problems. The most important reason for us to use this optimization ap-
proach is its great computational efficiency, which renders it much faster than
conventional graph-cuts optimization techniques.
3 Results
3.1 Image Data
We evaluated our algorithm on images of 10 patients from the ContraCancrum
brain tumor database [7]. Each patient dataset consists of a T1 , T1c , T2 and T2f
image. The images were resampled to an in-slice resolution of 1mm (210x210 to
260x260 voxels), the inter-slice spacing was between 3mm and 6mm depending
on the dataset (19 to 25 slices). Figure 1 shows an axial slice of one patient with
the four MRI modalities we used.
Fig. 1. Axial slice of one patient. The four MRI modalities under study are shown,
from left to right: the T1 , T1c , T2 and T2f image.
3.2 Evaluation
For quantitative evaluation of the results we use the Dice similarity coefficient.
The ground truth was defined by a manual segmentation. However, the manual
1
available from:
http://www.istb.unibe.ch/content/surgical technologies/medical image analysis/software/
6
segmentation was only available for the tumor tissues, not for the healthy tis-
sues. Therefore only the accuracy of the tumor segmentation could be evaluated
quantitatively. The accuracy of the segmentation of healthy tissues could only
be rated qualitatively by visual inspection.
We subdivided our results into intra- and interpatient case. For the intrap-
atient case, the classifier was trained on small subregions of the relevant areas
in the same patient, while in the interpatient case we performed leave-one-out
cross-validation. Additionally, in both cases, we compared our proposed hierar-
chical, regularized approach with a non-hierarchical classification, that does not
comprise the proposed two-step CRF-regularization.
Figure 2 presents the segmentation result in case of interpatient training us-
ing leave-one-out cross-validation for the same slice as shown in figure 1. The
gross tumor volume (GTV), comprising all tumor tissues, is well delineated,
although the segmentation appears noisy if no regularization is applied. Classi-
fying the tumor subregions appears to be more challenging, but our proposed
approach performs still better than the method without hierarchy and regu-
larization. From a visual inspection, the segmentation of healthy tissues (CSF,
GM, WM) appears to be reasonable in all cases, but again the result has many
outliers if no regularization is applied.
Fig. 2. Results for one axial slice of the interpatient leave-one-out case. First row: coarse
classification into tumor and healthy tissues, second row: fine classification into necrotic,
active, edema part and CSF, GM, WM respectively. From left to right: manual seg-
mentation, hierarchical SVM-classification with CRF-regularization, non-hierarchical
SVM-classification without regularization.
one-out case when the GTV is considered. Dice coefficients for the individual
tumor subregions are lower. However, the results show once again that the Dice
coefficient is clearly worse when no hierarchical regularization is applied.
Table 1. Dice similarity coefficient (mean and standard deviation) for the ten datasets
under study. Gross tumor volume (GTV) comprises the complete tumor region, includ-
ing active, necrotic and edema part. Results are shown for the intra- and interpatient
(leave-one-out cross-validation) case, with and without regularization.
segmentation results for healthy tissues as well as for tumor tissues appear much
noisier when no CRF regularization is applied and inconsistencies of adjacent
slices can be observed.
In contrast to other methods, our proposed segmentation approach yields the
tumor and healthy region, as well as all their subregions. It adds powerful spatial
regularization to iid classification and runs fast on clinically relevant data.
References
1. Boykov, Y., Jolly, M.: Interactive graph cuts for optimal boundary & region seg-
mentation of objects in ND images. In: Proc. ICCV. pp. 105–112. IEEE (2002)
2. Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines (2001),
http://www.csie.ntu.edu.tw/~cjlin/libsvm
3. Corso, J., Sharon, E., Dube, S., El-Saden, S., Sinha, U., Yuille, A.: Efficient multi-
level brain tumor segmentation with integrated bayesian model classification. IEEE
T Med Imaging 27(5), 629–640 (2008)
4. Ibanez, L., Schroeder, W., Ng, L., Cates, J.: The ITK Software Guide. Kitware,
http://www.itk.org/ItkSoftwareGuide.pdf, 2nd edn. (2005)
5. Komodakis, N., Tziritas, G., Paragios, N.: Performance vs computational efficiency
for optimizing single and dynamic MRFs: Setting the state of the art with primal-
dual strategies. Comput Vis Image Und 112(1), 14–29 (2008)
6. Lee, C., Wang, S., Murtha, A., Brown, M., Greiner, R.: Segmenting Brain Tumors
using Pseudo–Conditional Random Fields. In: Metaxas, D. Axel, L., Fichtinger, G.,
Szekely, G. (eds.) MICCAI 2008. LNCS, vol. 5242, pp. 359–366. Springer (2008)
7. Marias, K., Dionysiou, D., Sakkalis, V., Graf, N., Bohle, R.M., Coveney, P.V., Wan,
S., Folarin, A., Buechler, P., Reyes, M., Clapworthy, G., Liu, E., Sabczynski, J.,
Bily, T., Roniotis, A., Tsiknakis, M., Giatili, S., Veith, C., Messe, E., Stenzhorn,
H., Kim, Y.j., Zasada, S., Haidar, A.N., Bauer, S., Wang, T., Zhao, Y., Karasek,
M., Grewer, R., Franz, A., Stamatakos, G.: Clinically driven design of multi-scale
cancer models : the ContraCancrum project paradigm. J Roy Soc Interface Focus
1(3), 450–461 (2011)
8. Mazzara, G., Velthuizen, R., Pearlman, J., Greenberg, H., Wagner, H.: Brain tumor
target volume determination for radiation treatment planning through automated
MRI segmentation. Int. J. Radiat. Oncol., Biol. Phys. 59(1), 300–312 (2004)
9. Schoelkopf, B., Smola, A.: Learning with kernels: support vector machines, regu-
larization, optimization, and beyond. MIT Press (2002)
10. Tuceryan, M., Jain, A.: Texture analysis, pp. 207–248. World Scientific Publishing,
2nd edn. (1998)
11. Verma, R., Zacharaki, E., Ou, Y., Cai, H., Chawla, S., Lee, S., Melhem, E., Wolf,
R., Davatzikos, C.: Multiparametric Tissue Characterization of Brain Neoplasms
and Their Recurrence Using Pattern Classification of MR Images. Acad. Radiol.
15(8), 966–977 (2008)
12. Wels, M., Carneiro, G., Aplas, A., Huber, M., Hornegger, J., Comaniciu, D.: A Dis-
criminative Model-Constrained Graph Cuts Approach to Fully Automated Pedi-
atric Brain Tumor Segmentation in 3-D MRI. In: Metaxas, D. Axel, L., Fichtinger,
G., Szekely, G. (eds.) MICCAI 2008. LNCS, vol. 5242, pp. 67–75. Springer (2008)