Abstract
Thoracic image registration forms the basis for many applications as for example respiratory motion estimation and physiological investigations of the lung. Although clear motion patterns are shared among different subjects, such as the diaphragm moving in superior and inferior direction, in current image registration methods such basic prior knowledge is not considered. In this paper, we propose a novel approach for integrating a statistical motion model (SMM) into a parametric non-rigid registration framework. We formulate the SMM as a reproducing kernel and integrate it into a kernel machine for image registration. Since empirical samples are rare and statistical models built from small sample size are usually over-restrictive we localize the SMM by damping spatial long-range correlations and reduce the model bias by adding generic transformations to the SMM. As an example, we show our methods applicability on the example of the Dirlab 4DCT lung images where we build leave-one-out models for estimating the respiratory motion.
You have full access to this open access chapter, Download conference paper PDF
Similar content being viewed by others
Keywords
1 Introduction
Thoracic motion estimation is central for the analysis of respiratory dynamics or the physiology of abdominal organs as for example the lung. It is usually performed by non-rigid registration of images captured at different time points e.g. at an inhalation and an exhalation state. A main challenge which arises in this scenario are organs which slide along each other causing discontinuous changes in correspondence. At sliding organ boundaries, therefore, a high degree of freedom is required to express discontinuities in the spatial mapping. However, this is opposed to within organ regions where smooth deformations are presumed, which are usually achieved by reducing the degrees of freedom of the admissible transformations.
In this paper, we integrate a low-dimensional statistical motion model (SMM) as transformation model into the registration which already accounts for the discontinuous correspondence changes. The idea is that the SMM is built out of empirical motion fields, from exhalation to inhalation state, which are derived in a controlled semi-automatic setup where for example landmarks and image masks are applied in order to deal with discontinuities. The SMM is brought into correspondence with the subject of interest where no landmarks or masks are available. Thus, the learned motion patterns containing the characteristic discontinuities at sliding organ boundaries can be transferred to the subject of interest to finally perform the registration.
Discontinuity preserving registration approaches have gained increasing attention in literature starting from semi-automatic approaches [13] where moving organs are segmented and separately registered, to approaches with image-dependent inhomogeneous smoothness priors [5, 9] or approaches with sparse regularizers [14, 15], and motion segmentation approaches [12]. None of the approaches considers statistical knowledge about the respiratory motion.
In [3, 7, 11], PCA-based motion models are proposed for mean-motion based diagnosis and model-based shape prediction. In such models, each transformation lies within the linear span of the empirical motion patterns. In [8], localized and bias reduced statistical models were introduced with the focus on inter-subject registration. However, these richer models need to be approximated by an orthogonal basis in order to be fitted to the images. As the eigenvalues slowly decrease when modeling local deformations such an approximation becomes infeasible and the number of basis functions to store exceeds standard memory capacities.
The contribution of this paper is the integration of an SMM as reproducing kernel into image registration. In the registration, only correlations between image points are considered which allows to localize the SMM and to reduce an over-restrictive model bias without the need of a model basis approximation.
2 Background
In this section, we recap the kernel-framework for image registration which was elaborated in [5, 6] and borrow the notation used therein. Given a reference and target image which map the d-dimensional input domain
to intensity values, and given a spatial mapping
which transforms the reference coordinate system, image registration is performed by optimizing
where \(\mathcal {L}\) is a loss-function which quantifies the matching between the transformed reference and the target image, \(\mathcal {R}\) is a regularization term which enforces additional criteria on u and \(\eta \) is a trade-off parameter. As transformation model a reproducing kernel Hilbert space (RKHS) is defined

where is a reproducing kernel and \(\Vert \cdot \Vert _\mathcal {H}\) is the RKHS norm. For more details about kernel methods we refer to [4]. In [5], the existence of a finite dimensional solution to Eq. 1 was shown applying a regularization term operating solely on the finite many parameters \(c:=\{c_i\}_{i=1}^N\)
for N pair-wise distinct sampled domain points \(x_i\) and a regularizer comprising a strictly increasing function and a function
which is weakly semi-continuous and bounded from below. Examples are the non-informative regularizer \(\mathcal {R}_2\) or the homogeneity favoring radial differences regularizer \(\mathcal {R}_{rd}\)
3 Method
In the following, we distinguish between correspondence fields which match images
of different subjects and motion fields
which match exhalations and inhalation images
of the same subject. We first formulate a model of motion fields and afterwards we need the correspondence fields for building the SMM (see Sect. 3.2).
3.1 Statistical Motion Model
Suppose we are given some sample transformations \(F:=\{f_i\}_{i=1}^n\) which are in correspondence and known to be useful for the registration of exhalation and inhalation images. Based on the central limit theorem, we model F by assuming a Gaussian process over the transformations \(f_i\). We estimate the mean function and the matrix-valued covariance function
We adjust the transformation model as follows
Thus, the transformation model for the motion estimation yields transformations f which are linear combinations of the sample transformations at a point x.
Note that the complexity of Eq. 3 is \(\mathcal {O}(N^2)\) kernel evaluations which makes the optimization problem computationally intensive for 3d medical images. In addition, the evaluation of \(k_F\) requires a sum over all samples \(f_i\).
Dimensionality Reduction. To reduce the sum in \(k_F\) we rewrite the kernel in its Mercer’s expansion
where \(\lambda _i\ge \lambda _{i+1}\ge 0\) and \(i>n\Leftrightarrow \lambda _i=0\). The basis functions \(\phi _i\) are orthonormal. We approximate the kernel in Eq. 7 by truncating the sum
where \(\psi _i=\sqrt{\lambda _i}\phi _i\) and \(p = \max \{i\vert \lambda _i>\theta \}\). In Eq. 7, \(\lambda _i\) and \(\phi _i\) are the eigenvalue/eigenfunction pairs of the Hilbert-Schmidt integral operator of \(k_F\). Thus, the basis functions \(\psi _i\) are the principal modes of variation of the sample F. The amount of variation kept by considering p basis functions is therefore maximal when using the first p orthogonal functions \(\psi _i\).
Locality. The SMM kernel \(k_\mathcal {M}\) has infinite support. That means, for each x, y pair, \(k_\mathcal {M}\) yields a possibly non-zero value. In the following, we damp the correlation between two points with respect to the Euclidean distance between them in order to reduce the support range. Using the Wendland kernel [6]
with \(a_+=\max (0,a)\) and \(\sigma >0\) which is a compactly supported kernel we derive
with the d-dimensional identity matrix \(\mathbf {I}\) and scaling parameters \(\sigma _\mathcal {M}>0,\sigma _\omega >0,\sigma _s\ge 0\). The effect of this manipulation (Eq. 10) to the SMM \(k_\mathcal {M}\) is two-fold. First, the quadratic complexity can be overcome since k is now compact with a support \(\sigma \), and second the model is enhanced in a way that f is no longer in the strict linear span of the samples. Nonetheless, it is locally a linear combination of the samples (when setting \(\sigma _s=0\)).
With a small sample size n, even a localized model tend to be over-restrictive. In order to reduce this restrictive model bias, we add a Wendland kernel in Eq. 10 where the scale can be controlled with \(\sigma _s\).
Scaling. If we zero-out correlation values \(k_\mathcal {M}(x,y)\) the remaining scale of the transformation f is damped as well. Therefore, the scaling factors \(\sigma _\mathcal {M},\sigma _\omega \) have to be chosen appropriately
where \(\Vert \cdot \Vert _F\) is the Frobenius norm. The scale \(\sigma _\mathcal {M}\) is a heuristic estimate of the expected scale of the transformation. The scale of the Wendland kernel \(\sigma _\omega \) is chosen such that it integrates to one within its support. The Wendland kernel thus acts as a weighted average of \(k_\mathcal {M}\).
3.2 Model Construction
The goal in this paper is to finally guide the motion estimation for a subject of interest \(S_j\) with an SMM built from motion fields of other subjects \(S_{i}\) with \(i\ne j\). The motion fields \(f_i\) have to be in correspondence with \(S_j\) in order to be comparable and thus for actually building the SMM. In Fig. 1, the relation between the different subjects is illustrated.
Let an exhalation and inhalation image \(I^E,I^I\) be given for each subject. Furthermore, let the sample motion fields \(f_i\) be derived in a controlled setup. That means, they can be semi-automatically derived by registration of \(I^E_i\) and \(I^I_i\) including manual ground truth landmarks and image masks etc. The correspondence to the subject \(S_j\) is now derived by registration of the exhalation images \(I^E_i\) to the exhalation image \(I^E_j\) yielding the correspondence fields \(u_i\). Having given the correspondence fields \(u_i\), the motion fields \(f_i\) can be warped to the coordinate system of \(S_j\). Note that for a motion field warp the inverse of the correspondence field is needed (see Fig. 2). In our case, we approximate the inverse correspondence field with the fixed-point iteration proposed in [2].
4 Experiments
We tested our method on the DirlabFootnote 1 data set [1] comprising 10 subjects with an inhalation/exhalation 3d CT image of the thorax each. For evaluation, 300 ground truth landmarks are provided. We use the leave-one-out setup shown in Fig. 1. The exhalation images \(I^E_i\) are first brought into correspondence with \(I^E_j\) in three steps. First, the rib cages are threshold segmented at 1150 HU of smoothed versions of \(I^E\) and rigidly registered using the dice coefficient as image metric. Second, the rib cage segmentations are dilated and non-rigidly pre-registered using Eq. 3 applying again the dice metric, no regularization and a Wendland kernel \(k_W\). Finally, the images are non-rigidly registered using Eq. 3 applying the normalized cross-correlation (NCC) metric and the regularizer \(\mathcal {R}_{rd}\), again with \(k_W\). In this step, we cropped the images to a region of interest and used threshold segmented body masks to exclude the background.
The sample motion fields \(f_i\) are derived on three scale levels again using Eq. 3 with the NCC metric, the \(\mathcal {R}_{rd}\) as regularizer and \(k_W\). Additionally, a landmark cost-term was added in order to guide the registration with the 300 landmarks. Semi-automatically derived lung masks are used to consider only lung regions in the image metric.
The semi-automatically derived \(f_i\) are warped by the fully automatically derived \(u_i\) in order to build the SMM. Finally, the exhalation/inhalation images \(I^E_j,I^I_j\) are non-rigidly registered using Eq. 3, applying the localized and bias reduced kernel k of Eq. 10 and the non-informative regularizer \(\mathcal {R}_2\). Again, three scale levels where used, where k is applied only on the first level. On the remaining levels \(k_W\) is used. We empirically set \(\eta =\{{1}\mathrm {e}{-7},{1}\mathrm {e}{-6},{1}\mathrm {e}{-6}\}\), \(\sigma =\{100,80,40\}\) and \(\sigma _S={2}\mathrm {e}{-3}\) and used the same values for all cases. The orthogonal basis \(\psi _i(x)\) is numerically derived using the Singular Value Decomposition of the sample data matrix A where \(a_{ij}=f_j(x_i)\). For optimizing Eq. 3, we perform averaged stochastic gradient descent [10] on the analytically derived derivative.
In Fig. 3, an example of a mean transformation, an SMM registration (only first level) and a final registration result are shown. A clear discontinuous change in the motion field can be identified between the thoracic cavity and the lung. In Table 1, quantitative measures are provided. This experiment shows that our method achieves reasonable registration results which are on average 0.5 mm close to the intra-observer error (IOE). Since the Maxwell-Boltzmann (MB) distribution is more appropriate to model TREs, we additionally provide the expected TRE and variance of a fitted MB distribution. A complete comparison with the Dirlab benchmark considering the full landmark sets remains.
5 Conclusion
We presented a method for modeling statistical knowledge about motion patterns which can be integrated into image registration in order to estimate thoracic motion. In contrast to standard linear motion models our model is formulated as a reproducing kernel and integrated in the kernel framework for image registration. This allows to apply localized and bias reduced SMMs without the need of a basis approximation. With the leave-one-out models which we applied to the Dirlab data set, we presented an example of how such SMMs can be built and that they achieve reasonable registration performance. We think that our method opens the possibility for other types of SMMs which are built e.g. in a group-wise manner.
Notes
References
Castillo, E., Castillo, R., Martinez, J., Shenoy, M., Guerrero, T.: Four-dimensional deformable image registration using trajectory modeling. Phys. Med. Biol. 55(1), 305 (2009)
Chen, M., Lu, W., Chen, Q., Ruchala, K.J., Olivera, G.H.: A simple fixed-point approach to invert a deformation field. Med. Phys. 35(1), 81–88 (2008)
Ehrhardt, J., Werner, R., Schmidt-Richberg, A., Handels, H.: Statistical modeling of 4D respiratory lung motion using diffeomorphic image registration. IEEE Trans. Med. Imaging 30(2), 251–265 (2011)
Hofmann, T., Schölkopf, B., Smola, A.J.: Kernel methods in machine learning. Ann. Stat. 36, 1171–1220 (2008)
Jud, C., Möri, N., Bitterli, B., Cattin, P.C.: Bilateral regularization in reproducing kernel Hilbert spaces for discontinuity preserving image registration. In: Wang, L., Adeli, E., Wang, Q., Shi, Y., Suk, H.-I. (eds.) MLMI 2016. LNCS, vol. 10019, pp. 10–17. Springer, Cham (2016). doi:10.1007/978-3-319-47157-0_2
Jud, C., Möri, N., Cattin, P.C.: Sparse kernel machines for discontinuous registration and nonstationary regularization. In: Proceedings of the International Workshop on Biomedical Image Registration, pp. 9–16 (2016)
Jud, C., Preiswerk, F., Cattin, P.C.: Respiratory motion compensation with topology independent surrogates. In: Workshop on Imaging and Computer Assistance in Radiation Therapy (2015)
Lüthi, M., Jud, C., Vetter, T.: A unified approach to shape model fitting and non-rigid registration. In: Wu, G., Zhang, D., Shen, D., Yan, P., Suzuki, K., Wang, F. (eds.) MLMI 2013. LNCS, vol. 8184, pp. 66–73. Springer, Cham (2013). doi:10.1007/978-3-319-02267-3_9
Pace, D.F., Aylward, S.R., Niethammer, M.: A locally adaptive regularization based on anisotropic diffusion for deformable image registration of sliding organs. IEEE Trans. Med. Imaging 32(11), 2114–2126 (2013)
Polyak, B.T., Juditsky, A.B.: Acceleration of stochastic approximation by averaging. SIAM J. Control Optim. 30(4), 838–855 (1992)
Preiswerk, F., De Luca, V., Arnold, P., Celicanin, Z., Petrusca, L., Tanner, C., Bieri, O., Salomir, R., Cattin, P.C.: Model-guided respiratory organ motion prediction of the liver from 2D ultrasound. Med. Image Anal. 18(5), 740–751 (2014)
Preston, J.S., Joshi, S., Whitaker, R.: Deformation estimation with automatic sliding boundary computation. In: Ourselin, S., Joskowicz, L., Sabuncu, M.R., Unal, G., Wells, W. (eds.) MICCAI 2016. LNCS, vol. 9902, pp. 72–80. Springer, Cham (2016). doi:10.1007/978-3-319-46726-9_9
Risser, L., Vialard, F.X., Baluwala, H.Y., Schnabel, J.A.: Piecewise-diffeomorphic image registration: application to the motion estimation between 3D CT lung images with sliding conditions. Med. Image Anal. 17(2), 182–193 (2013)
Shi, W., Jantsch, M., Aljabar, P., Pizarro, L., Bai, W., Wang, H.: ORegan, D., Zhuang, X., Rueckert, D.: Temporal sparse free-form deformations. Med. Image Anal. 17(7), 779–789 (2013)
Vishnevskiy, V., Gass, T., Szekely, G., Tanner, C., Goksel, O.: Isotropic total variation regularization of displacements in parametric image registration. IEEE Trans. Med. Imaging 36, 385–395 (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Jud, C., Giger, A., Sandkühler, R., Cattin, P.C. (2017). A Localized Statistical Motion Model as a Reproducing Kernel for Non-rigid Image Registration. In: Descoteaux, M., Maier-Hein, L., Franz, A., Jannin, P., Collins, D., Duchesne, S. (eds) Medical Image Computing and Computer-Assisted Intervention − MICCAI 2017. MICCAI 2017. Lecture Notes in Computer Science(), vol 10434. Springer, Cham. https://doi.org/10.1007/978-3-319-66185-8_30
Download citation
DOI: https://doi.org/10.1007/978-3-319-66185-8_30
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-66184-1
Online ISBN: 978-3-319-66185-8
eBook Packages: Computer ScienceComputer Science (R0)