A Localized Statistical Motion Model as a Reproducing Kernel for Non-rigid Image Registration

Jud, Christoph; Giger, Alina; Sandkühler, Robin; Cattin, Philippe C.

doi:10.1007/978-3-319-66185-8_30

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 10434))

Included in the following conference series:

International Conference on Medical Image Computing and Computer-Assisted Intervention

9974 Accesses
3 Altmetric

Abstract

Thoracic image registration forms the basis for many applications as for example respiratory motion estimation and physiological investigations of the lung. Although clear motion patterns are shared among different subjects, such as the diaphragm moving in superior and inferior direction, in current image registration methods such basic prior knowledge is not considered. In this paper, we propose a novel approach for integrating a statistical motion model (SMM) into a parametric non-rigid registration framework. We formulate the SMM as a reproducing kernel and integrate it into a kernel machine for image registration. Since empirical samples are rare and statistical models built from small sample size are usually over-restrictive we localize the SMM by damping spatial long-range correlations and reduce the model bias by adding generic transformations to the SMM. As an example, we show our methods applicability on the example of the Dirlab 4DCT lung images where we build leave-one-out models for estimating the respiratory motion.

You have full access to this open access chapter, Download conference paper PDF

Spatial patterns and frequency distributions of regional deformation in the healthy human lung

Article 18 March 2017

Statistical Motion Mask and Sliding Registration

Deformation Estimation with Automatic Sliding Boundary Computation

Keywords

1 Introduction

Thoracic motion estimation is central for the analysis of respiratory dynamics or the physiology of abdominal organs as for example the lung. It is usually performed by non-rigid registration of images captured at different time points e.g. at an inhalation and an exhalation state. A main challenge which arises in this scenario are organs which slide along each other causing discontinuous changes in correspondence. At sliding organ boundaries, therefore, a high degree of freedom is required to express discontinuities in the spatial mapping. However, this is opposed to within organ regions where smooth deformations are presumed, which are usually achieved by reducing the degrees of freedom of the admissible transformations.

In this paper, we integrate a low-dimensional statistical motion model (SMM) as transformation model into the registration which already accounts for the discontinuous correspondence changes. The idea is that the SMM is built out of empirical motion fields, from exhalation to inhalation state, which are derived in a controlled semi-automatic setup where for example landmarks and image masks are applied in order to deal with discontinuities. The SMM is brought into correspondence with the subject of interest where no landmarks or masks are available. Thus, the learned motion patterns containing the characteristic discontinuities at sliding organ boundaries can be transferred to the subject of interest to finally perform the registration.

Discontinuity preserving registration approaches have gained increasing attention in literature starting from semi-automatic approaches [13] where moving organs are segmented and separately registered, to approaches with image-dependent inhomogeneous smoothness priors [5, 9] or approaches with sparse regularizers [14, 15], and motion segmentation approaches [12]. None of the approaches considers statistical knowledge about the respiratory motion.

In [3, 7, 11], PCA-based motion models are proposed for mean-motion based diagnosis and model-based shape prediction. In such models, each transformation lies within the linear span of the empirical motion patterns. In [8], localized and bias reduced statistical models were introduced with the focus on inter-subject registration. However, these richer models need to be approximated by an orthogonal basis in order to be fitted to the images. As the eigenvalues slowly decrease when modeling local deformations such an approximation becomes infeasible and the number of basis functions to store exceeds standard memory capacities.

The contribution of this paper is the integration of an SMM as reproducing kernel into image registration. In the registration, only correlations between image points are considered which allows to localize the SMM and to reduce an over-restrictive model bias without the need of a model basis approximation.

2 Background

In this section, we recap the kernel-framework for image registration which was elaborated in [5, 6] and borrow the notation used therein. Given a reference and target image which map the d-dimensional input domain to intensity values, and given a spatial mapping which transforms the reference coordinate system, image registration is performed by optimizing

$$\begin{aligned} \mathop {\hbox {arg min}}\limits _u \int _\mathcal {X} \mathcal {L}\left( I_R\left( x+u\left( x\right) \right) ,I_T\left( x\right) \right) dx + \eta \mathcal {R}[u], \end{aligned}$$

(1)

where $\mathcal {L}$ is a loss-function which quantifies the matching between the transformed reference and the target image, $\mathcal {R}$ is a regularization term which enforces additional criteria on u and $\eta $ is a trade-off parameter. As transformation model a reproducing kernel Hilbert space (RKHS) is defined

(2)

where is a reproducing kernel and $\Vert \cdot \Vert _\mathcal {H}$ is the RKHS norm. For more details about kernel methods we refer to [4]. In [5], the existence of a finite dimensional solution to Eq. 1 was shown applying a regularization term operating solely on the finite many parameters $c:=\{c_i\}_{i=1}^N$

$$\begin{aligned} \mathop {\hbox {arg min}}\limits _{u\in \mathcal {H}}\sum _{i=1}^N\mathcal {L}\Bigg (I_R\Bigg (x_i+\sum _{j=1}^N k(x_i,x_j)c_j\Bigg ),I_T(x_i)\Bigg )+\eta \cdot g\left( p\left( c\right) \right) , \end{aligned}$$

(3)

for N pair-wise distinct sampled domain points $x_i$ and a regularizer comprising a strictly increasing function and a function which is weakly semi-continuous and bounded from below. Examples are the non-informative regularizer $\mathcal {R}_2$ or the homogeneity favoring radial differences regularizer $\mathcal {R}_{rd}$

$$\begin{aligned} \mathcal {R}_2 = \sum _i \Vert c_i \Vert _2,\quad \mathcal {R}_{rd} = \sum _{i,j} \Vert c_i-c_j\Vert ^2 k(x_i,x_j). \end{aligned}$$

(4)

3 Method

In the following, we distinguish between correspondence fields which match images of different subjects and motion fields which match exhalations and inhalation images of the same subject. We first formulate a model of motion fields and afterwards we need the correspondence fields for building the SMM (see Sect. 3.2).

3.1 Statistical Motion Model

Suppose we are given some sample transformations $F:=\{f_i\}_{i=1}^n$ which are in correspondence and known to be useful for the registration of exhalation and inhalation images. Based on the central limit theorem, we model F by assuming a Gaussian process over the transformations $f_i$. We estimate the mean function and the matrix-valued covariance function

$$\begin{aligned} \mu _F(x) = \frac{1}{n}\sum _{i=1}^n f_i(x),\quad k_F(x,y) = \frac{1}{n-1}\sum _{i=1}^n (f_i - \mu _F)(x)(f_i-\mu _F)(y)^T. \end{aligned}$$

(5)

We adjust the transformation model as follows

$$\begin{aligned} f(x) = \mu _F(x) + \sum _{i=1}^N k_F(x,x_i)c_i. \end{aligned}$$

(6)

Thus, the transformation model for the motion estimation yields transformations f which are linear combinations of the sample transformations at a point x.

Note that the complexity of Eq. 3 is $\mathcal {O}(N^2)$ kernel evaluations which makes the optimization problem computationally intensive for 3d medical images. In addition, the evaluation of $k_F$ requires a sum over all samples $f_i$.

Dimensionality Reduction. To reduce the sum in $k_F$ we rewrite the kernel in its Mercer’s expansion

$$\begin{aligned} k_F(x,y) = \sum _{i=1}^{\infty }\lambda _i\phi _i(x)\phi _i(y)^T, \end{aligned}$$

(7)

where $\lambda _i\ge \lambda _{i+1}\ge 0$ and $i>n\Leftrightarrow \lambda _i=0$. The basis functions $\phi _i$ are orthonormal. We approximate the kernel in Eq. 7 by truncating the sum

$$\begin{aligned} k_\mathcal {M}(x,y) = \sum _{i=1}^{p} \psi _i(x)\psi _i(y)^T, \end{aligned}$$

(8)

where $\psi _i=\sqrt{\lambda _i}\phi _i$ and $p = \max \{i\vert \lambda _i>\theta \}$. In Eq. 7, $\lambda _i$ and $\phi _i$ are the eigenvalue/eigenfunction pairs of the Hilbert-Schmidt integral operator of $k_F$. Thus, the basis functions $\psi _i$ are the principal modes of variation of the sample F. The amount of variation kept by considering p basis functions is therefore maximal when using the first p orthogonal functions $\psi _i$.

Locality. The SMM kernel $k_\mathcal {M}$ has infinite support. That means, for each x, y pair, $k_\mathcal {M}$ yields a possibly non-zero value. In the following, we damp the correlation between two points with respect to the Euclidean distance between them in order to reduce the support range. Using the Wendland kernel [6]

$$\begin{aligned} k_W(x,y) = \omega _{3,2}\left( \frac{\Vert x-y\Vert }{\sigma }\right) ,\quad \omega _{3,2}(r) = (1-r)^6_+ \frac{3+18r+35r^2}{1680} \end{aligned}$$

(9)

with $a_+=\max (0,a)$ and $\sigma >0$ which is a compactly supported kernel we derive

$$\begin{aligned} k(x,y) = \sigma _\mathcal {M}k_\mathcal {M}(x,y) \cdot \sigma _\omega k_W(x,y)+\sigma _s\mathbf {I}_{d\times d} k_W(x,y) \end{aligned}$$

(10)

with the d-dimensional identity matrix $\mathbf {I}$ and scaling parameters $\sigma _\mathcal {M}>0,\sigma _\omega >0,\sigma _s\ge 0$. The effect of this manipulation (Eq. 10) to the SMM $k_\mathcal {M}$ is two-fold. First, the quadratic complexity can be overcome since k is now compact with a support $\sigma $, and second the model is enhanced in a way that f is no longer in the strict linear span of the samples. Nonetheless, it is locally a linear combination of the samples (when setting $\sigma _s=0$).

With a small sample size n, even a localized model tend to be over-restrictive. In order to reduce this restrictive model bias, we add a Wendland kernel in Eq. 10 where the scale can be controlled with $\sigma _s$.

Scaling. If we zero-out correlation values $k_\mathcal {M}(x,y)$ the remaining scale of the transformation f is damped as well. Therefore, the scaling factors $\sigma _\mathcal {M},\sigma _\omega $ have to be chosen appropriately

$$\begin{aligned} \sigma _\mathcal {M} := \sum _{i=1}^N \Vert k_\mathcal {M}(x_i,x_i)\Vert _F,\quad \sigma _\omega :=\Bigg \{\frac{34650}{4\pi \sigma ^3}~\text {if}~d=3,~\frac{10080}{2\pi \sigma ^2}~\text {if}~d=2\Bigg \}, \end{aligned}$$

(11)

where $\Vert \cdot \Vert _F$ is the Frobenius norm. The scale $\sigma _\mathcal {M}$ is a heuristic estimate of the expected scale of the transformation. The scale of the Wendland kernel $\sigma _\omega $ is chosen such that it integrates to one within its support. The Wendland kernel thus acts as a weighted average of $k_\mathcal {M}$.

3.2 Model Construction

The goal in this paper is to finally guide the motion estimation for a subject of interest $S_j$ with an SMM built from motion fields of other subjects $S_{i}$ with $i\ne j$. The motion fields $f_i$ have to be in correspondence with $S_j$ in order to be comparable and thus for actually building the SMM. In Fig. 1, the relation between the different subjects is illustrated.

Let an exhalation and inhalation image $I^E,I^I$ be given for each subject. Furthermore, let the sample motion fields $f_i$ be derived in a controlled setup. That means, they can be semi-automatically derived by registration of $I^E_i$ and $I^I_i$ including manual ground truth landmarks and image masks etc. The correspondence to the subject $S_j$ is now derived by registration of the exhalation images $I^E_i$ to the exhalation image $I^E_j$ yielding the correspondence fields $u_i$. Having given the correspondence fields $u_i$, the motion fields $f_i$ can be warped to the coordinate system of $S_j$. Note that for a motion field warp the inverse of the correspondence field is needed (see Fig. 2). In our case, we approximate the inverse correspondence field with the fixed-point iteration proposed in [2].

4 Experiments

We tested our method on the Dirlab^{Footnote 1} data set [1] comprising 10 subjects with an inhalation/exhalation 3d CT image of the thorax each. For evaluation, 300 ground truth landmarks are provided. We use the leave-one-out setup shown in Fig. 1. The exhalation images $I^E_i$ are first brought into correspondence with $I^E_j$ in three steps. First, the rib cages are threshold segmented at 1150 HU of smoothed versions of $I^E$ and rigidly registered using the dice coefficient as image metric. Second, the rib cage segmentations are dilated and non-rigidly pre-registered using Eq. 3 applying again the dice metric, no regularization and a Wendland kernel $k_W$. Finally, the images are non-rigidly registered using Eq. 3 applying the normalized cross-correlation (NCC) metric and the regularizer $\mathcal {R}_{rd}$, again with $k_W$. In this step, we cropped the images to a region of interest and used threshold segmented body masks to exclude the background.

The sample motion fields $f_i$ are derived on three scale levels again using Eq. 3 with the NCC metric, the $\mathcal {R}_{rd}$ as regularizer and $k_W$. Additionally, a landmark cost-term was added in order to guide the registration with the 300 landmarks. Semi-automatically derived lung masks are used to consider only lung regions in the image metric.

The semi-automatically derived $f_i$ are warped by the fully automatically derived $u_i$ in order to build the SMM. Finally, the exhalation/inhalation images $I^E_j,I^I_j$ are non-rigidly registered using Eq. 3, applying the localized and bias reduced kernel k of Eq. 10 and the non-informative regularizer $\mathcal {R}_2$. Again, three scale levels where used, where k is applied only on the first level. On the remaining levels $k_W$ is used. We empirically set $\eta =\{{1}\mathrm {e}{-7},{1}\mathrm {e}{-6},{1}\mathrm {e}{-6}\}$, $\sigma =\{100,80,40\}$ and $\sigma _S={2}\mathrm {e}{-3}$ and used the same values for all cases. The orthogonal basis $\psi _i(x)$ is numerically derived using the Singular Value Decomposition of the sample data matrix A where $a_{ij}=f_j(x_i)$. For optimizing Eq. 3, we perform averaged stochastic gradient descent [10] on the analytically derived derivative.

In Fig. 3, an example of a mean transformation, an SMM registration (only first level) and a final registration result are shown. A clear discontinuous change in the motion field can be identified between the thoracic cavity and the lung. In Table 1, quantitative measures are provided. This experiment shows that our method achieves reasonable registration results which are on average 0.5 mm close to the intra-observer error (IOE). Since the Maxwell-Boltzmann (MB) distribution is more appropriate to model TREs, we additionally provide the expected TRE and variance of a fitted MB distribution. A complete comparison with the Dirlab benchmark considering the full landmark sets remains.

Table 1. Expected TRE [mm] of 300 landmarks. IOE: intra-observer error (on all landmarks) taken from [1]. Dirlab: best performing results in snap-to-voxel (sv) TRE, where no masking was used and the TRE was evaluated on 300 landmarks (13.2.2017). The results of our method are listed in the right three columns.

Full size table

5 Conclusion

We presented a method for modeling statistical knowledge about motion patterns which can be integrated into image registration in order to estimate thoracic motion. In contrast to standard linear motion models our model is formulated as a reproducing kernel and integrated in the kernel framework for image registration. This allows to apply localized and bias reduced SMMs without the need of a basis approximation. With the leave-one-out models which we applied to the Dirlab data set, we presented an example of how such SMMs can be built and that they achieve reasonable registration performance. We think that our method opens the possibility for other types of SMMs which are built e.g. in a group-wise manner.

Notes

1.
https://www.dir-lab.com/.

References

Castillo, E., Castillo, R., Martinez, J., Shenoy, M., Guerrero, T.: Four-dimensional deformable image registration using trajectory modeling. Phys. Med. Biol. 55(1), 305 (2009)
Article Google Scholar
Chen, M., Lu, W., Chen, Q., Ruchala, K.J., Olivera, G.H.: A simple fixed-point approach to invert a deformation field. Med. Phys. 35(1), 81–88 (2008)
Article Google Scholar
Ehrhardt, J., Werner, R., Schmidt-Richberg, A., Handels, H.: Statistical modeling of 4D respiratory lung motion using diffeomorphic image registration. IEEE Trans. Med. Imaging 30(2), 251–265 (2011)
Article Google Scholar
Hofmann, T., Schölkopf, B., Smola, A.J.: Kernel methods in machine learning. Ann. Stat. 36, 1171–1220 (2008)
Article MathSciNet Google Scholar
Jud, C., Möri, N., Bitterli, B., Cattin, P.C.: Bilateral regularization in reproducing kernel Hilbert spaces for discontinuity preserving image registration. In: Wang, L., Adeli, E., Wang, Q., Shi, Y., Suk, H.-I. (eds.) MLMI 2016. LNCS, vol. 10019, pp. 10–17. Springer, Cham (2016). doi:10.1007/978-3-319-47157-0_2
Chapter Google Scholar
Jud, C., Möri, N., Cattin, P.C.: Sparse kernel machines for discontinuous registration and nonstationary regularization. In: Proceedings of the International Workshop on Biomedical Image Registration, pp. 9–16 (2016)
Google Scholar
Jud, C., Preiswerk, F., Cattin, P.C.: Respiratory motion compensation with topology independent surrogates. In: Workshop on Imaging and Computer Assistance in Radiation Therapy (2015)
Google Scholar
Lüthi, M., Jud, C., Vetter, T.: A unified approach to shape model fitting and non-rigid registration. In: Wu, G., Zhang, D., Shen, D., Yan, P., Suzuki, K., Wang, F. (eds.) MLMI 2013. LNCS, vol. 8184, pp. 66–73. Springer, Cham (2013). doi:10.1007/978-3-319-02267-3_9
Chapter Google Scholar
Pace, D.F., Aylward, S.R., Niethammer, M.: A locally adaptive regularization based on anisotropic diffusion for deformable image registration of sliding organs. IEEE Trans. Med. Imaging 32(11), 2114–2126 (2013)
Article Google Scholar
Polyak, B.T., Juditsky, A.B.: Acceleration of stochastic approximation by averaging. SIAM J. Control Optim. 30(4), 838–855 (1992)
Article MathSciNet Google Scholar
Preiswerk, F., De Luca, V., Arnold, P., Celicanin, Z., Petrusca, L., Tanner, C., Bieri, O., Salomir, R., Cattin, P.C.: Model-guided respiratory organ motion prediction of the liver from 2D ultrasound. Med. Image Anal. 18(5), 740–751 (2014)
Article Google Scholar
Preston, J.S., Joshi, S., Whitaker, R.: Deformation estimation with automatic sliding boundary computation. In: Ourselin, S., Joskowicz, L., Sabuncu, M.R., Unal, G., Wells, W. (eds.) MICCAI 2016. LNCS, vol. 9902, pp. 72–80. Springer, Cham (2016). doi:10.1007/978-3-319-46726-9_9
Chapter Google Scholar
Risser, L., Vialard, F.X., Baluwala, H.Y., Schnabel, J.A.: Piecewise-diffeomorphic image registration: application to the motion estimation between 3D CT lung images with sliding conditions. Med. Image Anal. 17(2), 182–193 (2013)
Article Google Scholar
Shi, W., Jantsch, M., Aljabar, P., Pizarro, L., Bai, W., Wang, H.: ORegan, D., Zhuang, X., Rueckert, D.: Temporal sparse free-form deformations. Med. Image Anal. 17(7), 779–789 (2013)
Article Google Scholar
Vishnevskiy, V., Gass, T., Szekely, G., Tanner, C., Goksel, O.: Isotropic total variation regularization of displacements in parametric image registration. IEEE Trans. Med. Imaging 36, 385–395 (2016)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Biomedical Engineering, University of Basel, Allschwil, Switzerland
Christoph Jud, Alina Giger, Robin Sandkühler & Philippe C. Cattin

Authors

Christoph Jud
View author publications
You can also search for this author in PubMed Google Scholar
Alina Giger
View author publications
You can also search for this author in PubMed Google Scholar
Robin Sandkühler
View author publications
You can also search for this author in PubMed Google Scholar
Philippe C. Cattin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Christoph Jud .

Editor information

Editors and Affiliations

Université de Sherbrooke, Sherbrooke, QC, Canada
Maxime Descoteaux
DKFZ, Heidelberg, Germany
Lena Maier-Hein
Ulm University of Applied Sciences, Ulm, Germany
Alfred Franz
Université de Rennes 1, Rennes, France
Pierre Jannin
McGill University, Montreal, QC, Canada
D. Louis Collins
Université Laval, Québec, QC, Canada
Simon Duchesne

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jud, C., Giger, A., Sandkühler, R., Cattin, P.C. (2017). A Localized Statistical Motion Model as a Reproducing Kernel for Non-rigid Image Registration. In: Descoteaux, M., Maier-Hein, L., Franz, A., Jannin, P., Collins, D., Duchesne, S. (eds) Medical Image Computing and Computer-Assisted Intervention − MICCAI 2017. MICCAI 2017. Lecture Notes in Computer Science(), vol 10434. Springer, Cham. https://doi.org/10.1007/978-3-319-66185-8_30

Download citation

DOI: https://doi.org/10.1007/978-3-319-66185-8_30
Published: 04 September 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-66184-1
Online ISBN: 978-3-319-66185-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)

A Localized Statistical Motion Model as a Reproducing Kernel for Non-rigid Image Registration