Prediction of Clinical Scores for Subjective Cognitive Decline and Mild Cognitive Impairment

Li, Aojie; Yue, Ling; Liu, Manhua; Xiao, Shifu

doi:10.1007/978-3-030-32281-6_14

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11843))

Included in the following conference series:

International Workshop on PRedictive Intelligence In MEdicine

905 Accesses

Abstract

Mild cognitive impairment (MCI) is a neurological disorder that occurs in older adults involving cognitive impairments. It may occur as a transitional stage between normal aging and dementia such as Alzheimer’s disease (AD). Recent studies found that subjective cognitive decline (SCD) may be the early clinical precursor of dementia that precedes MCI. SCD individuals with normal cognition may already have some medial temporal lobe atrophy. This paper proposes a machine learning framework by combination of sparse coding and random forest to identify the informative biomarkers for prediction of clinical scores in SCD and MCI using structural magnetic resonance imaging (MRI). The volumetric features are computed from brain regions and the subregions of hippocampus and amygdala in MRIs. Then, sparse coding is applied to identify the relevant features. Finally, the proximity-based random forest is used to combine three sets of volumetric features and establish a regression model for predicting clinical scores. Our method has double feature selections to better explore the relevant features for prediction. Our method is evaluated with the T1-weighted structural MR images from 36 MCI, 112 SCD, 78 Normal Control (NC) subjects. The results demonstrate the effectiveness of proposed method.

You have full access to this open access chapter, Download conference paper PDF

Cognitive Function Assessment and Prediction for Subjective Cognitive Decline and Mild Cognitive Impairment

Article 07 September 2021

Early diagnosis of Alzheimer’s disease and mild cognitive impairment using MRI analysis and machine learning algorithms

Article Open access 18 December 2024

Self-weighted Multi-task Learning for Subjective Cognitive Decline Diagnosis

Keywords

1 Introduction

Mild cognitive impairment (MCI) is a neurological disorder that occurs in older adults involving cognitive impairments. It is often considered as the first clinical precursor of dementia such as Alzheimer’s disease (AD) when the individual exhibits lower performance on standard neuropsychological tests [1]. Recently, a few studies supported that subjective cognitive decline (SCD), which applies to the individuals with self-reported memory complaints, may be the first clinical marker of AD even before MCI [2]. It was shown to have the increased presence of AD biomarkers compared to those without SCD and be associated with a higher risk of progression to AD dementia [3]. Longitudinal studies found that SCD and MCI are associated with a similarly increased risk of AD and predicting rapid cognitive decline [4]. These findings support the idea that SCD may be an early clinical marker of AD that precedes MCI. In order to provide early intervention and delay significant impairment, identification of clinically and cognitively normal individuals who are at risk of AD dementia is very important, especially in the early stage of disease.

Magnetic resonance images (MRI) non-invasively capture the internal body structures, helping us understand the anatomical and functional brain changes related to AD [5]. Some studies have also found that hippocampal atrophy occurs before the onset of AD. A study investigated that SCD individuals have a pattern of hippocampal subfield atrophy similar to that measured in AD pathology when compared to healthy individuals without SCD [6]. The findings indicate the topographically similar changes of hippocampal subfields in SCD individuals as those found in AD. Recently, a study compared SCD with MCI and NC individuals using the volumes and asymmetries of hippocampus, amygdala and temporal horn, and to assess their relationships with cognitive function in elderly population in China [5]. In this study, significant differences (P < 0.05) were found in the volumes and asymmetries of both hippocampus and amygdala among the three groups using structural MR images.

The above studies mainly investigated the relationships between the brain atrophy and risk of dementia from SCD, MCI and potential AD through structural MRIs. However, these methods have limitations in exploring the multiple factors on the risk of dementia. With the popularity of machine learning technologies, various methods have been investigated for MR image analysis to find the relevant biomarkers in prediction and analysis of diseases [7]. In addition to the assessment of dementia conditions with sMRI, MMSE and MoCA are often used for initial screening of various types of cognitive impairment and dementia. In fact, NC group has the highest average score in both MMSE and MoCA tests, while these cognitive scores are decreased with the dementia development from SCD, SMCI to AD. Thus, it is necessary to relate the biomarkers of neuroimage to assess and predict MMSE and MoCA scores.

In this work, we investigate the multi-scale brain regions from the ROIs of whole brain to the subregions of hippocampus and amygdala to predict the MMSE and MoCA scores in the early stages of SCD and MCI. We extract three subsets of volumetric features from brain ROIs and the hippocampal and amygdala subregions. The sparse coding is then applied to identify the relevant features for each subset. Finally, the proximity-based random forest is used to combine three sets of volumetric features and establish a regression model for assessment of MMSE and MoCA scores. This study is trying to find the correlation between the volumes of the multi-scale brain regions and the dementia risk to further understand their roles in cognitive impairment and dementia risk. The remainder of this paper is organized as follows. In Sect. 2, we present the materials used in this work and the details of proposed method. Section 3 will present the experimental results and discussion. Finally, we conclude this paper in Sect. 4.

2 Materials and Methods

In this section, we introduce the data set used in this study, followed by the proposed regression method with details. Figure 1 shows the flowchart of our proposed regression framework, which consists of image acquisition and processing, feature extraction and selection, and final score regression.

2.1 Materials and Image Processing

The data set in this study are obtained from Shanghai Mental Health Center, China. The participants were recruited from the China Longitudinal Aging Study (CLAS) of Cognitive Impairment (NCT03672448) started in 2011 [8]. This study includes 226 subjects consisting of 36 amnestic MCI, 112 SCD and 78 NC, recruited from a community-based study of individuals aged above 60 in Shanghai, China. Table 1 shows the demographic and clinical information of the studied subjects.

Table 1. Demographic and clinical information of the subjects (Mean ± standard deviation).

Full size table

All T1-weighted MR brain images are segmented into 50 regions of interests (ROIs) shown in Table 2 with a fully automated pipeline of FreeSurfer 6.0.0 [9]. The ROI volumes are computed as one subset of features for regression. In addition, the cortex, GM and WM volumes of left and right hemispheres and the volumes of supra tentorial are included in this feature set. There are 57 volumes in this feature set.

Table 2. The segmented 50 ROIs of the whole brain.

Full size table

Furthermore, to investigate the complex structure of hippocampus and amygdala, FreeSurfer is further used to partition these ROIs into 44 and 20 subregions, respectively, as shown in Fig. 2. The volumes are computed from these subregions as two feature sets to predict the cognitive scores.

2.2 The Proposed Prediction Method

After segmentation, three subsets of volume features are obtained from the ROIs and the subregions of hippocampal and amygdala to predict the MMSE and MoCA scores. Our proposed method can identify the most relevant features for each subset of features, followed by random forest regression for prediction of clinical scores.

First, sparse coding is used to select the most relevant features for each subset which considers the combination of features over different brain regions to handle the multivariate interactions. Let y denote the clinical scores of training data; $ {\mathbf{\rm A}} $ represent the feature matrix of $ M \times N $ for M participants; $ \overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\rightharpoonup}$}} {\omega } = \left( {\omega_{1} ,\omega_{2} , \ldots ,\omega_{N} } \right)^{T} $ is the coefficient vector assigned to the N features. An $ L1 $-regularized sparsity could be imposed on the coefficients to choose the relevant features for regression. The $ L1 $-regularized least square problem can be formulated as:

$$ \overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\rightharpoonup}$}} {\omega } = {\text{argmin}}_{\omega } \left\| {y - {\mathbf{\rm A}}\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\rightharpoonup}$}} {\omega } } \right\|_{2}^{2} + \gamma \left\| {\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\rightharpoonup}$}} {\omega } } \right\|_{1} ,\,\,\,s.t. \,\,\,\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\rightharpoonup}$}} {\omega }_{i} \ge 0,\,\forall i $$

(1)

where γ is the sparsity regularization parameter which controls the amount of zero coefficients in $ \overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\rightharpoonup}$}} {\omega } $. The non-zero elements in $ \overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\rightharpoonup}$}} {\omega } $ indicate that the corresponding features are relevant to the regression. The grid search can be used to obtain the optimal sparsity value through cross-validation on the training samples.

Second, random forest [10, 11] is used to compute the proximity measures and make the score regression with the selected features. It can also report the importance of features for each subset. For regression task, decision trees act as regression trees. During the growth of a tree, each node is determined by finding a feature that minimizes the difference between the left and right subset predicting errors. When the predicting error is below a threshold, the node stops splitting as a terminal node. The feature importance can be calculated with the difference between the left and right subset predicting errors. Each weight value is normalized between 0–1. After training, the random forest generates proximity measures showing the probability that two subjects fall into the same leaf node in the regression results of all T trees. Our method has a double feature selection to better explore the relevant features for prediction.

Finally, after 3 individual random forest models are trained to predict the scores with three subsets of features, their proximity matrices are linearly combined into a final proximity matrix as:

$$ {\text{P}} = w_{1} P_{1} + w_{2} P_{2} + \left( {1 - w_{1} - w_{2} } \right)P_{3} $$

(2)

where P denotes the final proximity matrix and $ \omega_{1} ,\omega_{2} $ are the weights assigned to the corresponding subsets of features. The composite proximity matrix P is input to the random forest model to combine three subsets of features for prediction of scores.

3 Experimental Results

3.1 Datasets and Implementation

The data used in our experiments are from 226 subjects as detailed in Sect. 2.1. In our experiments, the OOB error is converged to stable when $ nTree\,{ \gtrsim }\,500 $ and the optimal number of trees in the forest $ nTree = 1000 $. The weighting parameters $ {\text{w}}_{1} ,{\text{w}}_{2} $ were optimized via grid search in training process to obtain the best performance of random forest regression. The 10-fold cross-validation is used to evaluate the proposed method. It is repeated 10 times and the final result is obtained by averaging 10 test predictions to reduce the chance of experimental results. To evaluate the prediction performance, we compute the mean squared error (MSE) and the mean absolute error (MAE) between the actual and estimated MMSE and MoCA scores by averaging the results of ten tests. In addition, the Pearson’s correlation coefficient (CORR) is used to evaluate the power of regression line in data representation.

3.2 Results on Prediction of Cognitive Scores

The first experiment is to test the effects of different subsets of features on the MMSE and MoCA prediction. We also compare the results by using the t-test and sparse coding for feature selection. As for sparse coding, features from 3 subsets are selected separately to get more precise proximity matrix. As for t-test, two groups of data are divided according to the level of scores to select features. The predicting results by using different features and their combinations are listed in Tables 3 and 4, respectively. From the results, we can see the volume features from the subregions of Hippocampus and Amygdala achieve better performances than ROI features. The sparse coding performs better than the t-test. Specifically, the proposed combination achieves the highest correlation coefficients of 0.469 and 0.436.

Table 3. The performances comparison for prediction of MMSE scores using different features

Full size table

Table 4. The performances comparison for prediction of MoCA scores using different features

Full size table

The second experiment is to test the effects of the weighted combination (WC) of the proximity matrices for fusing three subsets of features on prediction performances. One direct method is to concatenate the selected features from different subsets as the input of regression model. Table 5 shows the prediction performances and the corresponding scatter plots are shown in Fig. 3. We can see that the proposed weighted combination performs better than the concatenating method.

Table 5. Performance comparison for prediction of clinical scores with different combinations

Full size table

3.3 Biomarkers Relevant to the Predictions of Cognitive Scores

In this section, we investigate the relevant biomarkers for disease interpretation. We computed the number of times that the features were selected out of 10 folds and denoted as frequency. The features with frequency higher than 8 were selected as the relevant biomarkers for each partition. Our study found that hippocampus atrophy in the right hemisphere has a higher weight than the left on the scores while the amygdala is just the opposite. The hippocampal fimbria shows the highest weight among all ROIs, with right fimbria showing higher weight than the left. The results indicate that the commonly selected top regions are consistent to the AD pathology studies [5, 6, 12].

4 Conclusion

In this paper, we have proposed a combined regression framework based on sparse coding and random forest for prediction of MMSE and MoCA scores. It enables MRI diagnostic analysis of the SCD group, which is rarely involved in current research. Three sets of volumetric features are extracted from the ROIs of whole brain and the subregions of hippocampus and amygdala. Sparse coding is applied to select the relevant features to clinical score estimation. As for brain ROIs, the paper subdivided the subregions on the basis of the hippocampus and the amygdala. By comparison with the whole brain, it is proved that the amygdala is more closely associated with clinical scores, followed by hippocampus. These results are also consistent with relative clinical experiments, achieving computer-aided diagnosis and prediction of AD process through the calculation and analysis of brain MRI.

References

Silveira, M., Marques, J.: Boosting Alzheimer disease diagnosis using PET images. In: 2010 20th International Conference on Pattern Recognition, pp. 2556–2559. IEEE, (2010)
Google Scholar
Lin, Y., Shan, P.-Y., Jiang, W.-J., Sheng, C., Ma, L.: Subjective cognitive decline: preclinical manifestation of Alzheimer’s disease. Neurol. Sci. 40, 41–49 (2019)
Article Google Scholar
Tales, A., Jessen, F., Butler, C., Wilcock, G., Phillips, J., Bayer, T.: Subjective cognitive decline. J. Alzheimers Dis. 48, S1–S3 (2015)
Article Google Scholar
Kirkova, V., Traykov, L.: Predictors of cognitive decline and dementia in individuals with subjective cognitive impairment: a longitudinal study. J. Neurol. S42 (2013). Springer, Heidelberg Tiergartenstrasse 17, D-69121 Heidelberg, Germany (2013)
Google Scholar
Yue, L., et al.: Asymmetry of hippocampus and amygdala defect in subjective cognitive decline among the community dwelling Chinese. Front. Psychiatry 9 (2018)
Google Scholar
Perrotin, A., et al.: Hippocampal subfield volumetry and 3D surface mapping in subjective cognitive decline. J. Alzheimers Dis. 48, S141–S150 (2015)
Article Google Scholar
Liu, M., Cheng, D., Wang, K., Wang, Y., Alzheimer’s Disease Neuroimaging Initiative: Multi-modality cascaded convolutional neural networks for Alzheimer’s disease diagnosis. Neuroinformatics 16, 1–14 (2018)
Article Google Scholar
Xiao, S., et al.: Methodology of China’s national study on the evaluation, early recognition, and treatment of psychological problems in the elderly: the China Longitudinal Aging Study (CLAS). Shanghai Archives of Psychiatry 25, 91 (2013)
Google Scholar
Fischl, B.: FreeSurfer. Neuroimage 62, 774–781 (2012)
Article Google Scholar
Breiman, L.: Random forests. Mach. Learn. 45, 5–32 (2001)
Article Google Scholar
Svetnik, V., Liaw, A., Tong, C., Culberson, J.C., Sheridan, R.P., Feuston, B.P.: Random forest: a classification and regression tool for compound classification and QSAR modeling. J. Chem. Inf. Comput. Sci. 43, 1947–1958 (2003)
Article Google Scholar
Evans, T.E., et al.: Subregional volumes of the hippocampus in relation to cognitive function and risk of dementia. Neuroimage 178, 129–135 (2018)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Instrument Science and Engineering, School of EIEE, Shanghai Jiao Tong University, Shanghai, 200240, China
Aojie Li & Manhua Liu
MoE Key Lab of Artificial Intelligence, AI Institute, Shanghai Jiao Tong University, Shanghai, China
Manhua Liu
Department of Geriatric Psychiatry, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, China
Ling Yue & Shifu Xiao
Alzheimer’s Disease and Related Disorders Center, Shanghai Jiao Tong University, Shanghai, China
Ling Yue & Shifu Xiao

Authors

Aojie Li
View author publications
You can also search for this author in PubMed Google Scholar
Ling Yue
View author publications
You can also search for this author in PubMed Google Scholar
Manhua Liu
View author publications
You can also search for this author in PubMed Google Scholar
Shifu Xiao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Ling Yue , Manhua Liu or Shifu Xiao .

Editor information

Editors and Affiliations

BASIRA, Istanbul Technical University, Istanbul, Turkey
Islem Rekik
Stanford University, Stanford, CA, USA
Ehsan Adeli
Daegu Gyeongbuk Institute of Science and Technology, Daegu, Korea (Republic of)
Sang Hyun Park

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, A., Yue, L., Liu, M., Xiao, S. (2019). Prediction of Clinical Scores for Subjective Cognitive Decline and Mild Cognitive Impairment. In: Rekik, I., Adeli, E., Park, S. (eds) Predictive Intelligence in Medicine. PRIME 2019. Lecture Notes in Computer Science(), vol 11843. Springer, Cham. https://doi.org/10.1007/978-3-030-32281-6_14

Download citation

DOI: https://doi.org/10.1007/978-3-030-32281-6_14
Published: 10 October 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-32280-9
Online ISBN: 978-3-030-32281-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)

Prediction of Clinical Scores for Subjective Cognitive Decline and Mild Cognitive Impairment

Abstract

Similar content being viewed by others

Cognitive Function Assessment and Prediction for Subjective Cognitive Decline and Mild Cognitive Impairment

Early diagnosis of Alzheimer’s disease and mild cognitive impairment using MRI analysis and machine learning algorithms

Self-weighted Multi-task Learning for Subjective Cognitive Decline Diagnosis

Keywords

1 Introduction