Article
Open access
Published: 09 April 2018

Multimodal and Multiscale Deep Neural Networks for the Early Diagnosis of Alzheimer’s Disease using structural MR and FDG-PET images

Donghuan Lu¹,
Karteek Popuri¹,
Gavin Weiguang Ding¹,
Rakesh Balachandar¹,
Mirza Faisal Beg ORCID: orcid.org/0000-0001-6706-8630¹ &
Alzheimer’s Disease Neuroimaging Initiative

Scientific Reports volume 8, Article number: 5697 (2018) Cite this article

17k Accesses
202 Citations
33 Altmetric
Metrics details

Subjects

Abstract

Alzheimer’s Disease (AD) is a progressive neurodegenerative disease where biomarkers for disease based on pathophysiology may be able to provide objective measures for disease diagnosis and staging. Neuroimaging scans acquired from MRI and metabolism images obtained by FDG-PET provide in-vivo measurements of structure and function (glucose metabolism) in a living brain. It is hypothesized that combining multiple different image modalities providing complementary information could help improve early diagnosis of AD. In this paper, we propose a novel deep-learning-based framework to discriminate individuals with AD utilizing a multimodal and multiscale deep neural network. Our method delivers 82.4% accuracy in identifying the individuals with mild cognitive impairment (MCI) who will convert to AD at 3 years prior to conversion (86.4% combined accuracy for conversion within 1–3 years), a 94.23% sensitivity in classifying individuals with clinical diagnosis of probable AD, and a 86.3% specificity in classifying non-demented controls improving upon results in published literature.

Generalizable deep learning model for early Alzheimer’s disease detection from structural MRIs

Article Open access 17 October 2022

Identification of Alzheimer's disease using a convolutional neural network model based on T1-weighted magnetic resonance imaging

Article Open access 17 December 2020

A multimodal machine learning model for predicting dementia conversion in Alzheimer’s disease

Article Open access 29 May 2024

Introduction

Alzheimer’s disease (AD), the most common dementia, affecting 1 out of 9 people over the age of 65 years¹. Alzheimer’s diseases involves progressive cognitive impairment, commonly associated with early memory loss, requiring assistance for activities of self care during advanced stages. Alzheimer’s is posited to evolve through a prodromal stage which is commonly referred to as the mild cognitive impairment (MCI) stage and 10–15% of individuals with MCI, progress to AD² each year. With improved life expectancy, it is estimated that about 1.2% of global population will develop Alzheimer’s disease by 2046³ thereby affecting millions of individuals directly, as well as many more indirectly through the effects on their families and caregivers. There is an urgent need to develop biomarkers that can identify the changes in a living brain due to the pathophysiology of AD providing numerical staging scores, as well as identifying syndromal stages.

Neuroimaging modalities such as magnetic resonance imaging (MRI)⁴ and fluorodeoxyglucose positron emission tomography (FDG-PET)⁵ have been previously used to develop such pathophysiology-based biomarkers for diagnosis of AD, specially targeting the prodromal stage of AD, where the pathology has begun but the clinical symptoms have not yet manifested. Structural MRI provides measures of brain gray matter, white matter and CSF compartments enabling the quantification of volumes, cortical thickness and shape of various brain regions and utilize these in developing classifiers for AD^{6,7,8,9,10,11,12,13}. FDG-PET provides measures of the resting state glucose metabolism¹⁴, reflecting the functional activity of the underlying tissue⁵ that has also been utilized for AD biomarker development^15,16,17. Other published approaches have utilized a combination of modalities for developing neuroimaging AD biomarkers^{4,18,19,20,21,22,23,24}.

Recent advances in deep neural network approaches for developing classifiers have delivered astounding performance for many recognition tasks²⁵. The application of deep neural networks in recognition of AD has also attracted application for AD^26,27,28. By applying deep neural network to extract features, such as stacked autoencoder (SAE) or Deep Boltzmann Machine (DBM), these approaches outperform other popular traditional machine learning methods, e.g., support vector machine (SVM) and random forest techniques. A major problem of deep neural network’s application in AD diagnosis is that only a small amount of training data is available for learning discriminative patterns in very high dimensional feature spaces. Another issue is that the scale at which the discriminative signal resides is not a-priori known hence dimensionality reduction techniques need to be sensitive to multiple scales to increase the chances of extracting the discriminative signal.

In this paper, we are proposing a novel approach for combining multimodal information from both MRI and FDG-PET images at multiple scales within a deep neural network framework. Our proposed multiscale approach extracts features at coarse-to-fine structural scales^29,30. This is achieved by segmenting the structural image into cortical and subcortical gray-matter compartments, and further subdividing each into patches of a hierarchical size, and extract features from each-sized patch^26,27,28 by averaging within the patch and use these multi-scale features taken from multiple modalities into a deep learning framework. Unlike the simple approach of down sampling, which could lead to the loss of discriminative information, our multi-scale approach preserves the structural and metabolism information at multiple scales and may potentially improve the classification accuracy for this diagnostic task³¹. To validate our proposed novel methodology, we performed cross validation experiments with all available ADNI data (subjects that include both a T1-structural MRI and an FDG-PET metabolism image). A comprehensive set of results of these experiments for the detection of controls and MCI that convert to AD as a function of years to conversion, as well as classification of controls, and AD subjects are presented for each modality separately and in combination, and compared to existing methods available in literature demonstrating superiority of the deep neural network framework in AD diagnosis and prognosis.

Methods

There are two major steps in the proposed framework: (1)image preprocessing: segment both MRI and FDG-PET images, subdivide the gray-matter segmentation into patches of a range of sizes, and extract features from each-sized patch; and, (2)classification: train a deep neural network to learn the patterns that discriminate AD individuals, and then use for individual classification.

Materials

Data used in the preparation of this article were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). The ADNI was launched in 2003 as a public-private partnership, led by Principal Investigator Michael W. Weiner, MD. The primary goal of ADNI has been to test whether serial MRI, PET, other biological markers, and clinical and neuropsychological assessment can be combined to measure the progression of mild cognitive impairment (MCI) and early Alzheimer’s disease (AD).

For a comprehensive validation of the proposed method, it is emphasized that all the available ADNI subjects (N = 1242) with both a T1-weighted MRI scan and FDG-PET image at the time of preparation of this manuscript were used in this study. These subjects were categorized into 5 groups based on the individual’s clinical diagnosis at baseline and future-timepoints. 1) Stable Normal controls (sNC): 360 subjects diagnosed to be NC at baseline and remained NC at the time of preparation of this manuscript. 2) Stable MCI (sMCI): 409 subjects diagnosed to be MCI at all time points (at least for 2 years). 3) Progressive NC (pNC): 18 subjects evaluated to be NC at baseline visit but progressed to clinical diagnosis of probable AD at the time of preparation of this manuscript. 4) Progressive MCI (pMCI): 217 subjects evaluated to be MCI at baseline visit and progressed to a clinical diagnosis of probable AD at some point in the future (data available for upto 8 years prior to conversion for some individuals). 5) Stable Alzheimer’s disease (sAD): 238 subjects with a clinical diagnoses of probably AD. Subjects showing improvement in their clinical diagnosis during their follow up, i.e. those clinically diagnosed as MCI but reverted to NC or those clinically diagnosed as probable AD but reverted to MCI were excluded from the proposed study because of the potential uncertainty of clinical misdiagnosis considering AD is an irreversible form of dementia¹. The progressive controls and progressive MCI subjects have some neuroimaging timepoints with a clinical diagnosis of probable AD. Hence, the subset of images from pNC and pMCI subjects that have a clinical diagnosis of probable AD will be identified as part of the sAD group for assessment of classifier accuracy, while the remaining images before the conversion to AD will be assessed as part of the pNC and pMCI groups. Demographic and clinical information of the subjects are shown in Table 1. Numbers in brackets are the number of male and female subjects in the second row, while in the rest 3 rows the two number represent the minimum and maximum value of age, education year and MMSE (Mini–Mental State Examination) score. In total, there are 2402 FDG-PET scans and 2402 MRI images including all the longitudinal time-points. Detailed descriptions of the ADNI subject cohorts, image acquisition protocols procedures and post-acquisition preprocessing procedures can be found at http://www.adni-info.org.

Table 1 Subject Demographics.

Full size table

Image Processing

Unlike typical image recognition problems where deep learning has shown to be effective, our data set, although very large in a neuroimaging context, is relatively smaller. Hence directly using this smaller database of images to train the deep neural network is unlikely to deliver high classification accuracy. However, contrary to typical image recognition tasks, where the database of images contains large heterogeneity, the images in this database are all human brain images acquired with similar pose and scale which show relatively much less heterogeneity in comparison. Therefore we applied the following processing steps to extract patch-wise features as shown in Fig. 1: FreeSurfer 5.3³² was used to segment each T1 structural MRI image into gray matter and white matter followed by subdivision of the gray matter into 87 anatomical regions of interest (ROI). The FreeSurfer segmentation were quality controlled by an expert neuroanatomist and any errors noted were manually corrected. Then, a T1 MRI image was chosen as the template. Each ROI of this template was further subdivided into smaller regions of varying sizes, denoted here as “patches”. The voxels in each ROI were clustered into patches through k-means clustering based on Euclidean distance of their spatial coordinates³³, i.e. voxels spatially close to each other would belong to the same patch. Given that the size of FreeSurfer ROIs were different, we predefined the number of voxels in each patch instead of fixing the number of patches in each ROI to keep uniform patch size density (patches in ROI/voxels in ROI) across the brain leading to signal aggregation at the same scale among the different ROIs. In this study, the size of patches was predefined to be 500, 1000 and 2000 voxels. Using these sizes, the number of patches in total across the brain gray matter ROIs segmented by FreeSurfer was found to be 1488, 705 and 343, respectively. The patch size chosen were designed to keep enough detailed information as well as avoiding too large feature dimension considering the limited number of available data samples. Subsequently, each ROI of the standard template MRI was registered to the same ROI of every target image via a high-dimensional non-rigid registration method (LDDMM³⁴). The registration maps were then applied to the patch-wise segmentation of the standard template. This transformed the template patch segmentation into each target MRI space so the target images were subdivided into the same number of patches for their FreeSurfer ROIs. It is also worth mentioning that after the transformation, the size of a template patch in different images is not the same due to non-rigid registration encoding local expansion/contraction and hence is one of the features used to represent the regional information of a given structural brain scan. Then, for each target subject, the FDG-PET image of the subject was co-registered to its skull-stripped T1 MRI scan with a rigid transformation using FSL-FLIRT program³⁵ based on normalized mutual information. The degrees of freedom (DOF) was set as 12 and Normalized correlation was used as cost function. The mean intensity in the brainstem region of the FDG-PET image was the chosen reference to normalize the voxel intensities in that individual brain metabolism image, because brainstem region was most unlikely to be affected by AD. The mean intensity of each patch was used to form the feature vector representing the metabolism activity, and the volume of each patch was used to represent the brain structure.

Multimodal and Multiscale Deep Neural Network

With the features extracted from MRI and FDG-PET images, we trained a Multimodal and Multiscale Deep Neural Network (MMDNN) to perform the classification. As shown in Fig. 2, the network consists of two parts. The first part consisted of 6 independent deep neural networks (DNNs) corresponding to each scale of a single modality. The second part was another DNN used to fuse the features extracted from these 6 DNNs. The input data of this DNN was the concatenated latent representation learned from each single DNN. The DNNs in the two parts shared the same structure. For each DNN, the number of nodes for each hidden layer were set as 3N, $\tfrac{3}{4}N$ and 100 respectively, where N denotes the dimension of input feature vector. The number of nodes was chosen to explore all possible hidden correlation across features from different patches in the first layer and gradually reduce the number of features in the following layers to avoid over-fitting. We trained each DNN with two steps, unsupervised pre-training and supervised fine-tuning, respectively. Then all the parameters of MMDNN were tuned together. The trained DNN output is a probability value for each class, the final classification is to the label with the highest probability. The probability value can also be interpreted as a disease staging score, with extreme value of 0 representing the highest probability of belonging to the sNC class, and extreme value of 1 representing the highest probability of belonging to the AD class.

Unsupervised Pre-training

For the unsupervised pre-training step, each DNN was trained as a stacked-autoencoder (SAE). Autoencoder is an artificial neural network used for unsupervised learning of non-linear hidden patterns from input data. It consists of three layers, input layer, hidden layer and output layer, for which two nearby layers are fully-connected. Three functions are used to define an autoencoder, encoding function, decoding function and loss function. In this study, encoding function is defined as: y = s (W₁x + b₁), where x is the input data, y is the latent representation, W₁ is the weight matrix, b₁ is the bias term and s is the activation function for which we used rectified linear function max(0, x). Similarly, decoding function can be represented as: z = s (W₂y + b₂), where we constrained it with tied weight W₁ = W^T and z is the reconstructed data which is supposed to be close to input x. Squared error $\tfrac{1}{2}\parallel x-z{\parallel }^{2}$ is applied as loss function to optimize the network. The hypothesis is that the latent representation can capture the main factors of variation in the data. Comparing with another popular unsupervised feature learning method, the principle component analysis (PCA), the activation function enables the network to capture non-linear factors of data variation, especially when multiple encoders and decoders are stacked to form a SAE. To fully train the network, we applied greedy layer-wise training³⁶ approach where every hidden layer was trained separately.

Supervised Fine-tuning

After pre-training, the first three layers of a DNN were initialized with the parameters of encoders from pre-trained SAE followed by a softmax output layer. At first, we trained the output layer independently while fixing the parameters of first 3 layers. Then we fine-tuned the whole network as Multilayer Perceptron (MLP) with subject labels for criterion. The network outputs the probabilities of a subject belonging to each class and the class with highest probability determines the output label of the subject. If we use xⁱ, yⁱ to represent the input feature vector and label of the i_th sample, respectively, the loss function based on cross entropy can be displayed as:

$$\quad \quad \quad \quad \quad \quad \quad \quad \quad \quad \quad \quad \quad \quad H(i)=-\frac{1}{N}\sum _{i\mathrm{=1}}^{N}\sum _{j\mathrm{=1}}^{2}\mathrm{[1\{}{y}^{i}=j\}log(h{({x}^{i})}_{j}]$$

(1)

where N is the number of input samples, j represents the class of samples, and h represents the network function.

Optimization of Network

Training of the network was performed via back propagation with the Adam algorithm³⁷. It is a first-order gradient-based optimization algorithm which has been proven to be computationally efficient and appropriate for training deep neural networks. During the training stage, the training set was randomly split into mini batches³⁸ where each split contains 50 samples in this study. At every iteration, only a single mini batch was used for optimization. After every batch has been used once, the training set was reordered and randomly divided again so that each batch would have different samples in different epochs.

Dropout

In order to prevent the deep neural network from overfitting, regularization is necessary to reduce its generalization error. In this study, we used dropout³⁹ to learn more robust features and prevent overfitting. In the dropout layer, some units were randomly dropped, providing a way to combine many different neural networks. In this study, we inserted dropout layers after every hidden layer. In each iteration of training stage, only half of hidden units were randomly selected to feed the results to the next layer, while in the testing stage all hidden units were kept to perform the classification. By avoiding training all hidden units on every training sample, this regularization technique not only prevented complex co-adaptations on training data and decrease overfitting, but also reduced the amount of computation and improved training speed.

Early Stopping

Another approach we used to prevent overfitting is early stopping. Because deep architectures were trained with iterative back propagation, the networks were prone to be more adaptive to the training data after every epoch. At a certain point, improving the network’s fit to the training set is likely to decrease generalization accuracy. In order to terminate the optimization algorithm before over-fitting, early stopping was used to provide guidance for how many iterations are needed. In the cross validation experiment, after dividing the data set into training and testing, we further split the training samples into a training set and a validation set. The networks were trained only with data in the former training set, while samples in the latter validation set were used to determine when to stop the algorithm: while the network has the highest generalization accuracy for validation set. In actual training, we stopped the optimization if the validation accuracy had ceased to increase for 50 epochs.

Ensemble Classifiers

Although early stopping has proven to be useful in most deep learning problems, relatively small data set limited the number of samples we could use for validation. And a small validation set may not able to represent the whole data set resulting in a biased network. Therefore, we resorted to ensemble multiple classifiers to perform more stable and robust classification. Instead of selecting a single validation set, we randomly divided the training set into 10 sets and used them to train 10 different networks to ‘vote’ for the classification. At the training stage, for network i, set i would be used for validation while the rest 9 sets were used for training. At the testing stage, the test samples were fed into all these networks resulting in 10 sets of probabilities. For each sample, the probabilities from 10 networks were added and the class with highest probability was the classification result of this sample. Although the performance of ensemble classifiers may not be greater than a single classifier on every occasion, the ensemble strategy can statistically improve the classification accuracy as well as the robustness and the stability of the classifier.

Ensemble Classifier Probability Distribution

The output of the DNN for each individual image is a pair of probability values representing the probabilities of the given input subject image features (or image pair features for multimodal images) as belonging to one of the two classes on which the DNN was trained. This probability score for belonging to the disease positive (AD) class can be interpreted as a disease severity staging score, since value of 1 represents the highest probability of being from the AD class, and 0 represents highest probability of being from the disease negative (NC) class.

Classifier Validation Experiment Setup

To validate the discriminant ability of proposed network, two kinds of binary classification experiments were performed. First, we performed discrimination between sMCI and pMCI to compare our results on this experiment directly with the published state-of-the-art methods^{18,20,21,28,40,41,42,43,44,45,46}. Since the published literature typically used only baseline images, we also used a single baseline image for each of the 409 sMCI subjects. Hence, the number of sMCI images is the same as the number of sMCI subjects. For the 217 pMCI subjects, their earliest image within 3 years before conversion was selected. The data samples were randomly divided into 10 sets. For each iteration, 1 set was used for testing while the rest sets were all used for training. Therefore, all subjects were used for testing exactly once.

One potential issue with respect to the sMCI class is that some of these individuals may progress to AD or other dementias in the future and if some of these individuals convert to probable AD in the future, these earlier timepoints would become part of the pMCI group, whereas some other individuals may revert back to NC. Hence, although the sMCI vs. pMCI experiment is commonly used to assess classifier performance in recent studies, the classification of sMCI subjects may not be entirely accurate due to the potential uncertainty in the clinical diagnosis of the sMCI class. Therefore, we performed additional experiments that involved classifying individuals with known future progression to AD, namely the pNC, pMCI and sAD classes, denoted as the dementia positive class, against those that are stable normal controls (sNC), denoted as the dementia negative class.

We investigated the performance of the classifier by using various combinations of samples during training phase. At the first level, the classifier was trained soley on samples from the sNC subjects (the dementia negative class) and the sAD subjects (the dementia positive class). At the next level, the dementia positive class was enriched with pMCI subjects’ images that represent an earlier stage in the evolution of AD. In the last level, the positive class was further enriched with adding pNC subjects’ images representing an even earlier stage in the evolution of AD. For each level, the classifier training followed the standard 10-fold cross validation procedure (90% of data samples used for training and 10% of data used for testing in each iteration). The groups not used for training, if any, were utilized in the testing group. In these experiments, allocation into training or testing was done on the level of subjects, not images. If a subject was allocated into the training group, all the available baseline and longitudinal images for this subject would be used for training. Otherwise, all the available images of a subject would be used for testing.

Sensitivity of the classifier is defined as the number of positive class images that are correctly classified, which in this case is the classification of the test subset of pNC, pMCI and sAD images as the positive class. Specificity of the classifier is the number of negative class images (the sNC class) that are correctly classified as sNC. Accuracy of the classifier is the fraction of images from both the positive and the negative classes that are correctly classified.

The proposed deep neural network (DNN) was built with Tensorflow⁴⁷, an open source deep learning toolbox provided by Google. For all the experiments, the number of nodes in each layer was predefined as shown in Fig. 2 and the learning rate was set as 10⁻⁴. The deep network parameter space is very large, with a large range of choices from which to sample i.e. number of layers and number of nodes, testing all the possible parameter combinations exhaustively is computationally unrealistic. Instead of doing parameter selection for each of the 10-fold experiments, the parameters were selected based on the results of the first fold experiment.

Results

Discrimination between Stable and Progressive MCI (sMCI vs pMCI)

We conducted the sMCI vs. pMCI experiment to be able to compare the classification accuracy of our proposed novel method with published and comparable state-of-the-art methods^{18,20,21,28,40,41,42,43,44,45,46}. The FDG-PET image and MRI image acquired at a single time point for each subject were used for the 10-fold cross validation experiment. For sMCI subjects, the images acquired at the first time to visit, while for pMCI subjects, the images acquired at the earliest time point within 3 years before conversion were used. Results of this experiment and comparable results from published methods are shown in Table 2. These results reveal an accuracy of 82.9% for our MMDNN method over 626 subjects and both specificity (83.8%) and sensitivity (79.7%) are high. The results for single modality DNN are also found to improve upon the state-of-art. These results suggest that our proposed MMDNN network is promising for applications requiring classification between sMCI and pMCI individuals for the single modality T1-MRI and FDG-PET or the multimodal (T1-MRI and FDG-PET combined) neuroimaging approach.

Table 2 Classification performance for the sMCI vs. pMCI task: Accuracy (%), Sensitivity (%), and Specificity (%) of the proposed network compared with the published state-of-the-art methods for the task of discriminating between sMCI and pMCI subjects.

Full size table

Discrimination between disease negative (sNC) and disease positive (the pNC, pMCI, sAD) classes

The classifier was trained to discriminate the negative class (sNC) from the disease positive class (pNC, pMCI, sAD) using three different enrichments for the positive class samples, namely training with the positive class containing only sAD, or, pMCI and sAD, or, pNC and pMCI and sAD samples. Each subject was used for testing at least once in the 10-fold cross validation experiments. In each fold of the experiment, images of the same subject acquired at different time points were either all used for training or all used for testing to ensure the independence of training and testing at all times, as further detailed in the Classifier Validation Experiment Setup Section.

The classification result of these experiments are shown as Table 3. The DNN based on FDG-PET neuroimaging features (accuracy 85.9%) performs better than the DNN based on T1-MRI (accuracy 82.5%) neuroimaging features, and the combined MMDNN outperforms each of the single modality DNNs (accuracy 86.4%). As the positive class is enriched with samples from the pMCI and then further with the pNC samples, there is an increase in the sensitivity (correctly classified members of the dementia positive class i. e. pNC, pMCI and sAD). Since some of the early stage patterns of AD represented in pMCI and pNC may overlap the sNC group, there is a slight decrease in specificity, but overall an increase in accuracy.

Table 3 Accuracy (%), sensitivity (%) and specificity (%) of each modality and the multimodal combination using different training sets for the classification of AD pathology (discrimination of pNC, pMCI and sAD from sNC).

Full size table

The features extracted by the deep neural network are displayed in Fig. 3. Although difficult to interpret as these are extracted from multiple nonlinear transformations of data, they show that the patterns for the different classes appear to be distinct, whereas patterns within each class appear to be relatively similar.

Classification performance of pNC and pMCI as function of time (years) to conversion

We analyzed the accuracy of classification of pNC and pMCI as a function of the time (years) to conversion and the numbers of subjects available for the MMDNN classifier. These results are shown in Fig. 4 for each of the three training scenarios with progressive enrichment of the positive class. As the positive class training set of sAD (top row, left panel) is enriched with samples from pMCI (top row, middle panel) and with pNC and pMCI samples (top row, right panel), the accuracy of detection of the pMCI and pNC class increases, as well as an increase in accuracy for identifying AD in pNC and pMCI earlier. The numerical values of classifier performance for the pNC, pMCI and sAD enriched positive class (top row, third panel on the right) are provided in the table in the second row of this figure.

The MMDNN classifier accuracy in identifying pMCI individuals with future conversion to AD was 90%, 86.6% and 82.4%, for years 1, 2, and 3 away to conversion. The accuracy for all the years taken together for pMCI classification was 79.22%, and 86.4% total for conversion within 1–3 years. The neuroimaging scans farther away from conversion are likely more challenging to classify correctly leading to overall lowered accuracy. The classification accuracy for sAD group, i. e, those images associated with a clinical diagnosis of AD, is 94.25%. The accuracy for correctly classifying all pNC images is 41.1% with higher numbers of 100%, 60.0% and 66.7% for years 1, 2 and 3 from conversion to clinical diagnosis of probable AD.

Classification Probability score distribution

The probability score output by the MMDNN trained with the dementia negative (sNC) class and the three enrichment choices for the dementia positive class (namely, sAD, pMCI + sAD, and pNC + pMCI + sAD) class samples is visualized as histograms in the top row of Fig. 5. The fraction of images of each class is shown on the y axis, along with classifier probability score shown on the x axis. This distribution shows how the sNC, pNC, pMCI and sAD classes are scored by the classifier for their probability of being from the dementia positive class. Further, the bottom row of Fig. 5 shows aggregate values of the probability score with respect to each class with a box plot. As the training set for the dementia positive class is enriched with samples from pMCI and then additionally, pNC class, the probability score for these classes is shown to increase. Overall, the distribution generated by the MMDNN leads to good separation between the classes, and the threshold choice of 0.5 (highest class probability assignment) is visually shown to provide good classification between the classes.

Discussion

In this paper, we have proposed a novel deep neural network (DNN) based method that utilizes multi-scale and multi-modal information (MMDNN) combining metabolism (FDG-PET) and regional volume (T1-MRI) for the discrimination of AD, with a focus on assessing classification accuracy in those pNC and pMCI subjects with known future conversion to probable AD. In accordance with scale-space theory, our incorporation of multiscale approach was intended to capture the discriminant signals at multiple scales, and avoid apriori assumption of the scale at which the discriminant signals may reside.

The comparison between our novel proposed MMDNN method and state-of-the-art methods for the sMCI vs. pMCI classification task is shown in Table 2. Although the data used for the cited studies are not identical, they all come from the ADNI database and have comparable image acquisition and preprocessing procedures. One of the strengths of our work is that we have analyzed all the available ADNI sMCI and pMCI subjects having both MRI and FDG-PET neuroimages at the time of preparation of this manuscript. When using only the T1-MRI modality, our method has better accuracy than most methods expect Huang et al.’s⁴⁶. However, they used a longitudinal method with multiple MRI images acquired from different time points for the classification of each subject, whereas we classify each image separately, an approach consistent with the other published cross-sectional methods. For single modality-based classifiers using only FDG-PET, our method outperforms the published methods by a significant margin as shown in Table 2. Extension of our DNN for utilization of longitudinal timepoints for single subject classification is a direction for future work, and we anticipate that adding longitudinal measures explicitly could further improve the classifier performance.

When using multiple modalities for sMCI vs. pMCI classification, our MMDNN approach has the best performance specially compared with the methods that also used the same T1-MRI and FDG-PET modalities. The study of Chen et al.⁴¹ performed domain transfer learning to exploit the auxiliary domain data (sAD/sNC subjects) to improve the classification whereas our proposed MMDNN method’s performance was better even though we did not utilize domain transfer learning in our sMCI vs. pMCI classification task.

Further, we performed experiments to detect prodromal AD by training the MMDNN classifier with samples from the dementia positive class namely the pNC, pMCI and sAD subjects. The accuracy of correctly classifying pNC and pMCI subjects as having patterns indicative of AD improved when the classifier training included pMCI and pNC images, as displayed in Table 3. Further, comparison of the DNN results for T1-MRI and FDG-PET classifiers as shown in Table 3 indicates that the sensitivity of detection of prodromal AD is higher with FDG-PET neuroimaging features as compared to T1-MRI neuroimaging features. This finding is consistent with previous studies^18,28,40,41 and could indicate support for the hypothesis that alterations in metabolism may precede changes in structure, and further, the altered metabolism measures could be detected with FDG-PET earlier than the detection of structural changes with T1-MRI.

Analysis of the accuracy of classifying prodromal AD i. e. detecting patterns corresponding to AD in pNC and pMCI individuals as function of time (years) to conversion is shown in Fig. 4. As the training set was enriched with samples from the pNC and the pMCI groups, the accuracy of detection of prodromal AD also increased. The MMDNN classifier delivered high accuracy upto three years prior to conversion and then performance was reduced for the timepoints 4–8 years prior to conversion. The number of subjects in 1–3 years before conversion are large (over 100 each), and there is also reduced numbers of available subject numbers 4–8 years away from conversion. The reduced sample for timepoints farther away from conversion to AD could potentially increase classification uncertainty. With more neuroimaging data corresponding to timepoints farther from conversion to AD becoming available, models such as the MMDNN proposed here could provide better classification performance for the earlier detection of prodromal AD.

The probability score output from the DNN is visualized in Fig. 5. The probability score is highest for the sAD class, and lowest for the sNC class, being the two extreme ends of the spectrum for the classifier. The probability score for the pNC and pMCI subjects is in between, and higher for pMCI than pNC generally in line with the expectation of progressive alterations detected with neuroimaging for subjects further along the disease trajectory. Further analysis of the classifier probability score could be an interesting avenue to develop a surrogate staging score for disease severity.

Despite the remarkable ability of DNN to discover patterns that may not be apparent on human visual examination, one major disadvantage of the DNN framework is that as a result of multiple non-linear transformations between the input in generating the output, it is not readily possible to map the output classification probability back to neuroimaging patterns in the input neuroimaging data that give rise to this output. The visualization of the output of the penultimate layer in the DNN for individual subject images is shown in Fig. 3 and except for observing a qualitative difference between the features of different classes, it is not possible to relate these to neuroimaging features from specific locations in the brain at the current time. Understanding how to provide pathophysiologically meaningful interpretation of the features extracted by the DNN for classificaion remains an unsolved problem and is an important future research direction.

A small number of subjects are awarded a probability score inconsistent with their clinical diagnosis. One of the main requirements of training DNNs are large quantities of well-characterized data²⁵. It is therefore possible that as more comprehensive and homogeneous training databases are developed and become available for learning, the accuracy numbers may increase and these outliers will be reduced. It is also possible that there may be some uncertainty in the available clinical diagnosis. Despite the limitations, our findings indicate that the DNN framework has considerable potential in learning the AD-related patterns for promising future applications in adding to the toolbox of clinical AD diagnosis.

Conclusion

In summary, we have proposed a novel deep neural network to identify individuals at risk of developing Alzheimer’s disease. Our multi-scale and multi-modal deep neural network (MMDNN) was designed to incorporate multiple scales of information from multiple regions in the gray matter of the brain taken from multiple modalities (T1-MRI and FDG-PET). First we demonstrated the discriminant ability of the proposed MMDNN approach by comparing with state-of-the-art methods on the task of discriminating between sMCI vs. pMCI individuals. Then we trained the classifier to distinguish subjects on trajectory towards clinical diagnosis of probable AD (i. e. the pNC, pMCI subjects). We observed the performance of MMDNN classifier built with a combination of FDG-PET and structural MRI images was better than those built using either structural MRI or FDG-PET neuroimaging scans alone. Further the classifier trained with the combined sample of pNC, pMCI and sAD was found to yield the highest overall classification accuracy of 82.4% accuracy in the identifying the individuals with mild cognitive impairment (MCI) who will convert to AD at 3 years prior to conversion (86.4% combined accuracy for conversion within 1–3 years), a 94.23% sensitivity in classifying individuals with clinical diagnosis of probable AD, and a 86.3% specificity in classifying non-demented controls. These results suggest that deep neural network classifiers may be useful as a potential tool for providing evidence in support of the clinical diagnosis of probable AD.

References

Association, A. et al. Alzheimer’s disease facts and figures. Alzheimer’s & dementia: journal Alzheimer’s Assoc. 7, 208 (2011).
Article Google Scholar
Petersen, R. C. et al. Mild cognitive impairment: ten years later. Arch. neurology 66, 1447–1455 (2009).
Article Google Scholar
Brookmeyer, R., Johnson, E., Ziegler-Graham, K. & Arrighi, H. M. Forecasting the global burden of Alzheimer’s disease. Alzheimer’s & dementia 3, 186–191 (2007).
Article Google Scholar
Davatzikos, C., Bhatt, P., Shaw, L. M., Batmanghelich, K. N. & Trojanowski, J. Q. Prediction of MCI to AD conversion, via MRI, CSF biomarkers, and pattern classification. Neurobiol. aging 32, 2322–e19 (2011).
Article PubMed Google Scholar
Landau, S. M. et al. Associations between cognitive, functional, and FDG-PET measures of decline in AD and MCI. Neurobiol. aging 32, 1207–1218 (2011).
Article PubMed Google Scholar
Farhan, S., Fahiem, M. A. & Tauseef, H. An ensemble-of-classifiers based approach for early diagnosis of Alzheimer’s disease: Classification using structural features of brain images. Comput. and mathematical methods medicine 2014 (2014).
Korolev, S., Safiullin, A., Belyaev, M. & Dodonova, Y. Residual and Plain Convolutional Neural Networks for 3D Brain MRI Classification. arXiv preprint arXiv:1701.06643 (2017).
Payan, A. & Montana, G. Predicting Alzheimer’s disease: a neuroimaging study with 3D convolutional neural networks. arXiv preprint arXiv:1502.02506 (2015).
Eskildsen, S. F. et al. Prediction of Alzheimer’s disease in subjects with mild cognitive impairment from the ADNI cohort using patterns of cortical thinning. Neuroimage 65, 511–521 (2013).
Article PubMed Google Scholar
Misra, C., Fan, Y. & Davatzikos, C. Baseline and longitudinal patterns of brain atrophy in MCI patients, and their use in prediction of short-term conversion to AD: results from ADNI. Neuroimage 44, 1415–1422 (2009).
Article PubMed Google Scholar
Wolz, R. et al. Multi-method analysis of MRI images in early diagnostics of Alzheimer’s disease. PLoS One 6, e25446 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Cuingnet, R. et al. Automatic classification of patients with Alzheimer’s disease from structural MRI: a comparison of ten methods using the ADNI database. Neuroimage 56, 766–781 (2011).
Article PubMed Google Scholar
Cho, Y. et al. Individual subject classification for Alzheimer’s disease based on incremental learning using a spatial frequency representation of cortical thickness data. Neuroimage 59, 2217–2230 (2012).
Article PubMed Google Scholar
Mosconi, L. et al. Pre-clinical detection of Alzheimer’s disease using FDG-PET, with or without amyloid imaging. J. Alzheimer’s Dis. 20, 843–854 (2010).
Article Google Scholar
Gray, K. R. et al. Multi-region analysis of longitudinal FDG-PET for the classification of Alzheimer’s disease. NeuroImage 60, 221–229 (2012).
Article PubMed PubMed Central Google Scholar
Toussaint, P.-J. et al. Resting state FDG-PET functional connectivity as an early biomarker of Alzheimer’s disease using conjoint univariate and independent component analyses. Neuroimage 63, 936–946 (2012).
Article PubMed Google Scholar
Illán, I. et al. 18 F-FDG PET imaging analysis for computer aided Alzheimer’s diagnosis. Inf. Sci. 181, 903–916 (2011).
Article Google Scholar
Young, J. et al. Accurate multimodal probabilistic prediction of conversion to Alzheimer’s disease in patients with mild cognitive impairment. NeuroImage: Clin. 2, 735–745 (2013).
Article Google Scholar
Zhang, D. et al. Multimodal classification of Alzheimer’s disease and mild cognitive impairment. Neuroimage 55, 856–867 (2011).
Article PubMed PubMed Central Google Scholar
Moradi, E. et al. Machine learning framework for early MRI-based Alzheimer’s conversion prediction in MCI subjects. Neuroimage 104, 398–412 (2015).
Article PubMed Google Scholar
Korolev, I. O. et al. Predicting progression from mild cognitive impairment to Alzheimer’s dementia using clinical, MRI, and plasma biomarkers via probabilistic pattern classification. PloS One 11, e0138866 (2016).
Article PubMed PubMed Central Google Scholar
Ye, J. et al. Sparse learning and stability selection for predicting MCI to AD conversion using baseline ADNI data. BMC Neurol 12, 46 (2012).
Article PubMed PubMed Central Google Scholar
Gaser, C., Franke, K., Kloppel, S., Koutsouleris, N. & Sauer, H. BrainAGE in Mild Cognitive Impaired Patients: Predicting the Conversion to Alzheimer’s Disease. PLoS One 8, e67346 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Zhang, D. et al. Multi-modal multi-task learning for joint prediction of multiple regression and classification variables in Alzheimer’s disease. NeuroImage 59, 895–907 (2012).
Article PubMed Google Scholar
Krizhevsky, A., Sutskever, I. & Hinton, G. E. Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems, 1097–1105 (2012).
Liu, S. et al. Multimodal neuroimaging feature learning for multiclass diagnosis of Alzheimer’s disease. IEEE Transactions on Biomed. Eng. 62, 1132–1140 (2015).
Article ADS Google Scholar
Liu, S. et al. Early diagnosis of Alzheimer’s disease with deep learning. In Biomedical Imaging (ISBI), 2014 IEEE 11th International Symposium on, 1015–1018 (IEEE, 2014).
Suk, H.-I. et al. Hierarchical feature representation and multimodal fusion with deep learning for AD/MCI diagnosis. NeuroImage 101, 569–582 (2014).
Article PubMed PubMed Central Google Scholar
Zhang, W., Zelinsky, G. & Samaras, D. Real-time accurate object detection using multiple resolutions. In Computer Vision, 2007. ICCV 2007. IEEE 11th International Conference on, 1–8 (IEEE, 2007).
Lowe, D. G. Distinctive image features from scale-invariant keypoints. Int. journal computer vision 60, 91–110 (2004).
Article Google Scholar
Tang, Y. & Mohamed, A.-R. Multiresolution Deep Belief Networks. In AISTATS, 1203–1211 (2012).
Dale, A. M., S., M. & Fischl, B. Cortical surface-based analysis. II: Inflation, flattening, and a surface-based coordinate system. Neuroimage 9(2), 195–207 (1999).
Article PubMed Google Scholar
Raamana, P. R. et al. Thickness network features for prognostic applications in dementia. Neurobiol. aging 36, S91–S102 (2015).
Article PubMed Google Scholar
Beg, F., Miller, M., Trouvé, A. & Younes, L. Computing large deformation metric mappings via geodesic flows of diffeomorphisms. Int. journal computer vision 61(2), 139–157 (2005).
Article Google Scholar
Jenkinson, M., Bannister, P., Brady, M. & Smith, S. Improved optimization for the robust and accurate linear registration and motion correction of brain images. Neuroimage 17, 825–841 (2002).
Article PubMed Google Scholar
Bengio, Y. et al. Greedy layer-wise training of deep networks. Adv. neural information processing systems 19, 153 (2007).
Google Scholar
Kingma, D. & Ba, J. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
Bengio, Y. Practical recommendations for gradient-based training of deep architectures. In Neural networks: Tricks of the trade, 437–478 (Springer, 2012).
Srivastava, N., Hinton, G. E., Krizhevsky, A., Sutskever, I. & Salakhutdinov, R. Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15, 1929–1958 (2014).
MathSciNet MATH Google Scholar
Liu, K., Chen, K., Yao, L. & Guo, X. Prediction of Mild Cognitive Impairment Conversion Using a Combination of Independent Component Analysis and the Cox Model. Front. human neuroscience 11 (2017).
Cheng, B., Liu, M., Zhang, D., Munsell, B. C. & Shen, D. Domain transfer learning for MCI conversion prediction. IEEE Transactions on Biomed. Eng. 62, 1805–1817 (2015).
Article Google Scholar
Zhu, X. et al. A novel relational regularization feature selection method for joint regression and classification in AD diagnosis. Med. image analysis (2017).
Xu, L., Wu, X., Chen, K. & Yao, L. Multi-modality sparse representation-based classification for Alzheimer’s disease and mild cognitive impairment. Comput. methods programs biomedicine 122, 182–190 (2015).
Article Google Scholar
Zhang, D. & Shen, D. Predicting future clinical changes of MCI patients using longitudinal and multimodal biomarkers. PLoS One 7, e33182 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
An, L. et al. A Hierarchical Feature and Sample Selection Framework and Its Application for Alzheimer’s Disease Diagnosis. Sci. Reports 7 (2017).
Huang, M. et al. Longitudinal measurement and hierarchical classification framework for the prediction of Alzheimer’s disease. Sci. reports 7 (2017).
Abadi, M. et al. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems. http://tensorflow.org/Software availablefromtensor flow.org (2015).

Download references

Acknowledgements

This work was supported by National Science Engineering Research Council (NSERC), Canadian Institutes of Health Research (CIHR), Michael Smith Foundation for Health Research (MSFHR), Brain Canada, Genome BC and the Pacific Alzheimer Research Foundation (PARF). Data collection and sharing for this project was funded by the Alzheimer’s Disease Neuroimaging Initiative (ADNI) (National Institutes of Health Grant U01 AG024904) and DOD ADNI (Department of Defense award number W81XWH-12-2-0012). ADNI is funded by the National Institute on Aging, the National Institute of Biomedical Imaging and Bioengineering, and through generous contributions from the following: AbbVie, Alzheimer’s Association; Alzheimer’s Drug Discovery Foundation; Araclon Biotech; BioClinica, Inc.; Biogen; Bristol-Myers Squibb Company; CereSpir, Inc.; Cogstate; Eisai Inc.; Elan Pharmaceuticals, Inc.; Eli Lilly and Company; EuroImmun; F. Hoffmann-La Roche Ltd and its affiliated company Genentech, Inc.; Fujirebio; GE Healthcare; IXICO Ltd.; Janssen Alzheimer Immunotherapy Research & Development, LLC.; Johnson & Johnson Pharmaceutical Research & Development LLC.; Lumosity; Lundbeck; Merck & Co., Inc.; Meso Scale Diagnostics, LLC.; NeuroRx Research; Neurotrack Technologies; Novartis Pharmaceuticals Corporation; Pfizer Inc.; Piramal Imaging; Servier; Takeda Pharmaceutical Company; and Transition Therapeutics. The Canadian Institutes of Health Research is providing funds to support ADNI clinical sites in Canada. Private sector contributions are facilitated by the Foundation for the National Institutes of Health (www.fnih.org). The grantee organization is the Northern California Institute for Research and Education, and the study is coordinated by the Alzheimer’s Therapeutic Research Institute at the University of Southern California. ADNI data are disseminated by the Laboratory for Neuro Imaging at the University of Southern California.

Author information

Authors and Affiliations

School of Engineering Science, Simon Fraser University, Burnaby, V5A 1S6, Canada
Donghuan Lu, Karteek Popuri, Gavin Weiguang Ding, Rakesh Balachandar & Mirza Faisal Beg
Magnetic Resonance Unit at the VA Medical Center and Radiology, Medicine, Psychiatry and Neurology, University of California, San Francisco, USA
Michael Weiner & Mary Creech
San Diego School of Medicine, University of California, California, USA
Paul Aisen
Mayo Clinic, Minnesota, USA
Ronald Petersen
Mayo Clinic, Rochester, USA
Cliford Jack, Sara Mason, Colleen Albers, David Knopman & Kris Johnson
University of California, Berkeley, USA
William Jagust
University of Pennsylvania, Pennsylvania, USA
John Trojanowki
University of Southern California, California, USA
Arthur Toga, Lon Schneider, Sonia Pawluczyk, Mauricio Beccera, Liberty Teodoro & Bryan Spann
University of California, Davis, California, USA
Laurel Beckett
MPH Brigham and Women’s Hospital/Harvard Medical School, Massachusetts, USA
Robert Green
Indiana University, Indiana, USA
Andrew Saykin, Martin Farlow, Ann Marie Hake, Brandy Matthews, Jared Brosch, Scott Herring & Cynthia Hunt
Washington University St. Louis, Missouri, USA
John Morris, Leslie Shaw, Beau Ances, Maria Carroll, Erin Franklin, Mark Mintun, Stacy Schneider & Angela Oliver
Oregon Health and Science University, Oregon, USA
Jefrey Kaye, Joseph Quinn, Lisa Silbert, Betty Lind, Raina Carter & Sara Dolen
University of California–San Diego, California, USA
James Brewer, Helen Vanderswag & Adam Fleisher
University of Michigan, Michigan, USA
Judith Heidebrink & Joanne Lord
Baylor College of Medicine, Houston, State of Texas, USA
Rachelle Doody, Javier Villanueva-Meyer, Munir Chowdhury, Susan Rountree & Mimi Dang
Columbia University Medical Center, South Carolina, USA
Yaakov Stern, Lawrence Honig & Karen Bell
University of Alabama, Birmingham, Alabama, USA
Daniel Marson, Randall Grifth, David Clark, David Geldmacher, John Brockington, Erik Roberson & Marissa Natelson Love
Mount Sinai School of Medicine, New York, USA
Hillel Grossman & Efe Mitsis
Rush University Medical Center, Rush University, Illinois, USA
Raj Shah & Leyla deToledo-Morrell
Wien Center, Florida, USA
Ranjan Duara, Daniel Varon, Maria Greig & Peggy Roberts
Johns Hopkins University, Maryland, USA
Marilyn Albert, Chiadi Onyike, Daniel D’Agostino & Stephanie Kielb
New York University, NY, USA
James Galvin, Brittany Cerbone, Christina Michel, Dana Pogorelec, Henry Rusinek, Mony de Leon, Lidia Glodzik & Susan De Santi
Duke University Medical Center, North Carolina, USA
P. Doraiswamy, Jefrey Petrella, Salvador Borges-Neto, Terence Wong & Edward Coleman
University of Kentucky, Kentucky, USA
Charles Smith, Greg Jicha, Peter Hardy, Partha Sinha, Elizabeth Oates & Gary Conrad
University of Rochester Medical Center, NY, USA
Anton Porsteinsson, Bonnie Goldstein, Kim Martin, Kelly Makino, M. Ismail & Connie Brand
University of California, Irvine, California, USA
Ruth Mulnard, Gaby Thai & Catherine Mc-Adams-Ortiz
University of Texas Southwestern Medical School, Texas, USA
Kyle Womack, Dana Mathews & Mary Quiceno
Emory University, Georgia, USA
Allan Levey, James Lah & Janet Cellar
University of Kansas, Medical Center, Kansas, USA
Jefrey Burns, Russell Swerdlow & William Brooks
University of California, Los Angeles, California, USA
Liana Apostolova, Kathleen Tingus, Ellen Woo, Daniel Silverman, Po Lu & George Bartzokis
Mayo Clinic, Jacksonville, USA
Neill Graf-Radford, Francine Parftt, Tracy Kendall & Heather Johnson
Yale University School of Medicine, Connecticut, USA
Christopher Dyck, Richard Carson, Martha MacAvoy & Pradeep Varma
McGill University, Montreal-Jewish General Hospital, Montreal, Canada
Howard Chertkow, Howard Bergman & Chris Hosein
Sunnybrook Health Sciences, Ontario, USA
Sandra Black, Bojana Stefanovic & Curtis Caldwell
U.B.C. Clinic for AD & Related Disorders, Vancouver, BC, Canada
Ging-Yuek Robin Hsiung, Howard Feldman, Benita Mudge & Michele Assaly
Cognitive Neurology - St. Joseph’s, Ontario, USA
Elizabeth Finger, Stephen Pasternack, Irina Rachisky, Dick Trost & Andrew Kertesz
Cleveland Clinic Lou Ruvo Center for Brain Health, Ohio, USA
Charles Bernick & Donna Munic
Northwestern University, San Francisco, USA
Marek-Marsel Mesulam, Kristine Lipowski, Sandra Weintraub, Borna Bonakdarpour, Diana Kerwin, Chuang-Kuo Wu & Nancy Johnson
Premiere Research Inst (Palm Beach Neurology), west Palm Beach, USA
Carl Sadowsky & Teresa Villena
Georgetown University Medical Center, Washington DC, USA
Raymond Scott Turner, Kathleen Johnson & Brigid Reynolds
Brigham and Women’s Hospital, Massachusetts, USA
Reisa Sperling, Keith Johnson & Gad Marshall
Stanford University, California, USA
Jerome Yesavage, Joy Taylor, Barton Lane, Allyson Rosen & Jared Tinklenberg
Banner Sun Health Research Institute, Sun City, AZ, 85351, USA
Marwan Sabbagh, Christine Belden, Sandra Jacobson & Sherye Sirrel
Boston University, Massachusetts, USA
Neil Kowall, Ronald Killiany, Andrew Budson, Alexander Norbash & Patricia Lynn Johnson
Howard University, Washington DC, USA
Thomas Obisesan, Saba Wolday & Joanne Allard
Case Western Reserve University, Ohio, USA
Alan Lerner, Paula Ogrocki, Curtis Tatsuoka & Parianne Fatica
University of California, Davis – Sacramento, California, USA
Evan Fletcher, Pauline Maillard, John Olichney, Charles DeCarli & Owen Carmichael
Neurological Care of CNY, Liverpool, NY, 13088, USA
Smita Kittur
Parkwood Hospital, Pennsylvania, USA
Michael Borrie, T.-Y. Lee & Rob Bartha
University of Wisconsin, Wisconsin, USA
Sterling Johnson, Sanjay Asthana & Cynthia Carlsson
University of California, Irvine – BIC, California, USA
Steven Potkin, Adrian Preda & Dana Nguyen
Banner Alzheimer’s Institute, Phoenix, 85006, Arizona, USA
Adam Fleisher, Pierre Tariot, Anna Burke, Nadira Trncic & Stephanie Reeder
Dent Neurologic Institute, NY, USA
Vernice Bates, Horacio Capote & Michelle Rainka
Ohio State University, Ohio, USA
Douglas Scharre, Maria Kataki & Anahita Adeli
Albany Medical College, NY, USA
Earl Zimmerman, Dzintra Celmins & Alice Brown
Hartford Hospital, Olin Neuropsychiatry Research Center, Connecticut, USA
Godfrey Pearlson, Karen Blank & Karen Anderson
Dartmouth-Hitchcock Medical Center, New Hampshire, USA
Laura Flashman, Marc Seltzer, Mary Hynes & Robert Santulli
Wake Forest University Health Sciences, North Carolina, USA
Kaycee Sink, Leslie Gordineer, Jef Williamson, Pradeep Garg & Franklin Watkins
Rhode Island Hospital, state of Rhode Island, Providence, RI, 02903, USA
Brian Ott, Henry Querfurth & Geofrey Tremont
Butler Hospital, Providence, Rhode Island, USA
Stephen Salloway, Paul Malloy & Stephen Correia
University of California, San Francisco, USA
Howard Rosen, Bruce Miller & David Perry
Medical University South Carolina, Charleston, SC, 29425, USA
Jacobo Mintzer, Kenneth Spicer & David Bachman
Nathan Kline Institute, Orangeburg, New York, USA
Nunzio Pomara & Antero Sarrael
Cornell University, Ithaca, New York, USA
Raymundo Hernando, Norman Relkin, Gloria Chaing, Michael Lin & Lisa Ravdin
USF Health Byrd Alzheimer’s Institute, University of South Florida, Tampa, FL, 33613, USA
Amanda Smith, Balebail Ashok Raj & Kristin Fargher

Author notes

A comprehensive list of consortium members appears at the end of the paper.

Authors

Donghuan Lu
View author publications
You can also search for this author in PubMed Google Scholar
Karteek Popuri
View author publications
You can also search for this author in PubMed Google Scholar
Gavin Weiguang Ding
View author publications
You can also search for this author in PubMed Google Scholar
Rakesh Balachandar
View author publications
You can also search for this author in PubMed Google Scholar
Mirza Faisal Beg
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

Alzheimer’s Disease Neuroimaging Initiative

Michael Weiner
, Paul Aisen
, Ronald Petersen
, Cliford Jack
, William Jagust
, John Trojanowki
, Arthur Toga
, Laurel Beckett
, Robert Green
, Andrew Saykin
, John Morris
, Leslie Shaw
, Jefrey Kaye
, Joseph Quinn
, Lisa Silbert
, Betty Lind
, Raina Carter
, Sara Dolen
, Lon Schneider
, Sonia Pawluczyk
, Mauricio Beccera
, Liberty Teodoro
, Bryan Spann
, James Brewer
, Helen Vanderswag
, Adam Fleisher
, Judith Heidebrink
, Joanne Lord
, Sara Mason
, Colleen Albers
, David Knopman
, Kris Johnson
, Rachelle Doody
, Javier Villanueva-Meyer
, Munir Chowdhury
, Susan Rountree
, Mimi Dang
, Yaakov Stern
, Lawrence Honig
, Karen Bell
, Beau Ances
, Maria Carroll
, Mary Creech
, Erin Franklin
, Mark Mintun
, Stacy Schneider
, Angela Oliver
, Daniel Marson
, Randall Grifth
, David Clark
, David Geldmacher
, John Brockington
, Erik Roberson
, Marissa Natelson Love
, Hillel Grossman
, Efe Mitsis
, Raj Shah
, Leyla deToledo-Morrell
, Ranjan Duara
, Daniel Varon
, Maria Greig
, Peggy Roberts
, Marilyn Albert
, Chiadi Onyike
, Daniel D’Agostino
, Stephanie Kielb
, James Galvin
, Brittany Cerbone
, Christina Michel
, Dana Pogorelec
, Henry Rusinek
, Mony de Leon
, Lidia Glodzik
, Susan De Santi
, P. Doraiswamy
, Jefrey Petrella
, Salvador Borges-Neto
, Terence Wong
, Edward Coleman
, Charles Smith
, Greg Jicha
, Peter Hardy
, Partha Sinha
, Elizabeth Oates
, Gary Conrad
, Anton Porsteinsson
, Bonnie Goldstein
, Kim Martin
, Kelly Makino
, M. Ismail
, Connie Brand
, Ruth Mulnard
, Gaby Thai
, Catherine Mc-Adams-Ortiz
, Kyle Womack
, Dana Mathews
, Mary Quiceno
, Allan Levey
, James Lah
, Janet Cellar
, Jefrey Burns
, Russell Swerdlow
, William Brooks
, Liana Apostolova
, Kathleen Tingus
, Ellen Woo
, Daniel Silverman
, Po Lu
, George Bartzokis
, Neill Graf-Radford
, Francine Parftt
, Tracy Kendall
, Heather Johnson
, Martin Farlow
, Ann Marie Hake
, Brandy Matthews
, Jared Brosch
, Scott Herring
, Cynthia Hunt
, Christopher Dyck
, Richard Carson
, Martha MacAvoy
, Pradeep Varma
, Howard Chertkow
, Howard Bergman
, Chris Hosein
, Sandra Black
, Bojana Stefanovic
, Curtis Caldwell
, Ging-Yuek Robin Hsiung
, Howard Feldman
, Benita Mudge
, Michele Assaly
, Elizabeth Finger
, Stephen Pasternack
, Irina Rachisky
, Dick Trost
, Andrew Kertesz
, Charles Bernick
, Donna Munic
, Marek-Marsel Mesulam
, Kristine Lipowski
, Sandra Weintraub
, Borna Bonakdarpour
, Diana Kerwin
, Chuang-Kuo Wu
, Nancy Johnson
, Carl Sadowsky
, Teresa Villena
, Raymond Scott Turner
, Kathleen Johnson
, Brigid Reynolds
, Reisa Sperling
, Keith Johnson
, Gad Marshall
, Jerome Yesavage
, Joy Taylor
, Barton Lane
, Allyson Rosen
, Jared Tinklenberg
, Marwan Sabbagh
, Christine Belden
, Sandra Jacobson
, Sherye Sirrel
, Neil Kowall
, Ronald Killiany
, Andrew Budson
, Alexander Norbash
, Patricia Lynn Johnson
, Thomas Obisesan
, Saba Wolday
, Joanne Allard
, Alan Lerner
, Paula Ogrocki
, Curtis Tatsuoka
, Parianne Fatica
, Evan Fletcher
, Pauline Maillard
, John Olichney
, Charles DeCarli
, Owen Carmichael
, Smita Kittur
, Michael Borrie
, T.-Y. Lee
, Rob Bartha
, Sterling Johnson
, Sanjay Asthana
, Cynthia Carlsson
, Steven Potkin
, Adrian Preda
, Dana Nguyen
, Pierre Tariot
, Anna Burke
, Nadira Trncic
, Stephanie Reeder
, Vernice Bates
, Horacio Capote
, Michelle Rainka
, Douglas Scharre
, Maria Kataki
, Anahita Adeli
, Earl Zimmerman
, Dzintra Celmins
, Alice Brown
, Godfrey Pearlson
, Karen Blank
, Karen Anderson
, Laura Flashman
, Marc Seltzer
, Mary Hynes
, Robert Santulli
, Kaycee Sink
, Leslie Gordineer
, Jef Williamson
, Pradeep Garg
, Franklin Watkins
, Brian Ott
, Henry Querfurth
, Geofrey Tremont
, Stephen Salloway
, Paul Malloy
, Stephen Correia
, Howard Rosen
, Bruce Miller
, David Perry
, Jacobo Mintzer
, Kenneth Spicer
, David Bachman
, Nunzio Pomara
, Raymundo Hernando
, Antero Sarrael
, Norman Relkin
, Gloria Chaing
, Michael Lin
, Lisa Ravdin
, Amanda Smith
, Balebail Ashok Raj
& Kristin Fargher

Contributions

Donghuan Lu and Gavin Weiguang Ding built the deep neural network. Donghuan Lu and Karteek Popuri processed the neuroimage data. Donghuan Lu, Karteek Popuri and Mirza Faisal Beg designed the experiments. Donghuan Lu, Rakesh Balachandar and Mirza Faisal Beg interpreted the results. All authors reviewed the manuscript.

Corresponding author

Correspondence to Mirza Faisal Beg.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lu, D., Popuri, K., Ding, G.W. et al. Multimodal and Multiscale Deep Neural Networks for the Early Diagnosis of Alzheimer’s Disease using structural MR and FDG-PET images. Sci Rep 8, 5697 (2018). https://doi.org/10.1038/s41598-018-22871-z

Download citation

Received: 10 October 2017
Accepted: 02 March 2018
Published: 09 April 2018
DOI: https://doi.org/10.1038/s41598-018-22871-z

This article is cited by

Predicting long-term progression of Alzheimer’s disease using a multimodal deep learning model incorporating interaction effects
- Yifan Wang
- Ruitian Gao
- Zhangsheng Yu
Journal of Translational Medicine (2024)
Intelligent prediction of Alzheimer’s disease via improved multifeature squeeze-and-excitation-dilated residual network
- Zengbei Yuan
- Xinlin Li
- Tao Wu
Scientific Reports (2024)
Exceptional performance with minimal data using a generative adversarial network for alzheimer's disease classification
- Pui Ching Wong
- Shahrum Shah Abdullah
- Mohd Ibrahim Shapiai
Scientific Reports (2024)
Diagnosis of disease affecting gait with a body acceleration-based model using reflected marker data for training and a wearable accelerometer for implementation
- Mohammad Ali Takallou
- Farahnaz Fallahtafti
- Fadi Alsaleem
Scientific Reports (2024)
Ensemble deep learning for Alzheimer’s disease characterization and estimation
- M. Tanveer
- T. Goel
- C. T. Lin
Nature Mental Health (2024)

Subjects

Abstract

Similar content being viewed by others

Introduction

Methods

Materials

Image Processing

Multimodal and Multiscale Deep Neural Network

Unsupervised Pre-training

Supervised Fine-tuning

Optimization of Network

Dropout

Early Stopping

Ensemble Classifiers

Ensemble Classifier Probability Distribution

Classifier Validation Experiment Setup

Results

Discrimination between Stable and Progressive MCI (sMCI vs pMCI)

Discrimination between disease negative (sNC) and disease positive (the pNC, pMCI, sAD) classes

Classification performance of pNC and pMCI as function of time (years) to conversion

Classification Probability score distribution

Discussion

Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Author notes

A comprehensive list of consortium members appears at the end of the paper.

Consortia

Alzheimer’s Disease Neuroimaging Initiative

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links