Abstract
Intra-voxel models of the diffusion signal are essential for interpreting organization of the tissue environment at micrometer level with data at millimeter resolution. Recent advances in data driven methods have enabled direct comparison and optimization of methods for in-vivo data with externally validated histological sections with both 2-D and 3-D histology. Yet, all existing methods make limiting assumptions of either (1) model-based linkages between b-values or (2) limited associations with single shell data. We generalize prior deep learning models that used single shell spherical harmonic transforms to integrate the recently developed simple harmonic oscillator reconstruction (SHORE) basis. To enable learning on the SHORE manifold, we present an alternative formulation of the fiber orientation distribution (FOD) object using the SHORE basis while representing the observed diffusion weighted data in the SHORE basis. To ensure consistency of hyper-parameter optimization for SHORE, we present our Deep SHORE approach to learn on a data-optimized manifold. Deep SHORE is evaluated with eight-fold cross-validation of a preclinical MRI-histology data with four b-values. Generalizability of in-vivo human data is evaluated on two separate 3T MRI scanners. Specificity in terms of angular correlation (ACC) with the preclinical data improved on single shell: 0.78 relative to 0.73 and 0.73, multi-shell: 0.80 relative to 0.74 (p < 0.001). In the in-vivo human data, Deep SHORE was more consistent across scanners with 0.63 relative to other multi-shell methods 0.39, 0.52 and 0.57 in terms of ACC. In conclusion, Deep SHORE is a promising method to enable data driven learning with DW-MRI under conditions with varying b-values, number of diffusion shells, and gradient directions per shell.
You have full access to this open access chapter, Download conference paper PDF
Similar content being viewed by others
Keywords
1 Introduction
Diffusion-weighted magnetic resonance imaging (DW-MRI) is essential for non-invasive reconstruction of the microstructure for the human in-vivo brain. These images are sensitized to the underlying organization of the tissue at a millimetric scale. Multiple approaches have been proposed that can model the non-linear relationship between the DW-MRI signal and biological microstructure with the most common being diffusion tensor imaging (DTI) [1]. Substantial efforts have shown that other advanced approaches can recover more elaborate reconstruction of the microstructure and these methods are collectively referred to as high angular resolution diffusion imaging (HARDI) [2]. HARDI methods have been broadly proposed in two categories of single shell acquisitions and multi-shell acquisitions (i.e., using multiple diffusivity values). A majority of single shell HARDI methods utilize spherical harmonics (SH) based modelling as in q-ball imaging (QBI) [3], super-resolved constrained deconvolution (sCSD) [4], and many others. However, SH based modelling cannot directly leverage additional information provided by multi-shell acquisitions. SH have been combined with other bases to represent multi-shell data, e.g., solid harmonics [5], simple harmonic oscillator reconstruction (SHORE) [6], and spherical polar Fourier imaging [7].
Methodological exploration has been driven through classical mathematical transforms while data-driven approaches have been limited (Fig. 1) due to lack of external validation data. Prior work using data-driven approaches for DW-MRI has been shown in [8], however the primary application for their work is shown for outlier detection and low rank signal prediction. Validation through histology is critical to evaluate the precision of white matter (WM) reconstruction [9]. Prior work through machine learning approaches on reconstruction for single shell diffusion acquisitions has exhibited higher precision and reproducibility. However, this has not been shown for multi-shell acquisitions due to lack of external validation data [10] (Fig. 1).
To overcome these issues, we propose a novel approach, Deep SHORE, which incorporates the following key contributions: (1) an unsupervised hyper-parameter optimization for improved learning in the SHORE manifold, (2) representation of a microstructure model in the SHORE manifold to improve precision and reproducibility, and (3) a non-negativity constraint implementation for a deep learning model.
2 Data Acquisition
Three ex-vivo squirrel monkey brains were imaged on a Varian 9.4T scanner. A total of 100 gradient volumes were acquired using a diffusion-weighted EPI sequence at diffusivity values of 3000, 6000, 9000 and 12000 s/mm2 at an isotropic resolution of 0.3 mm. An observation is that approximation of b-values for the ex-vivo acquisition is equivalent to in-vivo b-values of 1000, 2000, 3000 and 4000 s/mm2 [11]. After acquisition, the tissue was sectioned and stained with fluorescent dil and imaged on a LSM710 confocal microscope following procedures outlined in [9]. The histological fiber orientation distribution (HFOD) was extracted using 3D structure tensor analysis. A multi-step registration procedure was used to determine the corresponding diffusion MRI signal [9]. A similar procedure is outlined in [12]. A total of 567 histological voxels were processed. A hundred random rotations were applied to the remaining voxels for both the MR signal and the HFOD to augment the data, bringing the total to 57,267 voxels. As a limitation we acknowledge that there is a possibility of registration error approximately up to the size of MR voxels (up to 300 micrometers) [13].
The in-vivo acquisitions of the three human subjects were acquired on two sites ‘A’ and ‘B’. Both sites were equipped with a 3T scanner with a 32-channel head coil. Structural T1 MPRAGE was acquired for all subjects on both the sites. The diffusion acquisition protocol and scanner information are listed on each site as follows:
Site ‘A’: The scan was acquired at a diffusivity values of 1000, 1500, 2000, 2500, 3000 s/mm2. A total of 96 diffusion weighted gradient volumes were acquired per diffusivity value with a ‘b0’. Briefly the other parameters are: SENSE = 2.5, partial Fourier = 0.77, FOV = 96 × 96, Slice = 48, isotropic resolution: 2.5 mm.
Site ‘B’: All parameters of scan acquisition were same as that of the scanner at site ‘A’ except for the isotropic resolution which was 1.9 × 1.9 × 2.5 mm3 and down-sampled to 2.5 mm iso.
The in-vivo acquisitions were pre-processed with standard procedures of eddy, topup and b0 normalization followed by pairwise registration per subject [14]. T1s were registered and transformed to the diffusion space. Brain extraction tool was used for skull stripping [15]. WM segmentation was performed using T1 for in-vivo data [16].
3 Methods
The SHORE basis function has been shown to capture the representation of multi-shell DW-MRI with minimal representation error [6] and ensure the same when modelling single shell DW-MRI. The DW-MRI normalized signal, E(q), can be represented as:
where c are the coefficients to be estimated, G depicts the radial basis combined with Y, the SH basis. The radial basis G is represented as follows:
where ζ is the scale parameter, q is the radius of the diffusivity value, and L depicts the associated Laguerre polynomial. Equation (2) can be optimized using the BFGS [17] algorithm by iterative refitting of the coefficients c. BFGS is well-known for solving unconstrained non-linear optimization problems. The novelty that we introduce here is that, when functioning across several normalized datasets, an optimal ζ per dataset, will lead to learning on an optimized manifold. Additional parameters of SHORE include: radial order: 6, and regularization constants: 1e-8 [6]. SHORE estimates 50 coefficients at 6th order. Regularized linear least squares were used for the estimation of the coefficients. As one notes, l is even for diffusion spherical harmonics or SHORE due to symmetry of diffusion inference process. Essentially, SH at 8th order and SHORE at 6th order offer the same degree of freedom. SHORE at higher orders is known to suffer from overfitting effects [18].
The HFOD represented as SH coefficients can be fitted to the SHORE basis with the two considerations of (1) diffusivity value and (2) ‘ζ’ scaling parameter. First the SH coefficients from an 8th order were sampled over a sphere of 100 gradient directions. The directions were ensured to be uniformly sampled on the sphere with minimized electrostatic repulsion. These directions were kept consistent at all times while predicting from the network as well. The diffusivity value was set to 2000 s/mm2. After which, it was fitted to SHORE basis using the process described above.
Non-negativity.
The FOD, when modelled as SH, cannot exist with negative mean. If it does, then the microstructure exists in the imaginary part of the SH which does hold true when modelling with real even ordered SH. Hence, there was an existing gap to enforce non-negativity on a deep learning network while training and making predictions. We use a regularization value of 0.005 to truncate all values on a set of gradient directions where value is <=0 on both sides of input and output. Thereafter, log space is used instead of linear space: ln(E(q)) and ln(P(r)), where E(q) is the normalized signal and P(r) is the FOD sampled over the gradient directions. Fitting of the representation method such as SH or SHORE follows after log transformation. After the predictions are made, the coefficients of a representation are transformed using exponential to recover them back to linear space.
Deep Network Design.
We use a 5-layered deep network with the following number of neurons: x1:400, x2:45, x3:200, x4:45 and x5:200. A residual block was created for the layers x2, x3 and x4 and hence the number of neurons was kept equal for x2 and x4. All layers were activated with ‘elu’. Additional parameters of the network: Loss function: mean squared error, batch size: 1000, optimizer: RMSProp. While training, only the input and output coefficients were modified for different subcases (discussed in next section). This was due to the fact that SHORE at 6th radial order is defined by 50 coefficients and SH at 8th order is defined by 45 coefficients. For training of the network, we used k-fold cross validation where k = 5 for optimal training.
To evaluate on any withheld set of data we used the angular correlation coefficient (ACC) [19], which is a measure on a scale of −1 to 1 where 1 is the best correlation. ACC is defined using two sets of SH coefficients ‘u’ and ‘v’:
Evaluation Strategies.
From the total of 57,267, we create 8 testing sets of data where each set has 7,272 voxels except for the last one which has 6,363 voxels. The remaining data for each set were used as training. For all cross-validation experiments, blocks of 101 voxels were randomly allocated to testing/training cohorts ensuring that no synthetic rotations of training data were included in the testing phase. While fitting SHORE coefficients the diffusivity shell of 6000 s/mm2 was withheld leading to four cases of evaluation in incrementing order of shells. For evaluation purposes, we used three sub-cases of deep learning manifolds (1) Input of ‘ζ’ optimized SHORE DW-MRI and output SH-HFOD. (2) Input of unoptimized ‘ζ’ SHORE-DWMRI and output of SHORE-HFOD (3) Input of optimized ‘ζ’ SHORE-DWMRI and output of SHORE-HFOD.
Furthermore, we make comparisons between single-shell and multi-shell approaches. For single shell, we show the comparison between the leading single shell approach sCSD, a prior proposed approach that utilizes deep learning [10, 20] (SHDNN) and SHORE derived FOD on the withheld shell and the same for multi-shell where sCSD and SHDNN were excluded.
For in-vivo reproducibility evaluation, we compare the ACC for all the pairs of WM voxels between the two sites ‘A’ and ‘B’ on a per subject basis. For SHORE based approaches, all the five shells of data were used while for sCSD we used b-value of 2000 s/mm2 as the highest reproducibility was exhibited on the specific shell.
4 Results
Evaluation across the withheld shell using combinations with other shells shows that when learning across the optimized SHORE manifold the ACC is most skewed towards higher correlation as compared to the other two approaches (Fig. 2). The median of all distributions for each method is presented in Table 1. The median for optimized SHORE learning is the highest. We found significant improvements after non-parametric signed rank test for all pairs of distributions (p \( \ll \) 0.001, Wilcoxon signed rank test).
When comparing predictions across single shell methods (Fig. 3A), the trend in the following increasing order of correlation (median in parenthesis): sCSD (0.73), SHORE-FOD (0.74), SHDNN (0.76), NNSHORE-DL (0.77), SHORE-DL (0.78). Similarly, when making multi-shell comparisons (Fig. 3B) we can observe increasing order of correlation: SHORE-FOD (0.75), NNSHORE-DL (0.79) and SHORE-DL (0.80). Non-parametric signed rank test for all pairs of distributions were found to be p \( \ll \) 0.001.
Observing the distribution of ACC across all pairs of WM voxels per subject (Fig. 4), increasing level of reproducibility from sCSD, SHORE-DL, SHORE-FOD and NNSHORE-DL. Non-parametric signed rank test for all pairs of distributions were found to be p \( \ll \) 0.001 (Table 2). The NNSHORE-DL exhibits highest reproducibility across all three subjects.
Qualitatively we can observe that SHORE-FOD exhibits higher reproducibility than sCSD and NNSHORE-DL exhibits higher as compared to SHORE-FOD (Fig. 5).
5 Conclusion
Deep SHORE is the first data-driven approach that generalizes diffusion microstructure estimation across multiple b-values, radial b-value sampling, and angular orientation sampling. Our approach enables direct comparison of data-driven diffusion analyses with model-based methods, e.g., sCSD, SHORE-FOD. Although Deep SHORE (NNSHORE-DL) compares favorably when subjected to quantitative cross-validation against histology data (Figs. 3 and 4, Tables 1 and 2), the total amount of data available is a limitation of this study. As current and planned studies acquire more data, we will be able to better train and evaluate data-driven approaches.
References
Basser, P.J., Mattiello, J., LeBihan, D.: MR diffusion tensor spectroscopy and imaging. Biophys. J. 66(1), 259–267 (1994)
Descoteaux, M.: High Angular Resolution Diffusion Imaging (HARDI). Wiley Encyclopedia of Electrical and Electronics Engineering (2015)
Tuch, D.S.: Q-ball imaging. Magn. Reson. Med. 52(6), 1358–1372 (2004)
Tournier, J.-D., Calamante, F., Connelly, A.J.N.: Robust determination of the fibre orientation distribution in diffusion MRI: non-negativity constrained super-resolved spherical deconvolution. Neuroimage 35(4), 1459–1472 (2007)
Descoteaux, M., et al.: Multiple q-shell diffusion propagator imaging. Med. Image Anal. 15(4), 603–621 (2011)
Cheng, J., Jiang, T., Deriche, R.: Theoretical analysis and practical insights on EAP estimation via a unified HARDI framework. In: MICCAI Workshop on Computational Diffusion MRI (CDMRI) (2011)
Cheng, J., Ghosh, A., Jiang, T., Deriche, R.: Model-free and analytical EAP reconstruction via spherical polar fourier diffusion MRI. In: Jiang, T., Navab, N., Pluim, J.P.W., Viergever, M.A. (eds.) MICCAI 2010, Part I. LNCS, vol. 6361, pp. 590–597. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-15705-9_72
Christiaens, D., et al.: Learning compact q-space representations for multi-shell diffusion-weighted MRI. IEEE Trans. Med. Imaging 38(3), 834–843 (2019)
Schilling, K., et al.: Comparison of 3D orientation distribution functions measured with confocal microscopy and diffusion MRI. Neuroimage 129, 185–197 (2016)
Nath, V., et al.: Inter-scanner harmonization of high angular resolution DW-MRI using null space deep learning. In: Bonet-Carne, E., Grussu, F., Ning, L., Sepehrband, F., Tax, C.M.W. (eds.) MICCAI 2018. MV, pp. 193–201. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-05831-9_16
Dyrby, T.B., et al.: An ex vivo imaging pipeline for producing high-quality and high-resolution diffusion-weighted imaging datasets. Hum. Brain Mapp. 32(4), 544–563 (2011)
Stolp, H., et al.: Voxel-wise comparisons of cellular microstructure and diffusion-MRI in mouse hippocampus using 3D Bridging of Optically-clear histology with Neuroimaging Data (3D-BOND). Sci. Rep. 8(1), 4011 (2018)
Choe, A.S., et al.: Accuracy of image registration between MRI and light microscopy in the ex vivo brain. Magn. Reson. Imaging 29(5), 683–692 (2011)
Jenkinson, M., et al.: Improved optimization for the robust and accurate linear registration and motion correction of brain images. Neuroimage 17(2), 825–841 (2002)
Smith, S.M.: Fast robust automated brain extraction. Hum. Brain Mapp. 17(3), 143–155 (2002)
Zhang, Y., Brady, M., Smith, S.: Segmentation of brain MR images through a hidden Markov random field model and the expectation-maximization algorithm. IEEE Trans. Med. Imaging 20(1), 45–57 (2001)
Liu, D.C., Nocedal, J.: On the limited memory BFGS method for large scale optimization. Math. Program. 45(1–3), 503–528 (1989)
Özarslan, E., et al.: Mean apparent propagator (MAP) MRI: a novel diffusion imaging method for mapping tissue microstructure. Neuroimage 78, 16–32 (2013)
Anderson, A.W.: Measurement of fiber orientation distributions using high angular resolution diffusion imaging. Magn. Reson. Med. 54(5), 1194–1206 (2005)
Nath, V., et al.: Deep learning reveals untapped information for local white-matter fiber reconstruction in diffusion-weighted MRI. Magn. Reson. Imaging 62, 220–227 (2019)
Acknowledgements
NIH R01EB017230, NIH R01NS058639, NIH T32EB001628, 1S10OD020154-01, UL1 RR024975-01, UL1 TR000445-06 and Nvidia for GPU hardware.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Nath, V. et al. (2019). Enabling Multi-shell b-Value Generalizability of Data-Driven Diffusion Models with Deep SHORE. In: Shen, D., et al. Medical Image Computing and Computer Assisted Intervention – MICCAI 2019. MICCAI 2019. Lecture Notes in Computer Science(), vol 11766. Springer, Cham. https://doi.org/10.1007/978-3-030-32248-9_64
Download citation
DOI: https://doi.org/10.1007/978-3-030-32248-9_64
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-32247-2
Online ISBN: 978-3-030-32248-9
eBook Packages: Computer ScienceComputer Science (R0)