- Research
- Open access
- Published:
RU-Net: skull stripping in rat brain MR images after ischemic stroke with rat U-Net
BMC Medical Imaging volume 23, Article number: 44 (2023)
Abstract
Background
Experimental ischemic stroke models play a fundamental role in interpreting the mechanism of cerebral ischemia and appraising the development of pathological extent. An accurate and automatic skull stripping tool for rat brain image volumes with magnetic resonance imaging (MRI) are crucial in experimental stroke analysis. Due to the deficiency of reliable rat brain segmentation methods and motivated by the demand for preclinical studies, this paper develops a new skull stripping algorithm to extract the rat brain region in MR images after stroke, which is named Rat U-Net (RU-Net).
Methods
Based on a U-shape like deep learning architecture, the proposed framework integrates batch normalization with the residual network to achieve efficient end-to-end segmentation. A pooling index transmission mechanism between the encoder and decoder is exploited to reinforce the spatial correlation. Two different modalities of diffusion-weighted imaging (DWI) and T2-weighted MRI (T2WI) corresponding to two in-house datasets with each consisting of 55 subjects were employed to evaluate the performance of the proposed RU-Net.
Results
Extensive experiments indicated great segmentation accuracy across diversified rat brain MR images. It was suggested that our rat skull stripping network outperformed several state-of-the-art methods and achieved the highest average Dice scores of 98.04% (p < 0.001) and 97.67% (p < 0.001) in the DWI and T2WI image datasets, respectively.
Conclusion
The proposed RU-Net is believed to be potential for advancing preclinical stroke investigation and providing an efficient tool for pathological rat brain image extraction, where accurate segmentation of the rat brain region is fundamental.
Background
Stroke is the leading cause of serious long-term disability and the major cause of mortality worldwide [1]. Of all strokes, the majority are the ischemic type resulting from the occlusion of a cerebral artery by a blood clot. Cerebral ischemia can induce many injuries including energy failure, intracellular calcium overload, and cell death, which eventually lead to the loss of neurological functions and permanent disabilities [2]. Experimental ischemic stroke models are crucial to understand the mechanism of cerebral ischemia and to evaluate the development of the pathological extent. Among the models in a variety of species, rodent stroke models have been broadly employed in experimental ischemia studies for decades [3].
In particular, the transient middle cerebral artery occlusion (tMCAO) model in rats is one of the closest simulations of human ischemic strokes, which has been frequently utilized to induce infarction at the basal ganglion and cerebral cortex [4, 5]. To noninvasively disclose stroke regions and the associated tissue, one popular manner is through the use of magnetic resonance imaging (MRI), where diffusion-weighted imaging (DWI) and T2-weighted MRI (T2WI) exhibits complementary visualization of ischemic lesions [4]. A fundamental task of the preclinical MRI studies associated with tMCAO models is the skull stripping in rat brain MR images. Skull stripping, also known as brain extraction or intracranial segmentation, is a process to remove nonbrain tissues and separate brain regions in MR images. The extracted rat brain is critical to succeeding processes such as hemisphere segmentation, lesion segmentation, tissue classification, and volume measurement in preclinical stroke investigation [6,7,8].
Unfortunately, computer-aided tools for rat brain extraction have been lacking. Manual delineation of the rat brain region on numerous MR images has been widely adopted in many preclinical studies [3, 8, 9], which is a time-consuming and laborious work with low reproducibility [10, 11]. In consequence, an accurate and reliable image segmentation tool for the brain extraction in MR image volumes is essential in experimental stroke rat analysis. Automatic skull stripping in rat brain MR images is quite challenging as typical magnetic fields are higher (\(\ge 7\text{T}\) commonly) with a larger degree of radiofrequency inhomogeneity, which results in susceptibility artefacts and field biases [12]. Nevertheless, several attempts have been made to address the brain extraction problems in rat MR image volumes. For example, Li et al. [13] presented an automatic rat brain extraction method called the rat brain deformation (RBD) model, which made use of the information on the brain geometry and the T2WI image characteristics of the rat brain.
A fully automatic skull stripping method in an atlas-based manner was proposed for rat MRI scans [14], which was founded on an iterative, continuous joint registration algorithm. Lancelot et al. [15] developed a multi-atlas based method for automated anatomical rat brain MRI segmentation in such a way that MR images are registered to a common space, where a rat brain template and a maximum probability atlas were constructed. Delora et al. [16] presented a template-based brain extraction scheme called “SkullStrip” to segment the whole mouse brain in T1-weighted and T2-weighted MR images. Huang et al. [17] built a statistic template of the rodent brain, which was adopted to predict the location of the brain in MR images. Alternatively, Zhang et al. [18] combined deformable models and hierarchical shape priors, which constrain the intermediate result for rodent brain structure segmentation. Oguz et al. [19] introduced a rapid automatic tissue segmentation (RATS) algorithm based on grayscale morphology with initial surface extraction followed by graph search. Liu et al. [10] described an automatic brain extraction method, entitled SHape descriptor selected Extremal Regions after Morphologically filtering (SHERM), which extracted the brain tissue in both rat and mouse MR images.
With recent advances in artificial neural networks, many researchers have demonstrated their effectiveness in human brain image segmentation [20,21,22]. However, few studies have applied this strategy for rodent brain extraction comparing to human brain investigation [23]. The major difference between the human and rodent brain extraction results from the inherent brain dissimilarity in many aspects including the brain tissue geometry, brain-scalp distance ratio, tissue contrast around the skull, partial volume effect with respect to image resolution, and more noise due to a stronger magnetic field in rat brain MRI. One example is the automatic cropping scheme based on the pulse coupled neural network (PCNN) with a slice-by-slice fashion, which was proposed to segment the rat brain in T2WI image volumes [24]. Afterward, Chou et al. [25] described an automatic rodent brain extraction method by extending the PCNN algorithm into 3-D, which operated on the entire rodent brain MR image volume. Recently, deep learning-based approaches have shimmered the field of computer vision in that convolutional neural networks (CNNs) have been successfully applied in many image processing tasks, e.g., classification of the ImageNet database [26]. To handle semantic segmentation problems, the fully convolutional network (FCN) [27], which is an end-to-end and pixel-to-pixel network, has shown its outstanding performance over the CNN. In contrast to the CNN models, the FCN framework exploits an upsampling tactic instead of the fully-connected layer to recover the intermediate image back to the original image dimension.
One particular type of FCN architectures, U-Net [28], has been shown valuable in biomedical image segmentation and it has become the foundation of many segmentation methods. For example, an end-to-end learning algorithm for medical image segmentation was proposed [29], which introduced a category attention boosting module into the 3D U-Net segmentation network. A stacked U-Net scheme was applied to computed tomography image reconstruction that generated high-quality images in a short time with a small number of projections [30]. An automatic hemorrhagic stroke lesion segmentation approach in computed tomography scans was described, which is based on a 3D U-Net architecture incorporating the squeeze-and-excitation blocks [31]. For preclinical studies, Hsu et al. [32] employed the U-Net to automatically identify the rodent brain boundaries in MR images, which was trained and evaluated using rat and mouse datasets. De Feo et al. [33] presented a multi-task U-Net (MU-Net) framework that was designed to accomplish both skull stripping and region segmentation in large mouse brain MRI datasets. In light of the U-Net architecture, the final block of the decoder branch bifurcates into two different output maps corresponding to the two tasks. A unique CNN, called MedicDeepLabv3+ [34], was introduced to simultaneously segment intracranial brains and cerebral hemispheres in rat brain MR image volumes. By incorporating spatial attention layers and additional skip connections into the decoder, the network was able to attain more precise segmentation.
Stimulated by the demand of the preclinical ischemia studies, this paper develops an automatic skull stripping framework in rat brain MR images after stroke based on a deep learning network. The proposed architecture takes advantage of U-Net [28], residual network [35], and batch normalization [36] to perform efficient end-to-end segmentation in rat brain images, which is named Rat U-Net (RU-Net) and publicly available at https://github.com/lvanna/RU-Net. With the same U-shape like structure, two different skull stripping networks are individually trained and validated using two different MRI modalities of DWI and T2WI images. Due to the deficiency of public rat brain MR images after ischemic stroke, two in-house datasets corresponding to DWI and T2WI have been established. Skull stripping in the two MRI modalities using the proposed RU-Net is fairly compared with the state-of-the-art methods. The main contributions of the current work are summarized as follows:
-
1)
A new skull stripping system, referred to as RU-Net, specifically designed for handling pathological rat brain MR images after stroke was developed.
-
2)
On the foundation of a U-shape like architecture, a batch normalization associated with residual network strategy was investigated for extracting the rat brain characteristics.
-
3)
A pooling index transmission mechanism between the encoder and decoder was introduced to tackle large intensity variations in ischemic rat brain MR images.
-
4)
Two in-house datasets containing pathological rat brain DWI and T2WI image volumes were established.
The remainder of this paper is organized as follows. In Sect. 2, we describe the acquired datasets followed by the deep learning architecture for effective feature extraction and elaborate the proposed RU-Net for rat skull stripping. Section 3 presents experimental results and performance analyses regarding both modalities of DWI and T2WI image data. Section 4 discusses our investigation pertaining to the segmentation outcome. Finally, we draw the conclusion in Sect. 5.
Materials and methods
Ischemic stroke model
An ischemia-reperfusion model of rats based on the tMCAO with a silicon-coated nylon filament was carried out. Supplied by BioLASCO Taiwan Co., male Sprague-Dawley rats with ages of 7–9 weeks old and body weights of 181–336 g were employed as experimental subjects. Different ischemic durations of 0.5, 0.75, 1, 1.5, 2, and 3 h were performed to develop a wide range of infarction. Before the operation, the rats were kept under standard conditions and supplied with water and food ad lib. Under inhalation anesthesia with isoflurane (induction dosage: 4%, maintenance dosage: 2%), anterior neck incision at the right paramedian line (5 mm from the midline) was executed to disclose the right carotid artery. After serial ligations of the right common carotid artery (CCA), external carotid artery, and internal carotid artery (ICA), a silicon-coated filament was inserted into the right CCA and deliberately advanced towards the right ICA until a light resistance encountered. The filament sizes were determined in accordance with the body weight of each individual rat. The rats were allowed to regain consciousness after fixation of the filament on ICA followed by closure of the neck wound. Toward the end of the ischemic period, the rats received anesthesia again for removing the filament to accomplish reperfusion. In accordance with the principles of the Basel Declaration, the protocol was approved by the Animal Committee of National Taiwan University College of Medicine.
Image acquisition
This study was dedicated to the skull stripping of pathological rat brain MR images with cerebral ischemia. Since there is no public image dataset that is appropriate for our investigation, we have established two in-house preclinical stroke rat MRI datasets. Each stroke rat experienced DWI and T2WI examination for unveiling ischemic regions in the brain. All rat MR images were acquired using the 7T MRI machine (Bruker PharmaScan, Ettlingen, Germany) at National Taiwan University, Taipei, Taiwan. The parameters of the DWI sequence were as follows [37]: b-value 1000 s/mm2, repetition time (TR) 4500 ms, echo time (TE) 30 ms, coronal section thickness 1 mm with 15 slices, field of view (FOV) \(2.56\times 2.56\) cm2, and matrix size \(128\times 128\). The parameters of T2WI were as follows: 15 contiguous, coronal slices (thickness: 1 mm) acquired with an FOV of \(2.56\times 2.56\) cm2, matrix size \(256\times 256\), TR 3000 ms, and TE 50 ms. Altogether, there were 55 rat subjects captured with DWI and T2WI for this study. After the MRI scanning, the rats were sacrificed for in vitro staining experiments. All rats were euthanized by intracardiac infusion of 1% sodium nitrite under inhalation anesthesia of isoflurane at 5% through a vaporizer in a dedicated euthanasia chamber.
Data preprocessing
To generalize the proposed algorithm when handling heterogeneous image data, a least possible preprocessing step was first executed. Specifically, the standard score (or z-score) normalization [38] was exploited to reduce the intensity variation while maintaining the detailed structures of the input rat brain MR images. The standard score is the signed fractional number of standard deviations that is frequently utilized to standardize scores on the same scale by dividing a score’s deviation in a dataset. Mathematically, the input rat brain MR image scan \(I\) is normalized with
where \({\mu }_{I}\) is the mean intensity of the images in the dataset, \({\sigma }_{I}\) is the corresponding standard deviation in the image dataset, and \(\widehat{I}\) is the standardized rat brain MR image.
An essential role to deep learning-based investigation is the use of tremendous image data in the model training phase. For biological image processing applications as in our scenario, the number and scope of images are substantially limited comparing to many famous image databases such as ImageNet. In consequence, data augmentation, which is a strategy to expand the amount of data by generating modified copies or newly created images from existing data, has been commonly adopted as a regularizer to lessen overfitting [26, 39]. To increase the scale and diversity of the acquired rat brain MR image data, we employed four distinct forms of data augmentation, which allowed transformed images to be generated from the original data. The transformation consists of shears (within 0.3 rad), rotations (within 30 degrees), zooming (within 20% of brain regions), and horizontal reflections, which are randomly created to increase the size of our training dataset by a factor of 1000 through all epochs in both DWI and T2WI images.
RU-Net for rat brain extraction
Our RU-Net is a special deep learning framework that takes advantage of the decoupling utility in batch normalization [36], the skip connection in residual network [35], and the feature concatenation in U-Net [28] for skull stripping in pathological rat brain MR images. We introduce the batch normalization and residual network into our encoder-decoder U-Net like architecture to accelerate the convergence speed while reducing the gradient vanishing and explosion problems. As illustrated in Fig. 1, our RU-Net consists of 33 convolutional layers, 5 maximum pooling layers, and 5 upsampling layers. In the encoding path, there are 14 convolutional layers and 5 maximum pooling layers. Each individual rat brain MR image \(\widehat{I}\) with a dimension of \(N\times N\) is fed into the network in the input layer, followed by a \(3\times 3\) convolution process to boost the channel number to 64 in the convolution layer. This \(N\times N\times 64\) output provides two functions: input for the subsequent block and input for the residual addition. The block consists of three consecutive layers, namely, batch normalization (BN), activation, and convolution. By normalizing each mini-batch, the BN layer enables us to be less cautious concerning parameter initialization and adopt higher learning rates, which also helps stabilize the network. The rectified linear unit (ReLU) function is utilized in the activation layer followed by a \(3\times 3\) convolution layer for feature extraction. The same \(N\times N\times 64\) structure is constructed through the entire block, i.e., all three layers. After one additional block with the same architecture, the immediate output and the preserved convolution output are joined together to establish the residual learning network in the addition layer.
Subsequently, the output from the addition layer serves as both the input of the following maximum pooling layer and the concatenation in the decoder phase. The maximum pooling is executed using a \(2\times 2\) neighborhood with stride 2 that reduces the output to \(\left(N/2\right)\times \left(N/2\right)\times 64\). To tackle large intensity variation in ischemic rat brain images, a pooling index transmission mechanism is introduced so that the corresponding maximum value indices are also stored for recovering the feature locations in the decoding path [40]. The maximum pooling result and its output after two equivalent block processes are united to build a deeper residual learning scheme again in a second addition layer. These encoding procedures of one maximum pooling, three block processing, and one residual addition steps are repeated until the image dimension is scaled down to \(\left(N/16\right)\times \left(N/16\right)\). After an additional maximum pooling operation, the decoder phase starts from a contrary \(2\times 2\) maximum upsampling layer with stride 2 to produce enlarged features for concatenation. In the deepest concatenation layer, the upsampled result and the output from the deepest addition layer in the encoder phase are integrated into a double channel structure with a dimension of \(\left(N/16\right)\times \left(N/16\right)\times 128\). In the following block processing, the output architecture reduces to \(\left(N/16\right)\times \left(N/16\right)\times 64\) after the convolution layer. This output and the outcome after three successive blocks are added up to produce the deepest residual learning network in the decoding path. The subsequent maximum upsampling layer again combines the addition result with the output from the corresponding maximum pooling layer in the encoder phase, which expands the outcome to a dimension of \(\left(N/8\right)\times \left(N/8\right)\times 64\). This outcome is then concatenated with the output of the matching addition layer in the encoding path to generate a \(\left(N/8\right)\times \left(N/8\right)\times 128\) resulting structure.
These procedures associated with upsampling, concatenation, convolution, and residual learning are duplicated until the network dimension grows back to \(N\times N\times 64\). In the last block, after the BN layer, the sigmoid function is employed in the activation layer to produce output values between 0 and 1 for segmentation prediction. A final \(1\times 1\) convolution layer is utilized to consolidate all channels to a single \(N\times N\) probability map, which completes the decoder phase with 19 convolutional layers and 5 upsampling layers. The loss function \({\Lambda }\) is defined in terms of the Dice metric [41] using
where \({{\Omega }}_{sp}\) represents the segmentation prediction (SP) mask, \({{\Omega }}_{gt}\) represents the ground truth (GT) mask, and \({\kappa }_{D}\) represents the Dice coefficient, which is defined as
where \({\theta }_{TP}\) represents true positives, \({\theta }_{FN}\) represents false negatives, and \({\theta }_{FP}\) represents false positives associated with \({{\Omega }}_{sp}\) and \({{\Omega }}_{gt}\). To find the best parameters in the proposed RU-Net, the Adam optimizer [42] is employed due to its great effectiveness on computational complexity and memory usage. Varying learning rates with decaying values during the training process are employed to further accelerate the convergence speed.
Performance evaluation
In addition to the Dice metric as described in Eq. (3), some other evaluation measures are exploited to reveal the correlation between the segmentation and GT masks. Specifically, two similarity metrics of sensitivity \({\kappa }_{st}\) and sensibility \({\kappa }_{sb}\) [43] are adopted to evaluate the degree of under-segmentation and over-segmentation with
and
, respectively. The Hausdorff distance metric [44], which measures the largest distance of a point set to the nearest point in another, is utilized to signify how close the segmentation and GT contours are in a Euclidean space with
where \({\delta }_{h}\) represents the Hausdorff distance, \(\left\| {\, \cdot \,} \right\|\) symbolizes the norm, \({{\Gamma }}_{sp}\) and \({{\Gamma }}_{gt}\) indicate the point sets of the contours corresponding to \({{\Omega }}_{sp}\) and \({{\Omega }}_{gt}\), respectively. A robuster measure of \({\delta }_{h}\) is the average Hausdorff distance that computes the average distance instead of the maximum distance in Eq. (6), which is employed in this study and denoted as \({\delta }_{ah}\). A paired t-test is used to compare the evaluation scores of the proposed framework with those from other methods. A two-tailed P-value < 0.05 is considered statistically significant.
Results
Implementation
Our proposed RU-Net framework for rat brain extraction in two different modalities of DWI and T2WI was implemented and programmed in Python 3.5 using Keras 2.1.6 [45]. All experiments were executed on an Intel® Xeon(R) CPU ES-2620 v3 @ 2.40 GHz\(\times 24\) workstation running 64-bit Linux Ubuntu 16.04. The machine was equipped with a NVIDIA Tesla K40c GPU of 12GB RAM [46]. The percentages of the training, validation, and testing sets were 6: 2: 2, which were randomly selected from the acquired image datasets. The input image dimensions are \(128\times 128\) and \(256\times 256\) for DWI and T2WI images, respectively. The training phase was executed using a mini-batch size of 8 with a total number of 100 epochs. The learning rates were initialized with \(5{\text{e}}^{-4}\), which gradually decreased to \(1{\text{e}}^{-4}\) when the epoch number was larger than 20. The same RU-Net architecture was employed for both DWI and T2WI images but trained individually. There were two different sets of the GT masks corresponding to the DWI and T2WI datasets, which were independently delineated by experienced neurologists in our team. This was mainly because the infarct regions exhibited in DWI and T2WI images were not identical due to different resolution abilities. On the basis of the GT, our skull stripping results were compared with traditional methods including the BSE [47], rBET [48], and RATS [19] as well as the network-based approaches such as the 3-D PCNN [25], DeepMedic [49], and U-Net [32]. For deep-leaning methods of DeepMedic and U-Net, their models were retrained using the same protocols as our RU-Net.
Network cross validation
To understand the effectiveness of the proposed RU-Net skull stripping network, five-fold cross validation was exploited in the training phase. Figure 2 plots the accuracy and loss functions for the training and validation datasets in both DWI and T2WI images. Each fold had two curves that represented the training and validation subjects with respect to the epoch number. All of the five folds exhibited quite similar accuracy and loss trace patterns. For the DWI image scenario, the training curves climbed relatively slowly than the validation curves up towards the same high segmentation accuracy. While the training curves gradually raised their accuracy in T2WI images, the corresponding validation curves reached a plateau and maintained their high accuracy towards the end of the epoch. It was obvious that our RU-Net achieved high skull stripping accuracy with tiny loss in the DWI and T2WI rat brain image datasets, which indicated the robustness of our developed network. Further validation on the RU-Net segmentation performance was presented in Table 1 in comparison with different architecture variants using \(3\times 3\) and \(4\times 4\) maximum pooling, \(7\times 7\) convolution, and 4 level U-shape network structure. It was apparent that the proposed RU-Net architecture exhibited the best evaluation scores with the narrowest standard deviations in terms of \({\kappa }_{D}\), \({\kappa }_{st}\), and \({\kappa }_{sb}\).
DWI skull stripping
Figure 3 illustrates qualitative skull stripping results in a sequence of DWI images using the proposed scheme along with the corresponding GT masks. The segmented brain regions (yellow) were observed to be well conformed to the GT contours (red). Performance measures of the skull stripping results using the Dice, sensitivity, and sensibility metrics based on five-fold cross validation were depicted in Fig. 4. It was noted that the proposed RU-Net produced the highest average Dice and sensibility scores with the narrowest standard deviations over the DeepMedic and U-Net methods. While the average sensitivity scores of the three methods were somewhat overlapped, the U-Net was slightly higher than other two methods. Representative skull stripping results using the abovementioned seven methods were qualitatively illustrated in Fig. 5. All approaches more or less encompassed the rat brain regions but the BSE, rBET, RATS, 3-D PCNN, and DeepMedic methods revealed apparent false positive regions. Both U-Net and RU-Net produced accurate segmentation results with the U-Net contours more smooth and the RU-Net contours deformed into the fissures, which better resembles the GT. Figure 6 demonstrates visual skull stripping results of two different subjects with DWI in 3-D view. Obvious over-segmentation and under-segmentation outcomes were generated by the traditional methods of BSE, rBET, RATS, and 3-D PCNN. More precise results were obtained using the deep learning-based methods. While the segmentation masks provided by the DeepMedic and U-Net methods were with excess components, our RU-Net scheme generated clean rat brain regions. Table 2 summarizes statistical analyses of the skull stripping results in the DWI image dataset in terms of the four evaluation metrics. The proposed RU-Net framework achieved the highest average evaluation scores of \({\kappa }_{D}=98.04\%\) (\(p<0.001\)) and \({\kappa }_{sb}=98.15\%\) (\(p<0.001\)) with the \({\kappa }_{st}\) score slightly smaller than the maximum value received by the U-Net method. Our segmentation performance was further validated by the smallest average value of \({\delta }_{ah}=0.1161\)mm (\(p<0.001\)) compared with all competitive methods.
T2WI skull stripping
In the scenario of T2WI image segmentation, the proposed RU-Net scheme also performed well. As illustrated in Fig. 7, the segmented brain regions (yellow) were decently similar to the corresponding GT masks (red) in all instances. Figure 8 shows quantitative evaluation of the skull stripping results in the T2WI dataset based on five-fold cross validation. The average Dice and sensibility scores provided by our RU-Net architecture were higher and with smaller standard deviations than the DeepMedic and U-Net methods. The overlapping phenomena of the average sensitivity scores between the three methods in the T2WI subjects were more evident than the DWI subjects. We visually compared our skull stripping framework with the seven methods in Fig. 9, where two randomly selected subjects were presented. Similar to the DWI segmentation scenario, there were noticeable false positive regions in some slices using the BSE, rBET, RATS, 3-D PCNN, and DeepMedic methods. The U-Net generated smooth contours that approximately circumscribed the rat brain surfaces, whereas the proposed RU-Net achieved more accurate contours that were better compatible with the GT. Figure 10 compares the whole skull stripping outcomes of Subjects 3 and 37 in 3-D view between different methods. Apparent segmentation errors were observed using the BSE, rBET, RATS, 3-D PCNN, and DeepMedic methods. Both U-Net and RU-Net schemes produced more precise segmentation results with fewer flaws. Nevertheless, our RU-Net achieved higher Dice scores of 97.25% and 98.08% for Subjects 3 and 37, respectively. Statistical analyses of the rat brain segmentation results in T2WI image volumes in Table 3 indicated our advantage over other methods with the highest average values of \({\kappa }_{D}=97.67\%\) (\(p<0.001\)) and \({\kappa }_{sb}=97.42\%\) (\(p<0.001\)). Lastly, the smallest average score of \({\delta }_{ah}=0.1406\)mm (\(p<0.001\)) attained by the RU-Net further confirmed our skull stripping efficacy.
Discussion
A new skull stripping framework for pathological rat brain MR images in light of deep learning networks has been introduced. The development of this RU-Net was inspired by the demand for preclinical stroke investigation associated with both DWI and T2WI image volumes. As the U-Net [28] has been successfully employed in many medical image segmentation applications [28, 32, 33], our network took advantage of the U-shape architecture from the U-Net. To handle the nonuniform intensity distribution and blurred brain boundaries in the ischemic rat MR images, a series of BN layers conceived from the batch normalization scheme [36] constituted the block structure in the encoding and decoding paths. Enhancement learning was accomplished by a residual network [35] that connects the input with the output features of each block in both encoder and decoder. A common disadvantage of deep learning-based approaches for medical image processing is the limited number of image data comparing to the scale of natural image databases such as the ImageNet. We tackled this issue by augmenting existing image data through different spatial transformations to diversify the training data. Based on the five-fold cross validation with the rat brain MR image datasets, we updated and finalized the system parameters to achieve the optimal architectures. Different evaluation metrics associated with the paired t-test were employed to compare our segmentation outcome with the state-of-the-art methods.
As presented in Tables 2 and 3, comparable skull stripping results were obtained in both the DWI and T2WI image datasets using the traditional methods of BSE, rBET, RATS, and 3-D PCNN. Developed for human brain image segmentation, the BSE method produced acceptable skull stripping results around the middle slices of the rat brain image volumes. However, notable segmentation errors appeared roughly in the first and last three slices, which deteriorated the overall performance. Modified from the BET scheme, the rBET method also adopted an active contour model that was evaluated in rat brain T1-weighted and T2-weighted MR images. Obvious over-segmentation outside the rat brain boundaries reduced its segmentation accuracy due to the abnormity in the DWI and T2WI image datasets, leading to the poorest sensibility scores in both scenarios. Originally validated on normal rat brain MR images similar to the rBET method, the RATS algorithm was unable to efficiently separate the ischemic rat brain regions from the surrounding tissues, particularly for DWI images. Extended from the 2-D PCNN model and verified in mouse brain T2WI images, the 3-D PCNN algorithm generated unstable skull stripping results so that some slices exhibited apparent false positive regions in the ischemic rat image datasets as illustrated in Figs. 5 and 9.
Different from the traditional approaches, the deep learning-based models exploited an end-to-end network, which usually provide better outcomes. As can be realized from the evaluation scores, the DeepMedic, U-Net, and RU-Net schemes exhibited higher skull stripping accuracy with smaller \({\delta }_{ah}\) values in both DWI and T2WI scenarios. Equipped with the efficient multiscale 3-D CNN and fully connected conditional random field model, the DeepMedic scheme adequately captured the rat brain surfaces. Likewise, the skull stripping results using the U-Net method decently enclosed the rat brains in all demonstrated instances. Due to the large \({\theta }_{FP}\) regions, the U-Net exhibited low \({\kappa }_{sb}\) scores, which in turn produced higher \({\kappa }_{st}\) scores than our RU-Net. To segment the rat brain MR images with ischemia, both DeepMedic and U-Net models were retrained using the same protocols as our RU-Net to fine-tune their system parameters. All three deep learning-based frameworks were evaluated according to the five-fold cross validation in the DWI and T2WI image datasets as revealed in Figs. 4 and 8, respectively. Statistical analyses using the Dice, sensitivity, and sensibility metrics indicated convergent characteristics of the three networks. This was mainly because the deep learning mechanisms were refreshed to adapt the systems to new image data. Without the retraining process for parameter adjustments, the segmentation to unfamiliar image data could be improper. To illustrate this, Fig. 11 depicts the skull stripping results of the same slices and subjects in Figs. 5 and 9 using the original models of the MU-Net [33], which was originally developed for large mouse brain segmentation in T2WI images, and the MedicDeepLabv3+ [34]. Their average \({\kappa }_{D}\) scores were 31.01% and 34.72% for the DWI dataset, and 55.46% and 48.28% for the T2WI dataset, respectively.
One inevitable shortage of deep learning-based strategies for medical image segmentation is that the outcome may exhibit disconnected components with broken pieces and interior holes. This is mainly due to the natural characteristics of pixel-to-pixel partition based on the feature maps at different scales and depths. Although the consecutive convolution processes include neighboring information, the involvement is too shallow and limited mostly to adjacent pixels. For natural images, this partition scheme will not cause serious issues as the color information of three channels is involved and the intensity variation is relatively subtle. For medical images as in our scenario, the single gray scale image is the only input to the system and inhomogeneous intensities are obviously presented. As shown in Figs. 6 and 10, noticeable false positive regions apart from the brains were produced using the DeepMedic and U-Net methods. Thanks to the unique network architecture, the proposed RU-Net faithfully delineated the rat brain boundaries and achieved accurate skull stripping results with minor over-segmentation errors compared to other networks. This is not only because our architecture contains the BN layer associated with the residual network but also because the salient feature locations in the encoder are transmitted to the corresponding upsampling procedures in the decoder to strengthen the spatial correlation. From the perspective of practical applications, the outcome from deep learning-based approaches can be improved by appropriate morphological operations to acquire clean and complete brains. For example, the average sensibility scores of the DeepMedic and U-Net schemes in the DWI image dataset increased to 97.50% and 93.11%, respectively, and they advanced to 97.14% and 94.59% in the T2WI image dataset. Lastly, our RU-Net can be extended for multimodal learning by feeding, say, two different modalities of DWI and T2WI images to the corresponding network and integrating the intermediate results through an extra concatenation structure to generate the ultimate prediction.
Conclusion
In this paper, we investigated an automatic skull stripping framework in pathological rat brain MR images in light of a deep learning architecture, namely RU-Net. Motivated by the demand of segmenting rat brain MR images after ischemic stroke, the proposed scheme was established on an efficient U-shape like network with embedded BN layers reinforced by the residual network. A variety of ischemic rat brain images in two different DWI and T2WI datasets were employed to evaluate the capability of our rat brain segmentation network. Comparable performance with high evaluation scores in terms of Dice, sensitivity, and sensibility was observed in both image datasets. Our RU-Net outperformed the state-of-the-art methods either traditional mathematics models or deep learning networks in extracting clean rat brain regions with a nonuniform intensity distribution in the acquired MR image volumes. We believe that the proposed skull stripping network is of great potential for advancing preclinical stroke investigation as well as providing an efficient tool for abnormal rat brain MR image extraction, where accurate segmentation of the brain region is fundamental.
Data availability
The data that support the findings of this study are available for sharing from the corresponding authors upon reasonable request.
Abbreviations
- BET:
-
brain extraction tool
- BN:
-
batch normalization
- CCA:
-
common carotid artery
- CNN:
-
convolutional neural network
- DWI:
-
diffusion weighted imaging
- FCN:
-
fully convolutional network
- FOV:
-
field of view
- GT:
-
ground truth
- ICA:
-
internal carotid artery
- MRI:
-
magnetic resonance imaging
- MU-Net:
-
multi-task U-Net
- PCNN:
-
pulse coupled neural network
- RBD:
-
rat brain deformation
- ReLU:
-
rectified linear unit
- RU-Net:
-
Rat U-Net
- SHERM:
-
SHape descriptor selected Extremal Regions after Morphologically filtering
- SP:
-
segmentation prediction
- T2WI:
-
T2-weighted MRI
- TE:
-
echo time
- tMCAO:
-
transient middle cerebral artery occlusion
- TR:
-
repetition time
References
et al: Heart Disease and Stroke Statistics—2020 Update: A Report From the American Heart Association. Circulation 2020, 141(9):e139-e596.
Khatri R, Vellipuram AR, Maud A, Cruz-Flores S, Rodriguez GJ. Current Endovascular Approach to the management of Acute ischemic stroke. Curr Cardiol Rep. 2018;20(6):46.
Fluri F, Schuhmann MK, Kleinschnitz C. Animal models of ischemic stroke and their application in clinical research. Drug Des Devel Ther. 2015;9:3445–54.
Gubskiy IL, Namestnikova DD, Cherkashova EA, Chekhonin VP, Baklaushev VP, Gubsky LV, Yarygin KN. MRI guiding of the Middle cerebral artery occlusion in rats aimed to Improve Stroke modeling. Translational Stroke Research. 2018;9(4):417–25.
Kang M, Jin S, Lee D, Cho H. MRI visualization of whole brain macro- and microvascular remodeling in a rat model of ischemic stroke: a pilot study. Sci Rep. 2020;10(1):4989.
Li Y, Zhu X, Ju S, Yan J, Wang D, Zhu Y, Zang F. Detection of volume alterations in hippocampal subfields of rats under chronic unpredictable mild stress using 7T MRI: a follow-up study. J Magn Reson Imaging. 2017;46(5):1456–63.
Mulder IA, Khmelinskii A, Dzyubachyk O, de Jong S, Rieff N, Wermer MJH, Hoehn M, van den Lelieveldt BPF. Maagdenberg AMJM: automated ischemic lesion segmentation in MRI Mouse Brain Data after transient middle cerebral artery occlusion. Front Neuroinform. 2017;11:3–3.
Yeh S-J, Tang S-C, Tsai L-K, Jeng J-S, Chen C-L, Hsieh S-T. Neuroanatomy- and Pathology-Based Functional Examinations of Experimental Stroke in Rats: Development and Validation of a New Behavioral Scoring System.Frontiers in Behavioral Neuroscience2018, 12(316).
Aliena-Valero A, López-Morales MA, Burguete MC, Castelló-Ruiz M, Jover-Mengual T, Hervás D, Torregrosa G, Leira EC, Chamorro Á, Salom JB. Emergent Uric Acid Treatment is synergistic with mechanical recanalization in improving stroke outcomes in male and female rats. Neuroscience. 2018;388:263–73.
Liu Y, Unsal HS, Tao Y, Zhang N. Automatic brain extraction for Rodent MRI images. Neuroinformatics. 2020;18(3):395–406.
Huang S-M, Wu C-Y, Lin Y-H, Hsieh H-H, Yang H-C, Chiu S-C, Peng S-L. Differences in brain activity between normal and diabetic rats under isoflurane anesthesia: a resting-state functional MRI study. BMC Med Imaging. 2022;22(1):136.
Nemani A, Lowe MJ. Seed-based test–retest reliability of resting state functional magnetic resonance imaging at 3T and 7T. Med Phys. 2021;48(10):5756–64.
Li J, Liu X, Zhuo J, Gullapalli RP, Zara JM. An automatic rat brain extraction method based on a deformable surface model. J Neurosci Methods. 2013;218(1):72–82.
Oguz I, Lee J, Budin F, Rumple A, McMurray M, Ehlers C, Crews F, Johns J, Styner M. Automatic skull-stripping of rat MRI/DTI scans and atlas building. Volume 7962. SPIE; 2011.
Lancelot S, Roche R, Slimen A, Bouillot C, Levigoureux E, Langlois J-B, Zimmer L, Costes N. A Multi-Atlas based method for automated anatomical rat brain MRI segmentation and extraction of PET activity. PLoS ONE. 2014;9(10):e109113.
Delora A, Gonzales A, Medina CS, Mitchell A, Mohed AF, Jacobs RE, Bearer EL. A simple rapid process for semi-automated brain extraction from magnetic resonance images of the whole mouse head. J Neurosci Methods. 2016;257:185–93.
Huang W, Zhang J, Lin Z, Huang S, Duan Y, Lu Z. Template based rodent brain extraction and atlas mapping. In: 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC): 16–20 Aug. 2016 2016; 2016: 4063–4066.
Zhang S, Huang J, Uzunbas M, Shen T, Delis F, Huang X, Volkow N, Thanos P, Metaxas DN. 3D segmentation of rodent brain structures using hierarchical shape priors and deformable models. Med Image Comput Comput Assist Interv. 2011;14(Pt 3):611–8.
Oguz I, Zhang H, Rumple A, Sonka M. RATS: Rapid Automatic tissue segmentation in rodent brain MRI. J Neurosci Methods. 2014;221:175–82.
Isensee F, Schell M, Pflueger I, Brugnara G, Bonekamp D, Neuberger U, Wick A, Schlemmer H-P, Heiland S, Wick W, et al. Automated brain extraction of multisequence MRI using artificial neural networks. Hum Brain Mapp. 2019;40(17):4952–64.
Ali MJ, Raza B, Shahid AR. Multi-level kronecker convolutional neural network (ML-KCNN) for glioma segmentation from multi-modal MRI Volumetric Data. J Digit Imaging. 2021;34(4):905–21.
Yang Z, Liu H, Liu Y, Stojadinovic S, Timmerman R, Nedzi L, Dan T, Wardak Z, Lu W, Gu X. A web-based brain metastases segmentation and labeling platform for stereotactic radiosurgery. Med Phys. 2020;47(8):3263–76.
Bernal J, Kushibar K, Asfaw DS, Valverde S, Oliver A, Martí R, Lladó X. Deep convolutional neural networks for brain image analysis on magnetic resonance imaging: a review. Artif Intell Med. 2019;95:64–81.
Murugavel M, Sullivan JM. Automatic cropping of MRI rat brain volumes using pulse coupled neural networks. NeuroImage. 2009;45(3):845–54.
Chou N, Wu J, Bingren JB, Qiu A, Chuang K. Robust automatic rodent brain extraction using 3-D pulse-coupled neural networks (PCNN). IEEE Trans Image Process. 2011;20(9):2554–64.
Krizhevsky A, Sutskever I, Hinton GE. ImageNet classification with deep convolutional neural networks. In: Proceedings of the 25th International Conference on Neural Information Processing Systems - Volume 1 Lake Tahoe, Nevada: Curran Associates Inc.; 2012: 1097–1105.
Long J, Shelhamer E, Darrell T.Fully convolutional networks for semantic segmentation; 2015.
Ronneberger O, Fischer P, Brox T. U-Net: Convolutional Networks for Biomedical Image Segmentation. In: 2015; Cham:Springer International Publishing; 2015:pp. 234–241.
Ding X, Peng Y, Shen C, Zeng T. CAB U-Net: an end-to-end category attention boosting algorithm for segmentation. Comput Med Imaging Graph. 2020;84:101764.
Mizusawa S, Sei Y, Orihara R, Ohsuga A. Computed tomography image reconstruction using stacked U-Net. Comput Med Imaging Graph. 2021;90:101920.
Abramova V, Clèrigues A, Quiles A, Figueredo DG, Silva Y, Pedraza S, Oliver A, Lladó X. Hemorrhagic stroke lesion segmentation using a 3D U-Net with squeeze-and-excitation blocks. Comput Med Imaging Graph. 2021;90:101908.
Hsu L-M, Wang S, Ranadive P, Ban W, Chao T-HH, Song S, Cerri DH, Walton LR, Broadwater MA, Lee S-H et al. Automatic Skull Stripping of Rat and Mouse Brain MRI Data Using U-Net.Frontiers in Neuroscience2020, 14(935).
De Feo R, Shatillo A, Sierra A, Valverde JM, Gröhn O, Giove F, Tohka J. Automated joint skull-stripping and segmentation with Multi-Task U-Net in large mouse brain MRI databases. NeuroImage. 2021;229:117734.
Valverde JM, Shatillo A, De Feo R, Tohka J. Automatic cerebral hemisphere segmentation in Rat MRI with ischemic lesions via attention-based Convolutional neural networks. Neuroinformatics; 2022.
He K, Zhang X, Ren S, Sun J. Deep Residual Learning for Image Recognition.Proc Int Conf Learn2015.
Ioffe S, Szegedy C. Batch normalization. Accelerating Deep Network Training by Reducing Internal Covariate Shift; 2015.
Tsai L-K, Wang Z, Munasinghe J, Leng Y, Leeds P, Chuang D-M. Mesenchymal stem cells primed with Valproate and Lithium robustly migrate to infarcted regions and facilitate recovery in a stroke model. Stroke. 2011;42(10):2932–9.
Kreyszig E. Advanced Engineering Mathematics. 10th ed. Wiley; 2011.
Gomez W, Pereira WCA, Infantosi AFC. Analysis of Co-Occurrence Texture Statistics as a function of Gray-Level quantization for classifying breast Ultrasound. IEEE Trans Med Imaging. 2012;31(10):1889–99.
Noh H, Hong S, Han B. Learning Deconvolution Network for Semantic Segmentation. In: 2015 IEEE International Conference on Computer Vision (ICCV): 7–13 Dec. 2015 2015; 2015: 1520–1528.
Dice LR. Measures of the amount of ecologic association between species. Ecology. 1945;26(3):297–302.
Kingma D, Ba J. Adam: A Method for Stochastic Optimization; 2014.
Chang H-H, Zhuang AH, Valentino DJ, Chu W-C. Performance measure characterization for evaluating neuroimage segmentation algorithms. NeuroImage. 2009;47(1):122–35.
Rote G. Computing the minimum Hausdorff distance between two point sets on a line under translation. Inform Process Lett. 1991;38(3):123–7.
Chollet F. Keras Documentation: Francisco: Keras.Io.; 2015.
Nvidia C. Tesla K40 GPU Accelerator Overview. In.; 2014.
Shattuck DW, Sandor-Leahy SR, Schaper KA, Rottenberg DA, Leahy RM. Magnetic resonance image tissue classification using a partial volume model. NeuroImage. 2001;13:856–76.
Wood T, Lythgoe D, Williams S. rBET: Making BET work for Rodent Brains. In: 2013; 2013.
Kamnitsas K, Ledig C, Newcombe VFJ, Simpson JP, Kane AD, Menon DK, Rueckert D, Glocker B. Efficient multi-scale 3D CNN with fully connected CRF for accurate brain lesion segmentation. Med Image Anal. 2017;36:61–78.
Acknowledgements
The authors would like to thank Miss Hsiao-Fu Kuo and Mr. Shih-Hsin Ho for running the experiments and preparing the data.
Funding
This work was supported in part by the Ministry of Science and Technology of Taiwan under Grant MOST 107-2320-B-002-043-MY3, MOST 108-2221-E-002-080-MY3, and MOST 111-2221-E-002-148. The funder played no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.
Author information
Authors and Affiliations
Contributions
HC: Conceptualization, Methodology, Software, Validation, Funding acquisition, Writing - original draft. SY: Data curation, Methodology, Formal analysis, Writing - review & editing. MC: Formal analysis, Investigation, Writing - review & editing. SH: Conceptualization, Resources, Supervision, Funding acquisition, Writing - review & editing. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare that they have no competing interests.
Consent for publication
Not applicable.
Ethics approval and consent to participate
The study was conducted according to the guidelines of the Basel Declaration, and approved by the Institutional Animal Care and Use Committee (IACUC) of National Taiwan University College of Medicine (protocol code: 20180432 and date of approval: Jan 14 2019). All methods for the reporting of animal experiments were reported in accordance with the ARRIVE guidelines.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Chang, HH., Yeh, SJ., Chiang, MC. et al. RU-Net: skull stripping in rat brain MR images after ischemic stroke with rat U-Net. BMC Med Imaging 23, 44 (2023). https://doi.org/10.1186/s12880-023-00994-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s12880-023-00994-8