Scalable Neural Architecture Search for 3D Medical Image Segmentation

Sungwoong Kim¹⁶,
Ildoo Kim¹⁶,
Sungbin Lim¹⁶,
Woonhyuk Baek¹⁶,
Chiheon Kim¹⁶,
Hyungjoo Cho¹⁷,
Boogeon Yoon¹⁶ &
…
Taesup Kim^16,18

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11766))

Included in the following conference series:

International Conference on Medical Image Computing and Computer-Assisted Intervention

11k Accesses
35 Citations

Abstract

In this paper, a neural architecture search (NAS) framework is proposed for 3D medical image segmentation, to automatically optimize a neural architecture from a large design space. Our NAS framework searches the structure of each layer including neural connectivities and operation types in both of the encoder and decoder. Since optimizing over a large discrete architecture space is difficult due to high-resolution 3D medical images, a novel stochastic sampling algorithm based on a continuous relaxation is also proposed for scalable gradient based optimization. On the 3D medical image segmentation tasks with a benchmark dataset, an automatically designed architecture by the proposed NAS framework outperforms the human-designed 3D U-Net, and moreover this optimized architecture is well suited to be transferred for different tasks.

S. Kim, I. Kim and S. Lim—Contributed equally.

You have full access to this open access chapter, Download conference paper PDF

Resource Optimized Neural Architecture Search for 3D Medical Image Segmentation

Searching Learning Strategy with Reinforcement Learning for 3D Medical Image Segmentation

UXNet: Searching Multi-level Feature Aggregation for 3D Medical Image Segmentation

Keywords

1 Introduction

Recently, deep neural networks have been extensively used for medical image segmentation tasks. However, in general, their performance relies on manual trial-and-error processes for making decisions on the network architecture, hyperparameters for training, and pre-/post-procedures. Due to being restricted to manual tuning, they would have limitations in performance improvement as well as fast transfer to related tasks. Currently, the same problem in the field of general deep learning has promoted the rapid development of automated machine learning (AutoML). Yet, in contrast to the recent studies on the use of advanced AutoML algorithms such as neural architecture search (NAS) [6, 10, 16] and neural optimizer search [12] for general computer vision tasks, only few approaches using simple hyperparameter optimization have been proposed for medical imaging tasks [7, 8].

In this paper, we propose a NAS framework for AutoML in designing neural networks especially for 3D medical image segmentation. 3D U-Net [1] has been popularly used for segmenting high-resolution 3D medical images (see [5, 9, 14]) since both semantic as well as spatial information can be efficiently exploited through skip connections between an encoder and a decoder. However, a convolutional block for each layer in the 3D U-Net has been manually designed with various convolutional filter types, pooling types, skip-connections, and non-linear activation functions. Instead of using the artificial block, we employ a NAS framework to obtain an automatically tuned structure of the block, which is called a cell, for each layer in the 3D U-Net where all cell structures and the corresponding neural operation parameters (e.g. kernel weights) are simultaneously learned in the end-to-end manner. For this, four types of cells, encoder-normal, reduction, decoder-normal, and expansion cell are defined to compose the encoder as well as the decoder for the learned U-Net architecture (see Fig. 1). This is different from previous NAS approaches which merely use two types of cells (normal and reduction) for encoder-only networks [6, 10].

It is important to employ a sufficiently large search space in NAS in generating an improved network architecture on a target task. However, optimization over such a large space is difficult due to the extreme memory usage and the long run-time when dealing with high-resolution 3D images. Moreover, an exact bi-level formulation of NAS has to be solved on the mixed domain of (1) discrete variables regarding neural connections and operation types in each cell and (2) continuous operation parameters. This constraint restricts the use of a gradient method for architecture searching. To handle this problem, we propose a novel continuous approximation using Gumbel-softmax [4] sampling on the discrete variables. This makes it possible to use a stochastic gradient descent (SGD) in bi-level optimization. This sampling procedure also enables to reduce the computational burden of taking the entire connectivities and operations into account within an outrageously large network originated from the continuous relaxation. Namely, the proposed differentiable NAS with stochastic sampling supports great scalability in terms of solvable large search space with reduced computational cost. To our best knowledge, this is the first work to exploit a complete NAS framework for automatically designing an architecture for the task of 3D medical image segmentation.

Experimental results on the benchmark 3D medical image segmentation dataset show that in comparison to the human-designed 3D U-Net [1], the network obtained by the proposed scalable NAS produces more accurate results with similar computational complexity. It is furthermore shown that the found architecture from a task having large amounts of labeled data can be transferred to build a network for different segmentation tasks with various medical modalities including MRI and CT that have small amounts of labeled data and achieves better generalization performances.

2 Method

In this section, we first describe an architecture search space for 3D medical image segmentation. Then, we present a SGD-based bi-level optimization to simultaneously learn both of the architecture and the corresponding neural operation parameters.

2.1 Search Space for 3D Medical Image Segmentation

Following the idea of micro search space popularly used in the state-of-the-art NAS approaches [6, 10, 16], U-Net-like networks are designed as repeated encoder and decoder cells (see Fig. 1). Here, a cell C is one of four cell-types - encoder-normal ($C_{\mathsf {enc}}$), reduction ($C_{\mathsf {red}}$), decoder-normal ($C_{\mathsf {dec}}$), and expansion ($C_{\mathsf {exp}}$) - and the normal cells and resizing cells are stacked alternately with skip connections between the cells in the encoder and the cells in the decoder. Note that an inter-cell, a copy of $C_{\mathsf {enc}}$, is deployed between the encoder and decoder. Every cell takes two outputs of the last previous two cells as inputs^{Footnote 1} except the first reduction cell. A predefined first convolutional block, called a stem cell, duplicates its output as two inputs for the first reduction cell. The segmentation output is obtained from the predefined last convolutional block, referred to as an out cell.

The neural structure in each cell $C \in \mathcal {C} := \{ C_{\mathsf {enc}}, C_{\mathsf {red}}, C_{\mathsf {dec}}, C_{\mathsf {exp}}\}$ is represented as a directed acyclic graph (DAG). Let ${\mathcal G}=\mathcal {G}(C) = (\mathcal {V},\mathcal {E})$ be the DAG where each node $i\in \mathcal {V}$ corresponds to an intermediate feature vector $\mathbf{x}^{i}$ in cell C, and each directed edge $(i,j)\in \mathcal {E}$ stands for a connection between nodes i and j with a certain operation $o^{(i,j)}\in \mathcal {O}$ such that $\mathcal {O}$ denotes the set of all candidate operations, and $\mathbf {x}^{j} = \sum _{(i,j)\in \mathcal {E}}o^{(i,j)}(\mathbf {x}^{i})$. The output of a cell is a channel-wise concatenation of all the intermediate nodes. Therefore, the architecture search problem now amounts to find the best combination of all edge operations in the four cell-types. Basically, even the same type of cells can have different structures according to their layer levels. However, in this work, for simplicity, all cells that have a common type share a common structure regardless of layer levels. Note that a zero operation is also one of the candidate operations to optimize the neural connectivities as well; zero means a disconnection between two nodes.

2.2 Stochastic Bi-level Optimization with Operation Sampling

We first represent the selected edge operation using the one-hot vector $\mathbf{z}^{(i,j)} = \{z_{o}^{(i,j)}\mid o\in \mathcal {O}\}$ and operation vector $\mathbf {o}^{(i,j)}=\{o(\mathbf {x}^{i};\theta _{o}^{(i,j)}) | o\in \mathcal {O}\}$:

$$\begin{aligned} o^{(i,j)}(\mathbf{x}^{i}) = \langle \mathbf {z}^{(i,j)},\mathbf {o}^{(i,j)} \rangle = \sum _{o\in \mathcal {O}} z_{o}^{(i,j)} o(\mathbf{x}^{i}; \mathbf{\theta }_{o}^{(i,j)}), \end{aligned}$$

(1)

where $\mathbf{\theta }_{o}^{(i,j)}$ denotes the parameter set of the operation o on edge (i, j). Then, finding the best cell architecture corresponds to solving the following bi-level optimization problem:

$$\begin{aligned} \begin{aligned} \min _{Z} \qquad&\mathcal {L}_{\mathsf {val}}(\varTheta ^{*}(Z), Z) \\ \text {s.t.} \qquad&\varTheta ^{*}(Z) = \underset{\varTheta }{\text {argmin }}\mathcal {L}_{\mathsf {train}}(\varTheta , Z), \end{aligned} \end{aligned}$$

(2)

where $(\varTheta , Z) = \{ (\theta ^{(i,j)}, \mathbf{z}^{(i,j)}) \mid (i,j) \in \mathcal {E}(C), C \in \mathcal {C}\}$, and $\mathcal {L}_{\mathsf {val}}$ and $\mathcal {L}_{\mathsf {train}}$ are the validation loss and training loss, respectively. Bi-level programming (2), is hard to solve since its search space is the mixed domain of continuous variables $\varTheta $ and discrete variables Z. DARTS [6] try to circumvent this difficulty by relaxing Z to a continuous logits and considering mixed operations in the edges. This allows to use SGD method to obtain an approximate solution and derive the final architecture from the relaxed variables by taking the operation with the highest strength on each edge. However, this method is infeasible since training the mixed operations in edges requires the extremely large memory usage and the long run-time for desired high-resolution 3D image segmentation tasks.

To overcome the aforementioned problems, we propose a modified optimization, called stochastic bi-level optimization, by first treating Z as random discrete variables and then replacing (2) as

$$\begin{aligned} \begin{aligned} \min _{\alpha } \qquad&\mathbb {E}_{Z\sim P_{\alpha }}[\mathcal {L}_{\mathsf {val}}(\varTheta ^{*}(Z), Z)] \\ \text {s.t.} \qquad&\varTheta ^{*}(Z) = \underset{\varTheta }{\text {argmin }} \mathcal {L}_{\mathsf {train}}(\varTheta , Z), \end{aligned} \end{aligned}$$

(3)

where $P_\alpha $ is the discrete distribution on Z, parameterized by continuous variable $\alpha $. Since it is intractable to exactly compute $\nabla _{\alpha } \mathbb {E}_{Z\sim P_{\alpha }}[\mathcal {L}_{\mathsf {val}}(\varTheta ^{*}(Z), Z)]$, we estimated it through continuous relaxation using the Gumbel-softmax reparametrization technique [4, 11, 13] as

$$\begin{aligned} \begin{aligned} \!\!\!\nabla _{\alpha } \mathbb {E}_{Z\sim P_{\alpha }}[\mathcal {L}_{\mathsf {val}}(\varTheta ^{*}(Z), Z)] \approx \mathbb {E}_{\epsilon \sim \mathsf {Gumbel}(0,1)} [\nabla _{\alpha }\mathcal {L}_{\mathsf {val}}(\varTheta ^{*}(\bar{Z}(\alpha ,\epsilon ; \tau )), \bar{Z}(\alpha ,\epsilon ; \tau )],\!\! \end{aligned} \end{aligned}$$

(4)

where continuously relaxed variables $\bar{Z}(\alpha ,\epsilon ; \tau ) = \mathsf {softmax}((\alpha + \epsilon )/\tau )$, $\tau $ denotes the temperature, and $\epsilon $ is random variable drawn from the Gumbel distribution. Here, the expectation in (4) is approximated with $\epsilon $-sampling. It is noted that as $\tau \rightarrow 0$, the distribution of $\bar{Z}$ is identical to $P_\alpha $, which means that by annealing $\tau $ we can enforce $\bar{Z}$ to be one-hot discrete variables Z during training; the relaxed architecture is forced to be converged to the final architecture.

When alternatively updating $\varTheta $ and $\alpha $ by respective gradient descents, we again replace $\bar{Z}$ with $\hat{Z}$ by sampling two operations on each edge from the Gumbel-softmax with rescaling of the corresponding two operation weights to be summed to one, as shown in Algorithm 1. Note that due to $\tau $-annealing,^{Footnote 2} the number of sampled operations on each edge is naturally reduced from two to one during training. This stochastic operation sampling supports improved scalability in terms of solvable large search space with small computational cost.^{Footnote 3}

3 Experiments

Dataset and Evaluation. The proposed scalable NAS (SCNAS) was evaluated on the three 3D segmentation tasks, (1) brain tumor (MRI, 484 labeled images, 3 classes), (2) heart (MRI, 20 labeled images, 1 class), and (3) lung (CT, 64 labeled images, 1 class), from the Medical Segmentation Decathlon challenge (MSD, http://medicaldecathlon.com) where each task has different input modalities and sizes as well as different foreground classes, which is therefore suitable for evaluating the generalizability and transferability of the SCNAS. Since the ground-truth labels for test images are not provided in the MSD dataset, the evaluation was conducted by 5-fold cross-validation (CV) on the training images with the average dice similarity coefficient as the metric. Here, we used the splitting provided by the authors of the 3D nnU-Net [2].^{Footnote 4} For SCNAS, the training set after the validation split was split again into two sets with a ratio of 4:1 for respectively optimizing the operation parameters and the architecture parameters.

Implementation Details. The performances of the SCNAS are compared to those obtained by our implementation of the baseline 3D U-ResNet [5, 14], which makes use of residual blocks, multiple segmentation maps [5], and attention gates [9], as well as those from the 3D nnU-Net [2] that can be considered as the best performed single model from the perspective of challenge results. In both of the 3D U-ResNet and the SCNAS, patch-based training and inference were carried out such that each image was randomly cropped to the region of nonzero values with the predefined resolution during training, while in testing, the prediction results were obtained by combining patch-based inference results with 50% overlap. The input patch size was basically set to $128\times 128\times 128$ and modified for each task taking median shapes and memory constraints into account just like that used for the 3D nnU-Net in [2]. Since even the same task provides 3D images with heterogeneous voxel spacings, the input images were first resampled to have an equal voxel spacing of 0.7 mm $\times $ 0.7 mm $\times $ 0.7 mm, and then z-normalization was separately applied to each input channel. Following [2], we also utilized the data augmentation techniques at both training and testing time with the same kinds and parameters that used in [2]. However, unlike [2], network-cascade, prediction-ensembling from different architectures, and the removal of small connected components were not adopted in this evaluation to solely examine the effects by the use of NAS in designing the network architecture.

The set of operations $\mathcal {O}$ on each edge in the SCNAS consists of the following eight operations: $3\times 3\times 3$ convolutions, depthwise separable dilated $3\times 3\times 3$ convolutions with rate 2, 3 and 4, $3\times 3 \times 3$ max and average 3D pooling, identity (skip connection), and zero. Here, we used the LeakyReLU-Conv-InstanceNorm for convolutional operations. As shown in Fig. 1, the whole network in the SCNAS is composed of 12 automatically designed cells, each of which has 4 nodes. This number of stacked cells is consistent with that of the 3D U-ResNet in terms of respective three times of downsampling and upsampling by a factor of 2. Here, all operations in the reduction cell in the SCNAS are of stride two while the expansion cells perform pre-upsampling for the inputs of the cell. Similar to the 3D U-ResNet, the reduction and expansion cells in the SCNAS respectively double and halve the number of output channels of given inputs.

It is noted that the SCNAS first optimized all of cell architectures using 48 output channels of the stem cell in order to fit a batch size of 1 into a single GPU.^{Footnote 5} Then, a larger network was constructed by increasing the number of stem channels to 68 with found cell-topologies and was retrained from scratch. Here, 68 channels makes the computational complexity for inference of a found network by SCNAS to be similar to the baseline 3D U-ResNet, which has 32 output channels in the first convolutional block, in terms of FLOPs: 419.59 GFLOPs (3D U-ResNet) vs. 424.76 GFLOPs (SCNAS) on the brain tumor task.

The SCNAS was trained for 200 epochs with a batch size of 4, which took one day on 4 V100 GPUs. In this SCNAS training, the ADAM optimizer were used where the initial learning rates/beta parameters were as set to be 0.025/(0.1, 0.001) for training operation parameters $\varTheta $ and 0.003/(0.5, 0.999) for training architecture parameters $\alpha $. If a plateau for 20 epochs on the training loss was detected, the learning rate was reduced by a factor of 10. When retraining the SCNAS models as well as training the 3D U-ResNet models, an initial learning rate of 0.0003 and beta parameters of (0.9, 0.999) for the ADAM optimizer were used with a batch size of 8, where the learning rate was reduced by a factor of 5 if a training loss was not reduced for 30 epochs, and the iteration was terminated either if it lasted for 500 epochs or if the learning rate was smaller than $10^{-7}$. The loss function for both 3D U-ResNet and SCNAS is the Jaccard distance [3, 5, 15]. We empirically observed that the Jaccard distance slightly outperformed the dice loss [2] in both models.

Table 1. Average dice similarity coefficients (%) on three tasks of MSD. [2] obtained their 3D nnU-Net results by model selection based on the validation loss.

Full size table

Results. Table 1 shows that the SCNAS produced better architectures than the (human-designed) 3D U-ResNet in terms of the overall performances. Especially, the performances of SCNAS are comparable or even better than those of the 3D nnU-Net [2]. Here, it should be noted that the 3D nnU-Net performed model selection based on the validation loss during their 5-fold CV while ours did not take any validation result into account during training. On the heart and lung segmentation tasks, which have only 20 and 64 labeled images, respectively, the 3D U-ResNet as well as the SCNAS can be prone to overfitting on the training set. Therefore, we transferred the found architecture by SCNAS from the first CV fold of the brain tumor task having 484 labeled images to these tasks. For this, we modified the stem cell architecture to match the number of input channels according to each task, and the operation parameters in the transferred architecture were retrained from scratch on each task. As a result, the transferred architecture from the brain tumor task achieved better generalization performances in comparison to their own NAS results. Figure 2 shows the optimized cell architectures by SCNAS on the brain tumor task. We conjecture that the selected dilated convolutions are helpful to reflect a more global context for improving segmentation results. Example input images and the corresponding segmentation outputs from the brain tumor task are presented in Fig. 3, which shows better segmentation results by SCNAS compared to 3D U-ResNet.

4 Conclusion

In this work, a complete NAS framework for automatically designing an architecture is proposed and demonstrated on the benchmark dataset of 3D medical image segmentation tasks. In the proposed framework, NAS is formulated as finding the optimal structure of four types of cells composing an encoder as well as a decoder. We introduce a novel stochastic sampling algorithm which results in significant improvement in terms of the scalability suitable for handling high-resolution 3D medical images. Empirical evaluation demonstrates that the automatically optimized network via the proposed NAS outperforms the manually designed 3D U-Net, and the learned architecture is successfully transferred to different segmentation tasks.

Notes

1.
In the decoder, before used as one of inputs of the current cell, an output of the last previous cell is summed with an output of the encoder cell at the same level.
2.
We set the annealing schedule as $\tau = \max (0.001, \exp (-0.025t))$.
3.
We tried sampling one operation at a time, but the performance was not improved because of the use of high bias architecture and insufficient architecture variation (exploration) especially in the early stage of training.
4.
https://github.com/MIC-DKFZ/nnUNet.
5.
DARTS [6] requires approximately 4 times more GPU memory in comparison to the SCNAS during architecture search with the same number of channels.

References

Çiçek, Ö., Abdulkadir, A., Lienkamp, S.S., Brox, T., Ronneberger, O.: 3D U-Net: learning dense volumetric segmentation from sparse annotation. In: Ourselin, S., Joskowicz, L., Sabuncu, M.R., Unal, G., Wells, W. (eds.) MICCAI 2016. LNCS, vol. 9901, pp. 424–432. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46723-8_49
Chapter Google Scholar
Isensee, F., et al.: nnU-Net: self-adapting framework for u-net-based medical image segmentation (2018)
Google Scholar
Jaccard, P.: The distribution of the flora in the alpine zone. New Phytol. 11(2), 37–50 (1912)
Article Google Scholar
Jang, E., Gu, S., Poole, B.: Categorical reparameterization with gumbel-softmax. In: ICLR (2017)
Google Scholar
Kayalibay, B., Jensen, G., van der Smagt, P.: CNN-based segmentation of medical imaging data. CoRR abs/1701.03056 (2017). http://arxiv.org/abs/1701.03056
Liu, H., Simonyan, K., Yang, Y.: Darts: differentiable architecture search. arXiv preprint arXiv:1806.09055 (2018)
Mortazi, A., Bagci, U.: Automatically designing CNN architectures for medical image segmentation. In: Shi, Y., Suk, H.-I., Liu, M. (eds.) MLMI 2018. LNCS, vol. 11046, pp. 98–106. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00919-9_12
Chapter Google Scholar
Naceur, M.B., Saouli, R., Akil, M., Kachouri, R.: Fully automatic brain tumor segmentation using end-to-end incremental deep neural networks in MRI images. Comput. Methods Programs Biomed. 166, 39–49 (2018)
Article Google Scholar
Oktay, O., et al.: Attention U-Net: learning where to look for the pancreas. In: MIDL (2018)
Google Scholar
Pham, H., Guan, M.Y., Zoph, B., Le, Q.V., Dean, J.: Efficient neural architecture search via parameter sharing. In: ICML (2018)
Google Scholar
Veit, A., Belongie, S.: Convolutional networks with adaptive inference graphs. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11205, pp. 3–18. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01246-5_1
Chapter Google Scholar
Wichrowska, O., et al.: Learned optimizers that scale and generalize. In: ICML (2017)
Google Scholar
Xie, S., Zheng, H., Liu, C., Lin, L.: SNAS: stochastic neural architecture search. In: ICLR (2019)
Google Scholar
Yu, L., Yang, X., Chen, H., Qin, J., Heng, P.A.: Volumetric ConvNets with mixed residual connections for automated prostate segmentation from 3D MR images. In: AAAI (2017)
Google Scholar
Yuan, Y., Chao, M., Lo, Y.C.: Automatic skin lesion segmentation using deep fully convolutional networks with Jaccard distance. IEEE Trans. Med. Imaging 36(9), 1876–1886 (2017)
Article Google Scholar
Zoph, B., Vasudevan, V., Shlens, J., Le, Q.V.: Learning transferable architectures for scalable image recognition. In: CVPR (2018)
Google Scholar

Download references

Acknowledgement

We thank the Kakao Brain Cloud team for supporting to efficiently use GPU clusters for large-scale experiments.

Author information

Authors and Affiliations

Kakao Brain, Pangyo, Seongnam, Gyeonggi, South Korea
Sungwoong Kim, Ildoo Kim, Sungbin Lim, Woonhyuk Baek, Chiheon Kim, Boogeon Yoon & Taesup Kim
Department of Transdisciplinary Studies, Seoul National University, Seoul, South Korea
Hyungjoo Cho
MILA, Université de Montréal, Montreal, Canada
Taesup Kim

Authors

Sungwoong Kim
View author publications
You can also search for this author in PubMed Google Scholar
Ildoo Kim
View author publications
You can also search for this author in PubMed Google Scholar
Sungbin Lim
View author publications
You can also search for this author in PubMed Google Scholar
Woonhyuk Baek
View author publications
You can also search for this author in PubMed Google Scholar
Chiheon Kim
View author publications
You can also search for this author in PubMed Google Scholar
Hyungjoo Cho
View author publications
You can also search for this author in PubMed Google Scholar
Boogeon Yoon
View author publications
You can also search for this author in PubMed Google Scholar
Taesup Kim
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sungwoong Kim .

Editor information

Editors and Affiliations

University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
Dinggang Shen
University of Georgia, Athens, GA, USA
Tianming Liu
Western University, London, ON, Canada
Terry M. Peters
Yale University, New Haven, CT, USA
Lawrence H. Staib
University of Strasbourg, Illkirch, France
Caroline Essert
United Imaging Intelligence, Shanghai, China
Sean Zhou
University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
Pew-Thian Yap
Western University, London, ON, Canada
Ali Khan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kim, S. et al. (2019). Scalable Neural Architecture Search for 3D Medical Image Segmentation. In: Shen, D., et al. Medical Image Computing and Computer Assisted Intervention – MICCAI 2019. MICCAI 2019. Lecture Notes in Computer Science(), vol 11766. Springer, Cham. https://doi.org/10.1007/978-3-030-32248-9_25

Download citation

DOI: https://doi.org/10.1007/978-3-030-32248-9_25
Published: 10 October 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-32247-2
Online ISBN: 978-3-030-32248-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)

Scalable Neural Architecture Search for 3D Medical Image Segmentation

Abstract

Similar content being viewed by others

Resource Optimized Neural Architecture Search for 3D Medical Image Segmentation

Searching Learning Strategy with Reinforcement Learning for 3D Medical Image Segmentation

UXNet: Searching Multi-level Feature Aggregation for 3D Medical Image Segmentation

Keywords

1 Introduction

2 Method

2.1 Search Space for 3D Medical Image Segmentation

2.2 Stochastic Bi-level Optimization with Operation Sampling

3 Experiments

4 Conclusion

Notes

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Scalable Neural Architecture Search for 3D Medical Image Segmentation

Abstract

Similar content being viewed by others

Resource Optimized Neural Architecture Search for 3D Medical Image Segmentation

Searching Learning Strategy with Reinforcement Learning for 3D Medical Image Segmentation

UXNet: Searching Multi-level Feature Aggregation for 3D Medical Image Segmentation

Keywords

1 Introduction

2 Method

2.1 Search Space for 3D Medical Image Segmentation

2.2 Stochastic Bi-level Optimization with Operation Sampling

3 Experiments

4 Conclusion

Notes

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation