Tumor-Aware, Adversarial Domain Adaptation from CT to MRI for Lung Cancer Segmentation

Jue Jiang¹⁸,
Yu-Chi Hu¹⁸,
Neelam Tyagi¹⁸,
Pengpeng Zhang¹⁸,
Andreas Rimner¹⁹,
Gig S. Mageras¹⁸,
Joseph O. Deasy¹⁸ &
…
Harini Veeraraghavan¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11071))

Included in the following conference series:

International Conference on Medical Image Computing and Computer-Assisted Intervention

18k Accesses

Abstract

We present an adversarial domain adaptation based deep learning approach for automatic tumor segmentation from T2-weighted MRI. Our approach is composed of two steps: (i) a tumor-aware unsupervised cross-domain adaptation (CT to MRI), followed by (ii) semi-supervised tumor segmentation using Unet trained with synthesized and limited number of original MRIs. We introduced a novel target specific loss, called tumor-aware loss, for unsupervised cross-domain adaptation that helps to preserve tumors on synthesized MRIs produced from CT images. In comparison, state-of-the art adversarial networks trained without our tumor-aware loss produced MRIs with ill-preserved or missing tumors. All networks were trained using labeled CT images from 377 patients with non-small cell lung cancer obtained from the Cancer Imaging Archive and unlabeled T2w MRIs from a completely unrelated cohort of 6 patients with pre-treatment and 36 on-treatment scans. Next, we combined 6 labeled pre-treatment MRI scans with the synthesized MRIs to boost tumor segmentation accuracy through semi-supervised learning. Semi-supervised training of cycle-GAN produced a segmentation accuracy of 0.66 computed using Dice Score Coefficient (DSC). Our method trained with only synthesized MRIs produced an accuracy of 0.74 while the same method trained in semi-supervised setting produced the best accuracy of 0.80 on test. Our results show that tumor-aware adversarial domain adaptation helps to achieve reasonably accurate cancer segmentation from limited MRI data by leveraging large CT datasets.

H. Veeraraghavan—Equal contributing.

You have full access to this open access chapter, Download conference paper PDF

CTumorGAN: a unified framework for automatic computed tomography tumor segmentation

Article 28 March 2020

M-GenSeg: Domain Adaptation for Target Modality Tumor Segmentation with Annotation-Efficient Supervision

Deep learning-based auto-segmentation of lung tumor PET/CT scans: a systematic review

Article 11 February 2022

1 Introduction

MRI-guided radiotherapy is an emerging technology for improving treatment accuracy over conventional CT-based radiotherapy due to better soft-tissue contrast in MR compared to CT images. Real-time and accurate tumor segmentation on MRI can help to deliver high dose to tumors while reducing normal tissue dose. However, as MRI-guided radiotherapy is not used in standard-of-care, only very few MRIs are available for training. Therefore, we developed an adversarial domain adaptation from large CT datasets for tumor segmentation on MRI.

Although deep neural networks excel in learning from large amounts of (labeled) data, their accuracy is reduced when applied to novel datasets or domains [1]. Differences between source and target domain distribution is called domain shift. Typically used fine-tuning methods require prohibitively large labeled data in the target domain. As an alternative, domain adaptation methods attempt to minimize domain shift either by feature sharing [2] or by learning to reconstruct the target from source domain [3, 4]. In essence, domain adaptation methods learn the marginal distributions [5] to transform source to target domain.

The problems of domain shift are exacerbated in medical images, where imaging modalities capture physical properties of the underlying anatomy differently (eg. CT vs. MRI). For example, whereas bones appear hyper-dense on CT and dark on MRI, tumors appear with similar contrast as normal soft-tissue on CT but have a distinct appearance on MRI (Fig. 1(a) and (b)). Consequently, learning the marginal distributions of the domains alone may not be sufficient.

Cross-domain adaptation of highly different modalities, has been applied in medical image analysis for image synthesis using paired images [6] and unpaired images [7], as well as for segmentation [8, 9]. However, all aforementioned approaches aim to only synthesize images that match the marginal but not the structure-specific conditional distribution such as tumors. Therefore, segmentation/classification using such synthetic images will lead to lower accuracy.

Therefore, we introduced a novel target specific loss, called tumor-aware loss, for unsupervised cross-domain adaptation that helps to preserve tumors on synthesized MRIs produced from CT images (Fig. 1(d)), which cannot be captured with just the cycle-loss (Fig. 1(c)).

2 Method

Our objective is to solve the problem of learning to segment tumors from MR images through domain adaptation from CT to MRI, where we have access to a reasonably sized labeled data in the source domain $(X_{CT}, y_{CT})$ but are provided with very limited number of target samples $X_{MRI} \ll X_{CT}$ and fewer labels $y_{MR}$. Our solution first employs tumor-aware unsupervised cross-domain adaptation to synthesize a reasonably large number of MRI from CT through adversarial training. Second, we combine the synthesized MRI with a fraction of real MRI with corresponding labels and train a Unet [10] for generating tumor segmentation as outlined in Fig. 2.

2.1 Step 1: MRI Synthesis Using Tumor-Aware Unsupervised Cross Domain Adaptation

The first step is to learn a mapping $G_{CT \rightarrow MRI}$ that synthesizes MRI from the CT images to fool a discriminator $D_{MRI}$ using adversarial training [11]. Additionally, we compute an adversarial loss $L^{CT}_{adv}$ for synthesizing CT from MRI by simultaneously training a network that learns a mapping $G_{MRI \rightarrow CT}$. The adversarial loss, $L^{MRI}_{adv}$, for synthesizing MRI from CT, and $L^{CT}_{adv}$, for synthesizing CT from MRI, are computed as:

$$\begin{aligned} \begin{aligned}&L^{MRI}_{adv}(G_{CT \rightarrow MRI}, D_{MRI}, X_{MRI}, X_{CT})= \mathbb {E}_{x_{m} \sim X_{MRI}} [log(D_{MRI}(x_{m}))] \\ +&\mathbb {E}_{x_{c} \sim X_{CT}} [log(1-(D_{MRI}(G_{CT \rightarrow MRI}(x_{c}))] \\&L^{CT}_{adv}(G_{MRI \rightarrow CT}, D_{CT}, X_{CT}, X_{MRI}) = \mathbb {E}_{x_{c} \sim X_{CT}} [log(D_{CT}(x_{c}))] \\ +&\mathbb {E}_{x_{m} \sim X_{MRI}} [log(1-(D_{CT}(G_{MRI \rightarrow CT}(x_{m}))] \\ \end{aligned} \end{aligned}$$

(1)

where $x_{c}$ and $x_{m}$ are real images sampled from the CT ($X_{CT}$) and MRI ($X_{MRI}$) domains, respectively. The total adversarial loss (Fig. 2 (purple ellipse)) is then computed as the summation of the two losses as $L_{adv}=L^{MRI}_{adv}+L^{CT}_{adv}$. We also compute a cycle consistency loss [5] to regularize the images synthesized through independent training of the two networks. By letting the synthesized images be $x^{'}_{m}=G_{CT \rightarrow MRI}(x_{c})$ and $x^{'}_{c}=G_{MRI \rightarrow CT}(x_{m})$, the cycle consistency loss $L_{cyc}$ is calculated as:

$$\begin{aligned} \begin{aligned} L_{cyc}(G_{CT \rightarrow MRI}, G_{MRI \rightarrow CT}, X_{CT}, X_{MRI}&) = \mathbb {E}_{x_{c} \sim X_{CT}}\left[ \left\| G_{MRI \rightarrow CT}(x^{'}_{m}) - x_{c}\right\| _{1}\right] \\&+ \mathbb {E}_{x_{m} \sim X_{MRI}}\left[ \left\| G_{CT \rightarrow MRI}(x^{'}_{c}) - x_{m}\right\| _{1}\right] . \end{aligned} \end{aligned}$$

(2)

The cycle consistency and adversarial loss only constrain the model to learn a global mapping that matches the marginal distribution but not the conditional distribution pertaining to individual structures such as the tumors. Therefore, a model trained using these losses does not need to preserve tumors, which can lead to either deterioration or total loss of tumors in the synthesized MRIs (Fig. 1(c)). Therefore, we introduced a tumor-aware loss that forces the network to preserve the tumors. To be specific, the tumor-aware loss is composed of a tumor loss (Fig. 2 (red ellipse)) and a feature loss (Fig. 2 (orange ellipse)). We compute the tumor loss by training two parallel tumor detection networks using simplified models of the Unet [10] for CT ($U_{CT}$) and the synthesized MRI ($U_{MRI}$). The tumor loss constrains the CT and synthetic MRI-based Unets to produce similar tumor segmentations, thereby, preserving the tumors and is computed as:

$$\begin{aligned} \begin{aligned} L_{tumor}&= \mathbb {E}_{x_{c} \sim X_{CT}, y_{c} \sim y_{CT}}[logP(y_{c}|G_{CT \rightarrow MRI}(x_{c}))] \\&\quad +\mathbb {E}_{x_{c} \sim X_{CT}, y_{c} \sim y_{CT}}[logP(y_{c}|X_{CT})]. \end{aligned} \end{aligned}$$

(3)

On the other hand, the tumor feature loss $L_{feat}$ forces the high-level features of $X_{CT}$ and $X_{CT}^{MRI}$ to be shared by using a constraint inspired by [12] as:

$$\begin{aligned} L_{feat}(x_{c} \sim X_{CT})=\frac{1}{C \times H \times W }\left\| \phi _{CT}(x_{c})-\phi _{MRI}(G_{CT \rightarrow MRI}(x_{c}))\right\| ^{2}. \end{aligned}$$

(4)

where $\phi _{CT}$ and $\phi _{MRI}$ are the high-level features extracted from the $U_{CT}$ and $U_{MRI}$, respectively; C, H and W indicate the size of the feature. The total loss is then expressed as:

$$\begin{aligned} L_{total}=L_{adv}+\lambda _{cyc}{L_{cyc}}+\lambda _{tumor}{L_{tumor}}+\lambda _{feat}{L_{feat}}, \end{aligned}$$

(5)

where $\lambda _{cyc}$, $\lambda _{tumor}$ and $\lambda _{feat}$ are the weighting coefficients for each loss. During training, we alternatively update the domain transfer or generator network G, the discriminator D, and the tumor constraint network U with the following gradients, $-\varDelta _{\theta _{G}}(L_{adv}+ \lambda _{cyc}{L_{cyc}}+ \lambda _{tumor}{L_{tumor}}+ \lambda _{feat}L_{feat})$, $-\varDelta _{\theta _{D}}(L_{adv})$ and $-\varDelta _{\theta _{U}}(L_{tumor}+\lambda _{feat}L_{feat})$.

2.2 Step 2: Semi-supervised Tumor Segmentation from MRI

The synthesized MRI from the first step were combined with a small set of real MRI with labels ($\tilde{X}_{MRI}$ and $\tilde{y}_{MRI}$ in Fig. 2) to train a U-net [10] using Dice loss [13] (Fig. 2 (blue ellipse)) to generate tumor segmentation. Adversarial network optimization for MRI synthesis was frozen prior to semi-supervised tumor segmentation training to prevent leakage of MRI label information.

2.3 Network Structure and Implementation

The generators G and discriminators D for CT and MRI synthesis networks were implemented similar to that in [5]. We tied the penultimate layer in $U_{MRI}$ and $U_{CT}$. The details of all networks are shown in the supplementary documents. Pytorch library [14] was used for implementing the proposed networks, which were trained on Nvidia GTX 1080Ti of 12 GB memory with a batch size of 1 during image transfer and batch size of 10 during semi-supervised segmentation. The ADAM algorithm [15] with an initial learning rate of 1e-4 was used during training. We set $\lambda _{cyc}=10$, $\lambda _{tumor}=5$ and $\lambda _{feat}=1$.

3 Experiments and Results

3.1 Ablation Tests

We tested the impact of adding tumor-aware loss to the cycle loss (proposed vs. cycle-GAN [5] vs. masked-cycle-GAN [8]). Images synthesized using aforementioned networks were trained to segment using semi-supervised learning by combining with a limited number of real MRI. We call adversarial synthesis [8] that combined tumor labels as an additional channel with the original images as masked-cycle-GAN. We also evaluated the effect of adding a limited number of original MRI to the synthesized MRI on segmentation accuracy (tumor-aware with semi-supervised vs. tumor-aware with unsupervised training). We benchmarked the lowest achievable segmentation accuracy by training a network with only the pre-treatment (or week one) MRI.

3.2 Datasets

The image synthesis networks were trained using contrast-enhanced CT images with expert delineated tumors from 377 patients with non-small cell lung cancer (NSCLC) [16] available from The Cancer Imaging Archive (TCIA) [17], and an unrelated cohort of 6 patients scanned with T2w MRI at our clinic before and during treatment every week (n = 7) with radiation therapy. Masked cycle-GANs used both tumor labels and the images as additional channels even for image synthesis training. Image regions enclosing the tumors were extracted and rescaled to $256\times {256}$ to produce 32000 CT image slices and 9696 T2w MR image slices. Only 1536 MR images from pre-treatment MRI were used for semi-supervised segmentation training of all networks. Segmentation validation was performed on the subsequent on-treatment MRIs (n = 36) from the same 6 patients. Test was performed using 28 MRIs consisting of longitudinal scans (7, 7, 6) from 3 patients and pre-treatment scans from 8 patients not used in training. Tumor segmentation accuracy was evaluated by comparing to expert delineations using the Dice Score Coefficient (DSC), and the Hausdorff Distance 95% (HD95).

3.3 MR Image Synthesis Results

Figure 3 shows the representative qualitative results of synthesized MRI produced using only the cycle-GAN (Fig. 3(b)), masked cycle-GAN (Fig. 3(c)) and using our method (Fig. 3(d)). As seen, our method best preserves the anatomical details between CT and MRI. Quantitative evaluation using the Kullback - Leibler (KL) divergence computed from tumor regions between synthesized and original MRI, used for training, confirmed that our method resulted in the best match of tumor distribution with the lowest KL divergence of 0.069 compared with those obtained using the cycle-GAN (1.69) and masked cycle-GAN (0.32).

3.4 Segmentation Results

Figure 4 shows the segmentations generated using the various methods (yellow contours) for three representative cases from the test and validation sets, together with the expert delineations (red contours). As shown in Table 1, our approach outperformed cycle GAN irrespective of training without (unsupervised) or with (semi-supervised) labeled target data. Semi-supervised segmentation outperformed all methods in both test and validation datasets.

Table 1. Segmentation accuracy

Full size table

4 Discussion

In this work, we introduced a novel target-specific, tumor-aware loss for synthesizing MR images from unpaired CT datasets using unsupervised cross-domain adaptation. The tumor-aware loss forces the network to retain tumors that are typically lost when using only the cycle-loss and leads to accurate tumor segmentation. Although applied to lung tumors, our method is applicable to other structures and organs. Segmentation accuracy of our approach trained with only synthesized MRIs exceeded other methods trained in a semi-supervised manner. Adding small set of labeled target domain data further boosts accuracy. The validation set produced lower but not significantly different (p = 0.1) DSC accuracy than the test set due to significantly smaller (p = 0.0004) tumor volumes in validation (mean 37.66cc) when compared with the test set (mean 68.2cc). Our results showed that masked-cycle-GAN produced lower test performance compared to basic cycle-GAN, possibly due to poor modeling from highly unbalanced CT and MR datasets. As a limitation, our approach only forces the synthesized MRIs to preserve tumors but not the MR intensity distribution within tumors. Such modeling would require learning the mapping for individual scan manufacturers, magnet strengths and coil placements which was outside the scope of this work. Additionally, synthesized images irrespective of the chosen method do not produce a one-to-one pixel mapping from CT to MRI similar to [8]. There is also room for improving the segmentation accuracy by exploring more advanced segmentation models, e.g. boundary-aware fully convolutional networks (FCN) [18].

5 Conclusions

In this work, we proposed a tumor-aware, adversarial domain adaptation method using unpaired CT and MR images for generating segmentations from MRI. Our approach preserved tumors on synthesized MRI and generated the best segmentation performance compared with state-of-the-art adversarial cross-domain adaptation. Our results suggest feasibility for lung tumor segmentation from MRI trained using MRI synthesized from CT.

References

Tzeng, E., Hoffman, J., Saenko, K., Darrell, T.: Adversarial discriminative domain adaptation. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, Honolulu, HI, USA, 21–26 July 2017, pp. 2962–2971 (2017)
Google Scholar
Ganin, Y., Lempitsky, V.: Unsupervised domain adaptation by backpropagation. In: International Conference on Machine Learning (ICML), pp. 1180–1189 (2015)
Google Scholar
Shrivastava, A., Pfister, T., Tuzel, O., Susskind, J., Wang, W., Webb, R.: Learning from simulated and unsupervised images through adversarial training. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol. 3, pp. 2107–2116 (2017)
Google Scholar
Yoo, D., Kim, N., Park, S., Paek, A.S., Kweon, I.S.: Pixel-level domain transfer. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9912, pp. 517–532. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46484-8_31
Chapter Google Scholar
Zhu, J.Y., Park, T., Isola, P., Efros, A.: Unpaired image-to-image translation using cycle-consistent adversarial networks. In: International Conference on Computer Vision (ICCV), pp. 2223–2232 (2017)
Google Scholar
Nie, D., et al.: Medical image synthesis with context-aware generative adversarial networks. In: Descoteaux, M., Maier-Hein, L., Franz, A., Jannin, P., Collins, D.L., Duchesne, S. (eds.) MICCAI 2017. LNCS, vol. 10435, pp. 417–425. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-66179-7_48
Chapter Google Scholar
Wolterink, J.M., Dinkla, A.M., Savenije, M.H.F., Seevinck, P.R., van den Berg, C.A.T., Išgum, I.: Deep MR to CT synthesis using unpaired data. In: Tsaftaris, S.A., Gooya, A., Frangi, A.F., Prince, J.L. (eds.) SASHIMI 2017. LNCS, vol. 10557, pp. 14–23. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-68127-6_2
Chapter Google Scholar
Chartsias, A., Joyce, T., Dharmakumar, R., Tsaftaris, S.A.: Adversarial image synthesis for unpaired multi-modal cardiac data. In: Tsaftaris, S.A., Gooya, A., Frangi, A.F., Prince, J.L. (eds.) SASHIMI 2017. LNCS, vol. 10557, pp. 3–13. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-68127-6_1
Chapter Google Scholar
Huo, Y., Xu, Z., Bao, S., Assad, A., Abramson, R.G., Landman, B.-A.: Adversarial synthesis learning enables segmentation without target modality ground truth. In: IEEE International Symposium on Biomedical Imaging (2018)
Google Scholar
Ronneberger, O., Fischer, P., Brox, T.: U-Net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28
Chapter Google Scholar
Goodfellow, I., et al.: Generative adversarial nets. In: Advances in Neural Information Processing Systems (NIPS), pp. 2672–2680 (2014)
Google Scholar
Johnson, J., Alahi, A., Fei-Fei, L.: Perceptual losses for real-time style transfer and super-resolution. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9906, pp. 694–711. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46475-6_43
Chapter Google Scholar
Milletari, F., Navab, N., Ahmadi, S.A.: V-net: fully convolutional neural networks for volumetric medical image segmentation. In: 2016 Fourth International Conference on 3D Vision (3DV), pp. 565–571. IEEE (2016)
Google Scholar
Paszke, A., et al.: Automatic differentiation in Py Torch (2017)
Google Scholar
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. In: Proceedings of the 3rd International Conference on Learning Representations (ICLR) (2014)
Google Scholar
Aerts, H., et al.: Data from NSCLC-radiomics. The Cancer Imaging Archive (2015)
Google Scholar
Clark, K., et al.: The cancer imaging archive (TCIA): maintaining and operating a public information repository. J. Digital Imaging 26(6), 1045–1057 (2013)
Article Google Scholar
Shen, H., Wang, R., Zhang, J., McKenna, S.J.: Boundary-aware fully convolutional network for brain tumor segmentation. In: Descoteaux, M., Maier-Hein, L., Franz, A., Jannin, P., Collins, D.L., Duchesne, S. (eds.) MICCAI 2017. LNCS, vol. 10434, pp. 433–441. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-66185-8_49
Chapter Google Scholar

Download references

Acknowledgement

This work was funded in part through the NIH/NCI Cancer Center Support Grant P30 CA008748.

Author information

Authors and Affiliations

Medical Physics, Memorial Sloan Kettering Cancer Center, New York, USA
Jue Jiang, Yu-Chi Hu, Neelam Tyagi, Pengpeng Zhang, Gig S. Mageras, Joseph O. Deasy & Harini Veeraraghavan
Radiation Oncology, Memorial Sloan Kettering Cancer Center, New York, USA
Andreas Rimner

Authors

Jue Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Yu-Chi Hu
View author publications
You can also search for this author in PubMed Google Scholar
Neelam Tyagi
View author publications
You can also search for this author in PubMed Google Scholar
Pengpeng Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Rimner
View author publications
You can also search for this author in PubMed Google Scholar
Gig S. Mageras
View author publications
You can also search for this author in PubMed Google Scholar
Joseph O. Deasy
View author publications
You can also search for this author in PubMed Google Scholar
Harini Veeraraghavan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Harini Veeraraghavan .

Editor information

Editors and Affiliations

University of Leeds, Leeds, UK
Alejandro F. Frangi
King’s College London, London, UK
Julia A. Schnabel
University of Pennsylvania, Philadelphia, PA, USA
Christos Davatzikos
Universidad de Valladolid, Valladolid, Spain
Carlos Alberola-López
Queen’s University, Kingston, ON, Canada
Gabor Fichtinger

1 Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 106 KB)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jiang, J. et al. (2018). Tumor-Aware, Adversarial Domain Adaptation from CT to MRI for Lung Cancer Segmentation. In: Frangi, A., Schnabel, J., Davatzikos, C., Alberola-López, C., Fichtinger, G. (eds) Medical Image Computing and Computer Assisted Intervention – MICCAI 2018. MICCAI 2018. Lecture Notes in Computer Science(), vol 11071. Springer, Cham. https://doi.org/10.1007/978-3-030-00934-2_86

Download citation

DOI: https://doi.org/10.1007/978-3-030-00934-2_86
Published: 26 September 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00933-5
Online ISBN: 978-3-030-00934-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics