Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3449301.3449320acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicraiConference Proceedingsconference-collections
research-article

Analysis of Different Encoder-decoder-based Approaches for Biomedical Imaging Segmentation

Published: 09 June 2021 Publication History

Abstract

Recently, CNNs (convolutional neural networks) have been widely used in the field of medical image segmentation. In particular, the encoder-decoder architectures represented by U-Net have achieved state-of-art segmentation effects and inspired many more elaborated networks, which adopt newer and more advanced network designs. To our knowledge, the comprehensive and detailed comparison among these improved versions from a multiplicity of points of view has not been conducted up to now.
With U-Net as the baseline, we select the other four typical improvements for U-Net. For higher reliability, we finish the task of segmentation on four datasets and more experiments are performed to test the performance in various conditions. Finally, we evaluate their performance using multiple evaluation metrics.
We find that attention U-Net achieves the best segmentation results in terms of F1-score but also owns the most trainable parameters and is most time-consuming. As training images decrease, the original U-Net is most robust even only less than 5 training samples are available. Besides, for any networks, adding auxiliary loss function with small weighting such as 0.01 or 0.01 whatever the cross-entropy loss and the dice-coefficient loss for the other one is beneficial as well.

References

[1]
Alom, M.Z., Hasan, M., Yakopcic, C., Taha, T.M., Asari, V.K., 2018. Recurrent residual convolutional neural network based on u-net (r2u-net) for medical image segmentation. arXiv preprint arXiv:1802.06955.
[2]
Barghout, L., Sheynin, J., 2013. Real-world scene perception and perceptual organization: Lessons from computer vision. Journal of Vision 13, 709-709.
[3]
Christ, P.F., Ettlinger, F., Grün, F., Elshaera, M.E.A., Lipkova, J., Schlecht, S., Ahmaddy, F., Tatavarty, S., Bickel, M., Bilic, P., 2017. Automatic liver and tumor segmentation of ct and mri volumes using cascaded fully convolutional neural networks. arXiv preprint arXiv:1702.05970.
[4]
Çiçek, Ö., Abdulkadir, A., Lienkamp, S.S., Brox, T., Ronneberger, O., 2016. 3d u-net: learning dense volumetric segmentation from sparse annotation. arXiv preprint arXiv:1606.06650 .
[5]
Ciresan, D., Giusti, A., Gambardella, L.M., Schmidhuber, J., 2012. Deep neural networks segment neuronal membranes in electron microscopy images, in: Conference and Workshop on Neural Information Processing Systems.
[6]
Girshick, R., 2015. Fast r-cnn. arXiv preprint arXiv:1504.08083 .
[7]
Girshick, R., Donahue, J., Darrell, T., Malik, J., 2014. Rich feature hierarchies for accurate object detection and semantic segmentation, in: IEEE Conference on Computer Vision and Pattern Recognition.
[8]
Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., Bengio, Y., 2014. Generative adversarial nets, in: Conference and Workshop on Neural Information Processing Systems.
[9]
Gupta, S., Girshick, R., Arbeláez, P., Malik, J., 2014. Learning rich features from RGB-D images for object detection and segmentation, in: European Conference on Computer Vision.
[10]
Han, Z., Wei, B., Mercado, A., Leung, S., Li, S., 2018. Spine-gan: Semantic segmentation of multiple spinal structures. Medical Image Analysis 50, 23–35.
[11]
Hariharan, B., Arbeláez, P., Bourdev, L., Maji, S., Malik, J., 2011. Se- mantic contours from inverse detectors, in: IEEE International Conference on Computer Vision.
[12]
He, K., Zhang, X., Ren, S., Sun, J., 2016. Deep residual learning for image recognition, in: IEEE Conference on Computer Vision and Pattern Recognition.
[13]
Hong, S., You, T., Kwak, S., Han, B., 2015. Online tracking by learning discriminative saliency map with convolutional neural network, in: International Conference on Machine Learning.
[14]
Hu, H.H., Chen, J., Shen, W., 2016. Segmentation and quantification of adipose tissue by magnetic resonance imaging. Magnetic Resonance Materials in Physics, Biology and Medicine 29, 259–276.
[15]
Hu, J., Shen, L., Sun, G., 2018. Squeeze-and-excitation networks, in: IEEE Conference on Computer Vision and Pattern Recognition.
[16]
Huang, G., Liu, Z., Van Der Maaten, L., Weinberger, K.Q., 2017. Densely connected convolutional networks, in: IEEE Conference on Computer Vision and Pattern Recognition.
[17]
Ibtehaz, N., Rahman, M.S., 2019. Multiresunet: Rethinking the u-net architecture for multimodal biomedical image segmentation. arXiv preprint arXiv:1902.04049 .
[18]
Iglovikov, V., Shvets, A., 2018. Ternausnet: U-net with vgg11 encoder pre-trained on imagenet for image segmentation. arXiv preprint arXiv:1801.05746 .
[19]
Isard, M., Blake, A., 1998. CondensationâATconditional density propagation for visual tracking. International Journal of Computer Vision 29, 5-28.
[20]
Krizhevsky, A., Sutskever, I., Hinton, G.E., 2012. Imagenet classification with deep convolutional neural networks, in: Conference and Workshop on Neural Information Processing Systems.
[21]
Kuwatsuru, R., Shames, D.M., Mühler, A., Mintorovitch, J., Vexler, V., Mann, J.S., Cohn, F., Price, D., Huberty, J., Brasch, R.C., 1993. Quantification of tissue plasma volume in the rat by contrast-enhanced magnetic resonance imaging. Magnetic Resonance in Medicine 30, 76-81.
[22]
Lavallee, S., 1996. Registration for computer-integrated surgery: methodology. Computer-Integrated Surgery: Technology and Clinical Applications, 77–98.
[23]
Li, X., Chen, H., Qi, X., Dou, Q., Fu, C.W., Heng, P.A., 2018. H-denseunet: hybrid densely connected unet for liver and tumor segmentation from ct volumes. IEEE transactions on Medical Imaging 37, 2663–2674.
[24]
Long, J., Shelhamer, E., Darrell, T., 2015. Fully convolutional net- works for semantic segmentation, in: IEEE Conference on Computer Vision and Pattern Recognition.
[25]
Ma, J., Chen, Y., Chen, Y., Wan, F., Xue, S., Li, Z., Feng, S., 2019. Delving deep into liver focal lesion detection: A preliminary study. arXiv preprint arXiv:1907.10346 .
[26]
Marchetti, M.A., Codella, N.C., Dusza, S.W., Gutman, D.A., Helba, B., Kalloo, A., Mishra, N., Carrera, C., Celebi, M.E., DeFazio, J.L., 2018. Results of the 2016 international skin imaging collaboration international symposium on biomedical imaging challenge: Comparison of the accuracy of computer algorithms to dermatologists for the diagnosis of melanoma from dermoscopic images. Journal of the American Academy of Dermatology 78, 270–277.
[27]
Meltzer, C., Zubieta, J., Brandt, J., Tune, L., Mayberg, H., Frost, J., 1996. Regional hypometabolism in alzheimer's disease as measured by positron emission tomography after correction for effects of partial volume averaging. Neurology 47, 454-461.
[28]
Mezer, A., Yeatman, J.D., Stikov, N., Kay, K.N., Cho, N.J., Dougherty, R.F., Perry, M.L., Parvizi, J., Hua, L.H., Butts-Pauly, K., 2013. Quantifying the local tissue volume and composition in individual brains with magnetic resonance imaging. Nature Medicine 19, 1667.
[29]
Müller-Gärtner, H.W., Links, J.M., Prince, J.L., Bryan, R.N., McVeigh, E., Leal, J.P., Davatzikos, C., Frost, J.J., 1992. Measure- ment of radiotracer concentration in brain gray matter using positron emission tomography: Mri-based correction for partial volume effects. Journal of Cerebral Blood Flow & Metabolism 12, 571–583.
[30]
Ning, F., Delhomme, D., LeCun, Y., Piano, F., Bottou, L., Barbano, P.E., 2005. Toward automatic phenotyping of developing embryos from videos. IEEE Transactions on Image Processing 14, 1360-1371.
[31]
Oktay, O., Schlemper, J., Folgoc, L.L., Lee, M., Heinrich, M., Mis- awa, K., Mori, K., McDonagh, S., Hammerla, N.Y., Kainz, B., 2018. Attention u-net: Learning where to look for the pancreas. arXiv preprint arXiv:1804.03999 .
[32]
Pal, N.R., Pal, S.K., 1993. A review on image segmentation techniques. Pattern Recognition 26, 1277–1294.
[33]
Pauly, O., Glocker, B., Criminisi, A., Mateus, D., Möller, A.M., Nekolla, S., Navab, N., 2011. Fast multiple organ detection and localization in whole-body mr dixon sequences, in: International Conference on Medical Image Computing and Computer-Assisted Intervention.
[34]
Rafiei, S., Nasr-Esfahani, E., Najarian, K., Karimi, N., Samavi, S., Soroushmehr, S.R., 2018. Liver segmentation in ct images using three dimensional to two dimensional fully convolutional network, in: IEEE International Conference on Image Processing.
[35]
Redmon, J., Divvala, S., Girshick, R., Farhadi, A., 2016. You only look once: Unified, real-time object detection, in: IEEE Conference on Computer Vision and Pattern Recognition.
[36]
Ren, S., He, K., Girshick, R., Sun, J., 2015. Faster r-cnn: Towards real-time object detection with region proposal networks, in: Conference and Workshop on Neural Information Processing Systems.
[37]
Ronneberger, O., Fischer, P., Brox, T., 2015. U-net: Convolutional networks for biomedical image segmentation. arXiv preprint arXiv:1505.04597 .
[38]
Ross, D.A., Lim, J., Lin, R.S., Yang, M.H., 2008. Incremental learning for robust visual tracking. International Journal of Computer Vision 77, 125–141.
[39]
Sharma, N., Aggarwal, L.M., 2010. Automated medical image segmentation techniques. Journal of medical physics/Association of Medical Physicists of India 35, 3–14.
[40]
Simonyan, K., Zisserman, A., 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 .
[41]
Sudre, C.H., Li, W., Vercauteren, T., Ourselin, S., Cardoso, M.J., 2017. Generalised dice overlap as a deep learning loss function for highly unbalanced segmentations. arXiv preprint arXiv:1707.03237.
[42]
Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., Wojna, Z., 2016. Rethinking the inception architecture for computer vision, in: IEEE Conference on Computer Vision and Pattern Recognition.
[43]
Taylor, R.H., Menciassi, A., Fichtinger, G., Fiorini, P., Dario, P., 2016. Medical robotics and computer-integrated surgery, 1657-1684.
[44]
Vincent, L., Soille, P., 1991. Watersheds in digital spaces: an efficient algorithm based on immersion simulations. IEEE Transactions on Pattern Analysis and Machine Intelligence 13, 583-598.
[45]
Xiao, X., Lian, S., Luo, Z., Li, S., 2018. Weighted resunet for high- quality retina vessel segmentation, in: International Conference on Information Technology in Medicine and Education.
[46]
Zhao, H., Shi, J., Qi, X., Wang, X., Jia, J., 2017. Pyramid scene parsing network, in: IEEE Conference on Computer Vision and Pattern Recognition.
[47]
Zhou, Z., Siddiquee, M.M.R., Tajbakhsh, N., Liang, J., 2018. Unet++: A nested u-net architecture for medical image segmentation. arXiv preprint arXiv:1807.10165.

Cited By

View all
  • (2023)MSKD: Structured knowledge distillation for efficient medical image segmentationComputers in Biology and Medicine10.1016/j.compbiomed.2023.107284164(107284)Online publication date: Sep-2023

Index Terms

  1. Analysis of Different Encoder-decoder-based Approaches for Biomedical Imaging Segmentation
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Please enable JavaScript to view thecomments powered by Disqus.

          Information & Contributors

          Information

          Published In

          cover image ACM Other conferences
          ICRAI '20: Proceedings of the 6th International Conference on Robotics and Artificial Intelligence
          November 2020
          288 pages
          ISBN:9781450388597
          DOI:10.1145/3449301
          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          Published: 09 June 2021

          Permissions

          Request permissions for this article.

          Check for updates

          Author Tags

          1. Biomedical image
          2. Convolutional neural networks
          3. Encoder-decoder
          4. Segmentation

          Qualifiers

          • Research-article
          • Research
          • Refereed limited

          Funding Sources

          • Beijing Municipal Science & Technology Commission

          Conference

          ICRAI 2020

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • Downloads (Last 12 months)5
          • Downloads (Last 6 weeks)0
          Reflects downloads up to 25 Nov 2024

          Other Metrics

          Citations

          Cited By

          View all
          • (2023)MSKD: Structured knowledge distillation for efficient medical image segmentationComputers in Biology and Medicine10.1016/j.compbiomed.2023.107284164(107284)Online publication date: Sep-2023

          View Options

          Login options

          View options

          PDF

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          HTML Format

          View this article in HTML Format.

          HTML Format

          Media

          Figures

          Other

          Tables

          Share

          Share

          Share this Publication link

          Share on social media