research-article

Analysis of Different Encoder-decoder-based Approaches for Biomedical Imaging Segmentation

Authors:

Huiling ZhangAuthors Info & Claims

ICRAI '20: Proceedings of the 6th International Conference on Robotics and Artificial Intelligence

Pages 105 - 113

https://doi.org/10.1145/3449301.3449320

Published: 09 June 2021 Publication History

Abstract

Recently, CNNs (convolutional neural networks) have been widely used in the field of medical image segmentation. In particular, the encoder-decoder architectures represented by U-Net have achieved state-of-art segmentation effects and inspired many more elaborated networks, which adopt newer and more advanced network designs. To our knowledge, the comprehensive and detailed comparison among these improved versions from a multiplicity of points of view has not been conducted up to now.

With U-Net as the baseline, we select the other four typical improvements for U-Net. For higher reliability, we finish the task of segmentation on four datasets and more experiments are performed to test the performance in various conditions. Finally, we evaluate their performance using multiple evaluation metrics.

We find that attention U-Net achieves the best segmentation results in terms of F1-score but also owns the most trainable parameters and is most time-consuming. As training images decrease, the original U-Net is most robust even only less than 5 training samples are available. Besides, for any networks, adding auxiliary loss function with small weighting such as 0.01 or 0.01 whatever the cross-entropy loss and the dice-coefficient loss for the other one is beneficial as well.

References

[1]

Alom, M.Z., Hasan, M., Yakopcic, C., Taha, T.M., Asari, V.K., 2018. Recurrent residual convolutional neural network based on u-net (r2u-net) for medical image segmentation. arXiv preprint arXiv:1802.06955.

[2]

Barghout, L., Sheynin, J., 2013. Real-world scene perception and perceptual organization: Lessons from computer vision. Journal of Vision 13, 709-709.

[3]

Christ, P.F., Ettlinger, F., Grün, F., Elshaera, M.E.A., Lipkova, J., Schlecht, S., Ahmaddy, F., Tatavarty, S., Bickel, M., Bilic, P., 2017. Automatic liver and tumor segmentation of ct and mri volumes using cascaded fully convolutional neural networks. arXiv preprint arXiv:1702.05970.

[4]

Çiçek, Ö., Abdulkadir, A., Lienkamp, S.S., Brox, T., Ronneberger, O., 2016. 3d u-net: learning dense volumetric segmentation from sparse annotation. arXiv preprint arXiv:1606.06650 .

[5]

Ciresan, D., Giusti, A., Gambardella, L.M., Schmidhuber, J., 2012. Deep neural networks segment neuronal membranes in electron microscopy images, in: Conference and Workshop on Neural Information Processing Systems.

[6]

Girshick, R., 2015. Fast r-cnn. arXiv preprint arXiv:1504.08083 .

[7]

Girshick, R., Donahue, J., Darrell, T., Malik, J., 2014. Rich feature hierarchies for accurate object detection and semantic segmentation, in: IEEE Conference on Computer Vision and Pattern Recognition.

Digital Library

[8]

Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., Bengio, Y., 2014. Generative adversarial nets, in: Conference and Workshop on Neural Information Processing Systems.

[9]

Gupta, S., Girshick, R., Arbeláez, P., Malik, J., 2014. Learning rich features from RGB-D images for object detection and segmentation, in: European Conference on Computer Vision.

[10]

Han, Z., Wei, B., Mercado, A., Leung, S., Li, S., 2018. Spine-gan: Semantic segmentation of multiple spinal structures. Medical Image Analysis 50, 23–35.

[11]

Hariharan, B., Arbeláez, P., Bourdev, L., Maji, S., Malik, J., 2011. Se- mantic contours from inverse detectors, in: IEEE International Conference on Computer Vision.

[12]

He, K., Zhang, X., Ren, S., Sun, J., 2016. Deep residual learning for image recognition, in: IEEE Conference on Computer Vision and Pattern Recognition.

[13]

Hong, S., You, T., Kwak, S., Han, B., 2015. Online tracking by learning discriminative saliency map with convolutional neural network, in: International Conference on Machine Learning.

[14]

Hu, H.H., Chen, J., Shen, W., 2016. Segmentation and quantification of adipose tissue by magnetic resonance imaging. Magnetic Resonance Materials in Physics, Biology and Medicine 29, 259–276.

[15]

Hu, J., Shen, L., Sun, G., 2018. Squeeze-and-excitation networks, in: IEEE Conference on Computer Vision and Pattern Recognition.

[16]

Huang, G., Liu, Z., Van Der Maaten, L., Weinberger, K.Q., 2017. Densely connected convolutional networks, in: IEEE Conference on Computer Vision and Pattern Recognition.

[17]

Ibtehaz, N., Rahman, M.S., 2019. Multiresunet: Rethinking the u-net architecture for multimodal biomedical image segmentation. arXiv preprint arXiv:1902.04049 .

[18]

Iglovikov, V., Shvets, A., 2018. Ternausnet: U-net with vgg11 encoder pre-trained on imagenet for image segmentation. arXiv preprint arXiv:1801.05746 .

[19]

Isard, M., Blake, A., 1998. CondensationâATconditional density propagation for visual tracking. International Journal of Computer Vision 29, 5-28.

Digital Library

[20]

Krizhevsky, A., Sutskever, I., Hinton, G.E., 2012. Imagenet classification with deep convolutional neural networks, in: Conference and Workshop on Neural Information Processing Systems.

[21]

Kuwatsuru, R., Shames, D.M., Mühler, A., Mintorovitch, J., Vexler, V., Mann, J.S., Cohn, F., Price, D., Huberty, J., Brasch, R.C., 1993. Quantification of tissue plasma volume in the rat by contrast-enhanced magnetic resonance imaging. Magnetic Resonance in Medicine 30, 76-81.

[22]

Lavallee, S., 1996. Registration for computer-integrated surgery: methodology. Computer-Integrated Surgery: Technology and Clinical Applications, 77–98.

[23]

Li, X., Chen, H., Qi, X., Dou, Q., Fu, C.W., Heng, P.A., 2018. H-denseunet: hybrid densely connected unet for liver and tumor segmentation from ct volumes. IEEE transactions on Medical Imaging 37, 2663–2674.

[24]

Long, J., Shelhamer, E., Darrell, T., 2015. Fully convolutional net- works for semantic segmentation, in: IEEE Conference on Computer Vision and Pattern Recognition.

[25]

Ma, J., Chen, Y., Chen, Y., Wan, F., Xue, S., Li, Z., Feng, S., 2019. Delving deep into liver focal lesion detection: A preliminary study. arXiv preprint arXiv:1907.10346 .

[26]

Marchetti, M.A., Codella, N.C., Dusza, S.W., Gutman, D.A., Helba, B., Kalloo, A., Mishra, N., Carrera, C., Celebi, M.E., DeFazio, J.L., 2018. Results of the 2016 international skin imaging collaboration international symposium on biomedical imaging challenge: Comparison of the accuracy of computer algorithms to dermatologists for the diagnosis of melanoma from dermoscopic images. Journal of the American Academy of Dermatology 78, 270–277.

[27]

Meltzer, C., Zubieta, J., Brandt, J., Tune, L., Mayberg, H., Frost, J., 1996. Regional hypometabolism in alzheimer's disease as measured by positron emission tomography after correction for effects of partial volume averaging. Neurology 47, 454-461.

[28]

Mezer, A., Yeatman, J.D., Stikov, N., Kay, K.N., Cho, N.J., Dougherty, R.F., Perry, M.L., Parvizi, J., Hua, L.H., Butts-Pauly, K., 2013. Quantifying the local tissue volume and composition in individual brains with magnetic resonance imaging. Nature Medicine 19, 1667.

[29]

Müller-Gärtner, H.W., Links, J.M., Prince, J.L., Bryan, R.N., McVeigh, E., Leal, J.P., Davatzikos, C., Frost, J.J., 1992. Measure- ment of radiotracer concentration in brain gray matter using positron emission tomography: Mri-based correction for partial volume effects. Journal of Cerebral Blood Flow & Metabolism 12, 571–583.

[30]

Ning, F., Delhomme, D., LeCun, Y., Piano, F., Bottou, L., Barbano, P.E., 2005. Toward automatic phenotyping of developing embryos from videos. IEEE Transactions on Image Processing 14, 1360-1371.

Digital Library

[31]

Oktay, O., Schlemper, J., Folgoc, L.L., Lee, M., Heinrich, M., Mis- awa, K., Mori, K., McDonagh, S., Hammerla, N.Y., Kainz, B., 2018. Attention u-net: Learning where to look for the pancreas. arXiv preprint arXiv:1804.03999 .

[32]

Pal, N.R., Pal, S.K., 1993. A review on image segmentation techniques. Pattern Recognition 26, 1277–1294.

[33]

Pauly, O., Glocker, B., Criminisi, A., Mateus, D., Möller, A.M., Nekolla, S., Navab, N., 2011. Fast multiple organ detection and localization in whole-body mr dixon sequences, in: International Conference on Medical Image Computing and Computer-Assisted Intervention.

[34]

Rafiei, S., Nasr-Esfahani, E., Najarian, K., Karimi, N., Samavi, S., Soroushmehr, S.R., 2018. Liver segmentation in ct images using three dimensional to two dimensional fully convolutional network, in: IEEE International Conference on Image Processing.

[35]

Redmon, J., Divvala, S., Girshick, R., Farhadi, A., 2016. You only look once: Unified, real-time object detection, in: IEEE Conference on Computer Vision and Pattern Recognition.

[36]

Ren, S., He, K., Girshick, R., Sun, J., 2015. Faster r-cnn: Towards real-time object detection with region proposal networks, in: Conference and Workshop on Neural Information Processing Systems.

[37]

Ronneberger, O., Fischer, P., Brox, T., 2015. U-net: Convolutional networks for biomedical image segmentation. arXiv preprint arXiv:1505.04597 .

[38]

Ross, D.A., Lim, J., Lin, R.S., Yang, M.H., 2008. Incremental learning for robust visual tracking. International Journal of Computer Vision 77, 125–141.

Digital Library

[39]

Sharma, N., Aggarwal, L.M., 2010. Automated medical image segmentation techniques. Journal of medical physics/Association of Medical Physicists of India 35, 3–14.

[40]

Simonyan, K., Zisserman, A., 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 .

[41]

Sudre, C.H., Li, W., Vercauteren, T., Ourselin, S., Cardoso, M.J., 2017. Generalised dice overlap as a deep learning loss function for highly unbalanced segmentations. arXiv preprint arXiv:1707.03237.

[42]

Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., Wojna, Z., 2016. Rethinking the inception architecture for computer vision, in: IEEE Conference on Computer Vision and Pattern Recognition.

[43]

Taylor, R.H., Menciassi, A., Fichtinger, G., Fiorini, P., Dario, P., 2016. Medical robotics and computer-integrated surgery, 1657-1684.

[44]

Vincent, L., Soille, P., 1991. Watersheds in digital spaces: an efficient algorithm based on immersion simulations. IEEE Transactions on Pattern Analysis and Machine Intelligence 13, 583-598.

Digital Library

[45]

Xiao, X., Lian, S., Luo, Z., Li, S., 2018. Weighted resunet for high- quality retina vessel segmentation, in: International Conference on Information Technology in Medicine and Education.

[46]

Zhao, H., Shi, J., Qi, X., Wang, X., Jia, J., 2017. Pyramid scene parsing network, in: IEEE Conference on Computer Vision and Pattern Recognition.

[47]

Zhou, Z., Siddiquee, M.M.R., Tajbakhsh, N., Liang, J., 2018. Unet++: A nested u-net architecture for medical image segmentation. arXiv preprint arXiv:1807.10165.

Cited By

Zhao LQian XGuo YSong JHou JGong J(2023)MSKD: Structured knowledge distillation for efficient medical image segmentationComputers in Biology and Medicine10.1016/j.compbiomed.2023.107284164(107284)Online publication date: Sep-2023
https://doi.org/10.1016/j.compbiomed.2023.107284

Index Terms

Analysis of Different Encoder-decoder-based Approaches for Biomedical Imaging Segmentation
1. Applied computing
  1. Life and medical sciences
    1. Health care information systems
2. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Image segmentation
        Video segmentation
  2. Machine learning

Index terms have been assigned to the content through auto-classification.

Recommendations

Biomedical Image Segmentation Based on Classification Supervision
ICBBT '21: Proceedings of the 2021 13th International Conference on Bioinformatics and Biomedical Technology

Convolutional neural networks (CNN) has been widely used in the biomedical image segmentation (BIS) for their remarkable feature representation capability. However, there are often segmentation errors and missing segmentation problems in biomedical ...
Deep 2D Encoder-Decoder Convolutional Neural Network for Multiple Sclerosis Lesion Segmentation in Brain MRI
Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries
Abstract
In this paper, we propose an automated segmentation approach based on a deep two-dimensional fully convolutional neural network to segment brain multiple sclerosis lesions from multimodal magnetic resonance images. The proposed model is made as a ...
Fully Convolutional Encoder-Decoder Architecture (FCEDA) for Skin Lesions Segmentation
Computational Collective Intelligence
Abstract
Segmentation which is identification of regions of interest (ROIs) in medical images is a very important step for image analysis in computer-aided diagnosis systems. Accurate segmentation of skin lesions images plays a vital role in efficient ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences

ICRAI '20: Proceedings of the 6th International Conference on Robotics and Artificial Intelligence

November 2020

288 pages

ISBN:9781450388597

DOI:10.1145/3449301

Copyright © 2020 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 June 2021

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Funding Sources

Beijing Municipal Science & Technology Commission

Conference

ICRAI 2020

ICRAI 2020: 2020 6th International Conference on Robotics and Artificial Intelligence

November 20 - 22, 2020

Singapore, Singapore

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
48
Total Downloads

Downloads (Last 12 months)5
Downloads (Last 6 weeks)0

Reflects downloads up to 25 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Zhao LQian XGuo YSong JHou JGong J(2023)MSKD: Structured knowledge distillation for efficient medical image segmentationComputers in Biology and Medicine10.1016/j.compbiomed.2023.107284164(107284)Online publication date: Sep-2023
https://doi.org/10.1016/j.compbiomed.2023.107284

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Media

Figures

Other

Tables

View Table of Contents