Abstract
Automatic segmentation of blood cells is crucial in medical diagnosis and research, significantly improving the accuracy and efficiency of diagnosing blood disorders. Traditional segmentation methods involving manual segmentation are time-consuming, labor-intensive, and prone to errors. In recent years, advancements in deep learning have provided new solutions for automated segmentation. This paper proposes BCNet, a blood cell segmentation algorithm combining UNet and Transformer. Specifically, BCNet utilizes UNet’s Encoder-Decoder architecture as the backbone for extracting multi-scale features. A Spatial Reduction Transformer (SRT) Module is introduced for capturing long-range dependencies in the deepest downsampling layers to enhance sensitivity to local features. Additionally, coordinate attention is employed instead of skip connections for multi-scale feature fusion, enriching semantic information in deep features. Experimental results demonstrate that BCNet achieves superior Dice and IoU metrics compared to classical medical image segmentation models, facilitating automated analysis and medical diagnosis of blood cells.
Similar content being viewed by others
Data availability statement
This work was supported in Tianjin Research Innovation Project for Postgraduate Students (2022SKY126).
References
Das, P.K., Sahoo, B., Meher, S.: An efficient detection and classification of acute leukemia using transfer learning and orthogonal softmax layer-based model. IEEE/ACM Trans. Comput. Biol. Bioinf. 20(3), 1817–1828 (2022)
Khadidos, A., Sanchez, V., Li, C.-T.: Weighted level set evolution based on local edge features for medical image segmentation. IEEE Trans. Image Process. 26(4), 1979–1991 (2017)
Sahu, A., Das, P.K., Meher, S.: An efficient deep learning scheme to detect breast cancer using mammogram and ultrasound breast images. Biomed. Signal Process. Control 87, 105377 (2024)
Ostu, N.: A threshold selection method from gray-level histograms. IEEE Trans. SMC 9, 62 (1979)
Heimann, T., Meinzer, H.-P.: Statistical shape models for 3d medical image segmentation: a review. Med. Image Anal. 13(4), 543–563 (2009)
Yi, F., Moon, I.: Image segmentation: A survey of graph-cut methods. In: 2012 International Conference on Systems and Informatics (ICSAI2012), IEEE, pp. 1936–1941 (2012)
Zhang, Z., Wu, H., Zhao, H., Shi, Y., Wang, J., Bai, H., Sun, B.: A novel deep learning model for medical image segmentation with convolutional neural network and transformer. Interdiscipl. Sci. Comput. Life Sci. 15(4), 663–677 (2023)
Zhang, Z., Miao, Y., Wu, J., Zhang, X., Ma, Q., Bai, H., Gao, Q.: Deep learning and radiomics-based approach to meningioma grading: exploring the potential value of peritumoral edema regions, Phys. Med. Biol. 69 (2024)
Ronneberger, O., Fischer, P., Brox, T.: U-net: Convolutional networks for biomedical image segmentation. In: Medical image computing and computer-assisted intervention–MICCAI 2015: 18th international conference, Munich, Germany, October 5-9, 2015, proceedings, part III 18, Springer, pp. 234–241 (2015)
Zhou, Z., Rahman Siddiquee, M.M., Tajbakhsh, N., Liang, J.: Unet++: A nested u-net architecture for medical image segmentation. In: Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support: 4th International Workshop, DLMIA 2018, and 8th International Workshop, ML-CDS 2018, Held in Conjunction with MICCAI 2018, Granada, Spain, September 20, 2018, Proceedings 4, Springer, pp. 3–11 (2018)
Diakogiannis, F.I., Waldner, F., Caccetta, P., Wu, C.: Resunet-a: A deep learning framework for semantic segmentation of remotely sensed data. ISPRS J. Photogramm. Remote. Sens. 162, 94–114 (2020)
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, Ł., Polosukhin, I.: Attention is all you need. Adv. Neural Inf. Process. Syst. 30 (2017)
Chen, J., Lu, Y., Yu, Q., Luo, X., Adeli, E., Wang, Y., Lu, L., Yuille, A.L., Zhou, Y.: Transunet: Transformers make strong encoders for medical image segmentation. arXiv preprint arXiv:2102.04306 (2021)
Cao, H., Wang, Y., Chen, J., Jiang, D., Zhang, X., Tian, Q., Wang, M.: Swin-unet: Unet-like pure transformer for medical image segmentation. In: European Conference on Computer Vision, Springer, pp. 205–218 (2022)
Valanarasu, J.M.J., Oza, P., Hacihaliloglu, I., Patel, V.M.: Medical transformer: Gated axial-attention for medical image segmentation. In: Medical image computing and computer assisted intervention–MICCAI 2021: 24th international conference, Strasbourg, France, September 27–October 1, 2021, proceedings, part I 24, Springer, pp. 36–46 (2021)
Hou, Q., Zhou, D., Feng, J.: Coordinate attention for efficient mobile network design. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 13713–13722 (2021)
Das, P.K., Meher, S.: Awolse: Adaptive weight optimized level set evolution-based blood cell segmentation. IEEE Trans. Instrum. Meas. (2023)
Özcan, Ş.N., Uyar, T., Karayeğen, G.: Comprehensive data analysis of white blood cells with classification and segmentation by using deep learning approaches. Cytometry Part A (2024)
Tong, B., Wen, T., Du, Y., Pan, T.: Cell image instance segmentation based on polarmask using weak labels. Comput. Methods Programs Biomed. 231, 107426 (2023)
Lan, K., Cheng, J., Jiang, J., Jiang, X., Zhang, Q.: Modified unet++ with atrous spatial pyramid pooling for blood cell image segmentation. Math. Biosci. Eng. MBE 20(1), 1420–1433 (2023)
Chen, L.-C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE Trans. Pattern Anal. Mach. Intell. 40(4), 834–848 (2017)
Wang, H., Cao, P., Wang, J., Zaiane, O.R.: Uctransnet: rethinking the skip connections in u-net from a channel-wise perspective with transformer. In: Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 36, pp. 2441–2449 (2022)
Xiao, L., Pan, Z., Du, X., Chen, W., Qu, W., Bai, Y., Xu, T.: Weighted skip-connection feature fusion: a method for augmenting uav oriented rice panicle image segmentation. Comput. Electron. Agric. (2023)
Qian, L., Wen, C., Li, Y., Hu, Z., Zhou, X., Xia, X., Kim, S.-H.: Multi-scale context unet-like network with redesigned skip connections for medical image segmentation. Comput. Methods Programs Biomed. 243, 107885 (2024)
Zioulis, N., Albanis, G., Drakoulis, P., Alvarez, F., Zarpalas, D., Daras, P.: Hybrid skip: A biologically inspired skip connection for the unet architecture. IEEE Access 10, 53928–53939 (2022)
Wang, W., Xie, E., Li, X., Fan, D.-P., Song, K., Liang, D., Lu, T., Luo, P., Shao, L.: Pyramid vision transformer: A versatile backbone for dense prediction without convolutions. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 568–578 (2021)
Depto, D.S., Rahman, S., Hosen, M.M., Akter, M.S., Reme, T.R., Rahman, A., Zunai, H., Mahdy, M.R.C., Rahman, M.S., Lahiri, J.B.: Blood cell segmentation dataset (2023). https://doi.org/10.34740/KAGGLE/DSV/6107556. https://www.kaggle.com/dsv/6107556
Kingma, D.P., Ba, J.: Adam: A method for stochastic optimization, arXiv preprint arXiv:1412.6980 (2014)
Badrinarayanan, V., Kendall, A., Cipolla, R.: Segnet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 39(12), 2481–2495 (2017)
Zhao, H., Shi, J., Qi, X., Wang, X., Jia, J.: Pyramid scene parsing network. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2881–2890 (2017)
Huang, Z., Wang, X., Huang, L., Huang, C., Wei, Y., Liu, W.: Ccnet: Criss-cross attention for semantic segmentation. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 603–612 (2019)
Ruan, J., Xiang, S.: Vm-unet: Vision mamba unet for medical image segmentation, arXiv preprint arXiv:2402.02491 (2024)
Liu, M., Dan, J., Lu, Z., Yu, Y., Li, Y., Li, X.: Cm-unet: Hybrid cnn-mamba unet for remote sensing image semantic segmentation, arXiv preprint arXiv:2405.10530 (2024)
Song, Y., Zheng, J., Lei, L., Ni, Z., Zhao, B., Hu, Y.: Ct2us: Cross-modal transfer learning for kidney segmentation in ultrasound images with synthesized data. Ultrasonics 122, 106706 (2022)
Al-Dhabyani, W., Gomaa, M., Khaled, H., Fahmy, A.: Dataset of breast ultrasound images. Data Brief 28, 104863 (2020)
Funding
This work was supported by JSPS KAKENHI Grant Numbers JP21H05052, JP21K11881, JP23H00479, JP24K02938, and JST, CREST Grant Number JPMJCR22M1, Japan. This work was also supported by DIGIT Aarhus University Centre for Digitalisation, Big Data and Data Analytics, and Digital Research Centre Denmark (DIREC) under the Privacy and Machine Learning project, Denmark.
Author information
Authors and Affiliations
Contributions
Conceptualization, Z.Z. and Y.J.; methodology, H.B.; software, Y.J.; validation, Y.L., H.B. and Z.Z.; formal analysis, S.W.; investigation, M.Y.; resources, Q.X.; data curation, Z.Z.; writing-original draft preparation, Y.J.; writing-review and editing, H.B. and Y.L.; visualization, Y.J.; supervision, Z.Z.; project administration, H.B.; funding acquisition, H.B. All authors have read and agreed to the published version of the manuscript.
Corresponding authors
Ethics declarations
Conflict of interest
The authors declare that they have no Conflict of interest.
Ethical Approval
The study utilized publicly available datasets, and therefore, ethical review and approval were not required in accordance with the local legislation and institutional requirements.
Consent to Participate/Consent to Publish
The study utilized publicly available datasets, and therefore, Consent to Participate and Consent to Publish were not required in accordance with the local legislation and institutional requirements.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Jiang, Y., Wang, S., Yao, M. et al. BCNet: integrating UNet and transformer for blood cell segmentation. SIViP 19, 14 (2025). https://doi.org/10.1007/s11760-024-03568-5
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11760-024-03568-5