research-article

Masked LoGoNet: Fast and Accurate 3D Image Analysis for Medical Domain

Authors:

Heng Ji,

Srinivasan Parthasarathy,

Rajiv RamnathAuthors Info & Claims

KDD '24: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

Pages 1348 - 1359

https://doi.org/10.1145/3637528.3672069

Published: 24 August 2024 Publication History

Get Access

Abstract

Standard modern machine-learning-based imaging methods have faced challenges in medical applications due to the high cost of dataset construction and, thereby, the limited labeled training data available. Additionally, upon deployment, these methods are usually used to process a large volume of data on a daily basis, imposing a high maintenance cost on medical facilities. In this paper, we introduce a new neural network architecture, termed LoGoNet, with a tailored self-supervised learning (SSL) method to mitigate such challenges. LoGoNet integrates a novel feature extractor within a U-shaped architecture, leveraging Large Kernel Attention (LKA) and a dual encoding strategy to capture both long-range and short-range feature dependencies adeptly. This is in contrast to existing methods that rely on increasing network capacity to enhance feature extraction. This combination of novel techniques in our model is especially beneficial in medical image segmentation, given the difficulty of learning intricate and often irregular body organ shapes, such as the spleen. Complementary, we propose a novel SSL method tailored for 3D images to compensate for the lack of large labeled datasets. Our method combines masking and contrastive learning techniques within a multi-task learning framework and is compatible with both Vision Transformer (ViT) and CNN-based models. We demonstrate the efficacy of our methods in numerous tasks across two standard datasets (i.e., BTCV and MSD). Benchmark comparisons with eight state-of-the-art models highlight LoGoNet's superior performance in both inference time and accuracy. Code available at: https://github.com/aminK8/Masked-LoGoNet.

Supplemental Material

MP4 File - the Ohio State University

Masked LoGoNet: Fast and Accurate 3D Image Analysis for Medical Domain

Download
29.04 MB

MP4 File

Masked LoGoNet: Fast and Accurate 3D Image Analysis for Medical Domain

Download
29.04 MB

References

[1]

Bobby Azad, Reza Azad, Sania Eskandari, and other. 2023. Foundational models in medical imaging: A comprehensive survey and future vision. arXiv preprint arXiv:2310.18689 (2023).

Abstract

Supplemental Material

References

Index Terms

Recommendations

Automated brain tumour segmentation techniques- A review

Hierarchical Self-supervised Learning for Medical Image Segmentation Based on Multi-domain Data Aggregation

Self-supervised few-shot medical image segmentation with spatial transformations

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Login options

Full Access

View options

PDF

eReader

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations