Computer Science > Computer Vision and Pattern Recognition

arXiv:2309.06824 (cs)

[Submitted on 13 Sep 2023 (v1), last revised 8 Jul 2024 (this version, v2)]

Title:Beyond Adapting SAM: Towards End-to-End Ultrasound Image Segmentation via Auto Prompting

Authors:Xian Lin, Yangyang Xiang, Li Yu, Zengqiang Yan

Abstract:End-to-end medical image segmentation is of great value for computer-aided diagnosis dominated by task-specific models, usually suffering from poor generalization. With recent breakthroughs brought by the segment anything model (SAM) for universal image segmentation, extensive efforts have been made to adapt SAM for medical imaging but still encounter two major issues: 1) severe performance degradation and limited generalization without proper adaptation, and 2) semi-automatic segmentation relying on accurate manual prompts for interaction. In this work, we propose SAMUS as a universal model tailored for ultrasound image segmentation and further enable it to work in an end-to-end manner denoted as AutoSAMUS. Specifically, in SAMUS, a parallel CNN branch is introduced to supplement local information through cross-branch attention, and a feature adapter and a position adapter are jointly used to adapt SAM from natural to ultrasound domains while reducing training complexity. AutoSAMUS is realized by introducing an auto prompt generator (APG) to replace the manual prompt encoder of SAMUS to automatically generate prompt embeddings. A comprehensive ultrasound dataset, comprising about 30k images and 69k masks and covering six object categories, is collected for verification. Extensive comparison experiments demonstrate the superiority of SAMUS and AutoSAMUS against the state-of-the-art task-specific and SAM-based foundation models. We believe the auto-prompted SAM-based model has the potential to become a new paradigm for end-to-end medical image segmentation and deserves more exploration. Code and data are available at this https URL.

Comments:	Also known as SAMUS. Officially accepted by MICCAI 2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2309.06824 [cs.CV]
	(or arXiv:2309.06824v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2309.06824

Submission history

From: Zengqiang Yan [view email]
[v1] Wed, 13 Sep 2023 09:15:20 UTC (6,284 KB)
[v2] Mon, 8 Jul 2024 03:24:35 UTC (205 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Beyond Adapting SAM: Towards End-to-End Ultrasound Image Segmentation via Auto Prompting

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Beyond Adapting SAM: Towards End-to-End Ultrasound Image Segmentation via Auto Prompting

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators