Computer Science > Computer Vision and Pattern Recognition

arXiv:2404.12295 (cs)

[Submitted on 18 Apr 2024]

Title:When Medical Imaging Met Self-Attention: A Love Story That Didn't Quite Work Out

Authors:Tristan Piater, Niklas Penzel, Gideon Stein, Joachim Denzler

Abstract:A substantial body of research has focused on developing systems that assist medical professionals during labor-intensive early screening processes, many based on convolutional deep-learning architectures. Recently, multiple studies explored the application of so-called self-attention mechanisms in the vision domain. These studies often report empirical improvements over fully convolutional approaches on various datasets and tasks. To evaluate this trend for medical imaging, we extend two widely adopted convolutional architectures with different self-attention variants on two different medical datasets. With this, we aim to specifically evaluate the possible advantages of additional self-attention. We compare our models with similarly sized convolutional and attention-based baselines and evaluate performance gains statistically. Additionally, we investigate how including such layers changes the features learned by these models during the training. Following a hyperparameter search, and contrary to our expectations, we observe no significant improvement in balanced accuracy over fully convolutional models. We also find that important features, such as dermoscopic structures in skin lesion images, are still not learned by employing self-attention. Finally, analyzing local explanations, we confirm biased feature usage. We conclude that merely incorporating attention is insufficient to surpass the performance of existing fully convolutional methods.

Comments:	10 pages, 2 figures, 5 tables, presented at VISAPP 2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2404.12295 [cs.CV]
	(or arXiv:2404.12295v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2404.12295
Journal reference:	Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 2: VISAPP (2024), ISBN 978-989-758-679-8, ISSN 2184-4321, SciTePress, pages 149-158
Related DOI:	https://doi.org/10.5220/0012382600003660

Submission history

From: Niklas Penzel [view email]
[v1] Thu, 18 Apr 2024 16:18:41 UTC (1,003 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:When Medical Imaging Met Self-Attention: A Love Story That Didn't Quite Work Out

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:When Medical Imaging Met Self-Attention: A Love Story That Didn't Quite Work Out

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators