Computer Science > Computer Vision and Pattern Recognition

arXiv:2205.08473 (cs)

[Submitted on 17 May 2022 (v1), last revised 7 Jun 2022 (this version, v3)]

Title:ColonFormer: An Efficient Transformer based Method for Colon Polyp Segmentation

Authors:Nguyen Thanh Duc, Nguyen Thi Oanh, Nguyen Thi Thuy, Tran Minh Triet, Dinh Viet Sang

View PDF

Abstract:Identifying polyps is challenging for automatic analysis of endoscopic images in computer-aided clinical support systems. Models based on convolutional networks (CNN), transformers, and their combinations have been proposed to segment polyps with promising results. However, those approaches have limitations either in modeling the local appearance of the polyps only or lack of multi-level features for spatial dependency in the decoding process. This paper proposes a novel network, namely ColonFormer, to address these limitations. ColonFormer is an encoder-decoder architecture capable of modeling long-range semantic information at both encoder and decoder branches. The encoder is a lightweight architecture based on transformers for modeling global semantic relations at multi scales. The decoder is a hierarchical network structure designed for learning multi-level features to enrich feature representation. Besides, a refinement module is added with a new skip connection technique to refine the boundary of polyp objects in the global map for accurate segmentation. Extensive experiments have been conducted on five popular benchmark datasets for polyp segmentation, including Kvasir, CVC-Clinic DB, CVC-ColonDB, CVC-T, and ETIS-Larib. Experimental results show that our ColonFormer outperforms other state-of-the-art methods on all benchmark datasets.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2205.08473 [cs.CV]
	(or arXiv:2205.08473v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2205.08473

Submission history

From: Sang Dinh [view email]
[v1] Tue, 17 May 2022 16:34:04 UTC (8,015 KB)
[v2] Tue, 31 May 2022 17:14:29 UTC (8,205 KB)
[v3] Tue, 7 Jun 2022 14:23:55 UTC (8,752 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:ColonFormer: An Efficient Transformer based Method for Colon Polyp Segmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:ColonFormer: An Efficient Transformer based Method for Colon Polyp Segmentation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators