Computer Science > Computer Vision and Pattern Recognition

arXiv:2311.03518 (cs)

[Submitted on 6 Nov 2023 (v1), last revised 7 Dec 2023 (this version, v2)]

Title:High-resolution power equipment recognition based on improved self-attention

Authors:Siyi Zhang, Cheng Liu, Xiang Li, Xin Zhai, Zhen Wei, Sizhe Li, Xun Ma

View PDF

Abstract:The current trend of automating inspections at substations has sparked a surge in interest in the field of transformer image recognition. However, due to restrictions in the number of parameters in existing models, high-resolution images can't be directly applied, leaving significant room for enhancing recognition accuracy. Addressing this challenge, the paper introduces a novel improvement on deep self-attention networks tailored for this issue. The proposed model comprises four key components: a foundational network, a region proposal network, a module for extracting and segmenting target areas, and a final prediction network. The innovative approach of this paper differentiates itself by decoupling the processes of part localization and recognition, initially using low-resolution images for localization followed by high-resolution images for recognition. Moreover, the deep self-attention network's prediction mechanism uniquely incorporates the semantic context of images, resulting in substantially improved recognition performance. Comparative experiments validate that this method outperforms the two other prevalent target recognition models, offering a groundbreaking perspective for automating electrical equipment inspections.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2311.03518 [cs.CV]
	(or arXiv:2311.03518v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2311.03518

Submission history

From: Sizhe Li [view email]
[v1] Mon, 6 Nov 2023 20:51:37 UTC (391 KB)
[v2] Thu, 7 Dec 2023 00:45:21 UTC (549 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:High-resolution power equipment recognition based on improved self-attention

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:High-resolution power equipment recognition based on improved self-attention

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators