Computer Science > Computer Vision and Pattern Recognition

arXiv:2209.08516 (cs)

[Submitted on 18 Sep 2022]

Title:VisTaNet: Attention Guided Deep Fusion for Surface Roughness Classification

Authors:Prasanna Kumar Routray, Aditya Sanjiv Kanade, Jay Bhanushali, Manivannan Muniyandi

View PDF

Abstract:Human texture perception is a weighted average of multi-sensory inputs: visual and tactile. While the visual sensing mechanism extracts global features, the tactile mechanism complements it by extracting local features. The lack of coupled visuotactile datasets in the literature is a challenge for studying multimodal fusion strategies analogous to human texture perception. This paper presents a visual dataset that augments an existing tactile dataset. We propose a novel deep fusion architecture that fuses visual and tactile data using four types of fusion strategies: summation, concatenation, max-pooling, and attention. Our model shows significant performance improvements (97.22%) in surface roughness classification accuracy over tactile only (SVM - 92.60%) and visual only (FENet-50 - 85.01%) architectures. Among the several fusion techniques, attention-guided architecture results in better classification accuracy. Our study shows that analogous to human texture perception, the proposed model chooses a weighted combination of the two modalities (visual and tactile), thus resulting in higher surface roughness classification accuracy; and it chooses to maximize the weightage of the tactile modality where the visual modality fails and vice-versa.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2209.08516 [cs.CV]
	(or arXiv:2209.08516v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2209.08516

Submission history

From: Prasanna Kumar Routray [view email]
[v1] Sun, 18 Sep 2022 09:37:06 UTC (7,226 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:VisTaNet: Attention Guided Deep Fusion for Surface Roughness Classification

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:VisTaNet: Attention Guided Deep Fusion for Surface Roughness Classification

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators