Designing Lightweight Feature Descriptor Networks with Depthwise Separable Convolution

Yeo Ree Wang²³ &
Atsunori Kanemura²³

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1357))

Included in the following conference series:

Annual Conference of the Japanese Society for Artificial Intelligence

339 Accesses
1 Citations

Abstract

Extracting feature points and their descriptors from images is one of the fundamental techniques in computer vision with many applications such as geometric fitting and camera calibration, and for this task several deep learning models have been proposed. However, existing feature descriptor networks have been developed with the intention of improving the accuracy, and consideration for practical networks that can run on embedded devices has somewhat been deferred. Therefore, the objective of this study is to devise light feature descriptor networks. To this end, we employ lightweight convolution operations that have been developed for image classification networks (e.g. SqueezeNet and MobileNet) for the purpose of replacing the normal convolution operators in the state-of-the-art feature descriptor network, RF-Net. Experimental results show that the model size of the detector can be reduced by up to 80% compared to that of the original size with only a 11% degradation at worst performance in our final lightweight detector model for image matching tasks. Our study indicates that the modern convolution techniques originally proposed for small image classification models can be effectively extended to designing tiny models for the feature descriptor extraction and matching portions in deep local feature learning networks.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 149.00; Price excludes VAT (USA)

Softcover Book: USD 199.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

MLIFeat: Multi-level Information Fusion Based Deep Local Features

Performance Comparison of Algorithms Involving Automatic Learned Features and Hand-Crafted Features in Computer Vision

From Local Binary Patterns to Pixel Difference Networks for Efficient Visual Representation Learning

References

Balntas, V., Lenc, K., Vedaldi, A., Mikolajczyk, K.: HPatches: a benchmark and evaluation of handcrafted and learned local descriptors. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2017)
Google Scholar
Howard, A.G., et al.: MobileNets: efficient convolutional neural networks for mobile vision applications. arXiv:1704.04861 (2017)
Iandola, F.N., Han, S., Moskewicz, M.W., Ashraf, K., Dally, W.J., Keutzer, K.: SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and $<$0.5MB model size. arXiv:1602.07360 (2016)
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60, 91–110 (2004). https://doi.org/10.1023/B:VISI.0000029664.99615.94
Article Google Scholar
Ono, Y., Trulls, E., Fua, P., Yi, K.M.: LF-Net: learning local features from images. In: Advances in Neural Information Processing Systems (NIPS) (2018)
Google Scholar
Sandler, M., Chu, G., Chen, L.-C.: Searching for MobileNetV3. Presented at the (2019)
Google Scholar
Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., Chen, L.C.: MobileNetV2: inverted residuals and linear bottlenecks. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2018)
Google Scholar
Shen, X., et al.: RF-Net: an end-to-end image matching network based on receptive field. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2019)
Google Scholar
Zitnick, C.L., Ramnath, K.: Edge foci interest points. Presented at the (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

LeapMind Inc., Tokyo, 150-0044, Japan
Yeo Ree Wang & Atsunori Kanemura

Authors

Yeo Ree Wang
View author publications
You can also search for this author in PubMed Google Scholar
Atsunori Kanemura
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Yeo Ree Wang or Atsunori Kanemura .

Editor information

Editors and Affiliations

Kansai University, Suita, Osaka, Japan
Katsutoshi Yada
Department of Applied Computer Science, Tokyo Polytechnic University, Atsugi, Kanagawa, Japan
Daisuke Katagami
Graduate School of System Design, Tokyo Metropolitan University, Hino, Tokyo, Japan
Yasufumi Takama
Department of Social Informatics, Kyoto University, Kyoto, Japan
Takayuki Ito
Division of Behavioral Science, Faculty of Letters, Chiba University, Chiba, Chiba, Japan
Akinori Abe
Department of Computer Science, Graduate School of System Design, Tokyo Metropolitan University, Hino, Tokyo, Japan
Eri Sato-Shimokawara
Mathematics and Informatics Center, The University of Tokyo, Tokyo, Japan
Junichiro Mori
Graduate School of Economics, Osaka University, Toyonaka, Osaka, Japan
Naohiro Matsumura
Department of Intelligence Science and Technology, Kyoto University, Kyoto, Japan
Hisashi Kashima

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, Y.R., Kanemura, A. (2021). Designing Lightweight Feature Descriptor Networks with Depthwise Separable Convolution. In: Yada, K., et al. Advances in Artificial Intelligence. JSAI 2020. Advances in Intelligent Systems and Computing, vol 1357. Springer, Cham. https://doi.org/10.1007/978-3-030-73113-7_17

Download citation

DOI: https://doi.org/10.1007/978-3-030-73113-7_17
Published: 23 July 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-73112-0
Online ISBN: 978-3-030-73113-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Designing Lightweight Feature Descriptor Networks with Depthwise Separable Convolution

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

MLIFeat: Multi-level Information Fusion Based Deep Local Features

Performance Comparison of Algorithms Involving Automatic Learned Features and Hand-Crafted Features in Computer Vision

From Local Binary Patterns to Pixel Difference Networks for Efficient Visual Representation Learning

References

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Designing Lightweight Feature Descriptor Networks with Depthwise Separable Convolution

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

MLIFeat: Multi-level Information Fusion Based Deep Local Features

Performance Comparison of Algorithms Involving Automatic Learned Features and Hand-Crafted Features in Computer Vision

From Local Binary Patterns to Pixel Difference Networks for Efficient Visual Representation Learning

References

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation