Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3579142.3594294acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

DAFTA: Distributed Architecture for Fusion-Transformer training Acceleration

Published: 18 June 2023 Publication History

Abstract

Multi-modal data fusion transformer is a deep learning model that integrates information from multiple modalities, such as text, image, audio, etc., to improve performance in various tasks, especially in the remote sensing domain. Recent efforts leverage hyperspectral imaging (HSI) and LiDAR sensors and their complementary information about the target. Remote sensing image classification is inherently a transductive learning problem, requiring repeated model training (e.g., environment monitoring applications where changes occur on a daily or weekly basis). Hyperspectral data is typically high dimensional and massive, and model training can take an impractically long time (several days to weeks). By reducing training time, it becomes possible to process larger amounts of data and develop more efficient and accurate models. In this paper, we introduce a novel distributed training architecture, DAFTA, which addresses the challenges associated with the processing of large multi-modal remote sensing data. DAFTA is enabled to handle any combination of remote sensing modalities. Additionally, we leverage the similarity of the feature space to optimize the training process and achieve the training with a reduced data set which is equivalent to a complete data set. The proposed approach provides a systematic and efficient method for managing large remote sensing data and enables accurate and timely insights for various applications such as agriculture and infrastructure development. We conduct experiments on two real-world data sets of remotely-sensed images. Our results validate the efficacy of the proposed distributed training approach by achieving a speed-up in the range of 5x without compromising the accuracy, thus making it more practical for real-world applications.

References

[1]
2013. 2013 IEEE GRSS Data Fusion Contest. http://www.grss-ieee.org/community/technical-committees/data-fusion/
[2]
2020. Trento multimodal fusion data. https://github.com/danfenghong/IEEE_TGRS_MDL-RS/tree/master/MDL-RS_CNNs/HSI_LiDAR_CNN
[3]
Tal Ben-Nun and Torsten Hoefler. 2019. Demystifying parallel and distributed deep learning: An in-depth concurrency analysis. ACM Computing Surveys (CSUR) 52, 4 (2019), 1--43.
[4]
Karanbir Singh Chahal, Manraj Singh Grover, Kuntal Dey, and Rajiv Ratn Shah. 2020. A hitchhiker's guide on distributed training of deep neural networks. J. Parallel and Distrib. Comput. 137 (2020), 65--76.
[5]
Yushi Chen, Zhouhan Lin, Xing Zhao, Gang Wang, and Yanfeng Gu. 2014. Deep learning-based classification of hyperspectral data. IEEE Journal of Selected topics in applied earth observations and remote sensing 7 (2014), 2094--2107. Issue 6.
[6]
Yushi Chen, Xing Zhao, and Xiuping Jia. 2015. Spectral-spatial classification of hyperspectral data based on deep belief network. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 8 (2015), 2381--2392. Issue 6.
[7]
Mingmin Chi, Antonio Plaza, Jon Atli Benediktsson, Zhongyi Sun, Jinsheng Shen, and Yangyong Zhu. 2016. Big Data for Remote Sensing: Challenges and Opportunities. Proc. IEEE 104 (2016). Issue 11.
[8]
Shailesh Deshpande and Arun Inamdar. 2023. Classification of urban land use and land cover. Vol. in press. Taylor & Francis.
[9]
Shailesh Deshpande and Arun Inamdar. 2023. Introduction to hyperspectral imaging. Vol. in press. Taylor & Francis.
[10]
Shailesh Deshpande, Arun Inamdar, and Harrick Vin. 2013. Overview of hyperspectral remote sensing of impervious surfaces in urban environment. Communications in Computer and Information Science 398 PART I.
[11]
David Griffiths and Jan Boehm. 2019. A review on deep learning techniques for 3D sensed data classification. Remote Sensing 11, 12 (2019), 1499.
[12]
Shruti Kunde, Amey Pandit, Mayank Mishra, and Rekha Singhal. 2021. Distributed training for accelerating metalearning algorithms. In Proceedings of the International Workshop on Big Data in Emergent Distributed Environments. 1--6.
[13]
Shen Li, Yanli Zhao, Rohan Varma, Omkar Salpekar, Pieter Noordhuis, Teng Li, Adam Paszke, Jeff Smith, Brian Vaughan, Pritam Damania, et al. 2020. Pytorch distributed: Experiences on accelerating data parallel training. arXiv preprint arXiv:2006.15704 (2020).
[14]
Thomas M. Lillesand, Ralph W. Kiefer, and Jonathan W. Chipman. 2018. Remote Sensing and Image Interpretation Seventh Edition. Vol. 53. Issue 9.
[15]
Mengru Ma, Wenping Ma, Licheng Jiao, Xu Liu, Lingling Li, Zhixi Feng, Shuyuan Yang, et al. 2023. A multimodal hyper-fusion transformer for remote sensing image classification. Information Fusion (2023).
[16]
M E Paoletti, J M Haut, J Plaza, and A Plaza. 2019. Deep learning classifiers for hyperspectral imaging: A review. ISPRS Journal of Photogrammetry and Remote Sensing 158 (2019), 279--317.
[17]
Varad Pimpalkhute, Shruti Kunde, Rekha Singhal, Surya Palepu, Dheeraj Chahal, and Amey Pandit. 2022. MetaFaaS: learning-to-learn on serverless. In Proceedings of The International Workshop on Big Data in Emergent Distributed Environments. 1--9.
[18]
Swalpa Kumar Roy, Ankur Deria, Danfeng Hong, Behnood Rasti, Antonio Plaza, and Jocelyn Chanussot. 2022. Multimodal fusion transformer for remote sensing image classification. arXiv preprint arXiv:2203.16952 (2022).
[19]
Jia Song, Shaohua Gao, Yunqiang Zhu, and Chenyan Ma. 2019. A survey of remote sensing image classification based on CNNs. Big earth data 3, 3 (2019), 232--254.
[20]
Xin Wu, Danfeng Hong, and Jocelyn Chanussot. 2021. Convolutional neural networks for multimodal remote sensing data classification. IEEE Transactions on Geoscience and Remote Sensing 60 (2021), 1--10.

Cited By

View all

Index Terms

  1. DAFTA: Distributed Architecture for Fusion-Transformer training Acceleration

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    BiDEDE '23: Proceedings of the International Workshop on Big Data in Emergent Distributed Environments
    June 2023
    56 pages
    ISBN:9798400700934
    DOI:10.1145/3579142
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 18 June 2023

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. remote sensing
    2. multi-modal
    3. fusion transformers
    4. distributed training

    Qualifiers

    • Research-article

    Conference

    BiDEDE '23
    Sponsor:

    Acceptance Rates

    BiDEDE '23 Paper Acceptance Rate 7 of 15 submissions, 47%;
    Overall Acceptance Rate 25 of 47 submissions, 53%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 81
      Total Downloads
    • Downloads (Last 12 months)34
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 13 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media