Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

A novel meta-transfer learning approach via convolutional multi-head self-attention network for few-shot fault diagnosis

Published: 18 October 2024 Publication History

Abstract

In practical industrial applications, it is crucial to train a robust fault diagnosis (FD) model that can quickly adapt to new working conditions or fault modes using a few labeled fault samples. Therefore, a novel convolutional multi-head self-attention network-based meta-transfer learning approach (CMS-MTL) for few-shot fault diagnosis (FSFD) is proposed. Firstly, a convolutional multi-head self-attention network (CMHSAN) is designed, which ingeniously combines the multi-head self-attention (MHSA) blocks and convolution blocks. The local and global feature information of the input time–frequency images are fully considered through the mutual cooperation of MHSA and convolution, so as to fully extract the discriminative features among various fault classes. Secondly, a three-stage CMHSAN-based meta-transfer learning (MTL) scheme is proposed, which provides a good initialization state for the meta-training of the CMHSAN model through the pre-training stage, updates the pre-trained model with the scaling and shifting parameters in the meta-training stage, and fine-tunes the updated model in the meta-testing stage, so as to quickly adapt to new FSFD tasks from the target domain. Thirdly, aiming at the fault classes that are difficult to be diagnosed during meta-training, a meta-task re-training (MTRT) strategy is designed to learn more valuable transferable knowledge in the meta-training stage, thereby improving the adaptability of the CMHSAN model to hard FSFD tasks. Finally, extensive experiments are conducted under different FSFD scenarios to verify the effectiveness of the proposed approach. The results prove that the approach can quickly adapt to new FSFD tasks through the learned meta-knowledge and achieve high diagnosis accuracies.

Highlights

Design a convolutional multi-head self-attention network to extract fault features.
Develop a three-stage CMHSAN-based MTL scheme to adapt fast to new diagnosis tasks.
Design a meta-task re-training strategy to better diagnose difficult fault classes.
Proposed CMS-MTL approach greatly improves the few-shot fault diagnosis accuracy.

References

[1]
Chen X., Yang R., Xue Y., Huang M., Ferrero R., Wang Z., Deep transfer learning for bearing fault diagnosis: A systematic review since 2016, IEEE Trans. Instrum. Meas. 72 (2023).
[2]
Tama B.A., Vania M., Lee S., Lim S., Recent advances in the application of deep learning for fault diagnosis of rotating machinery using vibration signals, Artif. Intell. Rev. 56 (5) (2023) 4667–4709.
[3]
Ruan D., Wang J., Yan J., Gühmann C., CNN parameter design based on fault signal analysis and its application in bearing fault diagnosis, Adv. Eng. Inform. 55 (2023).
[4]
Hou Y., Wang J., Chen Z., Ma J., Li T., Diagnosisformer: An efficient rolling bearing fault diagnosis method based on improved Transformer, Eng. Appl. Artif. Intell. 124 (2023).
[5]
Tong J., Tang S., Wu Y., Pan H., Zheng J., A fault diagnosis method of rolling bearing based on improved deep residual shrinkage networks, Measurement 206 (2023).
[6]
Han T., Xie W., Pei Z., Semi-supervised adversarial discriminative learning approach for intelligent fault diagnosis of wind turbine, Inform. Sci. 648 (2023).
[7]
Chen Y., Rao M., Feng K., Zuo M.J., Physics-Informed LSTM hyperparameters selection for gearbox fault detection, Mech. Syst. Signal Process. 171 (2022).
[8]
Yao Y., Han T., Yu J., Xie M., Uncertainty-aware deep learning for reliable health monitoring in safety-critical energy systems, Energy 291 (2024).
[9]
Zhang T., Chen J., Li F., Zhang K., Lv H., He S., Xu E., Intelligent fault diagnosis of machines with small & imbalanced data: A state-of-the-art review and possible extensions, ISA Trans. 119 (2022) 152–171.
[10]
Wei J., Huang H., Yao L., Hu Y., Fan Q., Huang D., New imbalanced fault diagnosis framework based on Cluster-MWMOTE and MFO-optimized LS-SVM using limited and complex bearing data, Eng. Appl. Artif. Intell. 96 (2020).
[11]
Li J., Zhu Q., Wu Q., Fan Z., A novel oversampling technique for class-imbalanced learning based on SMOTE and natural neighbors, Inform. Sci. 565 (2021) 438–455.
[12]
Zhou F., Yang S., Fujita H., Chen D., Wen C., Deep learning fault diagnosis method based on global optimization GAN for unbalanced data, Knowl.-Based Syst. 187 (2020).
[13]
Liu J., Zhang C., Jiang X., Imbalanced fault diagnosis of rolling bearing using improved MsR-GAN and feature enhancement-driven CapsNet, Mech. Syst. Signal Process. 168 (2022).
[14]
Li W., Huang R., Li J., Liao Y., Chen Z., He G., Yan R., Gryllias K., A perspective survey on deep transfer learning for fault diagnosis in industrial scenarios: Theories, applications and challenges, Mech. Syst. Signal Process. 167 (2022).
[15]
Li Y., Song Y., Jia L., Gao S., Li Q., Qiu M., Intelligent fault diagnosis by fusing domain adversarial training and maximum mean discrepancy via ensemble learning, IEEE Trans. Ind. Inform. 17 (4) (2021) 2833–2841.
[16]
Tan Y., Guo L., Gao H., Zhang L., Deep coupled joint distribution adaptation network: A method for intelligent fault diagnosis between artificial and real damages, IEEE Trans. Instrum. Meas. 70 (2021).
[17]
Li W., Chen Z., He G., A novel weighted adversarial transfer network for partial domain fault diagnosis of machinery, IEEE Trans. Ind. Inform. 17 (3) (2021) 1753–1762.
[18]
Wan L., Li Y., Chen K., Gong K., Li C., A novel deep convolution multi-adversarial domain adaptation model for rolling bearing fault diagnosis, Measurement 191 (2022).
[19]
Chen Z., Gryllias K., Li W., Intelligent fault diagnosis for rotary machinery using transferable convolutional neural network, IEEE Trans. Ind. Inform. 16 (1) (2020) 339–349.
[20]
Feng Y., Chen J., Xie J., Zhang T., Lv H., Pan T., Meta-learning as a promising approach for few-shot cross-domain fault diagnosis: Algorithms, applications, and prospects, Knowl.-Based Syst. 235 (2022).
[21]
Zhang S., Ye F., Wang B., Habetler T.G., Few-shot bearing fault diagnosis based on model-agnostic meta-learning, IEEE Trans. Ind. Appl. 57 (5) (2021) 4754–4764.
[22]
Yang T., Tang T., Wang J., Qiu C., Chen M., A novel cross-domain fault diagnosis method based on model agnostic meta-learning, Measurement 199 (2022).
[23]
Feng Y., Chen J., Zhang T., He S., Xu E., Zhou Z., Semi-supervised meta-learning networks with squeeze-and-excitation attention for few-shot fault diagnosis, ISA Trans. 120 (2022) 383–401.
[24]
Ma R., Han T., Lei W., Cross-domain meta learning fault diagnosis based on multi-scale dilated convolution and adaptive relation module, Knowl.-Based Syst. 261 (2023).
[25]
Lin J., Shao H., Zhou X., Cai B., Liu B., Generalized MAML for few-shot cross-domain fault diagnosis of bearing driven by heterogeneous signals, Expert Syst. Appl. 230 (2023).
[26]
Sun Q., Liu Y., Chen Z., Chua T.-S., Schiele B., Meta-transfer learning through hard tasks, IEEE Trans. Pattern Anal. Mach. Intell. 44 (3) (2022) 1443–1456.
[27]
Li C., Li S., Wang H., Gu F., Ball A.D., Attention-based deep meta-transfer learning for few-shot fine-grained fault diagnosis, Knowl.-Based Syst. 264 (2023).
[28]
Ma L., Jiang B., Xiao L., Lu N., Digital twin-assisted enhanced meta-transfer learning for rolling bearing fault diagnosis, Mech. Syst. Signal Process. 200 (2023).
[29]
Lei Z., Zhang P., Chen Y., Feng K., Wen G., Liu Z., Yan R., Chen X., Yang C., Prior knowledge-embedded meta-transfer learning for few-shot fault diagnosis under variable operating conditions, Mech. Syst. Signal Process. 200 (2023).
[30]
A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, L. Kaiser, I. Polosukhin, Attention is all you need, in: Proc. 31st Int. Conf. Neural Inf. Process. Syst., NIPS, Long Beach, CA, USA, 2017, pp. 6000–6010.
[31]
Wang W., Xie E., Li X., Fan D.-P., Song K., Liang D., Lu T., Luo P., Shao L., PVT v2: Improved baselines with Pyramid vision transformer, Comput. Vis. Media 8 (3) (2022) 415–424.
[32]
A. Smith W., B. Randall R., Rolling element bearing diagnostics using the Case Western Reserve University data: A benchmark study, Mech. Syst. Signal Process. 64 (2015) 100–131.
[33]
C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, A. Rabinovich, Going deeper with convolutions, in: Proc. IEEE Conf. Comput. Vis. Pattern Recognit., CVPR, Boston, MA, USA, 2015, pp. 1–9.
[34]
C. Lessmeier, J.K. Kimotho, D. Zimmer, W. Sextro, Condition monitoring of bearing damage in electromechanical drive systems by using motor current signals of electric motors: A benchmark data set for data-driven classification, in: Proc. Eur. Conf. PHM Soc., PHME16, Vol. 3, Bilbao, Bizkaia, Spain, 2016, pp. 1–17.
[35]
Zhang B., Li W., Li X.-L., Ng S.-K., Intelligent fault diagnosis under varying working conditions based on domain adaptive convolutional neural networks, IEEE Access 6 (2018) 66367–66384.
[36]
Antoniou A., Edwards H., Storkey A., How to train your MAML, 2018,. arXiv:1810.09502.

Index Terms

  1. A novel meta-transfer learning approach via convolutional multi-head self-attention network for few-shot fault diagnosis
    Index terms have been assigned to the content through auto-classification.

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image Knowledge-Based Systems
    Knowledge-Based Systems  Volume 299, Issue C
    Sep 2024
    1543 pages

    Publisher

    Elsevier Science Publishers B. V.

    Netherlands

    Publication History

    Published: 18 October 2024

    Author Tags

    1. Fault diagnosis
    2. Few-shot
    3. Meta-transfer learning
    4. Multi-head self-attention mechanism

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 0
      Total Downloads
    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 18 Nov 2024

    Other Metrics

    Citations

    View Options

    View options

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media