Abstract
Relation classification (RC) is an essential task in information extraction. The distance supervision (DS) method can use many unlabeled data and solve the lack of training data on the RC task. However, the DS method has the problems of long tails and noise. Intuitively, people can solve these problems using few-shot learning (FSL). Our work aims to improve the accuracy and rapidity of convergence on the few-shot RC task. We believe that entity pairs have an essential role in the few-shot RC task. We propose a new context encoder, which is improved based on the bidirectional encoder representations from transformers (BERT) model to fuse entity pairs and their dependence information in instances. At the same time, we design hybrid attention, which includes support instance-level and query instance-level attention. The support instance level dynamically assigns the weight of each instance in the support set. It makes up for the insufficiency of prototypical networks, which distribute weights to sentences equally. Query instance-level attention is dynamically assigned weights to query instances by similarity with the prototype. The ablation study shows the effectiveness of our proposed method. In addition, a fusion network is designed to replace the Euclidean distance method of previous works when class matching is performed, improving the convergence’s rapidity. This makes our model more suitable for industrial applications. The experimental results show that the proposed model’s accuracy is better than that of several other models.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data availability
Data openly available in a public repository.
References
Wang, L., Cao, Z., De Melo, G., Liu, Z.: Relation classification via multi-level attention cnns. In: 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016, August 7, 2016 - August 12, 2016. 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers, vol. 3, pp. 1298–1307. Association for Computational Linguistics (ACL). 10.18653/v1/p16-1123
Chen T, Wang N, Wang H, Zhan H (2021) Distant supervision for relation extraction with sentence selection and interaction representation. Wireless Communications and Mobile Computing 2021:1–16. https://doi.org/10.1155/2021/8889075
Feng RW, Zheng XS, Gao TX, Chen JT, Wang WZ, Chen DZ, Wu J (2021) Interactive few-shot learning: Limited supervision, better medical image segmentation. IEEE Transactions on Medical Imaging 40(10):2575–2588. https://doi.org/10.1109/tmi.2021.3060551
Ye HJ, Hu HX, Zhan DC (2021) Learning adaptive classifiers synthesis for generalized few-shot learning. International Journal of Computer Vision 129(6):1930–1953. https://doi.org/10.1007/s11263-020-01381-4
Gao, T., Han, X., Zhu, H., Liu, Z., Li, P., Sun, M., Zhou, J.: Fewrel 2.0: Towards more challenging few-shot relation classification. In: 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, EMNLP-IJCNLP 2019, November 3, 2019 - November 7, 2019. EMNLP-IJCNLP 2019 - 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, Proceedings of the Conference, pp. 6250–6255. Association for Computational Linguistics. 10.18653/v1/D19-1649
Han, X., Zhu, H., Yu, P., Wang, Z., Yao, Y., Liu, Z., Sun, M.: Fewrel: A large-scale supervised few-shot relation classification dataset with state-of-the-art evaluation. In: 2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018, October 31, 2018 - November 4, 2018. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018, pp. 4803–4809. Association for Computational Linguistics. 10.18653/v1/D18-1514
Chen YS, Chiang SW, Wu ML (2022) A few-shot transfer learning approach using text-label embedding with legal attributes for law article prediction. Applied Intelligence 52(3):2884–2902. https://doi.org/10.1007/s10489-021-02516-x
Gao, T., Han, X., Liu, Z., Sun, M.: Hybrid attention-based prototypical networks for noisy few-shot relation classification. In: 33rd AAAI Conference on Artificial Intelligence, AAAI 2019, 31st Annual Conference on Innovative Applications of Artificial Intelligence, IAAI 2019 and the 9th AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019, January 27, 2019 - February 1, 2019. 33rd AAAI Conference on Artificial Intelligence, AAAI 2019, 31st Innovative Applications of Artificial Intelligence Conference, IAAI 2019 and the 9th AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019, pp. 6407–6414. AAAI Press. 10.1609/aaai.v33i01.33016407
Xie Y, Wang H, Yu B, Zhang C (2020) Secure collaborative few-shot learning. Knowledge-Based Systems 203:10. https://doi.org/10.1016/j.knosys.2020.106157
Xu H, Wang JX, Li H, Ouyang DQ, Shao J (2021) Unsupervised meta-learning for few-shot learning. Pattern Recognition 116:10. https://doi.org/10.1016/j.patcog.2021.107951
Li DW, Tian YJ (2018) Survey and experimental study on metric learning methods. Neural Networks 105:447–462. https://doi.org/10.1016/j.neunet.2018.06.003
Snell, J., Swersky, K., Zemel, R.: Prototypical networks for few-shot learning. In: 31st Annual Conference on Neural Information Processing Systems, NIPS 2017, December 4, 2017 - December 9, 2017. Advances in Neural Information Processing Systems, vol. 2017-December, pp. 4078–4088. Neural information processing systems foundation
Li LQ, Wang JB, Li JC, Ma QL, Wei J (2019) Relation classification via keyword-attentive sentence mechanism and synthetic stimulation loss. IEEE-ACM Transactions on Audio Speech and Language Processing 27(9):1392–1404. https://doi.org/10.1109/taslp.2019.2921726
Sun HY, Grishman R (2022) Lexicalized dependency paths based supervised learning for relation extraction. Computer Systems Science and Engineering 43(3):861–870. https://doi.org/10.32604/csse.2022.030759
Shi Y, Xiao Y, Quan P, Lei ML, Niu LF (2021) Distant supervision relation extraction via adaptive dependency-path and additional knowledge graph supervision. Neural Networks 134:42–53. https://doi.org/10.1016/j.neunet.2020.10.012
Liu Y, Li SJ, Wei FR, Ji H (2016) Relation classification via modeling augmented dependency paths. IEEE-ACM Transactions on Audio Speech and Language Processing 24(9):1589–1598. https://doi.org/10.1109/taslp.2016.2573050
Ma, Y.H., Zhu, J., Liu, J.: Enhanced semantic representation learning for implicit discourse relation classification. Applied Intelligence (2021). 10.1007/s10489-021-02785-6
Runyan Z, Fanrong M, Yong Z, Bing L (2018) Relation classification via recurrent neural network with attention and tensor layers. Big Data Mining and Analytics 1(3):234–244. https://doi.org/10.26599/bdma.2018.9020022
Mintz, M., Bills, S., Snow, R., Jurafsky, D.: Distant supervision for relation extraction without labeled data. In: ACL 2009, Proceedings of the 47th Annual Meeting of the Association for Computational Linguistics and the 4th International Joint Conference on Natural Language Processing of the AFNLP, 2-7 August 2009, Singapore. 10.3115/1690219.1690287
Zhang, N., Deng, S., Sun, Z., Wang, G., Chen, X., Zhang, W., Chen, H.: Long-tail relation extraction via knowledge graph embeddings and graph convolution networks. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pp. 3016–3025. Association for Computational Linguistics. 10.18653/v1/N19-1306
Khan MS, Lohani QMD (2022) Topological analysis of intuitionistic fuzzy distance measures with applications in classification and clustering. Engineering Applications of Artificial Intelligence 116. https://doi.org/10.1016/j.engappai.2022.105415
Hallajian B, Motameni H, Akbari E (2022) Ensemble feature selection using distance-based supervised and unsupervised methods in binary classification. Expert Systems with Applications 200:18. https://doi.org/10.1016/j.eswa.2022.116794
Jiang W, Huang K, Geng J, Deng XY (2021) Multi-scale metric learning for few-shot learning. IEEE Transactions on Circuits and Systems for Video Technology 31(3):1091–1102. https://doi.org/10.1109/tcsvt.2020.2995754
Lake BM, Salakhutdinov R, Tenenbaum JB (2015) Human-level concept learning through probabilistic program induction. Science 350(6266):1332–1338. https://doi.org/10.1126/science.aab3050
Li, W.B., Xu, J.L., Huo, J., Wang, L., Gao, Y., Luo, J.B., Aaai: Distribution consistency based covariance metric networks for few-shot learning. In: 33rd AAAI Conference on Artificial Intelligence / 31st Innovative Applications of Artificial Intelligence Conference / 9th AAAI Symposium on Educational Advances in Artificial Intelligence, pp. 8642–8649. Assoc Advancement Artificial Intelligence, PALO ALTO (2019). 10.1609/aaai.v33i01.33018642
Xie, Y.X., Xu, H., Yang, C.C., Gao, K., Assoc Advancement Artificial, I.: Multi-channel convolutional neural networks with adversarial training for few-shot relation classification. In: 34th AAAI Conference on Artificial Intelligence / 32nd Innovative Applications of Artificial Intelligence Conference / 10th AAAI Symposium on Educational Advances in Artificial Intelligence. AAAI Conference on Artificial Intelligence, vol. 34, pp. 13967–13968. Assoc Advancement Artificial Intelligence, PALO ALTO (2020). 10.1609/aaai.v34i10.7256
Xie Y, Xu H, Li J, Yang C, Gao K (2020) Heterogeneous graph neural networks for noisy few-shot relation classification. Knowledge-Based Systems 194. https://doi.org/10.1016/j.knosys.2020.105548
Ye, Z.X., Ling, Z.H.: Multi-level matching and aggregation network for few-shot relation classification. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 10.18653/v1/P19-1277
Gao, T.Y., Han, X., Xie, R.B., Liu, Z.Y., Lin, F., Lin, L.Y., Sun, M.S., Assoc Advancement Artificial, I.: Neural snowball for few-shot relation learning. In: 34th AAAI Conference on Artificial Intelligence / 32nd Innovative Applications of Artificial Intelligence Conference / 10th AAAI Symposium on Educational Advances in Artificial Intelligence. AAAI Conference on Artificial Intelligence, vol. 34, pp. 7772–7779. Assoc Advancement Artificial Intelligence, PALO ALTO (2020). 10.1609/aaai.v34i05.6281
Pang N, Tan Z, Xu H, Xiao WD (2020) Boosting knowledge base automatically via few-shot relation classification. Frontiers in Neurorobotics 14. https://doi.org/10.3389/fnbot.2020.584192
Xiao, Y., Jin, Y., Hao, K.: Adaptive prototypical networks with label words and joint representation learning for few-shot relation classification. IEEE transactions on neural networks and learning systems PP (2021). 10.1109/tnnls.2021.3105377
Wu JY, Zhao ZB, Sun C, Yan RQ, Chen XF (2020) Few-shot transfer learning for intelligent fault diagnosis of machine. Measurement 166. https://doi.org/10.1016/j.measurement.2020.108202
Hou, Y.T., Lai, Y.K., Wu, Y.S., Che, W.X., Liu, T., Assoc Advancement Artificial, I.: Few-shot learning for multi-label intent detection. In: 35th AAAI Conference on Artificial Intelligence / 33rd Conference on Innovative Applications of Artificial Intelligence / 11th Symposium on Educational Advances in Artificial Intelligence. AAAI Conference on Artificial Intelligence, vol. 35, pp. 13036–13044. Assoc Advancement Artificial Intelligence, PALO ALTO (2021)
Bao, Y., Wu, M., Chang, S., Barzilay, R.: Few-shot text classification with distributional signatures. In: International Conference on Learning Representations (2020). https://doi.org/10.1007/978-981-33-4859-2_14
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L., Polosukhin, I.: Attention is all you need. In: 31st Annual Conference on Neural Information Processing Systems, NIPS 2017, December 4, 2017 - December 9, 2017. Advances in Neural Information Processing Systems, vol. 2017-December, pp. 5999–6009. Neural information processing systems foundation
Chen XF, Wang GH, Ren HP, Cai Y, Leung HF, Wang T (2022) Task-adaptive feature fusion for generalized few-shot relation classification in an open world environment. IEEE-ACM Transactions on Audio Speech and Language Processing 30:1003–1015. https://doi.org/10.1109/taslp.2022.3153254
Wang, W., Yan, M., Wu, C.: Multi-granularity hierarchical attention fusion networks for reading comprehension and question answering. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 1705–1714. Association for Computational Linguistics. 10.18653/v1/P18-1158
Acknowledgements
This paper was supported by the Exercise and Health Laboratory of the Institute of Intelligent Machinery, Chinese Academy of Sciences. Thanks to Tsinghua University for developing the FewRel dataset. We would like to thank the anonymous reviewers for their helpful comments.
Funding
This research was supported by the National Key Research and Development Program of China (2022YFC2010200, 2020YFC2005603), the National Natural Science Foundation of China (NSFC) (grant numbers 61701482), the Key projects of the National Natural Science Foundation of universities in Anhui Province (grant number KJ2020A0112),the Major Special Projects of Anhui Province(grant number 202103a07020004), the Natural Science Foundation of Anhui Province, China (grant number 1808085MF191),the Education Research Project of Anhui Province, China(2020jyxm1573) and the High-level Talents Research Start-up Fund of Hefei Normal University (grant number 2020rcjj45). In addition, we would like to thank the anonymous reviewers who have helped to improve the paper.
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Conflict of interest
We declare that we have no financial and personal relationships with other people or organizations that can inappropriately influence our work. There is no professional or other personal interest of any nature or kind in any product, service and/or company that could be construed as influencing the position presented in, or the review of the manuscript entitled.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Li, Y., Ding, Z., Ma, Z. et al. Few-shot relation classification based on the BERT model, hybrid attention and fusion networks. Appl Intell 53, 21448–21464 (2023). https://doi.org/10.1007/s10489-023-04634-0
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-023-04634-0