Abstract
Currently, the knowledge graph construction from the text mainly relies on document-level relation extraction models. However, these models have lower accuracy due to the limitations in capturing relations across sentences and documents. To address this problem, this paper proposes a novel approach to continuously construct a knowledge graph based on a reinforcement learning framework, which leverages a question generation model, question answering model, and sentence-level relation extraction model to mine cross-document domain knowledge for effective knowledge updating. Three joint rewards are designed to optimize the question generation model, making it generate high-quality questions that facilitate the knowledge graph construction. Automatic evaluation combined with manual evaluation is conducted based on the SQuAD dataset, to assess the quality of the questions and generated knowledge graph respectively. The experiments and analyses demonstrate the effectiveness of our approach in improving the quality of the knowledge graph construction.
J. Luo, Y. Zhang and Y. Wang—The authors contributed equally to this work.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ye, H., Zhang, N., Chen, H., Chen, H.: Generative knowledge graph construction: a review. CoRR (2022). https://doi.org/10.48550/arXiv.2210.12714
Ji, S., Pan, S., Cambria, E., Marttinen, P., Philip, S.Y.: A survey on knowledge graphs: representation, acquisition, and applications. IEEE Trans. Neural Netw. Learn. Syst. 33(2), 494–514 (2021)
Xu, B., Wang, Q., Lyu, Y., Zhu, Y., Mao, Z.: Entity structure within and throughout: modeling mention dependencies for document-level relation extraction. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, pp. 14149–14157 (2021)
Krishna, R., Lee, D., Fei-Fei, L., Bernstein, M.S.: Socially situated artificial intelligence enables learning from human interaction. Proc. Natl. Acad. Sci. 119(39), e2115730119 (2022)
Vaswani, A., et al.: Attention is all you need. In: Proceedings of the 31st International Conference on Neural Information Processing Systems, pp. 6000–6010 (2017)
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)
Liu, Y., et al.: RoBERTa: a robustly optimized BERT pretraining approach. arXiv preprint arXiv:1907.11692 (2019)
Radford, A., Narasimhan, K., Salimans, T., Sutskever, I., et al.: Improving language understanding by generative pre-training (2018)
Radford, A., Wu, J., Child, R., Luan, D., Amodei, D., Sutskever, I.: Language models are unsupervised multitask learners (2019)
Roberts, A., Raffel, C., Shazeer, N.M.: How much knowledge can you pack into the parameters of a language model? In: Conference on Empirical Methods in Natural Language Processing (2020)
Shin, T., Razeghi, Y., Logan IV, R.L., Wallace, E., Singh, S.: Autoprompt: eliciting knowledge from language models with automatically generated prompts. arXiv preprint arXiv:2010.15980 (2020)
Dessì, D., Osborne, F., Reforgiato Recupero, D., Buscaldi, D., Motta, E., Sack, H.: AI-KG: an automatically generated knowledge graph of artificial intelligence. In: Pan, J.Z., et al. (eds.) ISWC 2020. LNCS, vol. 12507, pp. 127–143. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-62466-8_9
Rincon-Yanez, D., Senatore, S.: FAIR knowledge graph construction from text, an approach applied to fictional novels. In: Proceedings of the 1st International Workshop on Knowledge Graph Generation From Text and the 1st International Workshop on Modular Knowledge co-located with 19th Extended Semantic Conference (ESWC 2022), pp. 94–108. CEUR-WS, Hersonissos, Greece (2022)
Melnyk, I., Dognin, P., Das, P.: Knowledge graph generation from text. In: Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (Findings of EMNLP) (2022)
Xie, Y., Pan, L., Wang, D., Kan, M.Y., Feng, Y.: Exploring question-specific rewards for generating deep questions. In: The 28th International Conference on Computational Linguistics (COLING 2020) (2020)
Joshi, M., Chen, D., Liu, Y., Weld, D.S., Zettlemoyer, L., Levy, O.: SpanBERT: improving pre-training by representing and predicting spans. arXiv preprint arXiv:1907.10529 (2019)
Span-based joint entity and relation extraction with transformer pre-training. In: 24th European Conference on Artificial Intelligence (2020)
Li, M., Zhu, Y., Shen, Y., Angelova, M.: Clustering-enhanced stock price prediction using deep learning. World Wide Web 26(1), 207–232 (2023)
Papineni, K., Roukos, S., Ward, T., Zhu, W.J.: Bleu: a method for automatic evaluation of machine translation. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, pp. 311–318 (2002)
Lin, C.Y.: Rouge: a package for automatic evaluation of summaries. In: Text Summarization Branches Out, pp. 74–81 (2004)
Banerjee, S., Lavie, A.: Meteor: an automatic metric for MT evaluation with improved correlation with human judgments. In: Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization, pp. 65–72 (2005)
Islam, M.S.: Knowurenvironment: an automated knowledge graph for climate change and environmental issues. In: AAAI 2022 Fall Symposium: The Role of AI in Responding to Climate Challenges (2022)
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Luo, J. et al. (2023). A Reinforcement Learning-Based Approach for Continuous Knowledge Graph Construction. In: Jin, Z., Jiang, Y., Buchmann, R.A., Bi, Y., Ghiran, AM., Ma, W. (eds) Knowledge Science, Engineering and Management. KSEM 2023. Lecture Notes in Computer Science(), vol 14120. Springer, Cham. https://doi.org/10.1007/978-3-031-40292-0_34
Download citation
DOI: https://doi.org/10.1007/978-3-031-40292-0_34
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-40291-3
Online ISBN: 978-3-031-40292-0
eBook Packages: Computer ScienceComputer Science (R0)