Nothing Special   »   [go: up one dir, main page]

Skip to main content

Reinforced Negative Sampling for Knowledge Graph Embedding

  • Conference paper
  • First Online:
Database Systems for Advanced Applications (DASFAA 2024)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14853))

Included in the following conference series:

  • 187 Accesses

Abstract

As a key part of knowledge graph embedding (KGE), negative sampling can mine hard negative examples required for model training to improve the accuracy and effectiveness of the KGE models. However, static negative sampling can’t adapt to the dynamic distribution of hard negative examples, causing the model to miss some important negative examples during the model training. Dynamic negative sampling, although it can adapt to the changes in the data, is unable to mine and select enough hard negative examples. So most existing negative samplings affect the model’s comprehension and representation of the data. In this paper, we propose a negative example sampling method by mixup (NESI), which can generate a large number of synthetic examples at the feature-level to solve the problems of lack of hard negative examples and gradient vanishing. Our method utilizes two data mixing strategies and four evaluation functions to rapidly expand the size of the negative candidates, and effectively improve the quality of them. Therefore, NESI can efficiently select enough high-quality hard negative examples. Compared with the existing negative samplings, NESI improves the model performance on MRR by 10.58% and greatly accelerates the convergence speed of the KGE models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 69.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 89.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Adler, J., Lunz, S.: Banach wasserstein GAN. Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018 pp. 6755–6764 (2018)

    Google Scholar 

  2. Ahrabian, K., Feizi, A., Salehi, Y., Hamilton, W.L., Bose, A.J.: Structure aware negative sampling in knowledge graphs. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing. pp. 6093–6101. Association for Computational Linguistics (2020)

    Google Scholar 

  3. Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.: Dbpedia: A nucleus for a web of open data. In: The Semantic Web, 6th International Semantic Web Conference, 2nd Asian Semantic Web Conference. vol. 4825, pp. 722–735. Springer (2007)

    Google Scholar 

  4. Bollacker, K., Evans, C., Paritosh, P., Sturge, T., Taylor, J.: Freebase: A collaboratively created graph database for structuring human knowledge. In: Proceedings of the 2008 ACM SIGMOD international conference on Management of data. pp. 1247–1250. ACM (2008)

    Google Scholar 

  5. Bordes, A., Glorot, X., Weston, J., Bengio, Y.: A semantic matching energy function for learning with multi-relational data. Mach. Learn. 94(2), 233–259 (2014)

    Article  MathSciNet  Google Scholar 

  6. Bordes, A., Usunier, N., Garcia-Duran, A., Weston, J., Yakhnenko, O.: Translating embeddings for modeling multi-relational data. Advances in neural information processing systems 26: Annual Conference on Neural Information Processing Systems 2013 pp. 2787–2795 (2013)

    Google Scholar 

  7. Cai, L., Wang, W.Y.: Kbgan: Adversarial learning for knowledge graph embeddings. In: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers). pp. 1470–1480. Association for Computational Linguistics (2018)

    Google Scholar 

  8. Che, F., Yang, G., Shao, P., Zhang, D., Tao, J.: Mixkg: Mixing for harder negative samples in knowledge graph. CoRR abs/2202.09606 (2022), https://arxiv.org/abs/2202.09606

  9. Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., Bengio, Y.: Generative adversarial nets. In: Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014. pp. 2672–2680 (2014)

    Google Scholar 

  10. Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., Bengio, Y.: Generative adversarial networks. Commun. ACM 63(11), 139–144 (2020)

    Article  MathSciNet  Google Scholar 

  11. Gu, Z., Wang, L., Chen, X., Tang, Y., Wang, X., Du, X., Guizani,M., Tian, Z.: Epidemic risk assessment by a novel communication station based method. IEEE Trans. Netw. Sci. Eng. 9(1), 332–344 (2022). https://doi.org/10.1109/TNSE.2021.3058762, https://doi.org/10.1109/TNSE.2021.3058762

  12. Guo, Q., Zhuang, F., Qin, C., Zhu, H., Xie, X., Xiong, H., He, Q.: A survey on knowledge graph-based recommender systems. IEEE Trans. Knowl. Data Eng. 34(8), 3549–3568 (2022)

    Article  Google Scholar 

  13. Hao, Y., Zhang, Y., Liu, K., He, S., Liu, Z., Wu, H., Zhao, J.: An end-to-end model for question answering over knowledge base with cross-attention combining global knowledge. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). pp. 221–231. Association for Computational Linguistics (2017)

    Google Scholar 

  14. Islam, M.K., Aridhi, S., Smail-Tabbone, M.: Negative sampling and rule mining for explainable link prediction in knowledge graphs. Knowl.-Based Syst. 250, 109083 (2022)

    Article  Google Scholar 

  15. Kalantidis, Y., Sariyildiz, M.B., Pion, N., Weinzaepfel, P., Larlus, D.: Hard negative mixing for contrastive learning. Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020 33, 21798–21809 (2020)

    Google Scholar 

  16. Li, L., Zhang, X., Ma, Y., Gao, C., Wang, J., Yu, Y., Yuan, Z., Ma, Q.: A knowledge graph completion model based on contrastive learning and relation enhancement method. Knowl.-Based Syst. 256, 109889 (2022)

    Article  Google Scholar 

  17. Nickel, M., Tresp, V., Kriegel, H.P.: A three-way model for collective learning on multi-relational data. In: Proceedings of the 28th International Conference on Machine Learning. pp. 809–816. Omnipress (2011)

    Google Scholar 

  18. Nickel, M., Murphy, K., Tresp, V., Gabrilovich, E.: A review of relational machine learning for knowledge graphs. Proc. IEEE 104(1), 11–33 (2016)

    Article  Google Scholar 

  19. Paszke, A., Gross, S., Chintala, S., Chanan, G., Yang, E., DeVito, Z., Lin, Z., Desmaison, A., Antiga, L., Lerer, A.: Automatic differentiation in pytorch. In: Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017 (2017), https://openreview.net/forum?id=BJJsrmfCZ

  20. Shen, Z., Liu, Z., Liu, Z., Savvides, M., Darrell, T., Xing, E.: Un-mix: Rethinking image mixtures for unsupervised visual representation learning. In: Proceedings of the 36th AAAI Conference on Artificial Intelligence. pp. 2216–2224. AAAI Press (2022)

    Google Scholar 

  21. Sun, Z., Deng, Z.H., Nie, J.Y., Tang, J.: Rotate: Knowledge graph embedding by relational rotation in complex space. In: Proceedings of the 7th International Conference on Learning Representations (2019), https://openreview.net/forum?id=HkgEQnRqYQ

  22. Trouillon, T., Dance, C., Gaussier, E., Welbl, J., Riedel, S., Bouchard, G.: Knowledge graph completion via complex tensor factorization. J. Mach. Learn. Res. 18(130), 1–38 (2017)

    MathSciNet  Google Scholar 

  23. Verma, V., Lamb, A., Beckham, C., Najafi, A., Mitliagkas, I., Lopez-Paz, D., Bengio, Y.: Manifold mixup: Better representations by interpolating hidden states. In: Proceedings of the 36th International Conference on Machine Learning. vol. 97, pp. 6438–6447. PMLR (2019)

    Google Scholar 

  24. Wang, Q., Mao, Z., Wang, B., Guo, L.: Knowledge graph embedding: A survey of approaches and applications. IEEE Trans. Knowl. Data Eng. 29(12), 2724–2743 (2017)

    Article  Google Scholar 

  25. Wang, Z., Zhang, J., Feng, J., Chen, Z.: Knowledge graph embedding by translating on hyperplanes. In: Proceedings of the 28th AAAI conference on artificial intelligence. pp. 1112–1119. AAAI Press (2014)

    Google Scholar 

  26. Xiong, C., Power, R., Callan, J.: Explicit semantic ranking for academic search via knowledge graph embedding. In: Proceedings of the 26th international conference on world wide web. pp. 1271–1279. ACM (2017)

    Google Scholar 

  27. Yang, B., Yih, S.W.t., He, X., Gao, J., Deng, L.: Embedding entities and relations for learning and inference in knowledge bases. In: Proceedings of the 3rd International Conference on Learning Representations (2015), http://arxiv.org/abs/1412.6575

  28. Yao, X., Van Durme, B.: Information extraction over structured data: Question answering with freebase. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). pp. 956–966. The Association for Computer Linguistics (2014)

    Google Scholar 

  29. Zhang, H., Cisse, M., Dauphin, Y.N., Lopez-Paz, D.: mixup: Beyond empirical risk minimization. In: Proceedings of the 6th International Conference on Learning Representations (2018), https://openreview.net/forum?id=r1Ddp1-Rb

  30. Zhang, Y., Yao, Q., Shao, Y., Chen, L.: Nscaching: Simple and efficient negative sampling for knowledge graph embedding. In: Proceedings of the 35th IEEE International Conference on Data Engineering. pp. 614–625. IEEE (2019)

    Google Scholar 

Download references

Acknowledgments

This work is supported in part by the Major Key Project of PCL (PCL2022A03), and National Natural Science Foundation of China (Grant No. 62372137).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhaoquan Gu .

Editor information

Editors and Affiliations

1 Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 230 KB)

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Xie, Y., Wang, H., Wang, L., Luo, L., Li, J., Gu, Z. (2024). Reinforced Negative Sampling for Knowledge Graph Embedding. In: Onizuka, M., et al. Database Systems for Advanced Applications. DASFAA 2024. Lecture Notes in Computer Science, vol 14853. Springer, Singapore. https://doi.org/10.1007/978-981-97-5562-2_23

Download citation

  • DOI: https://doi.org/10.1007/978-981-97-5562-2_23

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-97-5561-5

  • Online ISBN: 978-981-97-5562-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics