Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Path-based multi-hop reasoning over knowledge graph for answering questions via adversarial reinforcement learning

Published: 27 September 2023 Publication History

Abstract

Multi-hop knowledge graph question answering targets at pinpointing the answer entities by inferring across multiple triples in knowledge graphs. To enhance model interpretability, path-based methods are proposed. Specifically, with the advances of deep reinforcement learning (DRL), this paper explores to extend the line of RL-based approaches. However, existing solutions suffer from the issue of spurious paths. A major reason lies in that the agent takes an opportunistic way to explicitly pursue the predictive accuracy of answer entities instead of considering correct reasoning paths. To overcome this challenge, our idea is inspired by adversarial learning and we expect that a discriminator could effectively distinguish whether the reasoning chain is correct or not. To this end, we propose an interpretable reasoning method based on adversarial reinforcement learning for multi-hop KGQA, namely Adversarial Reinforcement Reasoning Network (AR2N). AR2N consists of two crucial components: an answer generator (i.e., policy network of RL) and a path discriminator. By alternately updating two components in an adversarial manner, the generator is able to infer answer entities by following the correct reasoning chain, the discriminator is capable of evaluating the plausibility of reasoning paths. Extensive experiments conducted over three benchmark datasets well demonstrate the effectiveness of our method. For reproducibility, we publicly release the code at https://github.com/iDylanCui/AR2N.

Highlights

The agent takes an opportunistic way to explicitly pursue predictive accuracy.
The generator infers answer entities by following the correct reasoning chain.
The discriminator is capable of evaluating the plausibility of reasoning paths.
RL-based models enhance the interpretability of reasoning process.

References

[1]
Sukhbaatar S., Szlam A., Weston J., Fergus R., End-to-end memory networks, in: NIPS, 2015, pp. 2440–2448.
[2]
Miller A.H., Fisch A., Dodge J., Karimi A., Bordes A., Weston J., Key-value memory networks for directly reading documents, in: EMNLP, 2016, pp. 1400–1409.
[3]
Xu K., Lai Y., Feng Y., Wang Z., Enhancing key-value memory neural networks for knowledge based question answering, in: NAACL-HLT, 2019, pp. 2937–2947.
[4]
Li X., Alazab M., Li Q., Yu K., Yin Q., Question-aware memory network for multi-hop question answering in human-robot interaction, 2021, arXiv abs/2104.13173.
[5]
Saxena A., Tripathi A., Talukdar P.P., Improving multi-hop question answering over knowledge graphs using knowledge base embeddings, in: ACL, 2020, pp. 4498–4507.
[6]
Lin X.V., Socher R., Xiong C., Multi-Hop knowledge graph reasoning with reward shaping, in: EMNLP, 2018, pp. 3243–3253.
[7]
Zhou M., Huang M., Zhu X., An interpretable reasoning network for multi-relation question answering, in: COLING, 2018, pp. 2010–2022.
[8]
Chen Z., Chang C., Chen Y., Nayak J., Ku L., UHop: An unrestricted-hop relation extraction framework for knowledge-based question answering, in: NAACL, 2019, pp. 345–356.
[9]
Lan Y., Wang S., Jiang J., Multi-hop knowledge base question answering with an iterative sequence matching model, in: ICDM, 2019, pp. 359–368.
[10]
Qiu Y., Wang Y., Jin X., Zhang K., Stepwise reasoning for multi-relation question answering over knowledge graph with weak supervision, in: WSDM, 2020, pp. 474–482.
[11]
Lv X., Han X., Hou L., Li J., Liu Z., Zhang W., Zhang Y., Kong H., Wu S., Dynamic anticipation and completion for multi-Hop reasoning over sparse knowledge graph, in: EMNLP, 2020, pp. 5694–5703.
[12]
He G., Lan Y., Jiang J., Zhao W.X., Wen J., Improving multi-hop knowledge base question answering by learning intermediate supervision signals, in: WSDM, 2021, pp. 553–561.
[13]
Goodfellow I.J., Pouget-Abadie J., Mirza M., Xu B., Warde-Farley D., Ozair S., Courville A.C., Bengio Y., Generative adversarial nets, in: NIPS, 2014, pp. 2672–2680.
[14]
Yu L., Zhang W., Wang J., Yu Y., SeqGAN: Sequence generative adversarial nets with policy gradient, in: AAAI, 2017, pp. 2852–2858.
[15]
Sutton R.S., McAllester D.A., Singh S.P., Mansour Y., Policy gradient methods for reinforcement learning with function approximation, in: NIPS, 1999, pp. 1057–1063.
[16]
Lan Y., He G., Jiang J., Jiang J., Zhao W.X., Wen J., Complex knowledge base question answering: A survey, 2021, arXiv abs/2108.06688.
[17]
Ren H., Dai H., Dai B., Chen X., Yasunaga M., Sun H., Schuurmans D., Leskovec J., Zhou D., LEGO: Latent execution-guided reasoning for multi-hop question answering on knowledge graphs, in: ICML, Vol. 139, 2021, pp. 8959–8970.
[18]
Vaswani A., Shazeer N., Parmar N., Uszkoreit J., Jones L., Gomez A.N., Kaiser L., Polosukhin I., Attention is all you need, in: NIPS, 2017, pp. 5998–6008.
[19]
Dettmers T., Minervini P., Stenetorp P., Riedel S., Convolutional 2D knowledge graph embeddings, in: AAAI, 2018, pp. 1811–1818.
[20]
See A., Liu P.J., Manning C.D., Get to the point: Summarization with pointer-generator networks, in: ACL, 2017, pp. 1073–1083.
[21]
Bahdanau D., Cho K., Bengio Y., Neural machine translation by jointly learning to align and translate, in: ICLR, 2015.
[22]
Li J., Monroe W., Shi T., Jean S., Ritter A., Jurafsky D., Adversarial learning for neural dialogue generation, in: EMNLP, 2017, pp. 2157–2169.
[23]
Tuan Y., Lee H., Improving conditional sequence generative adversarial networks by stepwise evaluation, IEEE ACM Trans. Audio Speech Lang. Process. 27 (4) (2019) 788–798.
[24]
Watkins C.J.C.H., Dayan P., Technical note Q-learning, Mach. Learn. 8 (1992) 279–292.
[25]
Zhang Q., Weng X., Zhou G., Zhang Y., Huang J.X., ARL: An adaptive reinforcement learning framework for complex question answering over knowledge base, Inf. Process. Manage. 59 (3) (2022).
[26]
Zhang J., Zhang L., Hui B., Tian L., Improving complex knowledge base question answering via structural information learning, Knowl.-Based Syst. 242 (2022).
[27]
Cui H., Peng T., Xiao F., Han J., Han R., Liu L., Incorporating anticipation embedding into reinforcement learning framework for multi-hop knowledge graph question answering, Inform. Sci. 619 (2023) 745–761.
[28]
Cui H., Peng T., Han R., Zhu B., Bi H., Liu L., Reinforcemant learning with dynamic completion for answering multi-hop questions over knowledge graph, Inf. Process. Manage. 60 (3) (2023).
[29]
Zhang X., Tan Z., Zhang X., Cao Y., Yan R., Adaptively multi-objective adversarial training for dialogue generation, in: IJCAI, 2020, pp. 2872–2878.
[30]
Y. Huang, X. Wu, W. Hu, J. Feng, C. Deng, State-Aware Adversarial Training for Utterance-Level Dialogue Generation, in: Proceedings of the Towards Semi-Supervised and Reinforced Task-Oriented Dialog Systems (SereTOD), 2022, pp. 62–74.
[31]
Zhang Y., Dai H., Kozareva Z., Smola A.J., Song L., Variational reasoning for question answering with knowledge graph, in: AAAI, 2018, pp. 6069–6076.
[32]
Pennington J., Socher R., Manning C.D., Glove: Global vectors for word representation, in: EMNLP, 2014, pp. 1532–1543.
[33]
Kingma D.P., Ba J., Adam: A method for stochastic optimization, in: ICLR, 2015.
[34]
Cui H., Peng T., Bao T., Han R., Han J., Liu L., Stepwise relation prediction with dynamic reasoning network for multi-hop knowledge graph question answering, Appl. Intell. (2022),.
[35]
Sun H., Dhingra B., Zaheer M., Mazaitis K., Salakhutdinov R., Cohen W.W., Open domain question answering using early fusion of knowledge bases and text, in: EMNLP, 2018, pp. 4231–4242.
[36]
Sun H., Bedrax-Weiss T., Cohen W.W., PullNet: Open domain question answering with iterative retrieval on knowledge bases and text, in: EMNLP, 2019, pp. 2380–2390.
[37]
Xiong W., Yu M., Chang S., Guo X., Wang W.Y., Improving question answering over incomplete KBs with knowledge-aware reader, in: ACL, 2019, pp. 4258–4264.
[38]
Han J., Cheng B., Wang X., Two-phase hypergraph based reasoning with dynamic relations for multi-hop KBQA, in: IJCAI, 2020, pp. 3615–3621.
[39]
Wang X., Zhao S., Han J., Cheng B., Yang H., Ao J., Li Z., Modelling long-distance node relations for KBQA with global dynamic graph, in: COLING, 2020, pp. 2572–2582.
[40]
Heo Y., Kim E., Choi W.S., Zhang B., Hypergraph transformer: Weakly-supervised multi-hop reasoning for knowledge-based visual question answering, in: ACL, 2022, pp. 373–390.
[41]
Wang X., Zhao S., Cheng B., Yin Y., Yang H., Explore modeling relation information and direction information in KBQA, Neurocomputing 471 (2022) 139–148.
[42]
Das R., Dhuliawala S., Zaheer M., Vilnis L., Durugkar I., Krishnamurthy A., Smola A., McCallum A., Go for a walk and arrive at the answer: Reasoning over paths in knowledge bases using reinforcement learning, in: ICLR, 2018.
[43]
Liu Y., Ott M., Goyal N., Du J., Joshi M., Chen D., Levy O., Lewis M., Zettlemoyer L., Stoyanov V., RoBERTa: A robustly optimized BERT pretraining approach, 2019, arXiv abs/1907.11692.
[44]
Lin B.Y., Chen X., Chen J., Ren X., KagNet: Knowledge-aware graph networks for commonsense reasoning, in: EMNLP, 2019, pp. 2829–2839.
[45]
Guu K., Pasupat P., Liu E.Z., Liang P., From language to programs: Bridging reinforcement learning and maximum marginal likelihood, in: ACL, 2017, pp. 1051–1062.
[46]
Hao Y., Liu H., He S., Liu K., Zhao J., Pattern-revising enhanced simple question answering over knowledge bases, in: COLING, 2018, pp. 3272–3282.
[47]
Cui H., Peng T., Feng L., Bao T., Liu L., Simple question answering over knowledge graph enhanced by question pattern classification, Knowl. Inf. Syst. 63 (2021) 2741–2761.
[48]
Zhou G., Xie Z., Yu Z., Huang J.X., DFM: A parameter-shared deep fused model for knowledge base question answering, Inform. Sci. 547 (2021) 103–118.
[49]
Hu S., Zou L., Yu J.X., Wang H., Zhao D., Answering natural language questions by subgraph matching over knowledge graphs, Trans. Knowl. Data Eng. 30 (5) (2018) 824–837.
[50]
Bhutani N., Zheng X., Qian K., Li Y., Jagadish H., Answering complex questions by combining information from curated and extracted knowledge bases, in: ACL, 2020, pp. 1–10.
[51]
Huang X., Zhang J., Li D., Li P., Knowledge graph embedding based question answering, in: ACM, 2019, pp. 105–113.
[52]
Xiong W., Hoang T., Wang W.Y., DeepPath: A reinforcement learning method for knowledge graph reasoning, in: EMNLP, 2017, pp. 564–573.
[53]
Williams R.J., Simple statistical gradient-following algorithms for connectionist reinforcement learning, Mach. Learn. 8 (1992) 229–256.
[54]
Shen Y., Chen J., Huang P., Guo Y., Gao J., M-Walk: Learning to walk over graphs using Monte Carlo tree search, in: NeurIPS, 2018, pp. 6787–6798.
[55]
Wang Q., Hao Y., Cao J., ADRL: An attention-based deep reinforcement learning framework for knowledge graph reasoning, Knowl.-Based Syst. 197 (2020).
[56]
Wang Q., Ji Y., Hao Y., Cao J., GRL: Knowledge graph completion with GAN-based reinforcement learning, Knowl.-Based Syst. 209 (2020).
[57]
Swaminathan A., Zhang H., Mahata D., Gosangi R., Shah R.R., Stent A., A preliminary exploration of GANs for keyphrase generation, in: EMNLP, 2020, pp. 8021–8030.
[58]
He G., Li J., Zhao W.X., Liu P., Wen J., Mining implicit entity preference from user-item interaction data for knowledge graph completion via adversarial learning, in: WWW, 2020, pp. 740–751.

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Knowledge-Based Systems
Knowledge-Based Systems  Volume 276, Issue C
Sep 2023
664 pages

Publisher

Elsevier Science Publishers B. V.

Netherlands

Publication History

Published: 27 September 2023

Author Tags

  1. Knowledge graph
  2. Question answering
  3. Path-based reasoning
  4. Adversarial learning
  5. Reinforcement learning
  6. Spurious paths

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 0
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 17 Dec 2024

Other Metrics

Citations

Cited By

View all

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media