BERTDAN: Question-Answer Dual Attention Fusion Networks with Pre-trained Models for Answer Selection

Haitian Yang^13,14,
Chonghui Zheng¹⁵,
Xuan Zhao¹⁶,
Yan Wang¹³,
Zheng Yang¹³,
Chao Ma¹³,
Qi Zhang¹³ &
…
Weiqing Huang^13,14

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 13110))

Included in the following conference series:

International Conference on Neural Information Processing

1786 Accesses
3 Citations

Abstract

Community question answering (CQA) becomes more and more popular in both academy and industry recently. However, a large number of answers often amass in question-answering communities. Hence, it is almost impossible for users to view item by item and select the most relevant one. As a result, answer selection becomes a very significant subtask of CQA. Hence, we propose question-answer dual attention fusion networks with the pre-trained model (BRETDAN) for the task of answer selection. Specifically, we apply BERT model, which has achieved a better result in GLUE leaderboard with deep transformer architectures as the encoder layer to do fine-tuning for question subjects, question bodies and answers, respectively, then the cross attention mechanism selecting out the most relevant answer for different questions. Finally, we apply dual attention fusion networks to filter the noise caused by introducing question and answer pairs. Specifically, the cross attention mechanism aims to extract interactive information between question subject and answer. In a similar way, the interactive information between question body and answer is also captured. Dual attention fusion aims to address the noise problem in the question and answer pairs. Experiments show that the BERTDAN model achieves significant performance on two datasets: SemEval-2015 and SemEval-2017, outperforming all baseline models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

BertHANK: hierarchical attention networks with enhanced knowledge and pre-trained model for answer selection

Article 29 June 2022

AMQAN: Adaptive Multi-Attention Question-Answer Networks for Answer Selection

Attention-based encoder-decoder model for answer selection in question answering

Article 19 April 2017

References

Roth, D.: Learning to resolve natural language ambiguities: a unified approach. In: AAAI/IAAI 1998, pp. 806–813 (1998)
Google Scholar
Metzler, D., Croft, W.B.: Analysis of statistical question classification for fact-based questions. Inf. Retr. 8(3), 481–504 (2005)
Article Google Scholar
Barrón-Cedeno, A., et al.: Thread-level information for comment classification in community question answering. In: ACL, pp. 687–693, Beijing, China (2015)
Google Scholar
Joty, S., Màrquez, L., Nakov, P.: Joint learning with global inference for comment classification in community question answering. In: ACL, pp. 703–713, San Diego, California (2016)
Google Scholar
Yang, M., et al.: Knowledge-enhanced hierarchical attention for community question answering with multi-task and adaptive learning, pp. 5349–5355. In: IJCAI (2019)
Google Scholar
Deng, Y., et al.: Joint learning of answer selection and answer summary generation in community question answering, pp. 7651–7658. In: AAAI (2020)
Google Scholar
Xie, Y., Shen, Y., et al.: Attentive user-engaged adversarial neural network for community question answering. In: AAAI, vol. 34, pp. 9322–9329 (2020)
Google Scholar
Garg, S., Thuy, V., Moschitti, A.: Tanda: transfer and adapt pre-trained transformer models for answer sentence selection. In: AAAI, vol. 34, pp. 7780–7788 (2020)
Google Scholar
Yang, M., Wenting, T., Qiang, Q., et al.: Advanced community question answering by leveraging external knowledge and multi-task learning. Knowl.-Based Syst. 171, 106–119 (2019)
Article Google Scholar
Yang, H., et al.: AMQAN: adaptive multi-attention question-answer networks for answer selection. In: Hutter, F., Kersting, K., Lijffijt, J., Valera, I. (eds.) ECML PKDD 2020. LNCS (LNAI), vol. 12459, pp. 584–599. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-67664-3_35
Chapter Google Scholar
Wan, S., Lan, Y., Guo, J., et al.: A deep architecture for semantic matching with multiple positional sentence representations. In: AAAI, pp. 2835–2841 (2016)
Google Scholar
Zhang, X., Li, S., Sha, L., Wang, H.: Attentive interactive neural networks for answer selection in community question answering. In: AAAI, vol. 31 (2017)
Google Scholar
Devlin, J., Chang, M.W., et al.: Bert: pre-training of deep bidirectional transformers for language understanding. In: NAACL (2019)
Google Scholar
Yu, A.W., et al.: Fast and accurate reading comprehension by combining self-attention and convolution. In: ICLR (2018)
Google Scholar
Lin, Z., et al.: A structured self-attentive sentence embedding. In: ICLR (2017)
Google Scholar
Chen, Q., et al.: Enhanced lstm for natural language inference[c]. In: ACL, pp. 1657–1668 (2017)
Google Scholar
Mou, L., et al.: Natural language inference by tree-based convolution and heuristic matching[c]. In: ACL, pp. 130–136 (2016)
Google Scholar
Ba, J., Kingma, D.P.: Adam: a method for stochastic optimization. In: ICLR (2015)
Google Scholar
Tran, Q.H., Tran, D.V., Vu, T., Le Nguyen, M., Pham, S.B.: Jaist: combining multiple features for answer selection in community question answering. In: SemEval-2015, pp. 215–219, Denver, Colorado (2015)
Google Scholar
Wu, W., Wang, H., Li, S.: Bi-directional gated memory networks for answer selection. In: Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, pp. 251–262 (2017)
Google Scholar
Wu, G., Sheng, Y., Lan, M., Wu, Y.: Ecnu at semeval2017 task 3: using traditional and deep learning methods to address community question answering task. In: SemEval-2017, pp. 365–369 (2017)
Google Scholar
Xiang, Y., Zhou, X., et al.: Incorporating label dependency for answer quality tagging in community question answering via cnn-lstm-crf. In: COLING, pp. 1231–1241, Osaka, Japan (2016)
Google Scholar
Wu, W., Sun, X., Wang, H., et al.: Question condensing networks for answer selection in community question answering. In: ACL, pp. 1746–1755 (2018)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Information Engineering, Chinese Academy of Sciences, Beijing, China
Haitian Yang, Yan Wang, Zheng Yang, Chao Ma, Qi Zhang & Weiqing Huang
School of Cyber Security, University of Chinese Academy of Sciences, Beijing, China
Haitian Yang & Weiqing Huang
Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, China
Chonghui Zheng
York University, Ontario, Canada
Xuan Zhao

Authors

Haitian Yang
View author publications
You can also search for this author in PubMed Google Scholar
Chonghui Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Xuan Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Yan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Zheng Yang
View author publications
You can also search for this author in PubMed Google Scholar
Chao Ma
View author publications
You can also search for this author in PubMed Google Scholar
Qi Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Weiqing Huang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Haitian Yang or Yan Wang .

Editor information

Editors and Affiliations

Sampoerna University, Jakarta, Indonesia
Teddy Mantoro
Kyungpook National University, Daegu, Korea (Republic of)
Minho Lee
Sampoerna University, Jakarta, Indonesia
Media Anugerah Ayu
Murdoch University, Murdoch, WA, Australia
Kok Wai Wong
Universitas Indonesia, Depok, Indonesia
Achmad Nizar Hidayanto

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yang, H. et al. (2021). BERTDAN: Question-Answer Dual Attention Fusion Networks with Pre-trained Models for Answer Selection. In: Mantoro, T., Lee, M., Ayu, M.A., Wong, K.W., Hidayanto, A.N. (eds) Neural Information Processing. ICONIP 2021. Lecture Notes in Computer Science(), vol 13110. Springer, Cham. https://doi.org/10.1007/978-3-030-92238-2_43

Download citation

DOI: https://doi.org/10.1007/978-3-030-92238-2_43
Published: 05 December 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-92237-5
Online ISBN: 978-3-030-92238-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

BERTDAN: Question-Answer Dual Attention Fusion Networks with Pre-trained Models for Answer Selection

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

BertHANK: hierarchical attention networks with enhanced knowledge and pre-trained model for answer selection

AMQAN: Adaptive Multi-Attention Question-Answer Networks for Answer Selection

Attention-based encoder-decoder model for answer selection in question answering

References

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

BERTDAN: Question-Answer Dual Attention Fusion Networks with Pre-trained Models for Answer Selection

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

BertHANK: hierarchical attention networks with enhanced knowledge and pre-trained model for answer selection

AMQAN: Adaptive Multi-Attention Question-Answer Networks for Answer Selection

Attention-based encoder-decoder model for answer selection in question answering

References

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation