A Multi-level Attention Model for Text Matching

Qiang Sun¹⁸ &
Yue Wu¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11139))

Included in the following conference series:

International Conference on Artificial Neural Networks

7597 Accesses

Abstract

Text matching based on deep learning models often suffer from the limitation of query term coverage problems. Inspired by the success of attention based models in machine translation, which the models can automatically search for parts of a sentence that are relevant to a target word, we propose a multi-level attention model with maximum matching matrix rank to simulate what human does when finding a good answer for a query question. Firstly, we apply a multi-attention mechanism to choose the high effect document words for every query words. Then an approach we called reciprocal relative standard deviation (RRSD) will calculate the matching coverage score for all query words. Experiments on both question-answer task and learning to rank task have achieved state-of-the-art results compared to traditional statistical methods and deep neural network methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

MatchACNN: A Multi-Granularity Deep Matching Model

Article 12 October 2022

Asymmetry Sensitive Architecture for Neural Text Matching

Multi-granularity Chinese Text Matching Model Combined with Bidirectional Attention

References

Yu, L., Hermann, K.M., Blunsom, P., et al.: Deep learning for answer sentence selection. Comput. Sci. (2014)
Google Scholar
Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate. Comput. Sci. (2014)
Google Scholar
Sutskever, I., Vinyals, O., Le, Q.V.: Sequence to sequence learning with neural networks, 4, 3104–3112 (2014)
Google Scholar
Guo J, Fan Y, Ai Q, et al.: A deep relevance matching model for Ad-hoc retrieval. In: ACM International on Conference on Information and Knowledge Management, pp. 55–64. ACM (2016)
Google Scholar
Reed, G.F., Lynn, F., Meade, B.D.: Use of coefficient of variation in assessing variability of quantitative assays. Clin. Diagn. Lab. Immunol. 9(6), 1235–1239 (2002)
Google Scholar
Pang, L., Lan, Y., Guo, J., et al.: Text matching as image recognition (2016)
Google Scholar
Liu, T.Y.: Learning to rank for information retrieval. Acm Sigir. Forum 41(2), 904 (2010)
Google Scholar
Yang, Y., Yih, W.T., Meek, C.: WikiQA: a challenge dataset for open-domain question answering. In: Conference on Empirical Methods in Natural Language Processing, pp. 2013–2018 (2015)
Google Scholar
Qin, T., Liu, T.Y.: Introducing LETOR 4.0 datasets. Comput. Sci. (2013)
Google Scholar
Zhai, C., Lafferty, J.: A study of smoothing methods for language models applied to Ad Hoc information retrieval. In: International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 334–342. ACM (2001)
Google Scholar
Huang, P.S., He, X., Gao, J., et al.: Learning deep structured semantic models for web search using clickthrough data. In: ACM International Conference on Conference on Information & Knowledge Management, pp. 2333–2338. ACM (2013)
Google Scholar
Shen, Y., He, X., Gao, J., et al.: Learning semantic representations using convolutional neural networks for web search. In: International Conference on World Wide Web, pp. 373–374. ACM (2014)
Google Scholar
Hu, B., Lu, Z., Li, H., et al.: Convolutional neural network architectures for matching natural language sentences. In: International Conference on Neural Information Processing Systems. MIT Press, pp. 2042–2050 (2014)
Google Scholar
Xiong, C., Dai, Z., Callan, J., et al.: End-to-end neural Ad-hoc ranking with Kernel pooling, pp. 55–64 (2017)
Google Scholar
Wan, S., Lan, Y., Guo, J., et al.: A deep architecture for semantic matching with multiple positional sentence representations, pp. 2835–2841 (2015)
Google Scholar
Yang, L., Ai, Q., Guo, J., et al.: aNMM: ranking short answer texts with attention-based neural matching model. In: ACM International on Conference on Information and Knowledge Management, pp. 287–296. ACM (2016)
Google Scholar
Pang, L., Lan, Y., Guo, J., et al.: DeepRank: a new deep architecture for relevance ranking in information retrieval (2017)
Google Scholar
Pang, L., Lan, Y., Guo, J., et al.: A deep investigation of deep IR models (2017)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Engineering and Science, Shanghai University, Shanghai, China
Qiang Sun & Yue Wu

Authors

Qiang Sun
View author publications
You can also search for this author in PubMed Google Scholar
Yue Wu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Qiang Sun .

Editor information

Editors and Affiliations

Czech Academy of Sciences, Prague 8, Czech Republic
Věra Kůrková
Open University of Cyprus, Latsia, Cyprus
Yannis Manolopoulos
CITEC Bielefeld University, Bielefeld, Germany
Barbara Hammer
Democritus University of Thrace, Xanthi, Greece
Lazaros Iliadis
University of Piraeus, Piraeus, Greece
Ilias Maglogiannis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sun, Q., Wu, Y. (2018). A Multi-level Attention Model for Text Matching. In: Kůrková, V., Manolopoulos, Y., Hammer, B., Iliadis, L., Maglogiannis, I. (eds) Artificial Neural Networks and Machine Learning – ICANN 2018. ICANN 2018. Lecture Notes in Computer Science(), vol 11139. Springer, Cham. https://doi.org/10.1007/978-3-030-01418-6_15

Download citation

DOI: https://doi.org/10.1007/978-3-030-01418-6_15
Published: 27 September 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-01417-9
Online ISBN: 978-3-030-01418-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics