Abstract
We propose a probabilistic document retrieval model based on Bayesian networks. The network is used to compute the posterior probabilities of relevance of the documents in the collection given a query. These computations can be carried out efficiently, because of the specific network topology and conditional probability tables being considered, which allow the use of a fast and exact probabilities propagation algorithm. In the initial model, only direct relationships between the terms in the glossary and the documents that contain them are considered, giving rise to a Bayesian network with two layers. Next, we consider an extended model that also includes direct relationships between documents, using a network topology with three layers. We also report the results of a set of experiments with the two models, using several standard document collections.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
L.M. de Campos, J.M. Fernández-Luna, and J.F. Huete. Building Bayesian network-based information retrieval systems. In 2nd Workshop on Logical and Uncertainty Models for Information Systems (LUMIS), 543–552, 2000.
J.M. Fernández-Luna. Modelos de Recuperación de Información Basados en Redes de Creencia (in Spanish). Ph.D. Thesis, Universidad de Granada, 2001.
R. Fung and B.D. Favero. Applying Bayesian networks to information retrieval. Communications of the ACM, 38(2):42–57, 1995.
M.E. Maron and J.L. Kuhns. On relevance, probabilistic indexing and information retrieval. Journal of the ACM, 7:216–244, 1960.
J. Pearl. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan and Kaufmann, San Mateo, 1988.
I. Reis Silva. Bayesian Networks for Information Retrieval Systems. Ph.D. Thesis, Universidad Federal de Minas Gerais, 2000.
B.A. Ribeiro-Neto and R.R. Muntz. A belief network model for IR. In Proceedings of the 19th ACM-SIGIR Conference, H. Frei, D. Harman, P. Schäble and R. Wilkinson, eds., 253–260, 1996.
C.J. van Rijsbergen. Information Retrieval. Second Edition. Butter Worths, London, 1979.
S.E. Robertson and K. Sparck Jones. Relevance weighting of search terms. Journal of the American Society for Information Science, 27:129–146, 1976.
G. Salton and M.J. McGill. Introduction to Modern Information Retrieval. McGraw-Hill, Inc., 1983.
I. Silva, B. Ribeiro-Neto, P. Calado, E. Moura, and N. Ziviani. Link-based and content-based evidential information in a belief network model. In Proceedings of the 23th ACM-SIGIR Conference, 96–103, 2000.
K. Sparck Jones, S. Walker, and S.E. Robertson. A probabilistic model of information retrieval: development and comparative experiments Part 1. Information Processing and Management, 36:779–808, 2000.
H.R. Turtle, Inference Networks for Document Retrieval, Ph.D. Thesis, Computer and Information Science Dpt., University of Massachusetts, 1990.
H.R. Turtle and W.B. Croft. Inference networks for document retrieval. In Proceedings of the 13th ACM-SIGIR Conference, J.-L. Vidick, ed., 1–24, 1990.
H.R. Turtle and W.B. Croft. Efficient probabilistic inference for text retrieval. In Proceedings of the RIA0’91 Conference, 644–661, 1991.
H. R. Turtle and W. B. Croft. Evaluation of an inference network-based retrieval model. Information Systems, 9(3):187–222, 1991.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
de Campos, L.M., Fernández-Luna, J.M., Huete, J.F. (2002). A Layered Bayesian Network Model for Document Retrieval. In: Crestani, F., Girolami, M., van Rijsbergen, C.J. (eds) Advances in Information Retrieval. ECIR 2002. Lecture Notes in Computer Science, vol 2291. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45886-7_12
Download citation
DOI: https://doi.org/10.1007/3-540-45886-7_12
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43343-9
Online ISBN: 978-3-540-45886-9
eBook Packages: Springer Book Archive