Abstract
This paper presents an approach for extending the vector space model (VSM) to perform XML retrieval. The model is extended to support important aspects of XML structural and semantic information such as element nesting level, matching tag names in the query and the collection and the relation between tag names and content of an element. Potential use of the model for heterogeneous as well as for the unstructured collection is also shown. We compared our model with the standard vector space model and obtained a gain for unstructured and structured queries. For unstructured collections the vector space model effectiveness is preserved.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Abiteboul, S., Buneman, P., Suciu, D.: Data on the Web – From Relations to Semistructured Data in XML, pp. 27–50. Mogan Kaufmann Publishers, San Francisco (2000)
Abolhassani, M., Grobjohann, K., Fuhr, N.: Content-oriented XML Retrieval with HyREX. In: INEX 2002 Workshop Proceedings, Duisburg, pp. 26–32 (2002)
Bray, T., Paoli, J., Sperberg-McQueen, C.M., Maler, E.: Extensible Markup Language (XML) 1.0, October 2000. W3C Recommendation, 2nd edn., October 6 (2000), http://www.w3.org/TR/REC-xml
Fuhr, N., Lalmas, M.: INEX document Collection, Duisburg (2004), http://inex.is.informatik.uni-duisburg.de:2004/internal/
Kazai, G., Lalmas, M., Malik, S.: INEX 2003 Guidelines for Topic Development. In: INEX 2003 Workshop Proceedings, Duisburg, pp. 153–154 (2003)
Mandelbrod, M., Mass, Y.: Retrieving the most relevant XML Components. In: INEX 2003 Workshop Proceedings, Duisburg, pp. 58–64 (2003)
Ribeiro-Neto, B., Baeza-Yates, R.: Modern Information Retrieval, pp. 27–30. Addison Wesley, Reading (1999)
Salton, G., Lesk, M.E.: Computer evaluation of indexing and text processing. Journal of the ACM 15(1), 8–36 (1968)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Azevedo, M.I.M., Amorim, L.P., Ziviani, N. (2005). A Universal Model for XML Information Retrieval. In: Fuhr, N., Lalmas, M., Malik, S., Szlávik, Z. (eds) Advances in XML Information Retrieval. INEX 2004. Lecture Notes in Computer Science, vol 3493. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11424550_25
Download citation
DOI: https://doi.org/10.1007/11424550_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26166-7
Online ISBN: 978-3-540-32053-1
eBook Packages: Computer ScienceComputer Science (R0)