Abstract
The WWW provides an overwhelming amount of information, which – spatially indexed – can be a valuable additional data source for location-based applications. By manually building a spatial index, only a fraction of the available resources can be covered. This paper introduces a system for the automatic mapping of web pages to geographical locations. Our web robot uses several sets of domain specific keywords, lexical context rules, that are automatically learned, and a hierarchical catalogue of geographical locations that provides exact geographical coordinates for locations. Spatially indexed web pages are used to construct Geographical Web Portals, which can be accessed by different location-based applications. In addition, we present experimental results demonstrating the quantity and the quality of automatically indexed web pages.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Berners-Lee, T., Fischetti, M.: Weaving the web. 1. paperback ed., HarperCollins (2000)
Califf, M.E., Mooney, R.J.: Relational Learning of Pattern-Match Rules for Information Extraction. In: Proceedings of AAAI 1998 Spring Symposium on Applying Machine Learning to Discourse Processing, March 23-25 (1998)
Ding, J., Gravano, L., Shivakumar, N.: Computing Geographical Scopes of Web Resources. In: 26th International Conference on Very Large Databases (VLDB), September 10-14 (2000)
Dublin Core Metadata Element Set, http://www.dublincore.org/documents/dces/
The Getty Thesaurus of Geographic Names, http://www.getty.edu/research/~conducting_research/vocabularies/tgn/
Google Search by Location, http://labs.google.com/location
Leonhardi, U.K., Rothermel, K.: Virtual Information Towers – A metaphor for intuitive, location-aware information access in a mobile environment. In: Proc. of third International Symposium on Wearable Computers, San Francisco, CA (1999)
Markowetz, T.B., Seeger, B.: Geographic Information Retrieval. In: 3rd International Workshop on Web Dynamics (2004)
Nicklas, D., Großmann, M., Schwarz, T., Volz, S., Mitschang, B.: A Model-Based, Open Architecture for Mobile, Spatially Aware Applications. In: 7th International Symposium on Spatial and Temporal Databases (SSTD), Redondo Beach, CA, USA (2001)
Nicklas, D., Mitschang, B.: On building location aware applications using an open platform based on the Nexus Augmented World Model. Software and Systems Modeling 3(4) (2004)
Sütö, M.: Ortsbasierter Web-Zugriff (In German) University of Stuttgart (2002)
W3C: Resource Description Framework (RDF), http://w3.org/RDF/
W3C: Web Ontology Language (OWL), http://w3.org/2004/OWL/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Jakob, M., Grossmann, M., Nicklas, D., Mitschang, B. (2005). DCbot: Finding Spatial Information on the Web. In: Zhou, L., Ooi, B.C., Meng, X. (eds) Database Systems for Advanced Applications. DASFAA 2005. Lecture Notes in Computer Science, vol 3453. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11408079_71
Download citation
DOI: https://doi.org/10.1007/11408079_71
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25334-1
Online ISBN: 978-3-540-32005-0
eBook Packages: Computer ScienceComputer Science (R0)