Abstract
Most of the textual information posted on the Web are in documents which conform to the HTML [12] or recently emerging XML [4] specification. In the past, a number of query languages have been proposed for querying data in Web documents. We notice that these query languages are incapable of inferring hierarchically structured data from linked Web documents as well as within a Web document itself. In this paper, we propose a logic-based query language, called SemiLog, for retrieving data in Web documents that are hierarchically structured. SemiLog is capable of handling recursive queries, which infer data that are not explicitly presented in hierarchically structured Web documents, and processing partial knowledge of data in Web documents with irregular structure to answer a given query.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Abiteboul, S., Quass, D., McHugh, J., Widom, J., Wiener, J.: The Lorel Query Language for Semistructured Data. Journal on Digital Libraries 1(1), 68–88 (1997)
Abiteboul, S., Cluet, S., Christophides, V., Milo, T., Moerkotte, G., Simeon, J.: Querying Documents in Object Databases. Int. J. on Digital Libraries, 5–19 (1997)
Arocena, G.O., Mendelzon, A.O.: WebOQL: Restructuring Documents, Databases and Webs. In: Proceedings of the 14th Intl. Conf. on Data Engineering, pp. 24–33 (1998)
Bray, T., Paoli, J., Sperberg-McQueen, C.: Extensible Markup Language (XML) 1.0 W3C Recommendation (February 10 1998), http://www.w3.org/TR/1998/RECxml-19980210
Florescu, D., Levy, A., Mendelzon, A.O.: Database Techniques for the World-Wide Web: A Survey. SIGMOD Record 27(3), 59–74 (1998)
Lakshmanan, L.V.S., Sadri, F., Subramanian, I.N.: A Declarative Language for Querying and Restructuring the Web. In: Post-ICDE IEEE Workshop on Research Issues in Data Engineering (February 1996)
Lim, S.-J., Ng, Y.-K.: WebView: A Tool for Retrieving Internal Structures and Extracting Information from HTML Documents. In: Proceedings of the 6th International Conference on Database Systems for Advanced Applications, pp. 71–80 (April 1999).
Lim, S.-J., Ng, Y.-K.: SemiLog: A Logic-Based Query Language for Hierarchical Data in Web Documents, http://lunar.cs.byu.edu/papers.html/semi.ps
Lloyd, J.W.: Foundations of Logic Programming, 2nd edn. Springer, New York (1993) (extended edition)
Mendelzon, A.O., Mihaila, G., Milo, T.: Querying the World Wide Web. In: Proceedings of the Conf. on Parallel and Distributed Information Systems, pp. 80–91 (1996)
Papakonstantinou, Y., Abiteboul, S., Garcia-Molina, H.: Object Fusion in Mediator Systems. In: Proceedings of the 22nd Intl. Conf. on VLDB, pp. 413–424 (1996)
Raggett, D., Hors, A.L., Jacobs, I.: HTML 4.0 Specification - W3C Recommendation (April 1998), http://www.w3.org/TR/REC-html40
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lim, SJ., Ng, YK. (1999). SemiLog: A Logic-Based Query Language for Hierarchical Data in Web Documents. In: Hui, L.C.K., Lee, DL. (eds) Internet Applications. ICSC 1999. Lecture Notes in Computer Science, vol 1749. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-46652-9_11
Download citation
DOI: https://doi.org/10.1007/978-3-540-46652-9_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66903-6
Online ISBN: 978-3-540-46652-9
eBook Packages: Springer Book Archive