Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1559795.1559832acmconferencesArticle/Chapter ViewAbstractPublication PagespodsConference Proceedingsconference-collections
research-article

XML with incomplete information: models, properties, and query answering

Published: 29 June 2009 Publication History

Abstract

We study models of incomplete information for XML, their computational properties, and query answering. While our approach is motivated by the study of relational incompleteness, incomplete information in XML documents may appear not only as null values but also as missing structural information. Our goal is to provide a classification of incomplete descriptions of XML documents, and separate features - or groups of features - that lead to hard computational problems from those that admit efficient algorithms. Our classification of incomplete information is based on the combination of null values with partial structural descriptions of documents. The key computational problems we consider are consistency of partial descriptions, representability of complete documents by incomplete ones, and query answering. We show how factors such as schema information, the presence of node ids, and missing structural information affect the complexity of these main computational problems, and find robust classes of incomplete XML descriptions that permit tractable query evaluation.

References

[1]
S. Abiteboul, O. Duschka. Complexity of answering queries using materialized views. In PODS 1998, pages 254--263.
[2]
S. Abiteboul, P. Kanellakis, G. Grahne. On the representation and querying of sets of possible worlds. TCS 78 (1991), 158--187.
[3]
S. Abiteboul, L. Segoufin, V. Vianu. Representing and querying XML with incomplete information. ACM TODS, 31 (2006), 208--254.
[4]
S. Abiteboul, R. Hull and V. Vianu. Foundations of Databases, Addison Wesley, 1995.
[5]
M. Arenas, W. Fan, L. Libkin. On the complexity of verifying consistency of XML specifications. SIAM J. Comput. 38 (2008), 841--880.
[6]
M. Arenas, L. Libkin. XML data exchange: consistency and query answering. J. ACM 55(2): (2008).
[7]
M. Benedikt, W. Fan, F. Geerts. XPath satisfiability in the presence of DTDs. J. ACM 55(2): (2008).
[8]
H. Bjorklund, W. Martens, T. Schwentick. Conjunctive query containment over trees. DBPL'07, pages 66--80.
[9]
H. Bjorklund, W. Martens, T. Schwentick. Optimizing conjunctive queries over trees using schema information. MFCS'08, pages 132--143.
[10]
A. Cali, D. Lembo, R. Rosati. On the decidability and complexity of query answering over inconsistent and incomplete databases. PODS'03, pages 260--271.
[11]
D. Calvanese, G. De Giacomo, M. Lenzerini. Semi-structured data with constraints and incomplete information. Description Logics, 1998.
[12]
D. Calvanese, G. De Giacomo, M. Lenzerini. Representing and reasoning on XML documents: a description logic approach. J. Log. Comput. 9 (1999), 295--318.
[13]
S. Cohen, B. Kimelfeld, Y. Sagiv. Incorporating constraints in probabilistic XML. In PODS'08, pages 109--118.
[14]
C. David. Complexity of data tree patterns over XML documents. In MFCS'08, pages 278--289.
[15]
A. Deutsch, V. Tannen. Reformulation of XML queries and constraints. In ICDT'03, pages 225--241.
[16]
Document Object Model (DOM). W3C Recommendation, April 2004. http://www.w3.org/TR/DOM--Level--3--Core.
[17]
R. Fagin, Ph. Kolaitis, R. Miller, L. Popa. Data exchange: semantics and query answering. TCS 336(1): 89--124 (2005).
[18]
P. Gardner, G. Smith, M. Wheelhouse, U. Zarfaty. Local Hoare reasoning about DOM. In PODS'08, pages 261--270.
[19]
G. Gottlob, C. Koch, K. Schulz. Conjunctive queries over trees. J. ACM 53 (2006), 238--272.
[20]
T. Imielinski, W. Lipski. Incomplete information in relational databases. J. ACM 31 (1984), 761--791.
[21]
Y. Kanza, W. Nutt, Y. Sagiv. Querying incomplete information in semistructured data. JCSS 64 (2002), 655--693.
[22]
Ph. Kolaitis and M. Vardi. A logical approach to constraint satisfaction. In Finite Model Theory and its Applications, Springer 2007, pages 339--370.
[23]
M. Lenzerini. Data integration: a theoretical perspective. In PODS'02, pages 233--246.
[24]
D. Olteanu, C. Koch, L. Antova. World-set decompositions: expressiveness and efficient algorithms. TCS 403 (2008), 265--284.
[25]
T. Schwentick. A little bit infinite? On adding data to finitely labelled structures. In STACS'08, pages 17--18.
[26]
L. Segoufin. Automata and logics for words and trees over an infinite alphabet. In CSL'06, pages 41--57.
[27]
P. Senellart, S. Abiteboul. On the complexity of managing probabilistic XML data. In PODS'07, pages 283--292.
[28]
M. Vardi. Querying logical databases. JCSS 33 (1986), 142--160.

Cited By

View all

Index Terms

  1. XML with incomplete information: models, properties, and query answering

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    PODS '09: Proceedings of the twenty-eighth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
    June 2009
    298 pages
    ISBN:9781605585536
    DOI:10.1145/1559795
    • General Chair:
    • Jan Paredaens,
    • Program Chair:
    • Jianwen Su
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 29 June 2009

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. certain answers
    2. consistency
    3. incomplete information
    4. membership
    5. query answering
    6. xml

    Qualifiers

    • Research-article

    Conference

    SIGMOD/PODS '09
    SIGMOD/PODS '09: International Conference on Management of Data
    June 29 - July 1, 2009
    Rhode Island, Providence, USA

    Acceptance Rates

    PODS '09 Paper Acceptance Rate 26 of 97 submissions, 27%;
    Overall Acceptance Rate 642 of 2,707 submissions, 24%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)1
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 18 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2018)Incomplete data managementFrontiers of Computer Science: Selected Publications from Chinese Universities10.1007/s11704-016-6195-x12:1(4-25)Online publication date: 1-Feb-2018
    • (2017)Object-stackInformation Systems Frontiers10.1007/s10796-017-9748-019:3(669-697)Online publication date: 1-Jun-2017
    • (2013)Modeling, Querying, and Mining Uncertain XML DataData Mining10.4018/978-1-4666-2455-9.ch034(669-691)Online publication date: 2013
    • (2013)Partial Instances via SubclassingSoftware Language Engineering10.1007/978-3-319-02654-1_19(344-364)Online publication date: 2013
    • (2012)Modeling, Querying, and Mining Uncertain XML DataXML Data Mining10.4018/978-1-61350-356-0.ch002(29-52)Online publication date: 2012
    • (2010)XML with incomplete informationJournal of the ACM10.1145/1870103.187010758:1(1-62)Online publication date: 21-Dec-2010
    • (2010)Certain answers for XML queriesProceedings of the twenty-ninth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems10.1145/1807085.1807112(191-202)Online publication date: 6-Jun-2010
    • (2010)On the tradeoff between mapping and querying power in XML data exchangeProceedings of the 13th International Conference on Database Theory10.1145/1804669.1804689(155-164)Online publication date: 23-Mar-2010
    • (2010)On the coNP hardness of computing certain answers over locally specified incomplete DOM-treesInformation Processing Letters10.1016/j.ipl.2010.06.001110:17(753-756)Online publication date: 1-Aug-2010

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media