Nothing Special   »   [go: up one dir, main page]

skip to main content
article

Reasoning about keys for XML

Published: 01 December 2003 Publication History

Abstract

We study absolute and relative keys for XML, and investigate their associated decision problems. We argue that these keys are important to many forms of hierarchically structured data including XML documents. In contrast to other proposals of keys for XML, we show that these keys are always (finitely) satisfiable, and their (finite) implication problem is finitely axiomatizable. Furthermore, we provide a polynomial time algorithm for determining (finite) implication in the size of keys. Our results also demonstrate, among other things, that the analysis of XML keys is far more intricate than its relational counterpart.

References

[1]
{1} T. Bray, J. Paoli, C.M. Sperberg-McQueen, Extensible Markup Language (XML) 1.0, World Wide Web Consortium (W3C), 1998, http://www.w3.org/TR/REC-xml.]]
[2]
{2} A. Layman, et al., XML-Data, W3C Note, 1998, http:// www.w3.org/TR/1998/NOTE-XML-data.]]
[3]
{3} H.S. Thompson, D. Beech, M. Maloney, N. Mendelsohn, XML Schema Part 1: Structures, W3C Working Draft, 2001, http://www.w3.org/TR/xmlschema-1/.]]
[4]
{4} P. Buneman, S. Davidson, W. Fan, C. Hara, W. Tan, Keys for XML, Computer Networks 39 (5) (2002) 473-487.]]
[5]
{5} P. Buneman, S. Davidson, W. Fan, C. Hara, W. Tan, Reasoning about keys for XML, in: G. Ghelli and G. Grahne (Eds.), Lecture Notes in Computer Science (DBPL'2001), Vol. 2397, 2001, pp. 133-148.]]
[6]
{6} H. Thompson, personal communication.]]
[7]
{7} V. Apparao, et al., Document Object Model (DOM) Level 1 Specification, W3C Recommendation, 1998, http://www.w3.org/TR/REC-DOM-Level-1/.]]
[8]
{8} J. Clark, XSL Transformations (XSLT), W3C Recommendation, 1999, http://www.w3.org/TR/xslt.]]
[9]
{9} P. Wadler, A formal semantics for patterns in XSL, Technical Report, Computing Sciences Research Center, Bell Labs, Lucent Technologies, 2000.]]
[10]
{10} J. Robie, J. Lapp, D. Schach, XML Query Language (XQL), Workshop on XML Query Languages, Boston, MA, 1998.]]
[11]
{11} D. Benson, et al., GenBank, Nucleic Acids Res. 28 (2000) 15-18.]]
[12]
{12} J. Sulston, et al., The C. elegans genome sequencing project: a beginning, Nature 356 (6364) (1992) 37-41.]]
[13]
{13} W. Baker, et al., The EMBL nucleotide sequence database, Nucleic Acids Res. 28 (2000) 19-23.]]
[14]
{14} A. Bairoch, R. Apweiler, The SWISS-PROT protein sequence database and its supplement TrEMBL, Nucleic Acids Res. 28 (2000) 45-48.]]
[15]
{15} M. Arenas, W. Fan, L. Libkin, On verifying consistency of XML specifications, in: Proceedings of ACM Symposium on Principles of Database Systems (PODS), Madison, WI, 2002.]]
[16]
{16} W. Fan, L. Libkin, On XML integrity constraints in the presence of DTDs, J. Assoc. Comput. Mach. 49 (3) (2002) 368-406.]]
[17]
{17} S. Abiteboul, R. Hull, V. Vianu, Foundations of Databases, Addison-Wesley, Reading, MA, 1995.]]
[18]
{18} R. Ramakrishnan, J. Gehrke, Database Management Systems, McGraw-Hill, New York, 2000.]]
[19]
{19} J. Clark, S. DeRose, XML Path Language (XPath), W3C Working Draft, 1999, http://www.w3.org/TR/xpath.]]
[20]
{20} G. Miklau, D. Suciu, Containment and equivalence for an XPath fragment, in: Proceedings of ACM Symposium on Principles of Database Systems (PODS), Madison, WI, 2002, pp. 65-76.]]
[21]
{21} F. Neven, T. Schwentick, XPath containment in the presence of disjunction, dtds, and variables, in: Proceedings of the International Conference on Database Theory, Siena, Italy, 2003.]]
[22]
{22} M. Benedikt, W. Fan, G. Kuper, Structural properties of XPath fragments, in: Proceedings of the International Conference on Database Theory, Siena, Italy, 2003.]]
[23]
{23} M. Arenas, W. Fan, L. Libkin, What's hard about XML Schema constraints?, in: Proceedings of 13th International Conference on Database and Expert Systems Applications (DEXA), Aix en Provence, 2002, pp. 269-278.]]
[24]
{24} W. Fan, J. Siméon, Integrity constraints for XML, in: Proceedings of ACM Symposium on Principles of Data-base Systems (PODS), Dallas, TX, 2000, pp. 23-34.]]
[25]
{25} S. Abiteboul, P. Buneman, D. Suciu, Data on the Web. From Relations to Semistructured Data and XML, Morgan Kaufmann, Los Altos, CA, 2000.]]
[26]
{26} S. Abiteboul, V. Vianu, Regular path queries with constraints, in: Proceedings of ACM Symposium on Principles of Database Systems (PODS), Tucson, AZ, 1997, pp. 122-133.]]
[27]
{27} P. Buneman, W. Fan, S. Weinstein, Interaction between path and type constraints, in: Proceedings of ACM Symposium on Principles of Database Systems (PODS), Philadelphia, PA, 1999, pp. 56-67.]]
[28]
{28} P. Buneman, W. Fan, S. Weinstein, Path constraints in semistructured databases, J. Comput. System Sci. 61 (2000) 146-193.]]
[29]
{29} M. Ito, G.E. Weddell, Implication problems for functional constraints on databases supporting complex objects, J. Comput. System Sci. 50 (1) (1995) 165-187.]]
[30]
{30} M.F. van Bommel, G.E. Weddell, Reasoning about equations and functional dependencies on complex objects, IEEE Trans. Data Knowledge Eng. 6 (3) (1994) 455-469.]]
[31]
{31} Z.M. Ozsoyoglu, L.-Y. Yuan, A new normal form for nested relations, ACM Trans. Database Systems 12 (1) (1987) 111-136.]]
[32]
{32} C.S. Hara, S.B. Davidson, Reasoning about nested functional dependencies, in: Proceedings of ACM Symposium on Principles of Database Systems (PODS), Philadelphia, Pennsylvania, 1999, pp. 91-100.]]
[33]
{33} P. Buneman, W. Fan, J. Siméon, S. Weinstein, Constraints for semistructured data and XML, SIGMOD Record 30 (1) (2001) 47-54.]]
[34]
{34} V. Vianu, A Web odyssey: from Codd to XML, in: Proceedings of ACM Symposium on Principles of Data-base Systems (PODS), Santa Barbara, CA, 2001, pp. 1-15.]]
[35]
{35} H. Hunt, D. Rosenkrantz, T. Szymanski, On the equivalence, containment, and covering problems for the regular and context-free languages, J. Comput. System Sci. 12 (1976) 222-268.]]
[36]
{36} J.E. Hopcroft, J.D. Ullman, Introduction to Automata Theory, Languages and Computation, Addison-Wesley, Reading, MA, 1979.]]
[37]
{37} H.B. Enderton, A Mathematical Introduction to Logic, Academic Press, New York, 1972.]]
[38]
{38} M.R. Garey, D.S. Johnson, Computers and Intractability: A Guide to the Theory of NP-Completeness, Freeman, New York, 1979.]]
[39]
{39} T. Milo, D. Suciu, Index structures for path expressions, in: Proceedings of the International Conference on Database Theory, Jerusalem, 1999, pp. 277-295.]]
[40]
{40} Microsoft XML Parser 4.0(MSXML), available at: http:// msdn.microsoft.com.]]
[41]
{41} XML Schema Validator, available at: http://www.ltg.ed.ac.uk/ht/xsv-status.html.]]
[42]
{42} Y. Chen, S. Davidson, Y. Zheng, Validating constraints in XML, Technical Report, Computer and Information Science Department, University of Pennsylvania, 2002.]]
[43]
{43} Y.Chen, S. Davidson, Y. Zheng, Constraint preserving xml storage in relations, in: Proceedings of the International Workshop on the Web and Databases (WebDB), Madison, WI, 2002.]]
[44]
{44} S. Davidson, W. Fan, C. Hara, J. Qin, Propagating XML constraints to relations, in: Proceedings of the International Conference on Data Engineering (ICDE), 2003.]]

Cited By

View all
  • (2021)PG-Keys: Keys for Property GraphsProceedings of the 2021 International Conference on Management of Data10.1145/3448016.3457561(2423-2436)Online publication date: 9-Jun-2021
  • (2019)Dependencies for GraphsACM Transactions on Database Systems10.1145/328728544:2(1-40)Online publication date: 13-Feb-2019
  • (2018)Reasoning About XML Constraints Based on XML-to-Relational MappingsTheory of Computing Systems10.1007/s00224-018-9846-562:8(1826-1879)Online publication date: 1-Nov-2018
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Information Systems
Information Systems  Volume 28, Issue 8
December 2003
177 pages

Publisher

Elsevier Science Ltd.

United Kingdom

Publication History

Published: 01 December 2003

Author Tags

  1. XML keys
  2. axiomatization
  3. logical implication
  4. satisfiability

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 21 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2021)PG-Keys: Keys for Property GraphsProceedings of the 2021 International Conference on Management of Data10.1145/3448016.3457561(2423-2436)Online publication date: 9-Jun-2021
  • (2019)Dependencies for GraphsACM Transactions on Database Systems10.1145/328728544:2(1-40)Online publication date: 13-Feb-2019
  • (2018)Reasoning About XML Constraints Based on XML-to-Relational MappingsTheory of Computing Systems10.1007/s00224-018-9846-562:8(1826-1879)Online publication date: 1-Nov-2018
  • (2017)Extended Tuple Constraint Type in Relational and XML Data ModelProceedings of the 8th Balkan Conference in Informatics10.1145/3136273.3136294(1-8)Online publication date: 20-Sep-2017
  • (2016)XML Type Dependency and XML Type Normal FormProceedings of the 20th International Database Engineering & Applications Symposium10.1145/2938503.2938536(272-277)Online publication date: 11-Jul-2016
  • (2015)Specification and Implementation of the Inverse Referential Integrity Constraint in XML DatabasesProceedings of the 7th Balkan Conference on Informatics Conference10.1145/2801081.2801111(1-8)Online publication date: 2-Sep-2015
  • (2015)Functional dependencies on extended relations defined by regular languagesAnnals of Mathematics and Artificial Intelligence10.1007/s10472-013-9352-z73:1-2(205-243)Online publication date: 1-Jan-2015
  • (2015)Profiling relational dataThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-015-0389-y24:4(557-581)Online publication date: 1-Aug-2015
  • (2014)Discovering XSD Keys from XML DataACM Transactions on Database Systems10.1145/263854739:4(1-49)Online publication date: 30-Dec-2014
  • (2013)Heuristic Methods for Inference of XML SchemasInformatica10.5555/2699244.269924924:4(577-602)Online publication date: 1-Oct-2013
  • Show More Cited By

View Options

View options

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media