Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1410140.1410176acmconferencesArticle/Chapter ViewAbstractPublication PagesdocengConference Proceedingsconference-collections
research-article

Improving query performance on XML documents: a workload-driven design approach

Published: 16 September 2008 Publication History

Abstract

As XML has emerged as a data representation format and as great quantities of data have been stored in the XML format, XML document design has become an important and evident issue in several application contexts. Methodologies based on conceptual modeling are being tightly applied for designing XML documents. However, the conversion of a conceptual schema to an XML schema is a complex process. In many cases, conceptual relationships cannot be represented in a hierarchy so that they have to be represented by reference relationships in the XML schema. The problem is that reference relationships generate a disconnected XML structure and, consequently, produce an overhead cost for query processing on XML documents.
This paper presents a design approach for generating XML schemas from conceptual schemas considering the expected workload of the XML applications. Query workload is used to produce XML schemas which minimize the impact of the reference relationships on query performance. We evaluate our approach through a case study where a set of XML documents are redesigned by our methodology. The results demonstrate that query performance is improved in terms of the number of accesses generated by the queries on the XML documents designed by our approach.

References

[1]
C. Batini, S. Ceri, and S. Navathe. Conceptual Database Design: An Entity-Relationship Approach. The Benjamin/Cummings Publishing Company, 1992.
[2]
S. Bechhofer, F. Harmelen, and J. Hendler. Owl web ontology language reference. 2002.
[3]
L. Bird, A. Goodchild, and T. A. Halpin. Object role modeling and xml-schema. In International Conference on Conceptual Modeling, pages 661--705. Springer Heidelberg, 2000.
[4]
T. Bray and J. P. et. al. Extensible markup language (xml) 1.0 w3c recommendation, 2000.
[5]
M. Choi, J. Lim, and K. Joo. Developing a unified design methodology based on extended entity-relationship model for xml. In International Conference on Computational Science, pages 920--929. Springer Heidelberg, 2003.
[6]
R. Conrad, D. Scheffner, and J. C. Freytag. Xml conceptual modeling using xml. In International Conference on Conceptual Modeling, pages 558--571. Springer, 2000.
[7]
R. Elmasri, J. Weeldreyer, and A. R. Hevner. The category concept: An extension to the entity-relationship model. In Data Knowledge Engineering, number 1, pages 75--116, 1985.
[8]
J. Fong and A. F. et. al. Translating relational schema with constraints into xml schema. In International Journal of Software Engineering and Knowledge Engineering, number 16, pages 201--244, 2006.
[9]
H. Jagadish and S. A.-K. et. al. Timber: A native xml database. In International Journal on Very Large Databases, volume 4, pages 274--291. Springer-Verlag New York, 2002.
[10]
C. Liu and J. Li. Designing quality xml schemas from e-r diagrams. In Advances in Web-Age Information Management, pages 508--519. Springer Heidelberg, 2006.
[11]
R. S. Mello and C. A. Heuser. Binxs: A process for integration of xml schemata. In International Conference on Advanced Information Systems Engineering, pages 151--166. Springer Heidelberg, 2005.
[12]
W. Y. Mok and D. W. Embley. Generating compact redundancy-free xml documents from conceptual-model hypergraphs. In IEEE Transactions on Knowledge and Data Engineering, number 18, pages 1082--1096, 2006.
[13]
P. Pigozzo and E. Quintarelli. An algorithm for generating xml schemas from er schemas. In Italian Symposium on Advanced Database Systems, pages 192--199, 2005.
[14]
N. Routledge, L. Bird, and A. Goodchild. Uml and xml schema. In Australian Database Conference, pages 157--166. IEEE, 2002.
[15]
H. Schöning. Tamino - a dbms designed for xml-schema. In International Conference on Data Engineering, pages 149--154. IEEE, 2001.
[16]
R. Schroeder and R. S. Mello. Conversion of generalization hierarchies and union types from extended entity-relationship model to an xml logical model. In ACM Symposium on Applied Computing, pages 1036--1037. ACM Press, 2008.
[17]
J. M. Smith and D. C. P. Smith. Database abstractions: Aggregation and generalization. In ACM Transactions on Database Systems, volume 2, pages 105--133. IEEE, 1977.
[18]
H. Thompson and D. B. et. al. Xml schema part 1: Structures w3c recommendation, 2004.
[19]
N. Wiwatwattana and H. J. et. al. Making designer schemas with colors. In International Conference on Data Engineering. IEEE, 2006.
[20]
Z. Xu and Z. G. et. al. Dynamic tuning of xml storage schema in vxmlr. In International Database Engineering and Applications Symposium, pages 76--86. IEEE, 2003.

Cited By

View all
  • (2012)On evaluating an approach for balancing the trade‐off on XML schema designInternational Journal of Web Information Systems10.1108/174400812112828748:4(371-389)Online publication date: 16-Nov-2012
  • (2011)A workload-aware approach for optimizing the XML schema design trade-offProceedings of the 13th International Conference on Information Integration and Web-based Applications and Services10.1145/2095536.2095542(12-19)Online publication date: 5-Dec-2011
  • (2009)Document engineering approaches toward scalable and structured multimedia, web and printable documentsMultimedia Tools and Applications10.1007/s11042-009-0288-643:3(195-202)Online publication date: 1-Jul-2009
  • Show More Cited By

Index Terms

  1. Improving query performance on XML documents: a workload-driven design approach

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    DocEng '08: Proceedings of the eighth ACM symposium on Document engineering
    September 2008
    312 pages
    ISBN:9781605580814
    DOI:10.1145/1410140
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 16 September 2008

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. XML schemas
    2. conceptual schemas
    3. query performance

    Qualifiers

    • Research-article

    Conference

    DocEng '08
    Sponsor:
    DocEng '08: ACM Symposium on Document Engineering
    September 16 - 19, 2008
    Sao Paulo, Brazil

    Acceptance Rates

    DocEng '08 Paper Acceptance Rate 21 of 62 submissions, 34%;
    Overall Acceptance Rate 194 of 564 submissions, 34%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 13 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2012)On evaluating an approach for balancing the trade‐off on XML schema designInternational Journal of Web Information Systems10.1108/174400812112828748:4(371-389)Online publication date: 16-Nov-2012
    • (2011)A workload-aware approach for optimizing the XML schema design trade-offProceedings of the 13th International Conference on Information Integration and Web-based Applications and Services10.1145/2095536.2095542(12-19)Online publication date: 5-Dec-2011
    • (2009)Document engineering approaches toward scalable and structured multimedia, web and printable documentsMultimedia Tools and Applications10.1007/s11042-009-0288-643:3(195-202)Online publication date: 1-Jul-2009
    • (2009)Designing XML documents from conceptual schemas and workload informationMultimedia Tools and Applications10.1007/s11042-009-0272-143:3(303-326)Online publication date: 1-Jul-2009

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media