Abstract
As web-applications grow in number and complexity, there is a need for efficient mappings from XML schemas to the flat relational tables so that existing functions in relational database systems can be utilized. However, many of the existing mapping methods, such as the model-based or the structure-based methods, do not exploit query history for better query performance. In this paper, we propose the use of genetic algorithm (GA) in a cost-based approach for converting a XML schema to relational tables. By formulating the mapping problem as a cost optimization task with respect to a set of weighted frequent queries, we can obtain an efficient mapping that minimizes the queries execution time. In our experiments, we show that the mapping obtained by GA is superior to other non-cost-based approaches. In particular, the GA approach has out-performed the greedy heuristic in the browsing queries where the accessed attributes are many and scattered.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Cheng, J., Xu, J.: XML and DB2. In: Proceedings of 16th International Conference on Data Engineering, pp. 569–576 (2000)
Oracle Corporaion: XML Support in Oracle 8 and beyond, Technical white paper, http://otn.oracle.com/tech/xml/htdocs/xml_twp.html
Florescu, D., Kossman, D.: Storing and Querying XML Data Using an RDBMS. IEEE Data Engineering Bulletin 22(3), 27–34 (1999)
Pankowski, T.: XML-SQL: An XML Query Language Based on SQL and Path Tables. In: Chaudhri, A.B., Unland, R., Djeraba, C., Lindner, W. (eds.) EDBT 2002. LNCS, vol. 2490, pp. 184–209. Springer, Heidelberg (2002)
Shanmugasundaram, J., Tufte, K., Zhang, C., He, G., DeWitt, D.J., Naughton, J.F.: Relational Databases for Querying XML Documents: Limitations and Opportunities. In: VLDB 1999, pp. 302–314 (1999)
Mani, M., Lee, D.: XML to Relational Conversion using Theory of Regular Tree Grammars. In: Bressan, S., Chaudhri, A.B., Li Lee, M., Yu, J.X., Lacroix, Z. (eds.) CAiSE 2002 and VLDB 2002. LNCS, vol. 2590, pp. 81–103. Springer, Heidelberg (2003)
Lee, D., Mani, M., Chu, W.W.: Efficient Schema Conversions between XML and Relational Models (Invited Paper). In: Workshop on Knowledge Transformations for the Semantic Web (2002)
Lee, D., Chu, W.W.: CPI: Constraints-Preserving Inlining Algorithm for Mapping XML DTD to Relational Schema. Data and Knowledge Engineering (DKE) 39(1), 3–25 (2001)
Bohannon, P., Friere, J., Roy, P., Simeon, J.: From XML Schema to Relations: A Cost-based Approach to XML Storage. In: Proceedings of the 2002 International Conference on Data Engineering, Febuary 2002, pp. 64–80 (2002)
Ng, V., Law, D.M., Gorla, N., Chan, C.K.: Applying Genetic Algorithms in Database Partitioning. In: Proceedings of the 2003 ACM Symposium on Applied Computing, pp. 544–549 (2003)
Medline DTD, http://www.nlm.nih.gov/databases/dtd/nlmmedlinecitation_021101.dtd
Niamir, B.: Attribute partitioning in a Self-Adaptive Relational Database System, PhD Dissertation, MIT Lab. for Computer Science (1978)
Chambers, L.: Practical Handbook of Genetic Algorithms, vol. 1. CRC Press, Boca Raton (1995)
Internet Movies Database, http://www.imdb.com
Deutsch, M.F. Suciu, D.: Storing semistructured data with stored. In: SIGMOD Int’l. Conf. on Management of Data, pp. 431–442 (1999)
Wang, W.Q., Lee, M.-L., Ooi, B.C., Tan, K.-L.: XStorM: A Scalable Storage Mapping Scheme for XML Data. In: Poster Proceedings of the Tenth International World Wide Web Conference, pp. 176–177 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ng, V., Kong, C.C., Chan, S. (2004). Mapping XML Schema to Relations Using Genetic Algorithm. In: Negoita, M.G., Howlett, R.J., Jain, L.C. (eds) Knowledge-Based Intelligent Information and Engineering Systems. KES 2004. Lecture Notes in Computer Science(), vol 3215. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30134-9_33
Download citation
DOI: https://doi.org/10.1007/978-3-540-30134-9_33
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23205-6
Online ISBN: 978-3-540-30134-9
eBook Packages: Springer Book Archive