Abstract
Many practitioners and researchers advocate that the designs of the data models of the data warehouses should incorporate the source data as much as possible to answer the finest levels of queries. On the other hand, the source data are very likely to come from systems designed with ER Diagrams. Therefore, many researches have been devoted to design methodologies to build multidimensional model based on corresponding source ER diagrams. However, to the best of our knowledge, no algorithm has been proposed to systematically translates an entire ER Diagram into a multidimensional model with hierarchical snowflake structures. The algorithm proposed in the paper promised to do so with two characteristics, namely, grain preservation and minimal distance from each table to the fact table. Grain preservation characteristic guarantees that translated multidimensional model has cohesive granularity among entities. The minimal distance characteristics guarantees that if an entity can be connected to the fact table in the derived model with more than one paths, the one with the shortest hops will always be chosen. The first characteristic is achieved by translating problematic relationships between entities with weight_factor attributes in bridging tables and enhancing fact tables with unique primary keys. The second characteristic is achieved by including a revised shortest path algorithm in the translating algorithm with the distance being calculated as the number of relationships required between entities.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Baekgaard, L., Alle, F.: Event-entity-relationship modelling in data warehouse environment. In: ACM Second International Workshop on Data Warehousing and OLAP (DOLAP), Kansas City, Missouri, USA, November 6 1999, pp. 9–14. ACM Press, New York (1999)
Bernstein, P.A., Rahm, E.: Data warehouse scenarios for model management. In: ER (2000) Conference Proceedings, Salt Lake City, Utah, USA, October 2000, pp. 1–15. Springer, Heidelberg (2000)
Boehnlein, M., Lbrich, A.: Deriving initial data warehouse structures from the conceptual data models of the underlying operational information systems. In: ACM Second international Workshop on Data Warehousing and OLAP (DOLAP), Kansas City, Missouri, USA, November 1999, pp. 15–21 (1999)
Bonifati, A., Cattaneo, F., Ceri, S., Fuggetta, A., Paraboschi, S.: Designing data marts for data warehouse. ACM Transactions on Software Engineering and Methodology 10(4), 452–483 (2001)
Cabibbo, L., Torlone, R.: A logical approach to multidimensional databases. In: Proceedings of the International Conference on Extending Data Base Technology, Balencia, Spain, March 1998, pp. 183–197 (1998)
Chen, P.P.S.: The entity-relationship model -toward a unified view of data. ACM Transactions on Database Systems 1(1), 9–36 (1976)
Franconi, E., Sattler, U.: A data warehouse conceptual data model for multidimensional aggregation. In: Proceedings of the International Workshop on Design and Management of Data Warehouses (DMDW1999), pp. 1–13,10-13, Heideliberg, Germany (1999)
Golfarelli, M., Maio, D., Rizzi, S.: Conceptual design of data warehouses from er schemes. In: Proceedings of the Hawaii International Conference On system Sciences, Kona, Hawaii (January 1998)
Golfarelli, M., Rizzi, S.: A methodological framework for data warehouse design. In: ACM First International Workshop on Data Warehousing and OLAP, Washington D.C., United States, November 1998, pp. 3–9 (1998)
Husemann, B., Lechtenborger, J., Vossen, G.: Conceptual data warehouse design. In: Proceedings of the International Workshop on Design and Management of Data Warehouse(DMDW 2000), Stockholm, Sweden, p. 6 (2000)
Inmon, W.H.: Building the Data Warehouse, 3rd edn. John Wiley & Sons, Inc., New York (2002)
Kimball, R., Reeves, L., Ross, M., Thornthwaite, W.: The Data Warehouse Life cycle Toolkit. John Wiley & Sons, Inc., New York (1998)
Kimball, R., Ross, M.: The Data Warehouse Toolkit, 2nd edn. John Wiley & Sons, Inc., New York (2002)
Krippendorf, T.M., Song, I.Y.: The translation of star schema into entity relationship diagrams. In: Tjoa, A.M. (ed.) DEXA 1997. LNCS, vol. 1308, pp. 390–395. Springer, Heidelberg (1997)
Marotta, A., Ruggia, R.: Data warehouse design: A schema-transformation approach. In: Proceedings of the XXII International Conference of the Chilean Computer Science Society (SCCC 2002), November 2002, pp. 153–162. IEEE-, CS (2002), Proceedings of the XXII International Conference of the Chilean Computer Science Society (SCCC 2002)
Moody, D.L., Kortink, M.A.R.: From enterprise models to dimensional models: A methodology for data warehouse and data mart design. In: Proceedings of the International Workshop on Design and Management of Data Warehouse( DMDW 2000), Stockholm, Sweden, p. 5 (2000)
Pedersen, B., Jensen, C.S.: Multidimensional data modeling for complex data. In: Proc. of 15th ICDE, Sydney, Austrialia, March 1999, pp. 336–345 (1999)
Ravat, F., Teste, O., Zurfluh, G.: Towards data warehouse design. In: Proceedings of the 1999 ACM CIKM International Conference on Information and Knowledge Management, Kansas City, Missouri, USA, November 1999, pp. 359–366 (1999)
Sapia, C., Blaschka, M., Hofling, G., Dinter, B.: Extending the E/R model for the multidimensional paradigm. In: Kambayashi, Y., Lee, D.-L., Lim, E.-p., Mohania, M., Masunaga, Y. (eds.) ER Workshops 1998. LNCS, vol. 1552, pp. 105–116. Springer, Heidelberg (1999)
Song, I.Y., Medsker, C., Rowen, W., Ewen, E.: An analysis of many-to-many relationships between fact and dimension tables in dimensional modeling. In: Proceedings of the International Workshop on Design and Management of Data Warehouses (DMDW 2001), Interlaken, Switzerland, June,4, pp. 13–1,10-13 (2001)
Tryfona, N., Busborg, F., Christiansen, J.G.B.: Starer: A conceptual model for data warehouse design. In: ACM Second International Workshop on DataWarehousing and OLAP (DOLAP), Kansas City, Missouri, USA, November 1999, pp. 3–8 (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chen, YT., Hsu, PY. (2005). An Efficient and Grain Preservation Mapping Algorithm: From ER Diagram to Multidimensional Model. In: Ramos, F.F., Larios Rosillo, V., Unger, H. (eds) Advanced Distributed Systems. ISSADS 2005. Lecture Notes in Computer Science, vol 3563. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11533962_30
Download citation
DOI: https://doi.org/10.1007/11533962_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28063-7
Online ISBN: 978-3-540-31674-9
eBook Packages: Computer ScienceComputer Science (R0)