Nothing Special   »   [go: up one dir, main page]

Skip to main content

An Efficient and Grain Preservation Mapping Algorithm: From ER Diagram to Multidimensional Model

  • Conference paper
Advanced Distributed Systems (ISSADS 2005)

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 3563))

Included in the following conference series:

Abstract

Many practitioners and researchers advocate that the designs of the data models of the data warehouses should incorporate the source data as much as possible to answer the finest levels of queries. On the other hand, the source data are very likely to come from systems designed with ER Diagrams. Therefore, many researches have been devoted to design methodologies to build multidimensional model based on corresponding source ER diagrams. However, to the best of our knowledge, no algorithm has been proposed to systematically translates an entire ER Diagram into a multidimensional model with hierarchical snowflake structures. The algorithm proposed in the paper promised to do so with two characteristics, namely, grain preservation and minimal distance from each table to the fact table. Grain preservation characteristic guarantees that translated multidimensional model has cohesive granularity among entities. The minimal distance characteristics guarantees that if an entity can be connected to the fact table in the derived model with more than one paths, the one with the shortest hops will always be chosen. The first characteristic is achieved by translating problematic relationships between entities with weight_factor attributes in bridging tables and enhancing fact tables with unique primary keys. The second characteristic is achieved by including a revised shortest path algorithm in the translating algorithm with the distance being calculated as the number of relationships required between entities.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Baekgaard, L., Alle, F.: Event-entity-relationship modelling in data warehouse environment. In: ACM Second International Workshop on Data Warehousing and OLAP (DOLAP), Kansas City, Missouri, USA, November 6 1999, pp. 9–14. ACM Press, New York (1999)

    Chapter  Google Scholar 

  2. Bernstein, P.A., Rahm, E.: Data warehouse scenarios for model management. In: ER (2000) Conference Proceedings, Salt Lake City, Utah, USA, October 2000, pp. 1–15. Springer, Heidelberg (2000)

    Google Scholar 

  3. Boehnlein, M., Lbrich, A.: Deriving initial data warehouse structures from the conceptual data models of the underlying operational information systems. In: ACM Second international Workshop on Data Warehousing and OLAP (DOLAP), Kansas City, Missouri, USA, November 1999, pp. 15–21 (1999)

    Google Scholar 

  4. Bonifati, A., Cattaneo, F., Ceri, S., Fuggetta, A., Paraboschi, S.: Designing data marts for data warehouse. ACM Transactions on Software Engineering and Methodology 10(4), 452–483 (2001)

    Article  Google Scholar 

  5. Cabibbo, L., Torlone, R.: A logical approach to multidimensional databases. In: Proceedings of the International Conference on Extending Data Base Technology, Balencia, Spain, March 1998, pp. 183–197 (1998)

    Google Scholar 

  6. Chen, P.P.S.: The entity-relationship model -toward a unified view of data. ACM Transactions on Database Systems 1(1), 9–36 (1976)

    Article  Google Scholar 

  7. Franconi, E., Sattler, U.: A data warehouse conceptual data model for multidimensional aggregation. In: Proceedings of the International Workshop on Design and Management of Data Warehouses (DMDW1999), pp. 1–13,10-13, Heideliberg, Germany (1999)

    Google Scholar 

  8. Golfarelli, M., Maio, D., Rizzi, S.: Conceptual design of data warehouses from er schemes. In: Proceedings of the Hawaii International Conference On system Sciences, Kona, Hawaii (January 1998)

    Google Scholar 

  9. Golfarelli, M., Rizzi, S.: A methodological framework for data warehouse design. In: ACM First International Workshop on Data Warehousing and OLAP, Washington D.C., United States, November 1998, pp. 3–9 (1998)

    Google Scholar 

  10. Husemann, B., Lechtenborger, J., Vossen, G.: Conceptual data warehouse design. In: Proceedings of the International Workshop on Design and Management of Data Warehouse(DMDW 2000), Stockholm, Sweden, p. 6 (2000)

    Google Scholar 

  11. Inmon, W.H.: Building the Data Warehouse, 3rd edn. John Wiley & Sons, Inc., New York (2002)

    Google Scholar 

  12. Kimball, R., Reeves, L., Ross, M., Thornthwaite, W.: The Data Warehouse Life cycle Toolkit. John Wiley & Sons, Inc., New York (1998)

    Google Scholar 

  13. Kimball, R., Ross, M.: The Data Warehouse Toolkit, 2nd edn. John Wiley & Sons, Inc., New York (2002)

    Google Scholar 

  14. Krippendorf, T.M., Song, I.Y.: The translation of star schema into entity relationship diagrams. In: Tjoa, A.M. (ed.) DEXA 1997. LNCS, vol. 1308, pp. 390–395. Springer, Heidelberg (1997)

    Google Scholar 

  15. Marotta, A., Ruggia, R.: Data warehouse design: A schema-transformation approach. In: Proceedings of the XXII International Conference of the Chilean Computer Science Society (SCCC 2002), November 2002, pp. 153–162. IEEE-, CS (2002), Proceedings of the XXII International Conference of the Chilean Computer Science Society (SCCC 2002)

    Google Scholar 

  16. Moody, D.L., Kortink, M.A.R.: From enterprise models to dimensional models: A methodology for data warehouse and data mart design. In: Proceedings of the International Workshop on Design and Management of Data Warehouse( DMDW 2000), Stockholm, Sweden, p. 5 (2000)

    Google Scholar 

  17. Pedersen, B., Jensen, C.S.: Multidimensional data modeling for complex data. In: Proc. of 15th ICDE, Sydney, Austrialia, March 1999, pp. 336–345 (1999)

    Google Scholar 

  18. Ravat, F., Teste, O., Zurfluh, G.: Towards data warehouse design. In: Proceedings of the 1999 ACM CIKM International Conference on Information and Knowledge Management, Kansas City, Missouri, USA, November 1999, pp. 359–366 (1999)

    Google Scholar 

  19. Sapia, C., Blaschka, M., Hofling, G., Dinter, B.: Extending the E/R model for the multidimensional paradigm. In: Kambayashi, Y., Lee, D.-L., Lim, E.-p., Mohania, M., Masunaga, Y. (eds.) ER Workshops 1998. LNCS, vol. 1552, pp. 105–116. Springer, Heidelberg (1999)

    Chapter  Google Scholar 

  20. Song, I.Y., Medsker, C., Rowen, W., Ewen, E.: An analysis of many-to-many relationships between fact and dimension tables in dimensional modeling. In: Proceedings of the International Workshop on Design and Management of Data Warehouses (DMDW 2001), Interlaken, Switzerland, June,4, pp. 13–1,10-13 (2001)

    Google Scholar 

  21. Tryfona, N., Busborg, F., Christiansen, J.G.B.: Starer: A conceptual model for data warehouse design. In: ACM Second International Workshop on DataWarehousing and OLAP (DOLAP), Kansas City, Missouri, USA, November 1999, pp. 3–8 (1999)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Chen, YT., Hsu, PY. (2005). An Efficient and Grain Preservation Mapping Algorithm: From ER Diagram to Multidimensional Model. In: Ramos, F.F., Larios Rosillo, V., Unger, H. (eds) Advanced Distributed Systems. ISSADS 2005. Lecture Notes in Computer Science, vol 3563. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11533962_30

Download citation

  • DOI: https://doi.org/10.1007/11533962_30

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-28063-7

  • Online ISBN: 978-3-540-31674-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics