Nothing Special   »   [go: up one dir, main page]

skip to main content
article

Metrics for data warehouse conceptual models understandability

Published: 01 August 2007 Publication History

Abstract

Due to the principal role of Data warehouses (DW) in making strategy decisions, data warehouse quality is crucial for organizations. Therefore, we should use methods, models, techniques and tools to help us in designing and maintaining high quality DWs. In the last years, there have been several approaches to design DWs from the conceptual, logical and physical perspectives. However, from our point of view, none of them provides a set of empirically validated metrics (objective indicators) to help the designer in accomplishing an outstanding model that guarantees the quality of the DW. In this paper, we firstly summarise the set of metrics we have defined to measure the understandability (a quality subcharacteristic) of conceptual models for DWs, and present their theoretical validation to assure their correct definition. Then, we focus on deeply describing the empirical validation process we have carried out through a family of experiments performed by students, professionals and experts in DWs. This family of experiments is a very important aspect in the process of validating metrics as it is widely accepted that only after performing a family of experiments, it is possible to build up the cumulative knowledge to extract useful measurement conclusions to be applied in practice. Our whole empirical process showed us that several of the proposed metrics seems to be practical indicators of the understandability of conceptual models for DWs.

References

[1]
A. Abelló, J. Samos, F. Saltor, A framework for the classification and description of multidimensional data models, in: 12th International Conference on Database and Expert Systems Applications (DEXA' 01), Springer-Verlag, Munich (Germany), 2001.
[2]
A. Abelló, J. Samos, F. Saltor, YAM2 (Yet Another Multidimensional Model): An Extension of UML, in: International Database Engineering and Applications Symposium (IDEAS 2002), IEEE Computer Society, Edmonton (Canada), 2002, pp. 172-181.
[3]
Albrecht, A.J. and Gaffney, S.H., Source function, source lines of code and development effort prediction: a software science validation. IEEE Transactions on Software Engineering. v9. 639-648.
[4]
Basili, V., Shull, F. and Lanubile, F., Building knowledge through families of experiments. IEEE Transactions on Software Engineering. v25 i4. 435-437.
[5]
Basili, V. and Weiss, D., A Methodology for Collecting Valid Software Engineering Data. IEEE Transactions on Software Engineering. v10. 728-738.
[6]
M. Blaschka, C. Sappia, G. Höfling, B. Dinter, Finding your way through multidimensional data models, 9th International Conference on Database and Expert Systems Applications (DEXA '98), Springer-Verlag, Vienna (Austria), 1998, pp. 198-203.
[7]
L. Briand, K. El Emam, S. Morasca, Theoretical and empirical validation of software product measures, Technical Report ISERN-95-03, International Software Engineering Research Network, 1995.
[8]
Briand, L., Morasca, S. and Basili, V., Property-Based Software Engineering Measurement. IEEE Transactions on Software Engineering. v22 i1. 68-86.
[9]
Briand, L., Wüst, J. and Lounis, H., A Comprehensive Investigation of Quality Factors in Object-Oriented Designs: an Industrial Case Study. International Software Engineering Research Network.
[10]
F. Brito e Abreu, R. Carapuça, Object-Oriented Software Engineering: measuring and controlling the development process, in: 4th International Conference on Software Quality, McLean (USA), 1994.
[11]
Calero, C., . 2001. Computer Science, 2001.University of Castilla-La Mancha, Ciudad Real (Spain).
[12]
C. Calero, M. Piattini, M. Genero, Method for obtaining correct metrics, 3rd International Conference on Enterprise and Information Systems (ICEIS'2001), 2001, pp. 779-784.
[13]
Cantone, G. and Donzelli, P., Production and maintenance of software measurement models. Journal of Software Engineering and Knowledge Engineering. v5. 605-626.
[14]
Chidamber, S. and Kemerer, C., A Metrics Suite for Object Oriented Design. IEEE Transactions on Software Engineering. v20 i6. 476-493.
[15]
C. Eick, A methodology for the design and transformation of conceptual schemas, in: 17th International Conference on Very Large Data Bases, Barcelona (Spain), 1991, pp. 25-34.
[16]
English, L., . 1996. Brentwood, Information Impact International, Inc.
[17]
Fenton, N. and Pfleeger, S., Software Metrics: A Rigorous Approach. 1997. Chapman & Hall, London.
[18]
Flood, R.L. and Carson, E.R., Dealing with Complexity: An Introduction to the Theory and Application of Systems Science. 1993. Springer.
[19]
Genero, M., . 2002. Department of Computer Science, University of Castilla-La Mancha, Ciudad Real (Spain).
[20]
M. Genero, J. Olivas, M. Piattini, F. Romero, Using metrics to predict OO information systems maintainability, in: 13th International Conference Advanced Information Systems Engineering (CAiSE'01) (2001), pp. 388-401.
[21]
Golfarelli, M., Maio, D. and Rizzi, S., The Dimensional Fact Model: A Conceptual Model for Data Warehouses. International Journal of Cooperative Information Systems (IJCIS). v7. 215-247.
[22]
M. Golfarelli, S. Rizzi, A methodological framework for data warehouse design, in: 1st International Workshop on Data Warehousing and OLAP (DOLAP '98), Maryland (USA), 1998, pp. 3-9.
[23]
Gray, R., Carey, B., McGlynn, N. and Pengelly, A., Design metrics for database systems. BT Technology. v9.
[24]
Halstead, M., Elements of Software Science. 1977. Elsevier-North Holland, New York.
[25]
B. Husemann, J. Lechtenbörger, G. Vossen, Conceptual data warehouse design, in: 2nd, International Workshop on Design and Management of Data Warehouses (DMDW 2000), Stockholm (Sweeden), 2000, pp. 3-9.
[26]
IEEE, IEEE Std 1061-1998 IEEE Standard for a Software Quality Metrics Methodology, 1998.
[27]
Inmon, W.H., Building the Data Warehouse. 2003. John Wiley and Sons, USA.
[28]
ISO/IEC, ISO/IEC 15504 TR2:1998, Software Process Assessment - Part 2: A Reference Model for Processes and Process Capability, in: I.I. JTC1/SC7, (Ed.), International Organization for Standardization, 1998.
[29]
ISO/IEC, 9126-1: Software Engineering - Product quality - Part 1: Quality model., 2001.
[30]
ISO/IEC, ISO 15939: Software Engineering - Software Measurement Process, 2002.
[31]
ISO/IEC, ISO/IEC 90003, Software and Systems Engineering - Guidelines for the Application of ISO/IEC 9001:2000 to Computer Software, International Standards Organization, Geneva, Switzerland, 2004.
[32]
Jarke, M., Lenzerini, M., Vassiliou, Y. and Vassiliadis, P., Fundamentals of Data Warehouses. 2002. Springer-Verlag.
[33]
M. Jeusfeld, C. Quix, and M. Jarke, Design and Analysis of Quality Information for Data Warehouses, in: 17th International Conference on Conceptual Modeling (ER'98), Singapore, 1998.
[34]
Juristo, N. and Moreno, A., Basics of Software Engineering Experimentation. 2001. Kluwer Academic Publishers.
[35]
Kesh, S., Evaluating the Quality of Entity Relationship Models. Information and Software Technology. v37. 681-689.
[36]
Kimbal, R. and Ross, M., The Data Warehouse Toolkit. 2002. John Wiley and Sons.
[37]
Kitchenham, B., Pfleeger, S., Pickard, L., Jones, P., Hoaglin, D., El Emam, K. and Rosenberg, J., Preliminary Guidelines for Empirical Research in Software Engineering. IEEE Transactions on Software Engineering. v28. 721-734.
[38]
Klir, G.J. and Elias, D., Architecture of Systems Problem Solving. 2003. Prenum Publishing Corporation, New York.
[39]
Lechtenbörger, J. and Vossen, G., Multidimensional Normal Forms for Data Warehouse Design. Information Systems. v28. 415-434.
[40]
W. Lehner, J. Albretch, H. Wedekind, Normal forms for multidimensional databases, in: 10th International Conference on Scientific and Statistical Database Management (SSDBM), IEEE Press, 1998, pp. 63-72.
[41]
Lorenz, M. and Kidd, J., Object-Oriented Software Metrics: A Practical Guide. 1994. Prentice Hall, Englewood Cliffs (Nueva Jersey).
[42]
S. Luján-Mora, J. Trujillo, I.-Y. Song, Extending UML for multidimensional modeling, in: 5th International Conference on the Unified Modeling Language (UML 2002), LNCS 2460, Dresden (Germany), 2002, pp. 290-304.
[43]
M. Marchesi, OOA metrics for the unified modeling language, in: 2nd Euromicro Conference on Software Maintenance and Reengineering, 1998, pp. 67-73.
[44]
McCabe, T., A Software Complexity Measure. IEEE Transaction on Software Engineering. v2. 308-320.
[45]
McGarry, J., Card, D., Jones, C., Layman, B., Clark, E., Dean, J. and Hall, F., . 2002. Objective Information for Decision Makers, 2002.Wiley.
[46]
D. Moody, Metrics for evaluating the quality of entity relationship models, 17th International Conference on Conceptual Modelling (ER '98), Singapore, 1998, pp. 213-225.
[47]
OMG, OMG Unified Modeling Language Specification; versión 2.0, Object Management Group, 2005.
[48]
Pfleeger, S. and Kitchenham, B., Principles of Survey Research. Part 1: Turning Lemons into Lemonade. ACM Sigsoft. Software Engineering Notes. v26 i6. 16-18.
[49]
G. Poels, G. Dedene, DISTANCE: A Framework for Software Measure Construction, Research Report DTEW9937, Dept. Applied Economics Katholieke Universiteit Leuven, Belgium, 1999, p. 46.
[50]
C. Sapia, On modeling and predicting query behaviour in olap systems, International Workshop on Design and Management of Data Warehouses (DMDW '99), Heidelberg (Germany), 1999, pp. 1-10.
[51]
C. Sapia, M. Blaschka, G. Höfling, B. Dinter, Extending the E/R model for the multidimensional paradigm, 1st International Workshop on Data Warehouse and Data Mining (DWDM' 98), Springer-Verlag, Singapore, 1998, pp. 105-116.
[52]
SEI, Capability Maturity Model Integration (CMMI), version 1.1, 2002.
[53]
Serrano, M., Definition of a Set of Metrics for Assuring Data Warehouse Quality. 2004. Univeristy of Castilla, La Mancha (Spain).
[54]
Serrano, M., Calero, C. and Piattini, M., Validating metrics for data warehouses. IEE Proceedings SOFTWARE. v149. 161-166.
[55]
M. Serrano, C. Calero, J. Trujillo, S. Lujan, M. Piattini, Empirical validation of metrics for conceptual models of data warehouse, 16th International Conference on Advanced Information Systems Engineering (CAISE'04), Riga, Latvia, 2004, pp. 506-520.
[56]
M. Serrano, C. Calero, J. Trujillo, S. Lujan, M. Piattini, Empirical validation of metrics for data warehouses, 4th ASERC Workshop on Quantitative and Soft Computing Based Software Engineering (QSSE 2004), Banff, Alberta (Canada), 2004.
[57]
S. Si-Saı¿d, N. Prat, Multidimensional Schemas Quality: Assessing and Balancing Analyzability and Simplicity. in: M.A.a.P. Jeusfeld, O., (Ed.), ER 2003 Workshops, 2003, pp. 140-151.
[58]
Suppes, P., Krantz, M., Luce, R. and Tversky, A., Foundations of Measurement. 1989. Academic Press, New York.
[59]
Trujillo, J., Palomar, M., Gómez, J. and Song, I.-Y., Designing Data Warehouses with OO Conceptual Models. IEEE Computer. Special issue on Data Warehouses. v34. 66-75.
[60]
N. Tryfona, F. Busborg, J. Christiansen, starER: A Conceptual Model for Data Warehouse Design, ACM 2nd International Workshop on Data Warehousing and OLAP (DOLAP' 99), ACM, Missouri (USA), 1999, pp. 3-8.
[61]
Van Solingen, R. and Berghout, E., The Goal/Question/Metric Method: A Practical Guide for Quality Improvement of Software Development. 1999. McGraw-Hill.
[62]
Vassiliadis, P., Data Warehouse Modeling and Quality Issues. 2000. National Technical University of Athens, Athens(Greece).
[63]
Weyuker, E., Evaluating Software Complexity Measures. IEEE Transactions on Software Engineering. v14 i9. 1357-1365.
[64]
Whitmire, S., Object Oriented Design Measurement. 1997. John Wiley & Sons, Inc.
[65]
Wohlin, C., Runeson, P., Höst, M., Ohlson, M., Regnell, B. and Wesslén, A., Experimentation in Software Engineering: An Introduction. 2000. Kluwer Academic Publishers.
[66]
Zuse, H., A Framework of Software Measurement. 1998. Walter de Gruyter, Berlin.

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Information and Software Technology
Information and Software Technology  Volume 49, Issue 8
August, 2007
130 pages

Publisher

Butterworth-Heinemann

United States

Publication History

Published: 01 August 2007

Author Tags

  1. Data warehouse conceptual modelling
  2. Data warehouse metrics
  3. Data warehouse quality
  4. Metric validation

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 13 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2022)Multidimensional modeling driven from a domain languageAutomated Software Engineering10.1007/s10515-022-00375-530:1Online publication date: 26-Dec-2022
  • (2021)Requirements-driven data warehouse design based on enhanced pivot tablesRequirements Engineering10.1007/s00766-020-00331-326:1(43-65)Online publication date: 1-Mar-2021
  • (2020)A Semi-Automatic Design Methodology for (Big) Data Warehouse Transforming Facts into DimensionsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2019.292562133:1(28-42)Online publication date: 7-Dec-2020
  • (2020)The contribution of linked open data to augment a traditional data warehouseJournal of Intelligent Information Systems10.1007/s10844-020-00594-w55:3(397-421)Online publication date: 19-Feb-2020
  • (2019)Empirical investigation of dimension hierarchy sharing-based metrics for multidimensional schema understandabilityInternational Journal of Intelligent Engineering Informatics10.5555/3337636.33376387:2-3(141-163)Online publication date: 1-Jan-2019
  • (2018)A fuzzy-based automatic prediction system for quality evaluation of conceptual data warehouse modelsInternational Journal of Data Analysis Techniques and Strategies10.1504/IJDATS.2018.09413110:3(317-333)Online publication date: 1-Jan-2018
  • (2018)Investigating structural metrics for understandability prediction of data warehouse multidimensional schemas using machine learning techniquesInnovations in Systems and Software Engineering10.1007/s11334-017-0308-z14:1(59-80)Online publication date: 1-Mar-2018
  • (2017)Object-oriented dynamic complexity measures for software understandabilityInnovations in Systems and Software Engineering10.1007/s11334-017-0304-313:2-3(177-190)Online publication date: 1-Sep-2017
  • (2016)Evaluation of user satisfaction with OLAP recommender systemsInternational Journal of Business Information Systems10.1504/IJBIS.2016.07338421:1(117-136)Online publication date: 1-Dec-2016
  • (2014)Adding semantic modules to improve goal-oriented analysis of data warehouses using I-starJournal of Systems and Software10.5555/2747015.274719188:C(102-111)Online publication date: 1-Feb-2014
  • Show More Cited By

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media