Abstract
Data warehouse is an information provider that collects necessary data from individual source databases to support the analytical processing of decision-support functions. Recently, research about the indexing technologies of data warehousing has been proposed to help efficient on-line analytical processing (OLAP). In the past decades, some novel indexing technologies of data warehousing were proposed to retrieve the information precisely. However, the concept of similarity indexing technology in the increasingly larger data warehousing was seldom been discussed. In this paper, the performance issue of approximation indexing technology in the data warehousing is discussed and a new similarity indexing method, called bit-wise indexing method, and the corresponding efficient algorithms are proposed for retrieving the similar cases of a case-based reasoning system using a data warehouse to be the storage space. Some experiments are made for comparing the performance with two other methods and the results show the efficiency of the proposed method.
This work was supported by Ministry of Education and National Science Council of the Republic of China under Grand No. 89-E-FA04-1-4, High Confidence Information Systems.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Gonzalez, A.J., Xu, L., Gupta, U.M.: Validation Techniques for Case-Based Reasoning Systems. IEEE Transactions on Systems, Man, and Cybernetic-Part A: Systems And Humans 28(4) (1998) 465–477
Gardingen, D., Watson, I.: A web based CBR system for heating ventilation and air conditioning systems sales support. Knowledge-Based Systems 12 1999 207–214.
Waston, I.: Case-based reasoning is a methodology not a technology. Knowledge-Based Systems 12 (1999) 303–308
Daengdej, J., Lukpse, D., Tsui, E., Beinat, P., Prophet, L.: Combining case-based reasoning and statistical method for proposing solution in RICAD. Knowledge-Based Systems, 1 (1997).153–159
Wu, K.L., Yu, P.S.: Range-Based Bitmap Indexing for High Cardinality Attributes with Skew. Proceedings of 22nd Annual International Conference on Computer Software and Applications (1998) 61–66.
Gupta, K.M., Montazemi, A.R.: Empirical Evaluation of Retrieval in Case-Based Reasoning Systems Using Modified Cosine Matching Function. IEEE Transactions on Systems, Man, and cybernetics-Part A: Systems and Humans 27(5) (1997) 601–612
Shin, K.S., Han, I.: Case-based reasoning supported by genetic algorithms for corporate bond rating. Expert Systems with applications 16 (1999) 85–95
Li, L.L.X.: Knowledge-based problem solving: an approach to health assessment. Expert Systems with Application 16 (1999) 33–42
Wu M.C., Buchmann, A.P.: Encoded Bitmap Indexing for Data Warehouses. Proceedings of IEEE ICDE (1998) 220–230
Suh, M.S., Jhee, W.C., Ko, Y.K., Lee, A.: A case-based expert system approach for quality design. Expert Systems With Applications 15 (1998) 181–190
Cercone, N., Aijun, A., Chan, C.: Rule-Induction and Case-Based Reasoning: Hybrid Architectures Appear Advantageous. IEEE Transactions. On Knowledge and Data Engineering 11(1) (1999), 166–174
O’Neil, P., Quass, D.: Improved Query Performance with Variant Indexes. Proceedings of ACM SIGMOD (1997)
Dutta, S., Wierenga B., Dalebout, A.: Case-Based Reasoning Systems: From Automation to Decision-Aiding and Stimulation. IEEE Transactions. On Knowledge and Data Engineering 9(6) 1997 911–922
Inmon, W.H., Kelley, C.: Rdb/VMS: Developing The Data Warehouse. QED Publishing Group, Boston, Massachusetts (1993)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chen, WC., Tseng, SS., Chang, LP., Jiang, MF. (2001). A Similarity Indexing Method for the Data Warehousing - Bit-Wise Indexing Method. In: Cheung, D., Williams, G.J., Li, Q. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2001. Lecture Notes in Computer Science(), vol 2035. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45357-1_56
Download citation
DOI: https://doi.org/10.1007/3-540-45357-1_56
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41910-5
Online ISBN: 978-3-540-45357-4
eBook Packages: Springer Book Archive