Abstract
A Classification Association Rule (CAR), a common type of mined knowledge in Data Mining, describes an implicative co-occurring relationship between a set of binary-valued data-attributes (items) and a pre-defined class, expressed in the form of an “antecedent \(\Rightarrow\) (consequent-class” rule. Classification Association Rule Mining (CARM) is a recent Classification Rule Mining (CRM) approach that builds an Association Rule Mining (ARM) based classifier using CARs. Regardless of which particular methodology is used to build it, a classifier is usually presented as an ordered CAR list, based on an applied rule ordering strategy. Five existing rule ordering mechanisms can be identified: (1) Confi-dence-Support-size_of_Antecedent (CSA), (2) size_of_Antecedent-Confidence-Support (ACS), (3) Weighted Relative Accuracy (WRA), (4) Laplace Accuracy, and (5) (χ 2 Testing. In this paper, we divide the above mechanisms into two groups: (i) pure “support-confidence” framework like, and (ii) additive score assigning like. We consequently propose a hybrid rule ordering approach by combining one approach taken from (i) and another approach taken from (ii). The experimental results show that the proposed rule ordering approach performs well with respect to the accuracy of classification.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Agrawal, R., Imielinski, T., Swami, A.: Mining Association Rules between Sets of Items in Large Databases. In: Buneman, P., Jajodia, S. (eds.) SIGMOD 1993. Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data, Washington, DC, May 1993, pp. 207–216. ACM Press, New York (1993)
Agrawal, R., Srikant, R.: Fast Algorithm for Mining Association Rules. In: Bocca, J.B., Jarke, M., Zaniolo, C. (eds.) VLDB 1994. Proceedings of the 20th International Conference on Very Large Data Bases, Santiago de Chile, Chile, September 1994, pp. 487–499. Morgan Kaufmann Publishers, San Francisco (1994)
Ali, K., Manganaris, S., Srikant, R.: Partial Classification using Association Rules. In: Heckerman, D., Mannila, H., Pregibon, D., Uthurusamy, R. (eds.) KDD 1997. Proceedings of the Third International conference on Knowledge Discovery and Data Mining, Newport Beach, California, August 1997, pp. 115–118. AAAI Press, Menlo Park (1997)
Blake, C.L., Merz, C.J.: UCI Repository of Machine Learning Databases. Department of Information and Computer Science, University of California, Irvine, CA, United States (1998), http://www.ics.uci.edu/~mlearn/MLRepository.html
Clark, P., Boswell, R.: Rule Induction with CN2: Some Recent Improvement. In: Kodratoff, Y. (ed.) Machine Learning - EWSL-91. LNCS, vol. 482, pp. 111–116. Springer, Heidelberg (1991)
Coenen, F.: The LUCS-KDD Discretised/Normalised ARM and CARM Data Library. Department of Computer Science, The University of Liverpool, UK (2003), http://www.csc.liv.ac.uk/~frans/KDD/Software/LUCS-KDD-DN
Coenen, F., Leng, P.: An Evaluation of Approaches to Classification Rule Selection. In: ICDM 2004. Proceedings of the 4th IEEE International Conference on Data Mining, Brighton, November 2004, pp. 359–362. IEEE Computer Society Press, Los Alamitos (2004)
Coenen, F., Leng, P., Ahmed, S.: Data Structure for Association Rule Mining: T-trees and P-trees. IEEE Transactions on Knowledge and Data Engineering 16(6), 774–778 (2004)
Coenen, F., Leng, P., Zhang, L.: Threshold Tuning for Improved Classification Association Rule Mining. In: Ho, T.-B., Cheung, D., Liu, H. (eds.) PAKDD 2005. LNCS (LNAI), vol. 3518, pp. 216–225. Springer, Heidelberg (2005)
Freitas, A.A.: Data Mining and Knowledge Discovery with Evolutionary Algorithms. Springer, Heidelberg (2002)
Han, J., Pei, J., Yin, Y.: Mining Frequent Patterns without Candidate Generation. In: Chen, W., Naughton, J.F., Bernstein, P.A. (eds.) SIGMOD 2000. Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, Dallas, TX, May 2000, pp. 1–12. ACM Press, New York (2000)
Lavrac, N., Flach, P., Zupan, B.: Rule Evaluation Measures: A Unifying View. In: Džeroski, S., Flach, P.A. (eds.) Inductive Logic Programming. LNCS (LNAI), vol. 1634, pp. 174–185. Springer, Heidelberg (1999)
Li, W., Han, J., Pei, J., CMAR,: CMAR: Accurate and Efficient Classification based on Multiple Class-association Rules. In: Cercone, N., Lin, T.Y., Wu, X. (eds.) ICDM 2001. Proceedings of the 2001 IEEE International Conference on Data Mining, San Jose, November 29 –December 2, 2001, pp. 369–376. IEEE Computer Society Press, Los Alamitos (2001)
Liu, B., Hsu, W., Ma, Y.: Integrating Classification and Association Rule Mining. In: Agrawal, R., Stolorz, P.E., Piatetsky-Shapiro, G. (eds.) KDD 1998. Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining, New York City, August 1998, pp. 80–86. AAAI Press, Menlo Park (1998)
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, San Francisco (1993)
Quinlan, J.R., Cameron-Jones, R.M.: FOIL: A Midterm Report. In: Brazdil, P.B. (ed.) Machine Learning: ECML 1993. LNCS, vol. 667, pp. 3–20. Springer, Heidelberg (1993)
Yin, X., Han, J.: CPAR: Classification based on Predictive Association Rules. In: Barbara, D., Kamath, C. (eds.) SDM 2003. Proceedings of the Third SIAM International Conference on Data Mining, San Francisco, May 2003, pp. 331–335. SIAM, Philadelphia, PA (2003)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, Y.J., Xin, Q., Coenen, F. (2007). A Novel Rule Ordering Approach in Classification Association Rule Mining. In: Perner, P. (eds) Machine Learning and Data Mining in Pattern Recognition. MLDM 2007. Lecture Notes in Computer Science(), vol 4571. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73499-4_26
Download citation
DOI: https://doi.org/10.1007/978-3-540-73499-4_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73498-7
Online ISBN: 978-3-540-73499-4
eBook Packages: Computer ScienceComputer Science (R0)