Nothing Special   »   [go: up one dir, main page]

skip to main content
article

Improving the interpretability of classification rules discovered by an ant colony algorithm: Extended results

Published: 01 September 2016 Publication History

Abstract

Most ant colony optimization ACO algorithms for inducing classification rules use a ACO-based procedure to create a rule in a one-at-a-time fashion. An improved search strategy has been proposed in the cAnt-Miner<inline-formula><inline-graphic xlink="EVCO_a_00155inline1.gif" xlink:type="simple"/></inline-formula> algorithm, where an ACO-based procedure is used to create a complete list of rules ordered rules, i.e., the ACO search is guided by the quality of a list of rules instead of an individual rule. In this paper we propose an extension of the cAnt-Miner<inline-formula><inline-graphic xlink="EVCO_a_00155inline2.gif" xlink:type="simple"/></inline-formula> algorithm to discover a set of rules unordered rules. The main motivations for this work are to improve the interpretation of individual rules by discovering a set of rules and to evaluate the impact on the predictive accuracy of the algorithm. We also propose a new measure to evaluate the interpretability of the discovered rules to mitigate the fact that the commonly used model size measure ignores how the rules are used to make a class prediction. Comparisons with state-of-the-art rule induction algorithms, support vector machines, and the cAnt-Miner<inline-formula><inline-graphic xlink="EVCO_a_00155inline3.gif" xlink:type="simple"/></inline-formula> producing ordered rules are also presented.

References

[1]
Bacardit, J., Burke, E., and Krasnogor, N. (2009). Improving the scalability of rule-based evolutionary learning. Memetic Computing, 1(1): 55-67.
[2]
Clark, P., and Boswell, R. (1991). Rule induction with CN2: Some recent improvements. In Proceedings of the European Conference on Machine Learning, pp. 151-163.
[3]
Cohen, W. (1995). Fast effective rule induction. In Proceedings of the International Conference on Machine Learning, pp. 115-123.
[4]
Dem¿ar, J. (2006). Statistical comparisons of classifiers over multiple data sets. Journal of Machine Learning Research, 7:1-30.
[5]
Fayyad, U., Piatetsky-Shapiro, G., and Smith, P. (1996). From data mining to knowledge discovery: An overview. In U. Fayyad, G. Piatetsky-Shapiro, P. Smith, and R. Uthurusamy (Eds.), Advances in knowledge discovery and data mining, pp. 1-34. Cambridge, MA: MIT Press.
[6]
Franco, M., Krasnogor, N., and Bacardit, J. (2012). Post-processing operators for decision lists. In Proceedings of the Conference on Genetic and Evolutionary Computation (GECCO), pp. 847-854.
[7]
Frank, E., and Witten, I. (1998). Generating accurate rule sets without global optimization. In Proceedings of the International Conference on Machine Learning, pp. 144-151.
[8]
Freitas, A. (2013). Comprehensible classification models: A position paper. ACM SIGKDD Explorations, 15(1): 1-10.
[9]
Freitas, A., Parpinelli, R., and Lopes, H. (2008). Ant colony algorithms for data classification. In Encyclopedia of information science and technology, 2nd ed., vol. 1, pp. 154-159. Hershey, PA: IGI Global.
[10]
García, S., and Herrera, F. (2008). An extension on "statistical comparisons of classifiers over multiple data sets" for all pairwise comparisons. Journal of Machine Learning Research, 9:2677-2694.
[11]
Holden, N., and Freitas, A. (2008). A hybrid PSO/ACO algorithm for discovering classification rules in data mining. Journal of Artificial Evolution and Applications, January, article 2.
[12]
Huysmans, J., Dejaeger, K., Mues, C., Vanthienen, J., and Baesens, B. (2011). An empirical evaluation of the comprehensibility of decision table, tree and rule based predictive models. Decision Support Systems, 51:141-154.
[13]
Lavesson, H.A.N. (2011). User-oriented assessment of classification model understandability. In Proceedings of the Scandinavian Conference on Artificial Intelligence, pp. 11-19.
[14]
Lavrac, N. (1999). Selected techniques for data mining in medicine. Artificial Intelligence in Medicine, 16:3-23.
[15]
Lichman, M. (2013). UCI machine learning repository. Irvine, CA: University of California, School of Information and Computer Science. http://archive.ics.uci.edu/ml
[16]
Martens, D., Baesens, B., and Fawcett, T. (2011). Editorial survey: Swarm intelligence for data mining. Machine Learning, 82(1): 1-42.
[17]
Medland, M., Otero, F., and Freitas, A. (2012). Improving the cAnt-MinerPB classification algorithm. In Proceedings of the International Conference on Swarm Intelligence, pp. 73-84.
[18]
Nalini, C., and Balasubramanie, P. (2008). Discovering unordered rule sets for mixed variables using an ant-miner algorithm. Data Science Journal, 7:76-87.
[19]
Olmo, J., Romero, J., and Ventura, S. (2011). Using ant programming guided by grammar for building rule-based classifiers. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, 41:1585-1599.
[20]
Olmo, J., Romero, J., and Ventura, S. (2012). Classification rule mining using ant programming guided by grammar with multiple Pareto fronts. Soft Computing, 16:2143-2163.
[21]
Otero, F., and Freitas, A. (2013). Improving the interpretability of classification rules discovered by an ant colony algorithm. In Proceedings of the Conference on Genetic and Evolutionary Computation (GECCO), pp. 73-80.
[22]
Otero, F., Freitas, A., and Johnson, C. (2008). cAnt-Miner: An ant colony classification algorithm to cope with continuous attributes. In Proceedings of the International Conference on Swarm Intelligence, pp. 48-59.
[23]
Otero, F., Freitas, A., and Johnson, C. (2009). Handling continuous attributes in ant colony classification algorithms. In Proceedings of the IEEE Symposium on Computational Intelligence in Data Mining, pp. 225-231.
[24]
Otero, F., Freitas, A., and Johnson, C. (2013). A new sequential covering strategy for inducing classification rules with ant colony algorithms. IEEE Transactions on Evolutionary Computation, 17(1): 64-76.
[25]
Parpinelli, R., Lopes, H., and Freitas, A. (2002). Data mining with an ant colony optimization algorithm. IEEE Transactions on Evolutionary Computation, 6(4): 321-332.
[26]
Piatetsky-Shapiro, G., and Frawley, W. (Eds.) (1991). Knowledge discovery in databases. Menlo Park, CA: AAAI Press.
[27]
Quinlan, J. R. (1993). C4.5: Programs for machine learning. San Francisco: Morgan Kaufmann.
[28]
Quinlan, J. R. (1996). Improved use of continuous attributes in C4.5. Journal of Artificial Intelligence Research, 4:77-90.
[29]
Schwabacher, M., and Langley, P. (2001). Discovering communicable scientific knowledge from spatio-temporal data. In Proceedings of the International Conference on Machine Learning, pp. 489-496.
[30]
Smaldon, J., and Freitas, A. (2006). A new version of the Ant-Miner algorithm discovering unordered rule sets. In Proceedings of the Conference on Genetic and Evolutionary Computation (GECCO), pp. 43-50.
[31]
Tsumoto, S. (2000). Clinical knowledge discovery in hospital information systems: Two case studies. In Proceedings of the European Conference on Principles and Practice of Knowledge Discovery and Data Mining, pp. 652-656.
[32]
Vapnik, V. N. (2000). The nature of statistical learning theory, 2nd ed. Berlin: Springer.
[33]
Witten, H., and Frank, E. (2011). Data mining: Practical machine learning tools and techniques, 3rd ed. San Francisco: Morgan Kaufmann.

Cited By

View all
  1. Improving the interpretability of classification rules discovered by an ant colony algorithm: Extended results

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image Evolutionary Computation
      Evolutionary Computation  Volume 24, Issue 3
      Fall 2016
      185 pages
      ISSN:1063-6560
      EISSN:1530-9304
      Issue’s Table of Contents

      Publisher

      MIT Press

      Cambridge, MA, United States

      Publication History

      Published: 01 September 2016
      Published in EVOL Volume 24, Issue 3

      Author Tags

      1. Ant colony optimization
      2. classification
      3. comprehensibility
      4. data mining
      5. sequential covering
      6. unordered rules

      Qualifiers

      • Article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)1
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 13 Nov 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2022)Notions of explainability and evaluation approaches for explainable artificial intelligenceInformation Fusion10.1016/j.inffus.2021.05.00976:C(89-106)Online publication date: 22-Apr-2022
      • (2021)Network Traffic Statistics Method for Resource-Constrained Industrial Project Group Scheduling under Big DataWireless Communications & Mobile Computing10.1155/2021/55946632021Online publication date: 1-Jan-2021
      • (2021)Classification Algorithm Using Branches ImportanceNeural Processing Letters10.1007/s11063-021-10664-x54:2(891-907)Online publication date: 5-Nov-2021
      • (2019)A cost-sensitive meta-learning classifierFuture Generation Computer Systems10.1016/j.future.2019.05.080100:C(1031-1043)Online publication date: 1-Nov-2019
      • (2017)MYRAProceedings of the Genetic and Evolutionary Computation Conference Companion10.1145/3067695.3082471(1247-1254)Online publication date: 15-Jul-2017
      • (2017)Extraction and optimization of classification rules for temporal sequencesKnowledge-Based Systems10.1016/j.knosys.2017.02.001122:C(148-158)Online publication date: 15-Apr-2017

      View Options

      Get Access

      Login options

      Full Access

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media