Abstract
Bayesian confirmation measures, a special class of interestingness measures, are functions usually adopted in ranking inductive rules generated by data mining methods such as association rule mining, decision trees, rough sets. Till now a plethora of measures have been defined in many different ways. Identifying and effectively distinguishing among them is a difficult task. In this paper we propose a unified visual approach aimed at comparing and classifying a large subset of Bayesian confirmation measures (those satisfying the initial and final probability dependence condition). We first reduce the set of variables in their analytical expression to only two, thus allowing to draw their contour lines on the plane. We observe that two dimensional contour lines plots represent a sort of fingerprints of the confirmation measures and, therefore, this geometric visualization can be used as an effective tool in order to investigate properties and behavior of the measures. We highlight the potential of this approach not only to study known measures but also in order to invent new measures satisfying given required characteristics. We finally define, following the geometry of the plots, a new set of symmetry properties of confirmation measures and describe geometrically four classical symmetries.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Notes
IFPD stands for initial and final probability dependence (Crupi et al. 2010)
ROC stands for receiver operating characteristics (see e.g. Fawcett 2006)
Conjunction effects are “situations in which two hypotheses H1 and H2 are both confirmed by some evidence E, and in which the conjunction \(H1\wedge H2\) is even more highly confirmed” (Atkinson 2012).
Even if in the general definition of confirmation space, i.e. the domain of a Bayesian confirmation measure c(x, y), we stated that it corresponds to the open square (0,1)x(0,1) in many cases it can be extended to the square (0,1]x(0,1) by including also the points of the line x=1, i.e. \(P(H{\vert }E)=1\).
The logarithm has been used in order to meet Bayesian confirmation measures sign condition.
References
Agrawal R, Imielinski T, Swami A (1993) Mining association rules between sets of items in large databases. In: Proceedings of the 1993 ACM SIGMOD international conference on management of data (SIGMOD ’93). ACM, New York, pp 207–216
Atkinson D (2012) Confirmation and justification. A commentary on Shogenji’s measure. Synthese 184(1):49–61
Błaszczyński J, Greco S, Matarazzo B, Słowiński R, Szeląg M (2012) jMAF-dominance-based rough set data analysis framework, rough sets and intelligent systems—Professor Zdzisław Pawlak in memoriam., Intelligent systems reference librarySpringer, Berlin
Błaszczyński J, Słowiński R, Szeląg M (2011) Sequential covering rule induction algorithm for variable consistency rough set approaches. Inform Sci 181(5):987–1002
Carnap R (1950) Logical foundations of probability. University of Chicago Press, Chicago
Celotto E (2014) Sensitivity and simmetry of Confirmation Measures, Working Paper n. 22/2014, October 2014. Department Management, Università Ca’ Foscari Venezia (link: http://virgo.unive.it/wpideas/storage/2014wp22.pdf), ISSN: 2239-2734
Celotto E, Ellero A, Ferretti P (2015) Conveying tourist ratings into an overall destination evaluation. Proc Soc Behav Sci 188:35–41
Christensen D (1999) Measuring confirmation. J Philos 96(9):437–461
Crupi V, Festa R, Buttasi C (2010) Towards a grammar of Bayesian confirmation. In: Suárez M, Dorato M, Rédei M (eds) Epistemology and methodology of science. Springer, Dordrecht, pp 73–93
Crupi V, Tentori K (2014) State of the field: measuring information and confirmation. Stud Hist Philos Sci 47(2014):81–90
Crupi V, Tentori K, Gonzalez M (2007) On Bayesian measures of evidential support: theoretical and empirical issues. Philos Sci 74(2):229–252
Eells E, Fitelson B (2002) Symmetries and asymmetries in evidential support. Philos Stud 107(2):129–142
Fawcett T (2006) An introduction to ROC analysis. Pattern Recogn Lett 27(8):861–874
Finch H-A (1999) Confirming power of observations metricized for decisions among hypotheses. Philos Sci 27, pp 293–207 (part I), pp 391–404 (part II)
Fitelson B (2001) Studies in Bayesian confirmation theory. Ph.D. Thesis, University of Wisconsin, Madison
Fitelson B (2007) Likelihoodism, Bayesianism, and relational confirmation. Synthese 156(3):473–489
Flach P-A (2003) The geometry of ROC space: understanding machine learning metrics through ROC isometrics. In: Proceedings of the 20th International conference on machine learning (ICML’03), AAAI Press, pp 194–201
Fürnkranz J (1999) Separate-and-conquer rule learning. Artif Intell Rev 13(1):3–54
Fürnkranz J (2005) From local to global patterns: evaluation issues in rule learning algorithms. In: Proceedings of the 2004 international conference on Local Pattern Detection (LPD’04), Springer, Berlin, pp 20–38
Fürnkranz J, Flach P-A (2003a) An analysis of rule evaluation metrics. In: Proceedings of the 20th international conference on machine learning (ICML’03), AAAI Press, Washington, DC, pp 202–209
Fürnkranz J, Flach P-A (2003b) An analysis of rule learning heuristics. Department of Computer Science, University of Bristol, CSTR-03-002, February 2003
Fürnkranz J, Flach P-A (2005) ROC ‘n’ rule learning—towards a better understanding of covering algorithms. Mach Learn 58(1):39–77
Geng L, Hamilton H-J (2006) Interestingness measures for data mining: a survey. ACM Comput Surv 38(3):1–32
Glass D-H (2013) Confirmation measures of association rule interestingness. Knowl Based Syst 44:65–77
Good I-J (1950) Probability and the weighing of evidence. Hafners, New York
Good I-J (1985) Weight of Evidence: A Brief Survey, Bayesian Statistics 2, In: J. M. Bernardo, M. H. DeGroot, D. V. Lindley and A. F. M. Smith (eds). Proceedings of the valencia international meetings on Bayesian statistics, , Elsevier Science Publishers B.V, Auckland, pp 249–270
Greco S, Matarazzo B, Słowiński R (2008) Parameterized rough set model using rough membership and Bayesian confirmation measures. Int J Approx Reason 49(2):285–300
Greco S, Matarazzo B, Slowinski R, Stefanowski J (2001) An algorithm for induction of decision rules consistent with dominance principle, In: Revised papers from the second international conference on rough sets and current trends in computing (RSCTC ’00), Springer, pp 304–313
Greco S, Pawlak Z, Słowiński R (2004) Can Bayesian confirmation measures be useful for rough set decision rules? Eng Appl Artif Intell 17(4):345–361
Greco S, Słowiński R, Szczęch I (2012) Properties of rule interestingness measures and alternative approaches to normalization of measures. Inform Sci 216:1–16
Kemeny J, Oppenheim P (1952) Degrees of factual support. Philos Sci 19:307–324
Keynes J-M (1921) A treatise on probability. Macmillan, London
Lavrač N, Flach P-A, Zupan B (1999) Rule evaluation measures: a unifying view. In: Proceedings of the 9th international workshop on inductive logic programming (ILP ’99), Springer, pp 174–185
Lenca P, Meyer P, Vaillant B, Lallich S (2008) On selecting interestingness measures for association rules: user oriented description and multiple criteria decision aid. Eur J Oper Res 184(2):610–626
Mortimer H (1988) The logic of induction. Prentice Hall, paramus
Nozick R (1981) Philosophical explanations. Clarendon Press, Oxford
Pawlak Z (1991) Rough sets: theoretical aspects of reasoning about data. Kluwer Academic Publishers, Boston
Popper K-R (1959) The logic of scientific discovery. Hutchinson, London
Quinlan J-R (1986) Induction of decision trees. Mach Learn 1(1):81–106
Rescher N (1958) A theory of evidence. Philos Sci 25:83–94
Rips L-J (2001) Two kinds of reasoning. Psychol Sci 12:129–134
Shogenji T (2012) The degree of epistemic justification and the conjunction fallacy. Synthese 184(1):29–48
Susmaga R, Szczęch I (2013a) The Property of \(\chi ^{2}_{\{01\}}\)-Concordance for Bayesian confirmation measures. In: Proceedings of the 10th international conference modelling decisions for artificial intelligence (MDAI 2013), LNCS, Springer, vol 8234, pp 226–236
Susmaga R, Szczęch I (2013) Visualization of interestingness measures. In: Proceedings of the 6th language & technology conference: human language technologies as a challenge for computer science and linguistics. Fundacja UAM, Poznań, pp 95–99
Tan P-N, Kumar V, Srivastava J (2004) Selecting the right objective measure for association analysis. Inform Syst 29(4):293–313
Tentori K, Crupi V, Bonini N, Osherson D (2007) Comparison of confirmation measures. Cognition 103:107–119
Tew C, Giraud-Carrier C, Tanner K, Burton S (2014) Behavior-based clustering and analysis of interestingness measures for association rule mining. Data Min Knowl Discov 28(4):1004–1045
Todhunter I (1865) A history of mathematical theory of probability from the time of Pascal to that of Laplace. Macmillan, London reprinted, (1949) 1965. Chelsea Publishing Company, New York
Vilalta R, Oblinger D (2000) A quantification of distance-bias between evaluation metrics in classification. In: Proceedings of the 17th international conference on machine learning (ICML-00) Stanford. Morgan Kaufmann, pp 1087–1094
Yao Y-Y, Zhong N (1999) An analysis of quantitative measures associated with rules, In: Proceedings of the third Pacific-Asia conference on methodologies for knowledge discovery and data mining (PAKDD ’99), Springer, pp 479–488
Yule G-U (1900) On the association of attributes in statistics: with illustrations from the material of the Childhood Society, & c. Philos Trans R Soc Lond A 194:257–319
Yule G-U (1912) On the methods of measuring the association between two attributes. J R Stat Soc 75:579–652
Zhou B, Yao Y-Y (2014) Feature selection based on confirmation-theoretic rough sets, In: Proceedings of the 9th international conference on rough sets and current trends in computing (RSCTC 2014), Lecture Notes in Computer Science, vol 8536, pp 181–188
Author information
Authors and Affiliations
Corresponding author
Additional information
Responsible editor: Johannes Fürnkranz.
The present paper is part of the research project Development of a Web marketing tool for assessing and positioning tourist-cultural destinations jointly supported by the Department of Management of the University Ca’ Foscari-Venezia, CISET-International Centre for Studies on Tourism Economics and Fondazione Ca’ Foscari Venezia.
Rights and permissions
About this article
Cite this article
Celotto, E. Visualizing the behavior and some symmetry properties of Bayesian confirmation measures. Data Min Knowl Disc 31, 739–773 (2017). https://doi.org/10.1007/s10618-016-0487-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10618-016-0487-5