An experimental comparison of the effectiveness of the all-uses and all-edges adequacy criteria

PG Frankl, SN Weiss - Proceedings of the symposium on Testing …, 1991 - dl.acm.org
Proceedings of the symposium on Testing, analysis, and verification, 1991dl.acm.org
An experimental comparison of the effectiveness of the all-uses and all-edges test data
adequacy criteria was performed. A large number of test sets was randomly generated for
each of nine subject programs with subtle errors. For each test set, the percentages of
(executable) edges and definition-use associations covered were measured and it was
determined whether the test set exposed an error. Hypothesis testing was used to
investigate whether all-uses adequate test sets are more likely to expose errors than are all …
Abstract
An experimental comparison of the effectiveness of the all-uses and all-edges test data adequacy criteria was performed. A large number of test sets was randomly generated for each of nine subject programs with subtle errors. For each test set, the percentages of (executable) edges and definition-use associations covered were measured and it was determined whether the test set exposed an error. Hypothesis testing was used to investigate whether all-uses adequate test sets are more likely to expose errors than are all-edges adequate test sets. All-uses was shown to be significantly more effective than all-edges for five of the subjects; moreover, for four of these, all-uses appeared to guarantee detection of the error. Further analysis showed that in four subjects, all-uses adequate test sets appeared to be more effective than all-edges adequate test sets of the same size. Logistic regression showed that in some, but not all of the subjects there was a strong positive correlation between the percentage of definition-use associations covered by a test set and its error-exposing ability.
ACM Digital Library