research-article

The use of mutation in testing experiments and its sensitivity to external threats

Authors:

Akbar Siami Namin,

Sahitya KakarlaAuthors Info & Claims

ISSTA '11: Proceedings of the 2011 International Symposium on Software Testing and Analysis

Pages 342 - 352

https://doi.org/10.1145/2001420.2001461

Published: 17 July 2011 Publication History

Abstract

Mutation analysts has emerged as a standard approach for empirical assessment of testing techniques. The test practitioners decide about cost-effectiveness of testing strategies based on the number of mutants the testing techniques detect. Though fundamental rigor to empirical software testing, the use of mutants in the absence of real-world faults has raised the concern of whether mutants and real faults exhibit similar properties. This paper revisits this important concern and disseminates interesting findings regarding mutants and whether these synthetic faults can predict fault detection ability of test suites. The results of controlled experiments conducted in this paper show that mutation when used in testing experiments is highly sensitive to external threats caused by some influential factors including mutation operators, test suite size, and programming languages. This paper raises the awareness message of the use of mutation in testing experiment and suggests that any interpretation or generalization of experimental findings based on mutation should be justified according to the influential factors involved.

References

[1]

Docjar. http://www.docjar.com.

[2]

H. Agrawal, R. DeMillo, B. Hathaway, W. Hsu, E. Krauser, R. Martin, A. Mathur, and E. Spafford. Design of mutant operators for the c programming language. Technical Report SERC-TR-41-P, Department of Computer Science, Purdue University, Lafayette, Indiana, April 2006.

[3]

J. H. Andrews, L. C. Briand, and Y. Labiche. Is mutation an appropriate tool for testing experiments? In International Conference on Software Engineering (ICSE), pages 402--411, 2005.

Digital Library

[4]

J. H. Andrews, L. C. Briand, Y. Labiche, and A. S. Namin. Using mutation analysis for assessing and comparing testing coverage criteria. IEEE Transactions on Software Engineering, 32(8):608--624, 2006.

Digital Library

[5]

J. S. Bradbury, J. R. Cordy, and J. Dingel. An empirical framework for comparing effectiveness of testing and property-based formal analysis. In ACM SIGPLAN/SIGSOFT Workshop on Program Analysis for Software Tools and Engineering (PASTE), pages 2--5, 2005.

Digital Library

[6]

L. C. Briand, Y. Labiche, and M. M. Sówka. Automated, contract-based user testing of commercial-off-the-shelf components. In International Conference on Software Engineering (ICSE), pages 92--101, 2006.

Digital Library

[7]

M. Delamaro and J. Maldonado. A tool for the assessment of test adequacy for c programs. In Proceedings of the Conference on Performability in Computing Systems (PCS 96), pages 79--95, New Brunswick, NJ, July 1996.

[8]

H. Do and G. Rothermel. A controlled experiment assessing test case prioritization techniques via mutation faults. IEEE International Conference on Software Maintenance, 0:411--420, 2005.

Digital Library

[9]

H. Do and G. Rothermel. On the use of mutation faults in empirical assessments of test case prioritization techniques. IEEE Transactions on Software Engineering, 32:733--752, 2006.

Digital Library

[10]

R. A. Fisher. The Design of Experiments. MacMillan, 9th edition, 1971.

[11]

P. G. Frankl and S. N. Weiss. An experimental comparison of the effectiveness of the all-uses and all-edges adequacy criteria. In Symposium on Testing, Analysis, and Verification, pages 154--164, 1991.

Digital Library

[12]

G. Fraser and A. Zeller. Mutation-driven generation of unit tests and oracles. In International Symposium on Software Testing and analysis (ISSTA), pages 147--158, 2010.

Digital Library

[13]

J. Guilford. Fundamental Statistics in Psychology and Education. McGraw-Hill, New York, 1956.

[14]

Y.-S. Ma, J. Offutt, and Y.-R. Kwon. MuJava: a mutation system for Java. In Proceedings of the 28th international conference on Software engineering, International Conference on Software Engineering (ICSE), pages 827--830, New York, NY, USA, 2006. ACM.

Digital Library

[15]

J. Mayer and C. Schneckenburger. An empirical analysis and comparison of random testing techniques. In International Symposium on Empirical Software Engineering (ISESE), pages 105--114, 2006.

Digital Library

[16]

C. Murphy, K. Shen, and G. E. Kaiser. Automatic system testing of programs without test oracles. In International Symposium on Software Testing and analysis (ISSTA), pages 189--200, 2009.

Digital Library

[17]

A. S. Namin and J. H. Andrews. The influence of size and coverage on test suite effectiveness. In International Symposium on Software Testing and analysis (ISSTA), pages 57--68, 2009.

Digital Library

[18]

A. S. Namin, J. H. Andrews, and D. J. Murdoch. Sufficient mutation operators for measuring test effectiveness. In International Conference on Software Engineering (ICSE), pages 351--360, 2008.

Digital Library

[19]

A. Offutt, A. Lee, G. Rothermel, R. Untch, and C. Zapf. An experimental determination of sufficient mutation operators. ACM Transactions on Software Engineering and Methodology, 5(2):99--118, 1996.

Digital Library

[20]

J. Offutt, Y.-S. Ma, and Y. R. Kwon. The class-level mutants of muJava. In International Workshop on Automation of Software Test (AST), pages 78--84, 2006.

Digital Library

[21]

A. Pretschner, T. Mouelhi, and Y. L. Traon. Model-based tests for access control policies. In International Conference on Software Testing (ICST), pages 338--347, 2008.

Digital Library

[22]

M. J. Rutherford, A. Carzaniga, and A. L. Wolf. Evaluating test suites and adequacy criteria using simulation-based models of distributed systems. IEEE Transactions on Software Engineering, 34(4):452--470, 2008.

Digital Library

[23]

S. Sawilowsky and R. Blair. A more realistic look at the robustness and type II error properties of the t test to departures from population normality. Psychological Bulletin, 111:353--360, 1992.

[24]

K. R. Walcott, M. L. Soffa, G. M. Kapfhammer, and R. S. Roos. Time-aware test suite prioritization. In International Symposium on Software Testing and analysis (ISSTA), pages 1--12, 2006.

Digital Library

[25]

W. E. Wong, V. Debroy, and B. Choi. A family of code coverage-based heuristics for effective fault localization. Journal of Systems and Software, 83(2):188--208, 2010.

Digital Library

[26]

Q. Xie and A. M. Memon. Using a pilot study to derive a GUI model for automated testing. ACM Transactions on Software Engineering Methodology, 18(2), 2008.

Digital Library

[27]

T. Xie. Augmenting automatically generated unit-test suites with regression oracle checking. In European Conference on Object-Oriented Programming(ECOOP), pages 380--403, 2006.

Digital Library

[28]

L. Zhang, S.-S. Hou, C. Guo, T. Xie, and H. Mei. Time-aware test-case prioritization using integer linear programming. In International Symposium on Software Testing and analysis (ISSTA), pages 213--224, 2009.

Digital Library

[29]

L. Zhang, S.-S. Hou, J.-J. Hu, T. Xie, and H. Mei. Is operator-based mutant selection superior to random mutant selection? In International Conference on Software Engineering (ICSE), pages 435--444, 2010.

Digital Library

Cited By

Mansur RShaffer CEdwards SDorodchi MZhange MCooper S(2024)Mutating Matters: Analyzing the Influence of Mutation Testing in Programming CoursesProceedings of the 2024 on ACM Virtual Global Computing Education Conference V. 110.1145/3649165.3690110(151-157)Online publication date: 5-Dec-2024
https://dl.acm.org/doi/10.1145/3649165.3690110
Alblwi SAyad AMili ASaadatmand MLonetti FBudnik CLi JGuerriero A(2024)Mutation Coverage is not Strongly Correlated with Mutation CoverageProceedings of the 5th ACM/IEEE International Conference on Automation of Software Test (AST 2024)10.1145/3644032.3644442(1-11)Online publication date: 15-Apr-2024
https://dl.acm.org/doi/10.1145/3644032.3644442
Ayad AAlBlwi SMili A(2024)Detecting Faults vs. Revealing Failures: Exploring the Missing Link2024 IEEE 24th International Conference on Software Quality, Reliability and Security (QRS)10.1109/QRS62785.2024.00021(115-126)Online publication date: 1-Jul-2024
https://doi.org/10.1109/QRS62785.2024.00021
Show More Cited By

Index Terms

The use of mutation in testing experiments and its sensitivity to external threats
1. Software and its engineering
  1. Software creation and management
    1. Software verification and validation
      1. Software defect analysis
        Software testing and debugging

Recommendations

Is mutation an appropriate tool for testing experiments?
ICSE '05: Proceedings of the 27th international conference on Software engineering

The empirical assessment of test techniques plays an important role in software testing research. One common practice is to instrument faults, either manually or by using mutation operators. The latter allows the systematic, repeatable seeding of large ...
Mining scenario-based specifications with value-based invariants
OOPSLA '09: Proceedings of the 24th ACM SIGPLAN conference companion on Object oriented programming systems languages and applications

There have been a number of studies on mining candidate specifications from execution traces. Some extract specifications corresponding to value-based invariants, while others work on inferring ordering constraints. In this work, we merge our previous ...
Runtime Fault Detection in Programmed Molecular Systems

Watchdog timers are devices that are commonly used to monitor the health of safety-critical hardware and software systems. Their primary function is to raise an alarm if the monitored systems fail to emit periodic “heartbeats” that signal their well-...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

ISSTA '11: Proceedings of the 2011 International Symposium on Software Testing and Analysis

July 2011

394 pages

ISBN:9781450305624

DOI:10.1145/2001420

General Chair:
Matthew Dwyer
University of Nebraska
,
Program Chair:
Frank Tip
IBM T.J. Watson Research Center

Copyright © 2011 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGSOFT: ACM Special Interest Group on Software Engineering

In-Cooperation

SIGPLAN: ACM Special Interest Group on Programming Languages

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 July 2011

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

ISSTA '11

Sponsor:

SIGSOFT

ISSTA '11: International Symposium on Software Testing and Analysis

July 17 - 21, 2011

Ontario, Toronto, Canada

Acceptance Rates

Overall Acceptance Rate 58 of 213 submissions, 27%

Upcoming Conference

ISSTA '25

Sponsor:
sigsoft

34th ACM SIGSOFT International Symposium on Software Testing and Analysis

June 25 - 28, 2025

Trondheim , Norway

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

40
Total Citations
View Citations
607
Total Downloads

Downloads (Last 12 months)8
Downloads (Last 6 weeks)0

Reflects downloads up to 05 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Mansur RShaffer CEdwards SDorodchi MZhange MCooper S(2024)Mutating Matters: Analyzing the Influence of Mutation Testing in Programming CoursesProceedings of the 2024 on ACM Virtual Global Computing Education Conference V. 110.1145/3649165.3690110(151-157)Online publication date: 5-Dec-2024
https://dl.acm.org/doi/10.1145/3649165.3690110
Alblwi SAyad AMili ASaadatmand MLonetti FBudnik CLi JGuerriero A(2024)Mutation Coverage is not Strongly Correlated with Mutation CoverageProceedings of the 5th ACM/IEEE International Conference on Automation of Software Test (AST 2024)10.1145/3644032.3644442(1-11)Online publication date: 15-Apr-2024
https://dl.acm.org/doi/10.1145/3644032.3644442
Ayad AAlBlwi SMili A(2024)Detecting Faults vs. Revealing Failures: Exploring the Missing Link2024 IEEE 24th International Conference on Software Quality, Reliability and Security (QRS)10.1109/QRS62785.2024.00021(115-126)Online publication date: 1-Jul-2024
https://doi.org/10.1109/QRS62785.2024.00021
AlBlwi SMarsit IKhaireddine BAyad ALoh JMili A(2024)Subsumption, correctness and relative correctness: Implications for software testingScience of Computer Programming10.1016/j.scico.2024.103177(103177)Online publication date: Jul-2024
https://doi.org/10.1016/j.scico.2024.103177
Ahmed ZSchwass EHerbold STrautsch FGrabowski J(2024)A new perspective on the competent programmer hypothesis through the reproduction of real faults with repeated mutationsSoftware Testing, Verification and Reliability10.1002/stvr.1874Online publication date: 29-Feb-2024
https://doi.org/10.1002/stvr.1874
AlBlwi SMarsit IKhaireddine BAyad ALoh JMili A(2023)Three Forms of Mutant Subsumption: Basic, Strict and BroadSoftware Technologies10.1007/978-3-031-37231-5_6(122-144)Online publication date: 19-Jul-2023
https://doi.org/10.1007/978-3-031-37231-5_6
Mao RZhang LZhang X(2023)Mutation‐based data augmentation for software defect predictionJournal of Software: Evolution and Process10.1002/smr.2634Online publication date: 6-Nov-2023
https://doi.org/10.1002/smr.2634
Kaufman SFeatherman RAlvin JKurtz BAmmann PJust RDwyer MDamian DZeller A(2022)Prioritizing mutants to guide mutation testingProceedings of the 44th International Conference on Software Engineering10.1145/3510003.3510187(1743-1754)Online publication date: 21-May-2022
https://dl.acm.org/doi/10.1145/3510003.3510187
Gerten MMarsh ALathrop JCohen MMiner AKlinge TDwyer MDamian DZeller A(2022)Inference and test generation using program invariants in chemical reaction networksProceedings of the 44th International Conference on Software Engineering10.1145/3510003.3510176(1193-1205)Online publication date: 21-May-2022
https://dl.acm.org/doi/10.1145/3510003.3510176
Marsit IAyad AKim DLatif MLoh JOmri MMili A(2022)The ratio of equivalent mutantsJournal of Systems and Software10.1016/j.jss.2021.111039181:COnline publication date: 22-Apr-2022
https://dl.acm.org/doi/10.1016/j.jss.2021.111039
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten